Ab Initio Quiz

Ab Initio Beginner Quiz

In this AbInitio online Quiz, we will cover topics related to Ab Initio, abinitio, ab initio software, ab initio software llc, initio, ab initio etl tool and so on.



1) What is the syntax for local variable declaration

  1. Let int x;
  2. let int x = 0
  3. Int x;
  4. Int x=0
Answer : B

2) Which component allow change in input and output port Structures

  1. Reformat
  2. Redefine
  3. Replicate
  4. Both a and b
Answer : D

3) Input table unloads table records into the AB Initio graph

  1. True
  2. FALSE
Answer : A

4) Lookup file represents one or more serial files or a multifile of data records small enough to be held in main memory

  1. True
  2. FALSE
Answer : A

5) Ceiling Returns the smallest integer value greater than or equal to a specified value

  1. True
  2. FALSE
Answer : A

6) Broadcast combines the records arbitrarily into a single flow.

  1. True
  2. FALSE
Answer : B

7) Read from and write to the same file in a graph is possible

  1. True
  2. FALSE
Answer : A

8) what is the functionality ddl_to_dml function

  1. Converts a database column in to near equivalent DML type
  2. Converts a column definition to the corresponding DML type definition
Answer : B

9) Which component will change the order of fields

  1. Reformat
  2. Redefine
  3. sort
  4. Both a & b
Answer : A

10) What does is_blank(“”)  returns

  1. 1
  2. 0
  3. Both a & b
  4. None of the above

Answer : A



11) What does layout in graph means to the co>Operating system

  1. The location of files
  2. The number and locations of the partitions of multifiles
  3. The number of the partitions of program components and the locations where they execute
  4. All of the above
Answer : D

12) what would be the output of printf(“%f”, 12345678.90)

  1. 12345678.90
  2. “12345678.90”
  3. “12345678.900000”
  4. None of the above
Answer : C

13) How to change serial file to mfs

  1. copy file from serial file path to mfs file path
  2. Use Partition component
  3. All of the above
  4. None of the above
Answer : B

14) Combining multiple flow partitions of data records into a single flow is known as

  1. re-partitioning
  2. departitioning
  3. partitioning
  4. None of the above
Answer : B

15) When the data is divided into small chunks and processed on different components simultaneously we call it

  1. Component parallelism
  2. DataParallelism
  3. Pipeline parallelism
  4. None of the above
Answer : B

16) When multiple program components execute simultaneously on different branches of a graph it is called

  1. Pipeline parallelism
  2. Component parallelism
  3. DataParallelism
Answer : B

17) when several connected program components on the same branch of a graph execute simultaneously we call it

  1. Data Parallelism
  2. Pipeline parallelism
  3. Component parallelism
Answer : B

18) what is the output DML format of trash

  1. no output port/format
  2. same as input format
  3. blank
  4. All of the above
Answer : A

19) what port does Run SQL Component have?

  1. input port
  2. output port
  3. log port
  4. All of the above
Answer : C

20) Phasing

  1. separates work load
  2. sets sequence of operation
  3. Both a & b
  4. None of the above

Answer : C



21) Skew is

  1. load imbalance
  2. a partitioning component
  3. load balancer
  4. All of the above
Answer : A

22) Propagate from neighborhood, component, url, custom, hosts, database: These are

  1. ab initio connectors
  2. layout options
  3. input components
  4. All of the above
Answer : B

23) XFR is

  1. abinitio dml file
  2. redundant file
  3. user written transformation functions for reusability
  4. All of the above
Answer : C

24) Error port

  1. will have error records
  2. will have error message
  3. will have both
  4. doesn’t collect both
Answer : B

25) Reject Port

  1. will have reject error message
  2. will have rejected records
  3. will have both
  4. doesn’t collect both
Answer : B

26) deselect Port

  1. will have unselected error message
  2. will have unselected records
  3. will have both
  4. doesn’t collect both
Answer : B

27) How do you truncate a table

  1. Use Truncate Table component
  2. By using access tab of input/output component
  3. All of the above
  4. None of the above
Answer : C

28) What are the different types of joins

  1. right, left and full
  2. inner, full outer and explicit
  3. Product, full outer and implicit
  4. All of the above
Answer : B

29) What are the different types of parallelism in Ab Initio?

  1. Process parallelism, Data parallelism, Component parallelism
  2. Phase parallelism, Component parallelism, Pipeline parallelism
  3. Component parallelism, Data parallelism, Pipeline parallelism
  4. None of the above
Answer : C

30) What is component parallelism?

  1. A graph with multiple processes running simultaneously on separate data uses component parallelism.
  2. A graph that deals with data divided into segments and operates on each segment simultaneously uses component parallelism
  3. A graph with multiple phases running simultaneously on separate data uses component parallelism.
  4. A graph with multiple components running simultaneously on the same data

Answer : A



31) What is the notation to specify the URL of a layout.

  1. hostname/path/
  2. protocol://hostname/path
  3. path
  4. None of the above
Answer : B

32) Which of the following components need a sorted input?

  1. merge
  2. interleave
  3. gather
  4. All of the above
Answer : A

33) why is a gather component used

  1. To reduce data parallelism
  2. To reduce component parallelism
  3. both of the above
  4. None of the above
Answer : C

34) What is the maximum number of input ports a join components can have?

  1. only one
  2. 2
  3. 4
  4. 20
Answer : D

35) What does the max-core parameter refer to

  1. The maximum memory the component can use
  2. The maximum CPU utilization the component can use
  3. The maximum time limit the component can use
  4. The maximum number of rejects the component can allow
Answer : A

36) What does layout of a component mean?

  1. The place where the component works
  2. The depth of parallelism
  3. Number of partitions
  4. All of the above
Answer : D

37) Which component among the following is the most efficient

  1. Concatenate
  2. interleave
  3. Gather
  4. Merge
Answer : C

38) Which component saves the status information among the following components

  1. Phases
  2. Check points
  3. Phase breaks
  4. Commit
Answer : B

39) When complex joining expressions are required which component among these is preferred

  1. look up
  2. Join
  3. Merge
  4. Gather
Answer : A

40) Which component gives sorted output

  1. merge
  2. sort
  3. rollup
  4. All of the above

Answer : D



41) Which parameter specifies the components tolerence for reject events.

  1. reject port
  2. reject-threshold parameter
  3. max-core parameter
  4. log-reject parameter
Answer : B

42) What is the use of merge component?

  1. Joins Different Flows both sorted & unsorted
  2. Sorts Data
  3. Join Different sorted data Flows and maintains Sort order
  4. None Of The Above
Answer : C

43) Explain maxcore parameter?

  1. Minimum Amount Of Memory allocated To a Component
  2. Amount Of Main Memory Allocated To An Instruction
  3. Maximum Amount Of Main Memory Allocated To Store And Manipulate Data Temporarily
  4. Amount Of Main Memory Allocated To Store And Manipulate Data Permanently
Answer : C

44) What component will you use to undo the effect of Partition by Round Robin ?

  1. Partition by Expression
  2. Partition by key and Sort
  3. Interleave
  4. Partition by Range
Answer : C

45) Component that is used to reduce the volume of data flow due to narrow band width or lack of enough disk space to store data

  1. Compress
  2. Compress,Gzip
  3. Compress,Gzip,Gunzip
  4. None of the above
Answer : B

46) For which component connecting the ports is not mandatory

  1. Run SQL
  2. Intermediate file
  3. Output file
  4. Broadcast
Answer : A

47) Which is the generally used departitioning component?

  1. Concatenate
  2. Gather
  3. Merge
  4. All of the above
Answer : D

48) What do u call the file which can treat several serial files having the same record format as a single graph component

  1. Summary File
  2. Event Log File
  3. Adhoc Multifile
  4. Custom File
Answer : C

49) what is the control partition

  1. It is partion where the same data is run simultaneously in different components
  2. location of multifile’s datapartition
  3. A partition which divides the control information
  4. none of these
Answer : B

50) Replicate component supports which type of Parallelism?

  1. Data Parallelism
  2. Component Parallelism
  3. Pipeline Parellelism
  4. All of the above

Answer : B



51) If there are 100 records and only 10 percent rejection can be permitted the what is the value of limit and ramp?

  1. limit=100;ramp=10
  2. limit=0;ramp=0.1
  3. limit=0;ramp=10
  4. limit=0;ramp=0
Answer : B

52) Which component do not have finalize transform function?

  1. Dedup sorted
  2. Denormalize Sorted
  3. Rollup
  4. Scan
Answer : A

53) The main programs required for abinitio software?

  1. only GDE
  2. OS and GDE
  3. Co operating System and GDE
  4. None of these
Answer : C

54) Which partition balances the workload between CPU’s

  1. partition by round robin
  2. partition by percentage
  3. partition by load balance
  4. none of these
Answer : C

55) The component which does not have any ports

  1. look up
  2. Run SQL
  3. Scan
  4. Fuse
Answer : A

56) which of the following components does not require a sorted input

  1. match sorted
  2. join
  3. merge
  4. dedup sorted
Answer : B

57) which of the following components has an ordered input port.

  1. concatenate
  2. gather
  3. partition by key
  4. Reformat
Answer : A

58) The usage of which component/components is recommended when the incoming data records is unsorted and needs be joined with a small set of records.

  1. sort and join(input needs to be sorted or grouped)
  2. join(in memory, inputs need not be sorted or grouped)
  3. lookup file with reformat
  4. sort and match sorted
Answer : C

59) which of the following components requires a sorted input

  1. rollup
  2. join
  3. concatenate
  4. merge
Answer : D

60) delimiter “\001” refers to

  1. space delimiter
  2. NULL delimiter
  3. double space delimiter
  4. All of the above

Answer : B



61) what does varstring in the drop-down menu in the GDE’s Record Format Editor refer to

  1. variable length string
  2. fixed length string
  3. length prefixed string
  4. none of the above
Answer : C

62) Usage of which lookup function is recommended when one wants to check up whether a given key value exists in a serial lookup file

  1. lookup
  2. lookup_count
  3. lookup_count_local
  4. lookup_match
Answer : D

63) which of the following components does not have an ordered output port.

  1. partition by key
  2. concatenate
  3. interleave
  4. partition by Round Robin
Answer : A

64) .dbc files contain

  1. database backup configuration details
  2. database configuration details
  3. database credentials
  4. All of the Above
Answer : B

65) Lookup files are faster when

  1. small enough to store in memory; avoids I/O calls
  2. Too large to store in memory; avoids I/O calls
  3. No I/O calls
  4. None of the above
Answer : A

66) Phases are used to

  1. Logically divide work loads
  2. setting sequence of operation
  3. show different process separately
  4. A and C
Answer : B

67) .rec file is

  1. record changes to data
  2. used to recover point in time
  3. a recovery file
  4. None of the above
Answer : C

68) Which of the following are compound data types?

  1. vector
  2. union
  3. record
  4. All of the Above
Answer : D

69) A package is a collection of

  1. Functions and Variables
  2. Procedures and functions
  3. Procedures and variables
  4. None of the above
Answer : A

70) Which of the following are layout options?

  1. propagate from neighborhood
  2. component
  3. url
  4. All of the Above

Answer : D



71) Valid layout option?

  1. custom
  2. hosts
  3. database
  4. All of the Above
Answer : D

72) Can a resource in internet can be given as a layout?

  1. Yes
  2. No
  3. it has to be mentioned in dbc file
  4. None of the above
Answer : A

73) How do you set implicitly phases?

  1. By using staging
  2. By Using intermediate files
  3. By using logically divided work spaces
  4. All of the Above
Answer : B

74) Access tab of input/output table component can be used to

  1. truncate table
  2. collect error records
  3. collect error messages
  4. All of the Above
Answer : A

75) Replicate

  1. Non-partition
  2. makes a copy of records
  3. Both
  4. None of the above
Answer : C

76) The types of join are

  1. inner
  2. full outer
  3. explicit
  4. All of the Above
Answer : D

77) What is ramp and limit?

  1. error tolerance limit
  2. ramp is expressed in %
  3. limit is in no of records
  4. All of the Above
Answer : D

78) KEY Partitioning

  1. hash code is computed using the Key and same keys go to same partitions
  2. As many partitions as keys are created and same keys go to same partitions
  3. Both
  4. None of the above
Answer : A

79) Departitioning requires

  1. Fan in flow
  2. Fan out flow
  3. both
  4. None of the above
Answer : A

80) gather

  1. Not key based
  2. reads records as soon as they are available and order is unpredictable
  3. useful for repartitioning
  4. All of the Above

Answer : D



81) Merge

  1. result is sorted if each input is sorted
  2. result is sorted even if input is not sorted
  3. result is not sorted
  4. None of the above
Answer : A

82) Concatenate

  1. is used to add headers to records sets
  2. is used to add trailers to records sets
  3. used to bring non-parallel data flows together
  4. All of the Above
Answer : D

83) Layout is

  1. serial
  2. parallel
  3. Either
  4. None of the above
Answer : C

84) Layout

  1. determines what runs where
  2. propagates in all directions
  3. the computer/directory are unknown till runtime
  4. All of the Above
Answer : D

85) Which is a validate dml function?

  1. is_valid
  2. is_null
  3. is_defined
  4. All of the Above
Answer : D

86) max-core parameter is found in

  1. Sort
  2. Join
  3. Rollup
  4. All of the Above
Answer : D

87) Control filename is mandatory for

  1. DBC file
  2. Serial File
  3. MFS file
  4. Database
Answer : C

88) ab initio is case sensitive

  1. FALSE
  2. TRUE
Answer : B

89) Run SQL has ___ port(s)

  1. One input one output one log
  2. one log
  3. One Input One Output
  4. One Output One Log
Answer : B

90) Error tolerance limit is achieved through

  1. Ramp
  2. Limit
  3. Skew
  4. A or B

Answer : D



91) Lookup files

  1. Are small enough to store in memory
  2. Avoids I/O calls
  3. A & B
  4. B
Answer : C

92) ____ is a saving point so job can be restarted from that point case of crash

  1. Recovery file
  2. Check point
  3. DML
  4. XFR
Answer : B

93) Package is a collection of

  1. Vector
  2. Union
  3. Record
  4. Functions and Variables
Answer : D

94) Which is NOT a type of Parallelism in Ab Initio

  1. Pipeline Parallelism
  2. Process Parallelism
  3. Component Parallelism
  4. Data Parallelism
Answer : B

95) Which component will you find the parameter  “Output Index”

  1. Rollup
  2. Reformat
  3. Join
  4. Filter by Expression
Answer : B

96) There is a 2-way multifile. Partition 0 has 3 records and partition 1 has 5 records. We connect this file to a broadcast component with multiple flows going out with the same depth. How many records will come in each flow?

  1. 20
  2. 8
  3. 16
  4. 10
Answer : B

97) What error will we get if the MAX CORE value is set to too low.

  1. Record out of order
  2. Paging by OS
  3. Too many open files
  4. Increase MAX CORE
Answer : C

98) Which of the following is NOT a valid command

  1. m_mv
  2. m_wc
  3. m_cat
  4. m_zip
Answer : D

99) What does “skew” mean

  1. Uneven distribution of data among partitions
  2. Choosing incorrect partition key
  3. Partitions are equally divided
  4. Balanced Data
Answer : A

100) If a parameter is defined in multiple project as following, which will be the correct order of precedence (from lowest to highest)    (1) Stdenv  (2)Localenv (3) Private Project

  1. 1,2,3
  2. 3,2,1
  3. 2,1,3
  4. 2,3,1

Answer : A



101) What is .abinitiorc file used for and its location

  1. Password file present in home directory
  2. User configuration file present in home directory
  3. User configuration file present in private project
  4. Database configuration file present in common project
Answer : B

102) Which is NOT TRUE about component folding?

  1. It reduces number of processes
  2. When component folding is enabled, Co Op automatically scans and folds as many components as it can
  3. We can specify some components to be excluded from the foldable list
  4. Decreases graph performance
Answer : D

103) Which of the following is NOT TRUE about Driving Input

  1. It is present only in in-memory join
  2. It is always present in a join component regardless of sorted-input
  3. Ideally largest input is taken as driving input
  4. All non-driving inputs are taken into memory
Answer : B

104) How to go from n_way to m_way partitioning?

  1. Departition
  2. Repartition
  3. Partition
  4. None of the above
Answer : B

105) Which air command is used to “checkin” a project through command line?

  1. air project export
  2. air project checkin
  3. air project import
  4. air project checkout
Answer : C

106) Variable used to set EME path

  1. AB_AIR_PATH
  2. AB_AIR_BRANCH
  3. AB_HOME
  4. AB_AIR_ROOT
Answer : C

107) What does MFS stands for?

  1. Multi File System
  2. Multi File Storage
  3. Maximum File Storage
  4. Minimum File Storage
Answer : A

108) Which component breaks pipeline parallelism?

  1. Reformat
  2. Sort
  3. Redefine Format
  4. Filter By Expression
Answer : B

109) What the difference is between reformat and redefine format?

  1. Reformat can actually change the bytes of the data whereas redefine format changes the record format
  2. Reformat changes the record format whereas redefine format changes the bytes of the data
  3. Both are used to change record format as well as bytes of the data
  4. There is no difference between Reformat and Redefine component
Answer : A

110) Which command is used to test “DML expression” quickly?

  1. m_dump
  2. m_touch
  3. m_eval
  4. m_expand

Answer : C



111) What are “NULLs” represent in DML?

  1. A zero-length string
  2. Unknown or missing piece of data
  3. Number zero
  4. A string containing all blanks
Answer : B

112) What is the difference between phasing and checkpoint?

  1. Phasing is used for recovery and checkpoint is used for memory management
  2. Both are used for managing data
  3. Phasing is used for memory management and checkpoint is used for recovery
  4. Phasing & Checkpoints both are same
Answer : C

113) How to create surrogate key ?

  1. Use Assign Key Component
  2. next_in_sequence()
  3. A&B
  4. using surrogate function
Answer : C

114) Command used to list tags on a particular object

  1. air tag show
  2. air tag ls
  3. air tag view
  4. air tag all
Answer : B

115) How can you find the depth of MFS

  1. m_eval
  2. m_wc
  3. m_expand
  4. m_dump
Answer : C

116) What are the files present in MFS directories which store the directory structure

  1. .mfctl
  2. .mdir
  3. .air-project-parameters
  4. .air-project-overrides
Answer : A

117) Which of the following statements in NOT true

  1. ROLLUP – Generates a final summary record rather than a running cumulative summary for the group.
  2. NORMALIZE – Optionally computes summary fields in the output record for each group.
  3. SCAN – Ggenerates an output record that includes a running cumulative summary for the group to which the input record belongs.
  4. Gather LogS – Collects the output from the log ports of components for analysis of a graph after execution.
Answer : B

118) Which of the following is NOT a valid m_db command

  1. m_db test
  2. m_db unload
  3. m_db gendml
  4. m_db del
Answer : D

119) Best way to change fixed length data to delimieted data

  1. Use Rollup component
  2. Use Scan Component
  3. Use Filter by Expression
  4. Use Redefine Component
Answer : D

120) Which of the below component can not separate a file with header/trailer and body records? The file has a record identifier.

  1. Filter by expression
  2. Reformat with Output Index
  3. Partition by Expression
  4. Partition By Key
Answer : D

121) Which component generates multiple output records from each of its input records.

  1. Normalize
  2. Denormalize
  3. Rollup
  4. Filter by Expression

Answer : A