Big Data | Testing | Question and Answers

Big Data | Testing | Mock Test

1
Which of the following is better suited for the use of a RDBMS over Hadoop?
a. Structured data that needs to be read, updated and written multiple times
b. Structured data that needs tobe analysed many times but is written to the storage unit occasionally
c. Structured data that is large in volume
d. CSV files

Answer : Structured data that needs to be read, updated and written multiple times

2
Which of the following is not an eample of a Big Data characteristic?
a. Velocity
b. Variety
c. Vigor
d. Idenifity business risk
Answer : Vigor 

3
Which of the following is an example of Quasi-structured data?
a. XML
b. Web clickstream data
c. Json
d. 10 TB XML files
Answer : Web clickstream data

4
Which of the following is NOT a possible business driver for performing Big Data Analytics?
a. Understand tools and technologies better
b. Predict new business opportunities
c. Compliance with laws and regulations
d. Transaction data
Answer : Understand tools and technologies better

5
What is NOT a good example of Big Data defined by Volume?
a. Satellite images stored for analysis
b. Number of tweets recorded per second
c. Facebook posts for 100 users over 5 months
d. Hadoop can be used for any type and size of data
Answer : Number of tweets recorded per second

6
Which of the following is an example of Semi-structured data?
a. Web clickstream data
b. CSV files
c. XML files
d. 3
Answer : XML files

7
Which of the following is NOT a good reason to use Hadoop over an RDBMS?
a. Hadoop is good at storing and processing multiple data forms
b. Hadoop can use commodity software that is cheaper to use
c. Hadoop lowers costs by eliminating the need for licenses
d. The Secondary namenode process is the master process in the HDFS
Answer : The Secondary namenode process is the master process in the HDFS

8
How many single points of failure does a High Availability HDFS architecture have?
a. 0
b. 1
c. 2
d. The HDFS works with a wide variety of hardware and software platforms
Answer : 0

9
Which of the following dataase types should not be used when a large number of incremental writes need to be made to the database?
a. Columnar
b. Document
c. Key-Value Stores
d. Graph
Answer : Columnar

10
Which of the following is FALSE about the HDFS?
a. The HDFS is highly fault tolerant
b. By default the HDFS provides high availability which reduces the chances of single points of failure
c. The HDFS uses redundancy and replication of data to prevent data loss
d. The HDFS works with a wide variety of hardware and software platforms
Answer : By default the HDFS provides high availability which reduces the chances of single points of failure