Big Data Analytics (2180710) MCQs

MCQs of Spark

Showing 21 to 30 out of 31 Questions
21.
Is it possible to mitigate stragglers in RDD?
(a) Yes
(b) No
Answer:

Option (a)

22.
Fault Tolerance in RDD is achieved using
(a) Immutable nature of RDD
(b) DAG (Directed Acyclic Graph)
(c) Lazy-evaluation
(d) None of the above
Answer:

Option (b)

23.
What is action in Spark RDD?
(a) The ways to send result from executors to the driver
(b) Takes RDD as input and produces one or more RDD as output.
(c) Creates one or many new RDDs
(d) All of the above
Answer:

Option (a)

24.
The shortcomings of Hadoop MapReduce was overcome by Spark RDD by
(a) Lazy-evaluation
(b) DAG
(c) In-memory processing
(d) All of the above
Answer:

Option (d)

25.
Spark is developed in which language
(a) Java
(b) Scala
(c) Python
(d) R
Answer:

Option (b)

26.
Which of the following is not a component of the Spark Ecosystem?
(a) Sqoop
(b) GraphX
(c) MLlib
(d) BlinkDB
Answer:

Option (a)

27.
Which of the following algorithm is not present in MLlib?
(a) Streaming Linear Regression
(b) Streaming KMeans
(c) Tanimoto distance
(d) None of the above
Answer:

Option (c)

28.
Which of the following is not the feature of Spark?
(a) Supports in-memory computation
(b) Fault-tolerance
(c) It is cost-efficient
(d) Compatible with other file storage system
Answer:

Option (c)

29.
Which of the following is the reason for Spark being Speedy than MapReduce?
(a) DAG execution engine and in-memory computation
(b) Support for different language APIs like Scala, Java, Python and R
(c) RDDs are immutable and fault-tolerant
(d) None of the above
Answer:

Option (a)

30.
Which of the following is true for RDD?
(a) RDD is a programming paradigm
(b) RDD in Apache Spark is an immutable collection of objects
(c) It is a database
(d) None of the above
Answer:

Option (b)

Showing 21 to 30 out of 31 Questions