MCQs of Spark (Big Data Analytics - 2180710)

Showing 21 to 30 out of 31 Questions

21.

Is it possible to mitigate stragglers in RDD?

(a)	Yes
(b)	No

22.

Fault Tolerance in RDD is achieved using

23.

What is action in Spark RDD?

(a)	The ways to send result from executors to the driver
(b)	Takes RDD as input and produces one or more RDD as output.
(c)	Creates one or many new RDDs
(d)	All of the above

24.

The shortcomings of Hadoop MapReduce was overcome by Spark RDD by

25.

Spark is developed in which language

26.

Which of the following is not a component of the Spark Ecosystem?

27.

Which of the following algorithm is not present in MLlib?

(a)	Streaming Linear Regression
(b)	Streaming KMeans
(c)	Tanimoto distance
(d)	None of the above

28.

Which of the following is not the feature of Spark?

(a)	Supports in-memory computation
(b)	Fault-tolerance
(c)	It is cost-efficient
(d)	Compatible with other file storage system

29.

Which of the following is the reason for Spark being Speedy than MapReduce?

(a)	DAG execution engine and in-memory computation
(b)	Support for different language APIs like Scala, Java, Python and R
(c)	RDDs are immutable and fault-tolerant
(d)	None of the above

30.

Which of the following is true for RDD?

(a)	RDD is a programming paradigm
(b)	RDD in Apache Spark is an immutable collection of objects
(c)	It is a database
(d)	None of the above

Showing 21 to 30 out of 31 Questions