Spark Dataset with example

Spark dataset is distributed collection of typed objects.This was introduced in Spark 1.6 version.

It consolidates features of both RDD and Data frame with fast execution response and memory optimization of data processing .The methodology used in DataSet is Encoder which enhances execution and processing response of data over the network compared to others which depends on JAVA or Kryo Serializer .

Dataset concept came into feature with the following advantages.