
Learn Apache Spark: Tutorial for Beginners - This Apache Spark tutorial documentation will introduce you to Apache Spark programming in Scala. You will learn about Scala programming, dataframe, RDD, Spark SQL, and Spark Streaming with examples and finally prepare yourself for Spark interview questions and answers.
What is Apache Spark?
Apache Spark is an analytics engine for big data processing.
It runs 100 times faster than Hadoop and gives you full freedom to process large-scale data in real time, run analytics and apply machine learning algorithms.
Navigation menu
1. Apache Spark and Scala Installation
1.1 Spark installation on Windows
2. Getting Familiar with Scala IDE
2.1 Hello World with Scala IDE
3. Spark data structure basics
3.1 Spark RDD Transformations and Actions example
4. Spark Shell
4.1 Starting Spark shell with SparkContext example
5. Reading data files in Spark
5.1 SparkContext Parallelize and read textFile method
5.2 Loading JSON file using Spark Scala
5.3 Loading TEXT file using Spark Scala
5.4 How to convert RDD to dataframe?
6. Writing data files in Spark
6.1 How to write single CSV file in Spark
7. Spark streaming
8. Sample Big Data Architecture with Apache Spark
9. What's Artificial Intelligence, Machine Learning, Deep Learning, Predictive Analytics, Data Science?
10. Spark Interview Questions and Answers