Apache Spark
with fully coded examples, cheat sheets, interview questions with answers & more
Say goodbye to lengthy workshops and bootcamps. Access our comprehensive eBook, offering lifetime access to this exclusive ebook which deep dives into Apache Spark. With this CODE RICH book you can up-skill & lead the way.
Topics Covered:
Chapter 1: Introduction
Chapter 2: Overview of Hadoop
Chapter 3: Hadoop Architecture
Chapter 4: Hadoop Ecosystem
Chapter 5: Setting Up Hadoop
Chapter 6: Installing Hadoop
Chapter 7: Hadoop Configuration
Chapter 8: Hadoop Distributed
File System (HDFS)
Chapter 9: DataNodes and NameNodes
Chapter 10: Block Placement in HDFS
Chapter 11: Introduction to MapReduce
Chapter 12: Writing MapReduce Jobs
Chapter 13: Understanding
MapReduce Workflow
Chapter 14: Overview of Apache YARN
Chapter 15: Resource Management in YARN
Chapter 16: Scheduling in Apache YARN
Chapter 17: Introduction to Apache Pig
Chapter 18: Writing Pig Latin Scripts
Chapter 19: Pig Latin Commands
Chapter 20: Overview of Apache Hive
Chapter 21: Working with HiveQL
Chapter 22: Hive Partitioning and Bucketing
Chapter 23: Advanced Queries in Hive
Chapter 24: Introduction to Apache HBase
Chapter 25: Table Management in HBase
Chapter 26: Reads and Writes in HBase
Chapter 27: Integration of HBase with Hive
Chapter 28: Introduction to Apache Spark
Chapter 29: Working with RDDs in Spark
Chapter 30: DataFrames in Spark
Chapter 31: Spark SQL for Data Processing
Chapter 32: Introduction to Apache Sqoop
Chapter 33: Transferring Data with Sqoop
Chapter 34: Overview of Apache Flume
Chapter 35: Collecting and
Aggregating Data with Flume
Chapter 36: Overview of
Apache Oozie
Chapter 37: Workflow Management in Oozie
Chapter 38: Introduction to Apache Kafka
Chapter 39: Real-Time Data Streaming
with Kafka
Chapter 40: Security in Hadoop
Preview:
Reviews
There are no reviews yet.