BIG DATA ANALYTICS
Functional Programming With Scala – Overview
- Scala Immutable and Mutuable
- Variable Declaration Values (concept of val and var)
Scala Functions
- Create normal functions or Anonymous Functions
- Applying Functions Partially Curried Functions
- Var-args Support in Functions
Scala Classes, Traits
- Classes in Scala
- Constructors
- Expressions
- Functions vs Methods Inheritance
- Overloading Methods
- Abstract Class,
- Traits Inheritance in Traits
- When to use Trait instead of Abstract Class Using Generics/Types with Traits
Exception Handling
- try-catch-finally
- Expression-oriented programming
Case Classes & Collections
- Case classes
- Objects, packages, apply, update
- Functions are Objects (uniform access principle),
- Pattern matching Collections : Lists, Maps, functional combinators (map, foreach, filter, zip, folds)
- Pattern matching & functional composition
BigData Overview and Apache Hadoop
- What is Big Data & Hadoop
- Hadoop Architecture : Yarn , MapReduce and HDFS
- Overview Use Cases of Hadoop
Apache Spark Overview
Spark Core
- Introduction to Apache Spark and its Core Architecture
- What are Spark RDD and how they internally works?
- Exploring various functions of Spark RDD API's (map, flatMap, filter, reduce , reduceByKey etc
Spark SQL, DataFrames and DataSets
- The DataFrames/Datasets API
- Data Aggregation
- Column Operations
- The Functions API: date/time, string manipulation, aggregation Caching and caching storage levels
Structured Spark Streaming
- Need of Real-time Stream Processing?
- Sources and sinks
- Structured Streaming
- APIs Windowing & Aggregation
Machine Learning with Apache Spark MLlib
- Build machine learning pipelines for both supervised and unsupervised learning
- Transformer/Estimator/Pipeline API
Graph Processing with Apache Spark
- Basic Graph Analysis
- GraphFrames API
- GraphFrames ETL
Project