4 months Bigdata Bootcamp to switch to Data Engineering.
Python Programming:
- Python Basics
- Python Datatypes (Primitive & Non-Primitive)
- Python Operators (Arithmetic, Assignment, Logical)
- Python Package & Modules
- Python List
- Python Dictionary
- Python Identity & Membership Operators
- Python Loops
- Python String Operations
- Python Objects & Class
- File Handling in Python
Spark:
- Spark Architecture
- Spark Execution
- Spark Name Node & Data Node Concepts
- Spark DAGs
- Spark Transformation & Actions
- Spark-submit
- Spark Memory Management
- Spark Optimizations
PySpark Coding:
- Spark DataFrame Operations
- DataFrame Select
- DataFrame NULL Value & Blank Values Handle
- DataFrame Joins
- DataFrame Broadcast
- Repartition & Coalesce
- Cashing & Persisting
- Window Functions
Spark SQL:
- Aggregation &Group by
- Joins (Inner, Outer, Left, Right, Semi, Anti)
- Window Functions
- Partitioning & Bucketing
- Hive query execution
- SQL Handson
Unix Knowledge:
- Important Unix Commands
- File handling
- Basic Shell Scripting
- Job execution
- Log monitoring
Git Knowledge:
- Git Repository
- Git Branch
- Git Commit
- Git Pull
- Pull Request & Merge code
- Conflict Management
- Git Handson
Bigdata Technologies:
- Hive
- HDFS
- Scoop
- Kafka Streaming
Interview Preparation:
- CV Preparation
- Mock Interview
- Live session with DE for interview prep tips