Soutir Sen

profile
Bigdata Bootcamp - 4 Months
profile
16 sessions

4 months Bigdata Bootcamp to switch to Data Engineering.


Python Programming:

-       Python Basics

-       Python Datatypes (Primitive & Non-Primitive)

-       Python Operators (Arithmetic, Assignment, Logical)

-       Python Package & Modules

-       Python List

-       Python Dictionary

-       Python Identity & Membership Operators

-       Python Loops

-       Python String Operations

-       Python Objects & Class

-       File Handling in Python

Spark:

-       Spark Architecture

-       Spark Execution

-       Spark Name Node & Data Node Concepts

-       Spark DAGs

-       Spark Transformation & Actions

-       Spark-submit

-       Spark Memory Management

-       Spark Optimizations

PySpark Coding:

-       Spark DataFrame Operations

-       DataFrame Select

-       DataFrame NULL Value & Blank Values Handle

-       DataFrame Joins

-       DataFrame Broadcast

-       Repartition & Coalesce

-       Cashing & Persisting

-       Window Functions

Spark SQL:

-       Aggregation &Group by

-       Joins (Inner, Outer, Left, Right, Semi, Anti)

-       Window Functions

-       Partitioning & Bucketing

-       Hive query execution

-       SQL Handson

Unix Knowledge:

-       Important Unix Commands

-       File handling

-       Basic Shell Scripting

-       Job execution

-       Log monitoring

Git Knowledge:

-       Git Repository

-       Git Branch

-       Git Commit

-       Git Pull

-       Pull Request & Merge code

-       Conflict Management

-       Git Handson

Bigdata Technologies:

-       Hive

-       HDFS

-       Scoop

-       Kafka Streaming

Interview Preparation:

-       CV Preparation

-       Mock Interview

-       Live session with DE for interview prep tips

6,000