Big Data Engineering
Essentials

You are all set to begin your learning path with this first stepping stone and foundational Essential course. In this course you are mentored through finely tailored course content to introduce the most popular tech stack – Big Data Aka Hadoop. It is equally important to learn the roles and responsibilities of a Data Engineer and it is part of this course curriculum. A clear introduction to Data Pipelines, Data Processing, HDFS, Resource Management, Data Access, Pipeline Automations is covered in this course.

You will be developing a real-world Data Engineering hands-on pipeline using one of the public datasets available along with the context of the business. For those who do not have enough resources to run installations on your system, hands-on session to setup the server on AWS Cloud is included as add-on in this course.

Pre-requisites (Free)

SQL Foundation

Shell/Bash Scripting for Beginners

System Requirements

CPU: Quad cores with i5 or better/M1

Memory: 16GB

OS: Windows/MacOS

Not to worry if you do not have enough capacity on your system, towards the end of this course you will be guided on procuring AWS Cloud server for your practice.

Mode Of Trainings

Online Interactive Sessions

Recorded Video Sessions – From the latest Online batch

Resources

Approximate number of sessions: 25 (Varies across the batches)

Lifetime access to the recorded videos will be given along with all supportive documents, logs, references and software’s if any.

Placements

With this course you are not ready yet for the market hunt. Complete the next Booster level course to be able to get our placement support.

Chapter 1: INTRODUCTION

Responsibilities
ETL/ELT
Data Sources
Batch Processing
Stream Processing
Data Lake
Data Warehouse
Data Marts
Data Staging
Data Integration
Administration
Data Optimizations
Required Skills

Chapter 2: DATA PIPELINES

Pipelines
Automation & Scheduling
Handling Exceptions
Logging

Chapter 3: INSTALLING HADOOP

Hadoop vs RDBMS
System requirements
Installation Modes
Pre-requisites
Installation
Real-world Installations
Questions & Answers

Chapter 4: HADOOP INTRODUCTION

Hadoop Ecosystem
Hadoop Distributions
Evolution
Storage
Resources
Processing
Data Access
Applications

Chapter 5: HDFS STORAGE

HDFS Intro & Architecture
Nodes & File System
Data blocks
Racks & Replications
High Availability
Space Reclamation
Story in Short
Hands-on
Questions & Answers

Chapter 6: RESOURCE MANAGEMENT

YARN Intro
Architecture
Resource Manager
Resource Manager HA
Node Manager
Application Master & Containers
Workflow
Zookeeper
Story in short
Hands-on

Chapter 7: DATA PROCESSING

Processing Engines
MapReduce Intro
MapReduce Architecture
Mappers
Reducers
Spark Intro
Spark Architecture
Spark Workflow
Spark Terms
Spark vs MapReduce
Story in short

Chapter 8: ACCESS DATA

Hive Intro
Hive Architecture
Hive Hands-on
Pig Intro
Pig Architecture
Pig Latin
Pig Hands-on

Chapter 9: SCHEDULING JOBS

Oozie Intro
Architecture
Scheduling
Hands-on

Chapter 10: ESSENTIAL REALTIME PROJECT

Business & Dataset
Data Dictionary
Dump Data
Design Pipeline
Pipeline Development
Oozie Workflow
Conclusion

Chapter 11: AWS CLOUD SETUP

AWS Account
EC2 Instance
Setup & Login
Port Forwarding
Docker & Verify

Chapter 12: QUESTIONS & ANSWERS

Frequently Asked Questions (FAQs)

What are the modes of training available?

There are two modes of training. Online Instructor Led or Recorded Video Sessions. While you can purchase the later anytime, look out for the schedule on this page to take the first.

Is this course enough to become a Data Engineer?

This is the foundational course towards becoming a Data Engineer and needs you to complete Booster as well to be market ready.

What are the pre-requisites for this course?

Basic SQL, Python & Shell script programming skills are the pre-requisites. Do not worry, we have free courses for you to enroll.

Do I get any assistance if I enroll for Recorded Video Sessions?

You will be part of the professional community and there will be assistance for your blockers.

Will there be any placement assistance?

After this foundational course, you will have to complete the next level to be market ready. You will be assisted and guided in profile building and mock interviews