Introduction to Python and Big Data Analysis

CDIP offers the certificate course on Introduction to Python and Big Data Analysis. Nowadays, Data Science is the most demanding profession in the software industry. To be a Data Scientist one should have a vast knowledge of Big Data and Machine Learning. For both cases, python provides best packages and libraries. In this course, a student will not only learn the basic of python but also the data analysis on Big Data.

Batch 5 class has already been started on 11th October.

Name: Sifatur Rahim

Designation: Lead BigData DevOps Engineer

Company: Telenor Health

Experience: 12 Years

Linkedin: https://bd.linkedin.com/in/sifatur-rahim-59930a33

Name: Rakib Hasan

Designation: Sr. Software Engineer

Company: Telenor Health

Experience: 7 Years

Linkedin: https://bd.linkedin.com/in/rakib-hasan-amiya-a7700a66

Basic python Part 1 

  • Overview 
  • Environment Setup 
  • Basic Syntax 
  • Variable Types 

Basic python Part 2 

  • Basic Operators 
  • Decision Making 
  • Loops 
  • Numbers 
  • Strings 
  • Basic Operators 
  • Decision Making 
  • Loops 
  • Numbers 
  • Strings 

Basic python Part 3 

  • Arrays 
  • Matrix 
  • Lists 
  • Tuples 
  • Dictionary 
  • Images 
  •  Tables 
  •  Forms 

Basic Python Part-4 

  • Date & Time 
  • Functions 
  • Modules 
  • Files I/O 
  • Exceptions 

Advance python Part 1 

  • Classes/Objects 
  • Reg Expressions 
  • Database Access 

Advance python Part 2 

  • Sending Email 
  • Multithreading 
  • JSON Processing 
  • Logging 
  • Unit testing 

Big data 

Data Concept 

  •    Structure concept (Hadoop, Spark) 
  •    When to choose what 
  •    Data collection (Open dataSets) 
  •    Kaggle, stat-computing.org etc. 

Big Data Part 2 (PySpark) 

Implementation 

  •      AWS EMR (and related basics of AWS like ec2, S3) 
  •      Run example with Hive, Hue 
  •      Example with PySpark 

Big Data Part 3 (Pandas) 

Pandas (intro) 

  • Understanding dataframe 
  • Database connection and execute query 
  • Dataframe filtering and storing result to CSV, Excel 

Big Data Part 3 (NumPy) 

NumPy(Intro) 

  • Why numpy 
  • Basic operations 

  Student’s personal Data Analysis(If have)

  • Students can gain proper idea about Script writing using Python.
  • Student can Identify Proper objective types and can build Python Modules for reusability.
  • Exception & Error handeling in Python.
  • Data Analysis using Python scripts.
  • Students will learn different types of open-source relational database management system.
  • Big Data Analysis Using Pyspark, Numpy & Pandas.
  • Student will expert to find the result using Data analysis.