Data Science Master's Program

BIT's Data Science Masters Program online training course makes you proficient in tools and systems used by Data Science Professionals. It includes training on Statistics, Data Science, Python, Apache Spark & Scala, SAS and Tableau etc.

  • 170000
  • 200000
  • Course Includes
  • Live Class Practical Oriented Training
  • 200 + Hrs Instructor LED Training
  • 150 + Hrs Practical Exercise
  • 100 + Hrs Project Work & Assignment
  • Timely Doubt Resolution
  • Dedicated Student Success Mentor
  • Certification & Job Assistance
  • Free Access to Workshop & Webinar
  • No Cost EMI Option


Have Query ?

What you will learn

  • Learn the data science process, including how to build effective data visualizations, and how to communicate with variou...
  • Develop software engineering skills that are essential for data scientists, such as creating unit tests and building cla...
  • Learn to work with data through the entire data science process, from running pipelines, transforming data, building mod...
  • Learn to design experiments and analyze results. Explore approaches for building recommendation systems.
  • Leverage what you’ve learned throughout the program to build your own open-ended Data Science project. This project will...
  • MapReduce and HDFS. Real-time analytics with Spark. Data Scientist roles and responsibilities. Prediction and analysis t...
  • Deploying the recommender system. SAS advanced analytics and R programming. Linear and logistic regression
  • Making sense of NoSQL data. Deep Learning model in AI

Requirements

  • Basic understanding of Computer Programming Languages.

Description

|| About Data Sceince Master's Program Course

BIT’s Data Science Master’s Program online training course lets you gain proficiency in Data Science. Data Science with Python, R, Hadoop Dev, Admin, Test and Analysis, Apache Spark, Scala, Deep Learning, Tableau, Data Science with SAS, SQL, MongoDB and more. You’ll master the skills necessary to become a successful Data Scientist. You’ll work on projects designed by industry experts, and learn to run data pipelines, design experiments, build recommendation systems, and deploy solutions to the cloud.

 

Masters Program is a structured learning path recommended by leading industry experts and ensures that you transform into Data Scientist. This immersive Data Scientist program starts with Data Science training to master important Data Extraction, Exploration Techniques, and Machine Learning Algorithms, then helps you gain expertise on Python for dealing with Big Data, followed by becoming adept at Apache Spark and it's machine learning capabilities and become proficient in trending skills about AI & Deep learning using Tensorflow and finally ends at Data Visualization using Tableau. Data Scientist Masters Program has been curated after thorough research and recommendations from industry experts. It will help you master concepts of Data Management, Statistics, Machine Learning and Big Data together with hands-on experience of tools & systems used by Data Scientists including Data Visualisation using Tableau.

 

Course Content

Introduction Data Science

Interested in learning more about data science, but don’t know where to start? This course will provide you with the key foundational skills any data scientist needs to prepare you for a career in data science or further advanced learning in the field.

 

This Specialization will introduce you to what data science is and what data scientists do. You’ll discover the applicability of data science across fields, and learn how data analysis can help you make data driven decisions. You’ll find that you can kickstart your career path in the field without prior knowledge of computer science or programming languages: this Specialization will give you the foundation you need for more advanced learning to support your career goals.

 

You’ll grasp concepts like big data, statistical analysis, and relational databases, and gain familiarity with various open source tools and data science programs used by data scientists, like Jupyter Notebooks, RStudio, GitHub, and SQL. You'll complete hands-on labs and projects to learn the methodology involved in tackling data science problems and apply your newly acquired skills and knowledge to real world data sets.

 

Topics Includes: 

Part 1 Advance Excel

Part 2 Statistics & Probability

Part 3 Data Science with Python

Part 4 Data Science with R

Part 5 Data Science with SAS

Part 6 Deep Learning & Artificial Intelligence

Part 7 Mongo DB

Part 8 MS SQL

Part 9 Big Data Hadoop & Spark

Part 10 Tableau

Lectutre-1 Entering Data

·      Introduction to Excel spreadsheet

·      Learning to enter data

·      Flling of series and custom fill list

·      Editing and deleting field

·      Practical Exercise              

 

Lecture-2 Referencing in Formulas

·      Learning about relative and absolute referencing

·      The concept of relative formulae

·      The issues in relative formulae

·      Creating of absolute and mixed references

·      Practical Exercise              

Lecture-3 Name Range

·      Creating names range

·      Using names in new formulae

·      Working with the name box

·      Selecting range

·      Names from a selection

·      Pasting names in formulae

·      Selecting names

·      Working with Name Manager

·      Practical Exercise              

Lecture-4 Understanding Logical Functions

·      The various logical functions in Excel

·      The If function for calculating values and displaying text

·      Nested If functions

·      VLookUp and IFError functions

·      Practical Exercise              

Lecture-5 Getting started with Conditional Formatting

·      Learning about conditional formatting

·      The options for formatting cells

·      Various operations with icon sets

·      Data bars and color scales

·      Creating and modifying sparklines

·      Practical Exercise              

Lecture-6 Advanced-level Validation

·      Multi-level drop down validation

·      Restricting value from list only

·      Learning about error messages and cell drop down

·      Practical Exercise              

Lecture-7 Important Formulas in Excel

·      Introduction to the various formulae in Excel

·      Sum, SumIF & SumIFs

·      Count, CountA, CountIF and CountBlank

·      Networkdays, Networkdays International

·      Today & Now function

·      Trim (Eliminating undesirable spaces)

·      Concatenate (Consolidating columns)

·      Practical Exercise              

Lecture-8 Working with Dynamic table

·      Introduction to dynamic table in Excel

·      Data conversion

·      Table conversion

·      Tables for charts

·      VLOOKUP

·      Practical Exercise              

Lecture-9 Data Sorting

·      Sorting in Excel

·      Various types of sorting

·      Alphabetical

·      Numerical

·      Row

·      Multiple column

·      Working with paste special

·      Hyperlinking

·      Using subtotal

·      Practical Exercise              

Lecture-10 Data Filtering

·      The concept of data filtering

·      Understanding compound filter and its creation

·      Removing of filter

·      Using custom filter and multiple value filters

·      Working with wildcards

·      Practical Exercise              

Lecture-11 Chart Creation

·      Creation of Charts in Excel

·      Performing operations in embedded chart

·      Modifying

·      Resizing

·      Dragging of chart

·      Practical Exercise              

Lecture-12 Various Techniques of Charting

·      Introduction to the various types of charting techniques

·      Creating titles for charts

·      Axes

·      Learning about data labels

·      Displaying data tables

·      Modifying axes

·      Displaying gridlines and inserting trendlines

·      Textbox insertion in a chart

·      Creating a 2-axis chart

·      Creating combination chart

·      Practical Exercise              

Lecture-13 Pivot Tables in Excel

·      The concept of Pivot tables in Excel

·      Report filtering

·      Shell creation

·      Working with Pivot for calculations

·      Formatting of reports

·      Dynamic range assigning

·      The slicers

·      Creating of slicers

·      Practical Exercise              

Lecture-14 Ensuring Data and File Security

·      Data and file security in Excel

·      Protecting row, column, and cell

·      Different safeguarding techniques

·      Practical Exercise              

Lecture-15 Getting started with VBA Macros

·      Learning about VBA macros in Excel

·      Executing macros in Excel

·      The macro shortcuts

·      Applications

·      The concept of relative reference in macros

·      Practical Exercise              

Lecture-16 Core concepts of VBA

·      In-depth understanding of Visual Basic for Applications

·      The VBA Editor

·      Module insertion and deletion

·      Performing action with Sub

·      Ending Sub if condition not met

·      Practical Exercise              

Lecture-17 Ranges and Worksheet in VBA

·      Learning about the concepts of workbooks & worksheets in Excel Protection of macro codes

·      Range coding

·      Declaring a variable

·      The concept of Pivot Table in VBA

·      Introduction to arrays

·      User forms

·      Getting to know how to work with databases within Excel

·      Practical Exercise              

Lecture-18 IF condition

·      Learning how the If condition works

·      How to apply it in various scenarios

·      Working with multiple Ifs in Macro

·      Practical Exercise              

Lecture-19 Loops in VBA

·      Understanding the concept of looping

·      Deploying looping in VBA Macros

·      Practical Exercise              

Lecture-20 Debugging in VBA

·      Studying about debugging in VBA

·      The various steps of debugging

·      Understanding breakpoints and way to mark it

·      The code for debugging and code commenting

·      Practical Exercise              

Lecture-21 Messaging in VBA

·      The concept of message box in VBA

·      Learning to create the message box

·      Various types of message boxes

·      The IF condition as related to message boxes

·      Practical Exercise              

Lecture-22 Practical Projects in VBA

·      Mastering the various tasks and functions using VBA

·      Understanding data separation

·      Auto filtering

·      Formatting of report

·      Combining multiple sheets into one

·      Merging multiple files together

·      Practical Exercise              

Lecture-23 Best Practices of Dashboards Visualization

·      Introduction to powerful data visualization with Excel Dashboard

·      Loading the data

·      Managing data and linking the data to tables and charts

·      Creating Reports using dashboard features

·      Practical Exercise              

Lecture-24 Principles of Charting

·      Learning to create charts in Excel

·      The various charts available

·      The steps to successfully build a chart

·      Personalization of charts

·      Formatting and updating features

·      Various special charts for Excel dashboards

·      Understanding how to choose the right chart for the right data

·      Practical Exercise              

Lecture-25 Getting started with Pivot Tables

·      Creation of Pivot Tables in Excel

·      Learning to change the Pivot Table layout

·      Generating Reports

·      The methodology of grouping and ungrouping of data

·      Practical Exercise              

Lecture-26 Creating Dashboards

·      Learning to create Dashboards

·      The various rules to follow while creating Dashboards

·      Creation of dynamic dashboards

·      Knowing what is data layout

·      Introduction to thermometer chart and its creation

·      How to use alerts in the Dashboard setup

·      Practical Exercise              

Lecture-27 Creation of Interactive Components

·      How to insert a Scroll bar to a data window

·      Concept of Option buttons in a chart

·      Use of combo box drop-down

·      List box control Usage

·      How to use Checkbox Control

·      Practical Exercise              

Lecture-28 Data Analysis

·      Understanding data quality issues in Excel

·      Linking of data

·      Consolidating and merging data

·      Working with dashboards for Excel Pivot Tables

·      Practical Exercise              

Lecture -1 Information of Statistics

·      What is statistics?

·      How is this useful

·      What is this course for

·      Practical Exercise              

Lecture -2 Data Conversion

·      Converting data into useful information

·      Collecting the data

·      Understand the data

·      Finding useful information in the data

·      Interpreting the data

·      Visualizing the data

·      Practical Exercise              

Lecture -3 Terms of Statistics

·      Descriptive statistics

·      Let us understand some terms in statistics

·      Variable

·      Practical Exercise              

Lecture -4 Plots

·      Dot Plots

·      Histogram

·      Stemplots

·      Box and whisker plots

·      Outlier detection from box plots and Box and whisker plots

·      Practical Exercise              

Lecture -5 Statistics & Probability

·      What is probability?

·      Set & rules of probability

·      Bayes Theorem

·      Practical Exercise              

Lecture -6 Distributions

·      Probability Distributions

·      Few Examples

·      Student T- Distribution

·      Sampling Distribution

·      Student t- Distribution

·      Poison distribution

·      Practical Exercise              

Lecture -7 Sampling

·      Stratified Sampling

·      Proportionate Sampling

·      Systematic Sampling

·      P – Value

·      Stratified Sampling

·      Practical Exercise              

Lecture -8 Tables & Analysis

·      Cross Tables

·      Bivariate Analysis

·      Multi variate Analysis

·      Dependence and Independence tests ( Chi-Square )

·      Analysis of Variance

·      Correlation between Nominal variables

·      Practical Exercise              

Lecture 1-22 Python Programming

·      Python Environment Setup and Essentials

·      The Installation of Anaconda Python Distribution

·      What Is A Programming Language

·      Python Interpreter

·      How To Run Python Code

·      Our First Python Program

·      Learning Python

·      Python Data Types

·      Numbers

·      Math Functions

·      Operator Precedence

·      Optional: bin() and complex

·      Variables

·      Expressions vs Statements

·      Augmented Assignment Operator

·      Strings

·      String Concatenation

·      Type Conversion

·      Escape Sequences

·      Formatted Strings

·      String Indexes

·      Immutability

·      Built-In Functions + Methods

·      Booleans

·      Lists

·      List Slicing

·      Matrix

·      List Methods

·      Common List Patterns

·      List Unpacking

·      Dictionaries

·      Practical Exercise              

Lecture 23-24 Pandas

·      Pandas Introduction

·      Series, Data Frames and CSVs

·      Data from URLs

·      Describing Data with Pandas

·      Selecting and Viewing Data with Pandas

·      Manipulating Data

·      Practical Exercise              

Lecture 25-27 NumPy

·      Mathematical Computing with Python (NumPy)

·      NumPy Introduction

·      NumPy DataTypes and Attributes

·      Creating NumPy Arrays

·      NumPy Random Seed

·      Viewing Arrays and Matrices

·      Manipulating Arrays

·      Standard Deviation and Variance

·      Reshape and Transpose

·      Dot Product vs Element Wise

·      Store Sales

·      Comparison Operators

·      Sorting Arrays

·      Turn Images Into NumPy Arrays

·      Practical Exercise              

Lecture-28 Introduction to Machine Learning

·      What Is Machine Learning?

·      AI/Machine Learning/Data Science

·      Practical Exercise              

Lecture 29-30 Machine Learning and Data Science Framework

·      Machine Learning Framework

·      Types of Machine Learning

·      Types of Data

·      Types of Evaluation

·      Features In Data

·      Modelling - Splitting Data

·      Modelling - Picking the Model

·      Modelling - Tuning

·      Modelling – Comparison

·      Practical Exercise              

Lecture-31 Data Science Environment Setup

·      Introducing Our Tools

·      Windows Environment Setup

·      Linux Environment Setup

·      Jupyter Notebook Walkthrough

·      Practical Exercise              

Lecture 32-35 Matplotlib + Seaborn: Plotting and Data Visualization

·      Matplotlib Introduction

·      Importing And Using Matplotlib

·      Anatomy Of A Matplotlib Figure

·      Scatter Plot And Bar Plot

·      Histograms And Subplots

·      Subplots Option 2

·      Plotting From Pandas DataFrames

·      Customizing Your Plots

·      Saving And Sharing Your Plots

·      Practical Exercise              

Lecture 36-40 Scikit-learn: Creating Machine Learning Models

·      Scikit-learn Introduction

·      Scikit-learn Cheatsheet

·      Typical scikit-learn Workflow

·      Debugging Warnings In Jupyter

·      Splitting Your Data

·      Clean, Transform, Reduce

·      Convert Data To Numbers

·      Handling Missing Values With Pandas

·      Handling Missing Values With Scikit-learn

·      Choosing The Right Model For Your Data

·      Practical Exercise              

Lecture 41-45 Scikit-learn - Regression Model

·      Types of Regression Algorithms

·      Simple Linear Regression

·      Multiple Linear Regression

·      Logistic Regression

·      Polynomial Regression

·      Support Vector Regression

·      Ridge Regression

·      Lasso Regression

·      ElasticNet Regression

·      Bayesian Regression

·      Decision Tree Regression

·      Random Forest Regression

·      Practical Exercise              

Lecture 46-48 Scikit-learn – Classification

·      Types of Classification Algorithms

·      Logistic Regression/Classification

·      K-Nearest Neighbours

·      Support Vector Machines

·      Kernel Support Vector Machines

·      Naive Bayes

·      Decision Tree Classification

·      Random Forest Classification

·      Case Studies

·      Practical Exercise              

Lecture 49-50 K Means Clustering

·      K-Means Clustering

·      A Simple Example of Clustering

·      Clustering Categorical Data

·      How to Choose the Number of Clusters

·      Pros and Cons of K-Means Clustering

·      To Standardize or not to Standardize

·      Relationship between Clustering and Regression

·      Market Segmentation with Cluster Analysis

·      Market Segmentation with Cluster Analysis

·      Species Segmentation with Cluster Analysis

·      Advanced Statistical Methods - Other Types of Clustering

·      Types of Clustering

·      Dendrogram

·      Heatmaps

·      Practical Exercise              

Lecture 51-53 Text Mining

·      The concepts of text-mining

·      Use cases

·      Text Mining Algorithms

·      Quantifying text

·      TF-IDF

·      Beyond TF-IDF

·      Practical Exercise              

Lecture 54-58 Basic Time Series Forecasting

·      What is time series?

·      Techniques and applications

·      Time series components

·      Moving average

·      Smoothing techniques

·      Exponential smoothing

·      Univariate time series models

·      Multivariate time series analysis

·      Sentiment analysis in Python (Twitter sentiment analysis)

·      Text analysis

·      Rolling Mean For Detecting Temporal Variation

·      Simple Exponential Smoothing (SES)

·      Holt extended simple exponential smoothing

·      Holt Winters

·      Auto Regression Model (AR): Consider Previous Time Steps

·      Implement a Basic ARIMA Model

·      Automated ARIMA & Account for Seasonality (SARIMA)

·      Practical Exercise              

Lecture 59-60 Principal Component Analysis (PCA)

·      Primer

·      Measurement

·      Assumptions

·      Applied PCA work flow

·      Analysis of performance

·      Dimensionality reduction with (PCA)

·      Practical Exercise              

Lecture-1 An Introduction to R

·      History of  R

·      Introduction to R

·      The R environment

·      What is Statistical Programming?

·      Why use a command line? 

·      Your first R session

·      Practical Exercise              

Lecture-2 Starting and quitting R

·      Recording your work 

·      Basic features of R

·      Calculating with R 

·      Named storage

·      Functions 

·      Exact or approximate? 

·      R is case-sensitive 

·      Listing the objects in the workspace 

·      Vectors

·      Extracting elements from vectors 

·      Vector arithmetic 

·      Simple patterned vectors 

·      Missing values and other special values

·      Character vectors 

·      Factors 

·      More on extracting elements from vectors 

·      Matrices and arrays 

·      Data frames

·      Dates and times

·      Practical Exercise              

Lecture-3 Import and Export data in R

·      Importing data in to R

·      CSV File

·      Excel File

·      Import data from text table

·      SAS and SPSS datasets

·      Exporting Data from R

·      CSV File

·      Text Table

·      Excel File

·      SAS dataset

·      Practical Exercise              

Lecture-4 Merge / Join

·      Inner Join

·      Left Join

·      Right Join

·      Full Join

·      Anti Join

·      Semi Join

·      Practical Exercise              

Lecture-5 Programming statistical graphics

·      High-level plots

·      Bar charts and dot charts

·      Pie charts

·      Histograms

·      Box plots

·      Scatterplots

·      QQ plots

·      Density Plot

·      Choosing a high-level graphic

·      Low-level graphics functions

·      The plotting region and margins

·      Adding to plots

·      Setting graphical parameters

·      Practical Exercise              

Lecture-6 Programming with R

·      Flow control

·      The for() loop

·      The if() statement

·      The while() loop

·      The repeat loop, and the break and next statements

·      Apply

·      Sapply

·      Lapply

·      Managing complexity through functions • What are functions? 

·      Scope of variables

·      Practical Exercise              

Lecture-7 Data in R

·      Modes and Classes

·      Data Storage in R 

·      Testing for Modes and Classes

·      Structure of  R Objects

·      Conversion of Objects

·      Missing Values 

·      Working with Missing Values

·      Practical Exercise              

Lecture-8 Reading and Writing Data

·      Reading Vectors and Matrices

·      Data Frames: read.table

·      Comma- and Tab-Delimited Input Files

·      Fixed-Width Input Files 

·      Extracting Data from R Objects 

·      Connections 

·      Reading Large Data Files

·      Generating Data

·      Sequences

·      Random Numbers 

·      Permutations 

·      Random Permutations

·      Enumerating All Permutations 

·      Working with Sequences  v Spreadsheets 

·      The RODBC Package on Windows 

·      The gdata Package (All Platforms)

·      Saving and Loading R Data Objects

·      Working with Binary Files 

·      Writing R Objects to Files in ASCII Format 

·      The write Function 

·      The write.table function

·      Reading Data from Other Programs 

·      Practical Exercise              

Lecture-9 Dates

·      as.Date

·      The chron Package 

·      POSIX Classes

·      Working with Dates

·      Time Intervals

·      Time Sequences

·      Current time

·      Present date

·      Practical Exercise              

Lecture-10 Factors

·      Using Factors

·      Numeric Factors  vs.  Manipulating Factors 

·      Creating Factors from Continuous Variables

·      Practical Exercise              

Lecture-11 Subscripting

·      Basics of Subscripting 

·      Numeric Subscripts 

·      Character Subscripts 

·      Logical Subscripts

·      Subscripting Matrices and Arrays

·      Specialized Functions for Matrices 

·      Lists

·      Subscripting Data Frames

·      Practical Exercise              

Lecture-12 Character Manipulation

·      Basics of Character Data

·      Displaying and Concatenating Character 

·      Working with Parts of Character Values

·      Regular Expressions in R

·      Basics of Regular Expressions

·      Breaking Apart Character Values

·      Using Regular Expressions in R

·      Substitutions and Tagging

·      Practical Exercise              

Lecture-13 Reshaping Data

·      Modifying Data Frame Variables 

·      Recoding Variables 

·      The recode Function

·      Reshaping Data Frames 

·      The reshape Package

·      Combining Data Frames

·      Practical Exercise              

Lecture-14 Data Manipulation

·      Random Selection of rows and columns

·      Summarization

·      Sort, Arrange

·      Group by

·      Filter

·      Practical Exercise              

Lecture-15 Missing Value and Outlier

·      Identify Missing values

·      Impute missing values

·      Identify Outliers

·      Capping outliers

·      Practical Exercise              

Lecture-16 Introduction to Statistics:

·      Types of Statistics

·      Types of Data

·      Practical Exercise              

Lecture-17 Descriptive Statistics

·      Measures of Central Tendency

·      Measures of Central Tendency – Usage Chart

·      Measures of Dispersion / Variability

·      Measures of Shape

·      Application of Variance/Std Deviation

·      Practical Exercise              

Lecture-18 Hypothesis Testing

·      Applications of Hypothesis Testing (Called T Test or Z Test)

·      Steps in Hypothesis Testing

·      Practical Exercise              

Lecture-19 Anova (Analysis of Variance)

·      What is Anova

·      Anova Steps

·      Simple One-Way Anova

·      Simple Two-Way Anova With Multiple Variables

·      Practical Exercise              

Lecture-20 Chi Square Tests

·      What is Chi-Square

·      Applications of Chi-Square

·      Practical Exercise              

Lecture-21 Correlation

·      Types of Correlation

·      Properties of Correlation

·      Methods of Calculating Correlation

·      Steps to Calculate Correlation

·      Practical Exercise              

Lecture-22 Regression Analysis

·      What is Regression

·      Types of Regression Analysis

·      Properties of The Regression Line

·      Validating the Model

·      Regression Assumptions

·      Data Transformation for Regression

·      Practical Exercise              

Lecture-23 Variable Selection Procedure for Regression

·      Forward Selection Procedure

·      Backward Elimination Procedure

·      Stepwise Regression Method

·      Dummy Variable Analysis

·      Practical Exercise              

Lecture-24 Logistic Regression

·      Likelihood Profiling

·      Assumption

·      Variable Selection Method :- Woe And Iv

·      Model Validation

·      Model Performance

·      Prediction

·      Practical Exercise              

Lecture-25 Cluster Analysis
Lecture-26 Decision Tree

·      What is decision Tree

·      How decision tree works

·      Cart

·      Pruning

·      Overfitting

·      Underfitting

·      Model validation

·      Model performance

·      Practical Exercise              

Lecture-27 Market Basket Analysis

·      What is MBA

·      Application of MBA

·      Support

·      Confidence

·      Lift

·      Rules

·      Practical Exercise              

Lecture-28 Random Forest

·      What is random forest

·      Application of random forest

·      Tune parameters

·      How to tune parameters

·      Model validation

·      Model performance

·      Practical Exercise              

Lecture-29 Support Vector Machine

·      What is support vector machine

·      Why to use SVM

·      Hyperplane

·      Kernel

·      Cost

·      Gamma

·      Model validation

·      Model performance

·      Practical Exercise              

Lecture-30 Naïve bayes

·      What is Naïve bayes

·      Bayes theorem

·      Conditional probability

·      Prior probability

·      Posterior probability

·      Application of Naïve bayes

·      Model validation

·      Model performance

·      Practical Exercise              

Lecture-31 ARIMA

·      What is time series

·      What is Arima

·      Stationary

·      Seasonality

·      Trend

·      How to find p,d,q

·      What are p,d,q

·      Find best model

·      Forecasting

·      Practical Exercise              

Lecture 1: Analytics overview

·      An introduction to the business analytics

·      The types of analytics

·      The areas of analytics

·      About analytical tools

·      About analytical techniques

·      Practical Exercise              

Lecture 2: Introduction to sas

·      An overview of the sas

·      Installation and introduction to sas,

·      Understanding different sas windows,

·      How to work with data sets,

·      Various sas windows like output,

·      Search, editor, log and explorer

·      Understanding the sas functions,

·      Which are various library types & programming files

·      The navigation in the sas console

·      About the sas language input files

·      What is data step? 

·      The proc step and data step

·      All about data step processing

·      The sas libraries

·      How to import and export raw data files,

·      How to read and subset the data sets       

·      Practical Exercise  

Lecture 3: Combining and modifying datasets

·      Necessity of combining or modifying the data

·      Concatenating the datasets

·      About interleaving method

·      What is data manipulation?

·      Modifying the variable attributes  

·      Practical Exercise  

Lecture 4: Sas operators and functions

·      Logical operator

·      Comparison operator

·      Arithmetic operator

·      Deploying different sas

·      Conditional statements

·      If/else, do while, do until  

·      Practical Exercise  

Lecture 5 Compilation and execution

·      Input buffer,

·      Pdv (backend)

·      Missover                

·      Practical Exercise

Lecture 6: Creation and compilation of sas data sets

·      Understanding the delimiter,

·      Dataline rules,

·      Dlm,

·      Delimiter dsd,

·      Raw data files and execution

·      List input for standard data

·      Practical Exercise              

Lecture 7: Proc SQL

·      What is proc sql?

·      How to retrieve data from a table?

·      How to select columns in a table?

·      Retrieving the data from multiple tables

·      Selecting the data from multiple tables

·      Concatenating query results                      

·      Practical Exercise

Lecture 8: Input statement and formatted input

·      Reading standard and non-standard numeric inputs

·      Column pointer controls,

·      Controlling while a record loads,

·      line pointer control/absolute line pointer control,

·      Single trailing,

·      Multiple in and out statements,

·      Dataline statement and rules,

·      List input method

·      Comparing single trailing

·      Double trailing      

·      Practical Exercise  

Lecture 9: Sas format

·      Sas format statements: standard and user-written,

·      Associating a format with a variable,

·      Working with sas format,

·      Deploying it on proc data sets

·      Comparing attrib

·      Format statements 

·      Practical Exercise  

Lecture 10: Sas graphs

·      Understanding proc gchart,

·      Various graphs,

·      Bar charts pie,

·      Bar and 3d and plotting variables with proc gplot           

·      Practical Exercise

Lecture 11: Sas macros

·      Need for sas macros

·      Macro functions

·      Sql clauses for macros

·      The % macro statement

·      The conditional statement             

·      Practical Exercise

Lecture 12: Basics of statistics

·      Introduction to the statistics

·      The statistical terms

·      Procedures in the sas for descriptive statistics

·      About hypothesis testing

·      About variable types

·      The hypothesis testing - process

·      The parametric and non - parametric tests

·      What are parametric tests?

·      What are non - parametric tests?

·      Parametric tests – the advantages and the disadvantages            

·      Practical Exercise

Lecture 13: Statistical procedures

·      The statistical procedures

·      What do proc means?

·      What is proc freq?

·      About proc univariate

·      About proc corr

·      About proc corr options

·      About proc reg

·      The proc reg options

·      The proc anova      

·      Practical Exercise  

Lecture 14: Data exploration

·      Data exploration: an overview

·      What is data preparation? 

·      The general comments and observations on data cleaning

·      Data type conversion

·      Character functions

·      What is scan function?

·      About date/time functions

·      Missing value treatment

·      Various functions to handle missing value

·      Data summarization          

·      Practical Exercise  

Lecture 15: Advanced statistics

·      An introduction to the advanced statistics

·      Introduction to the cluster

·      The clustering methodologies

·      What is k means clustering?

·      About the decision tree

·      The regression

·      The logistic regression       

·      Practical Exercise  

Lecture 16: Working with time series data

·      An introduction to the working with time series data

·      Need for time series analysis

·      About the time series analysis — options

·      Reading date and datetime values

·      What is the white noise process?

·      The stationarity of a time series

·      The plot transform transpose and interpolating time series data   

·      Practical Exercise  

Lecture 17: Designing optimization models

·      Introduction to the designing optimization models

·      About the need for the optimization

·      About optimization problems

·      What is proc optmodel?                

·      Practical Exercise

Lecture-1 Introduction to Deep Learning and Neural Networks

·      Field of machine learning, its impact on the field of artificial intelligence

·      The benefits of machine learning w.r.t. Traditional methodologies

·      Deep learning introduction and how it is different from all other machine learning methods

·      Classification and regression in supervised learning

·      Clustering and association in unsupervised learning, algorithms that are used in these categories

·      Introduction to ai and neural networks

·      Machine learning concepts

·      Supervised learning with neural networks

·      Fundamentals of statistics, hypothesis testing, probability distributions, and hidden markov models

·      Practical Exercise              

Lecture-2 Multi-layered Neural Networks

·      Multi-layer network introduction, regularization, deep neural networks

·      Multi-layer perceptron

·      Overfitting and capacity

·      Neural network hyperparameters, logic gates

·      Different activation functions used in neural networks, including relu, softmax, sigmoid and hyperbolic functions

·      Back propagation, forward propagation, convergence, hyperparameters, and overfitting

·      Practical Exercise              

Lecture-3 Artificial Neural Networks and Various Methods

·      Various methods that are used to train artificial neural networks

·      Perceptron learning rule, gradient descent rule, tuning the learning rate, regularization techniques, optimization techniques

·      Stochastic process, vanishing gradients, transfer learning, regression techniques,

·      Lasso l1 and ridge l2, unsupervised pre-training, xavier initialization.

·      Practical Exercise              

Lecture-4 Deep Learning Libraries

·      Understanding how deep learning works

·      Activation functions, illustrating perceptron, perceptron training

·      multi-layer perceptron, key parameters of perceptron;

·      Tensorflow introduction and its open-source software library that is used to design, create and train

·      Deep learning models followed by google’s tensor processing unit (tpu) programmable ai

·      Python libraries in tensorflow, code basics, variables, constants, placeholders

·      Graph visualization, use-case implementation, keras

·      Practical Exercise              

Lecture-5 Keras API

·      Keras high-level neural network for working on top of tensorflow

·      Defining complex multi-output models

·      Composing models using keras

·      Sequential and functional composition, batch normalization

·      Deploying keras with tensorboard, and neural network training process customization

·      Practical Exercise              

Lecture-6 TFLearn API for TensorFlow

·      Using tflearn api to implement neural networks

·      Defining and composing models, and deploying tensorboard

·      Practical Exercise              

Lecture-7 Dnns (deep neural networks)

·      Mapping the human mind with deep neural networks (dnns)

·      Several building blocks of artificial neural networks (anns)

·      The architecture of dnn and its building blocks

·      Reinforcement learning in dnn concepts, various parameters, layers, and optimization algorithms in dnn, and activation functions

·      Practical Exercise              

Lecture-8 Cnns (convolutional neural networks)

·      What is a convolutional neural network?

·      Understanding the architecture and use-cases of cnn

·      ‘What is a pooling layer?’ how to visualize using cnn

·      How to fine-tune a convolutional neural network

·      What is transfer learning?

·      Understanding recurrent neural networks, kernel filter, feature maps, and pooling, and deploying convolutional neural networks in tensorflow

·      Practical Exercise              

Lecture-9 Rnns (recurrent neural networks)

·      Introduction to the rnn model

·      Use cases of rnn, modeling sequences

·      Rnns with back propagation

·      Long short-term memory (lstm)

·      Recursive neural tensor network theory, the basic rnn cell, unfolded rnn,  dynamic rnn

·      Time-series predictions

·      Practical Exercise              

Lecture-10 Gpu in deep learning

·      Gpu’s introduction, ‘how are they different from cpus?,’ the significance of gpus

·      Deep learning networks, forward pass and backward pass training techniques

·      Gpu constituent with simpler core and concurrent hardware

·      Practical Exercise              

Lecture-11 Autoencoders and restricted boltzmann machine (rbm)

·      Introduction  rbm and autoencoders

·      Deploying rbm for deep neural networks, using rbm for collaborative filtering

·      Autoencoders features and applications of autoencoders.

·      Practical Exercise              

Lecture-12 Deep learning applications

·      Image processing

·      Natural language processing (nlp) – Speech recognition, and video analytics

·      Practical Exercise              

Lecture-13 Chatbots

·      Automated conversation bots leveraging

·      Ibm Watson

·      Microsoft’s luis

·      Open–closed domain bots

·      Generative model

·      The sequence to sequence model (lstm)

·      Practical Exercise              

Lecture-1 Introduction to NoSQL and MongoDB

·      RDBMS

·      Types of relational databases

·      Challenges of RDBMS

·      NoSQL database

·      Its significance

·      How NoSQL suits Big Data needs

·      Introduction to MongoDB and its advantages

·      MongoDB installation

·      JSON features

·      Data types

·      Practical Exercise              

Lecture-2 MongoDB Installation

·      Installing MongoDB

·      Basic MongoDB commands and operations

·      MongoChef (MongoGUI) installation and MongoDB data types

·      Install MongoDB and install MongoChef (MongoGUI)

·      Practical Exercise              

Lecture-3 Importance of NoSQL

·      The need for NoSQL,

·      Types of NoSQL databases

·      OLTP

·      OLAP

·      Limitations of RDBMS

·      ACID properties

·      CAP Theorem

·      Base property

·      Learning about JSON/BSON

·      Database collection and documentation

·      MongoDB uses

·      MongoDB write concern—acknowledged, replica acknowledged, unacknowledged, journaled—and Fsync

·      Write a JSON document

·      Practical Exercise              

Lecture-4 CRUD Operations

·      Understanding CRUD and its functionality

·      CRUD concepts

·      MongoDB query and syntax and read and write queries and query optimization

·      Use insert query to create a data entry

·      Use find query to read data

·      Use update and replace queries to update and use delete query operations on a DB file

·      Practical Exercise              

Lecture-5 Data Modeling and Schema Design

·      Concepts of data modelling

·      Difference between MongoDB and RDBMS modelling

·      Model tree structure

·      Operational strategies

·      Monitoring and backup

·      Use shard key and hashed shard keys

·      Perform backup and recovery of a dummy dataset

·      Import data from a CSV file and export data to a CSV file

·      Write a data model tree structure for a family hierarchy

·      Practical Exercise              

Lecture-6 Data Management and Administration

·      MongoDB® Administration activities

·      Health check

·      Backup

·      Recovery

·      Database sharding and profiling

·      Data import/export

·      Performance tuning

·      Use shard key and hashed shard keys

·      Perform backup and recovery of a dummy dataset

·      Import data from a CSV file and export data to a CSV file

·      Practical Exercise              

Lecture-7 Data Indexing and Aggregation

·      Concepts of data aggregation and types

·      Data indexing concepts, properties and variations

·      Do aggregation using pipeline, sort, skip and limit and create index on data using single key and using multi-key      

·      Practical Exercise  

Lecture-8 MongoDB security

·      Understanding database security risks

·      MongoDB security concept

·      Security approach

·      MongoDB integration with Java and Robomongo

·      Practical Exercise              

Lecture-9 Working with Unstructured Data

·      Implementing techniques to work with variety of unstructured data

·      Understanding GridFS MongoDB file system for storing data

·      Practical Exercise              

Lecture-1 Introduction to SQL

·      Various types of databases

·      Introduction to Structured Query Language

·      Distinction between client server and file server databases

·      Understanding SQL Server Management Studio

·      SQL Table basics

·      Data types and functions

·      Transaction-SQL

·      Authentication for Windows

·      Data control language

·      The identification of the keywords in T-SQL, such as Drop Table

·      Practical Exercise

Lecture-2 Database Normalization and Entity Relationship Model

·      Data Anomalies

·      Update Anomalies

·      Insertion Anomalies

·      Deletion Anomalies

·      Types of Dependencies

·      Functional Dependency

·      Fully functional dependency

·      Partial functional dependency

·      Transitive functional dependency

·      Multi-valued functional dependency

·      Decomposition of tables

·      Lossy decomposition

·      Lossless decomposition

·      What is Normalization?

·      First Normal Form

·      Second Normal Form

·      Third Normal Form

·      Boyce-Codd Normal Form(BCNF)

·      Fourth Normal Form

·      Entity-Relationship Model

·      Entity and Entity Set

·      Attributes and types of Attributes

·      Entity Sets

·      Relationship Sets

·      Degree of Relationship

·      Mapping Cardinalities, One-to-One, One-to-Many, Many-to-one, Many-to-many

·      Symbols used in E-R Notation

·      Practical Exercise

Lecture-3 SQL Operators

·      Introduction to relational databases

·      Fundamental concepts of relational rows, tables, and columns

·      Several operators (such as logical and relational), constraints, domains, indexes, stored procedures, primary and foreign keys

·      Understanding group functions

·      The unique key

·      Practical Exercise

Lecture-4 Working with SQL: Join, Tables, and Variables

·      Advanced concepts of SQL tables

·      SQL functions

·      Operators & queries

·      Table creation

·      Data retrieval from tables

·      Combining rows from tables using inner, outer, cross, and self joins

·      Deploying operators such as ‘intersect,’ ‘except,’ ‘union,’

·      Temporary table creation

·      Set operator rules

·      Table variables

·      Practical Exercise

Lecture-5 Deep Dive into SQL Functions

·      Understanding SQL functions – what do they do?

·      Scalar functions

·      Aggregate functions

·      Functions that can be used on different datasets, such as numbers, characters, strings, and dates

·      Inline SQL functions

·      General functions

·      Duplicate functions

·      Practical Exercise

Lecture-6 Working with Subqueries

·      Understanding SQL subqueries, their rules

·      Statements and operators with which subqueries can be used

·      Using the set clause to modify subqueries

·      Understanding different types of subqueries, such as where, select, insert, update, delete, etc.

·      Methods to create and view subqueries

·      Practical Exercise

Lecture-7 SQL Views, Functions, and Stored Procedures

·      Learning SQL views

·      Methods of creating, using, altering, renaming, dropping, and modifying views

·      Understanding stored procedures and their key benefits

·      Working with stored procedures

·      Studying user-defined functions

·      Error handling

·      Practical Exercise

Lecture-8 Deep Dive into User-defined Functions

·      User-defined functions

·      Types of UDFs, such as scalar

·      Inline table value

·      Multi-statement table

·      Stored procedures and when to deploy them

·      What is rank function?

·      Triggers, and when to execute triggers?

·      Practical Exercise

Lecture-9 SQL Optimization and Performance

·      SQL Server Management Studio

·      Using pivot in MS Excel and MS SQL Server

·      Differentiating between Char, Varchar, and NVarchar

·      XL path, indexes and their creation

·      Records grouping, advantages, searching, sorting, modifying data

·      Clustered indexes creation

·      Use of indexes to cover queries

·      Common table expressions

·      Index guidelines

·      Practical Exercise

Lecture-10 Managing Data with Transact-SQL

·      Creating Transact-SQL queries

·      Querying multiple tables using joins

·      Implementing functions and aggregating data

·      Modifying data

·      Determining the results of DDL statements on supplied tables and data

·      Constructing DML statements using the output statement

·      Practical Exercise

Lecture-11 Querying Data with Advanced Transact-SQL Components

·      Querying data using subqueries and APPLY

·      Querying data using table expressions

·      Grouping and pivoting data using queries

·      Querying temporal data and non-relational data

·      Constructing recursive table expressions to meet business requirements

·      Using windowing functions to group

·      Rank the results of a query

·      Practical Exercise

Lecture-12 Programming Databases Using Transact-SQL

·      Creating database programmability objects by using T-SQL

·      Implementing error handling and transactions

·      Implementing transaction control in conjunction with error handling in stored procedures

·      Implementing data types and NULL

·      Practical Exercise

Lecture-13 Designing and Implementing Database Objects

·      Designing and implementing relational database schema

·      Designing and implementing indexes

·      Learning to compare between indexed and included columns

·      Implementing clustered index

·      Designing and deploying views

·      Column store views

·      Practical Exercise

Lecture-14 Implementing Programmability Objects

·      Explaining foreign key constraints

·      Using T-SQL statements

·      Usage of Data Manipulation Language (DML)

·      Designing the components of stored procedures

·      Implementing input and output parameters

·      Applying error handling

·      Executing control logic in stored procedures

·      Designing trigger logic, DDL triggers

·      Practical Exercise

Lecture-15 Managing Database Concurrency

·      Applying transactions

·      Using the transaction behavior to identify DML statements

·      Learning about implicit and explicit transactions

·      Isolation levels management

·      Understanding concurrency and locking behavior

·      Using memory-optimized tables

·      Practical Exercise

Lecture-16 Optimizing Database Objects

·      Accuracy of statistics

·      Formulating statistics maintenance tasks

·      Dynamic management objects management

·      Identifying missing indexes

·      Examining and troubleshooting query plans

·      Consolidating the overlapping indexes

·      The performance management of database instances

·      SQL server performance monitoring

·      Practical Exercise

Lecture-17 Advanced Topics

·      Correlated Subquery, Grouping Sets, Rollup, Cube

·      Implementing Correlated Subqueries

·      Using EXISTS with a Correlated subquery

·      Using Union Query

·      Using Grouping Set Query

·      Using Rollup

·      Using CUBE to generate four grouping sets

·      Perform a partial CUBE

·      Practical Exercise

Lecture-18 Microsoft Courses: Study Material

·      Performance Tuning and Optimizing SQL Databases

·      Querying Data with Transact-SQL

·      Practical Exercise

Lecture-1 Hadoop Installation and Setup

·      The architecture of Hadoop cluster

·      What is High Availability and Federation?

·      How to setup a production cluster?

·      Various shell commands in Hadoop

·      Understanding configuration files in Hadoop

·      Installing a single node cluster with Cloudera Manager

·      Understanding Spark, Scala, Sqoop, Pig, and Flume

·      Practical Exercise

Lecture-2 Introduction to Big Data Hadoop and Understanding HDFS and MapReduce

·      Introducing Big Data and Hadoop

·      What is Big Data and where does Hadoop fit in?

·      Two important Hadoop ecosystem components, namely, MapReduce and HDFS

·      In-depth Hadoop Distributed File System – Replications, Block Size, Secondary Name node, High Availability and in-depth YARN – resource manager and node manager

·      HDFS working mechanism

·      Data replication process

·      How to determine the size of the block?

·      Understanding a data node and name node

·      Practical Exercise

Lecture-3 Deep Dive in MapReduce

·      Learning the working mechanism of MapReduce

·      Understanding the mapping and reducing stages in MR

·      Various terminologies in MR like Input Format, Output Format, Partitioners, Combiners, Shuffle, and Sort

·      How to write a WordCount program in MapReduce?

·      How to write a Custom Partitioner?

·      What is a MapReduce Combiner?

·      How to run a job in a local job runner

·      Deploying a unit test

·      What is a map side join and reduce side join?

·      What is a tool runner?

·      How to use counters, dataset joining with map side, and reduce side joins?

·      Practical Exercise

Lecture-4 Introduction to Hive

·      Introducing Hadoop Hive

·      Detailed architecture of Hive

·      Comparing Hive with Pig and RDBMS

·      Working with Hive Query Language

·      Creation of a database, table, group by and other clauses

·      Various types of Hive tables, HCatalog

·      Storing the Hive Results, Hive partitioning, and Buckets

·      Database creation in Hive

·      Dropping a database

·      Hive table creation

·      How to change the database?

·      Data loading

·      Dropping and altering table

·      Pulling data by writing Hive queries with filter conditions

·      Table partitioning in Hive

·      What is a group by clause?

·      Practical Exercise

Lecture-5 Advanced Hive and Impala

·      Indexing in Hive

·      The ap Side Join in Hive

·      Working with complex data types

·      The Hive user-defined functions

·      Introduction to Impala

·      Comparing Hive with Impala

·      The detailed architecture of Impala

·      How to work with Hive queries?

·      The process of joining the table and writing indexes

·      External table and sequence table deployment

·      Data storage in a different table

·      Practical Exercise

Lecture-6 Introduction to Pig

·      Apache Pig introduction and its various features

·      Various data types and schema in Hive

·      The available functions in Pig, Hive Bags, Tuples, and Fields

·      Working with Pig in MapReduce and local mode

·      Loading of data

·      Limiting data to 4 rows

·      Storing the data into files and working with Group By, Filter By, Distinct, Cross, Split in Hive

·      Practical Exercise

Lecture-7 Flume, Sqoop and HBase

·      Apache Sqoop introduction

·      Importing and exporting data

·      Performance improvement with Sqoop

·      Sqoop limitations

·      Introduction to Flume and understanding the architecture of Flume

·      What is HBase and the CAP theorem?

·      Working with Flume to generate Sequence Number and consume it

·      Using the Flume Agent to consume the Twitter data

·      Using AVRO to create Hive Table

·      AVRO with Pig

·      Creating Table in HBase

·      Deploying Disable, Scan, and Enable Table

·      Practical Exercise

Lecture-8 Writing Spark Applications Using Scala

·      Using Scala for writing Apache Spark applications

·      Detailed study of Scala

·      The need for Scala

·      The concept of object-oriented programming

·      Executing the Scala code

·      Various classes in Scala like getters, setters, constructors, abstract, extending objects, overriding methods

·      The Java and Scala interoperability

·      The concept of functional programming and anonymous functions

·      Bobsrockets package and comparing the mutable and immutable collections

·      Scala REPL, Lazy Values, Control Structures in Scala, Directed Acyclic Graph (DAG), first Spark application using SBT/Eclipse, Spark Web UI, Spark in Hadoop ecosystem

·      Practical Exercise

Lecture-9 Spark framework

·      Detailed Apache Spark and its various features

·      Comparing with Hadoop

·      Various Spark components

·      Combining HDFS with Spark and Scalding

·      Introduction to Scala

·      Importance of Scala and RDD

·      Practical Exercise

Lecture-10 RDD in Spark

·      Understanding the Spark RDD operations

·      Comparison of Spark with MapReduce

·      What is a Spark transformation?

·      Loading data in Spark

·      Types of RDD operations viz. transformation and action

·      What is a Key/Value pair?

·      Practical Exercise

Lecture-11 Data Frames and Spark SQL

·      The detailed Spark SQL

·      The significance of SQL in Spark for working with structured data processing

·      Spark SQL JSON support

·      Working with XML data and parquet files

·      Creating Hive Context

·      Writing Data Frame to Hive

·      How to read a JDBC file?

·      Significance of a Spark data frame

·      How to create a data frame?

·      What is schema manual inferring?

·      Work with CSV files, JDBC table reading, data conversion from Data Frame to JDBC, Spark SQL user-defined functions, shared variable, and accumulators

·      How to query and transform data in Data Frames?

·      How data frame provides the benefits of both Spark RDD and Spark SQL?

·      Deploying Hive on Spark as the execution engine

·      Practical Exercise

Lecture-12 Machine Learning Using Spark (MLlib)

·      Introduction to Spark MLlib

·      Understanding various algorithms

·      What is Spark iterative algorithm?

·      Spark graph processing analysis

·      Introducing Machine Learning

·      K-Means clustering

·      Spark variables like shared and broadcast variables

·      What are accumulators?

·      Various ML algorithms supported by MLlib

·      Linear regression, logistic regression, decision tree, random forest, and K-means clustering techniques

·      Practical Exercise

Lecture-13 Integrating Apache Flume and Apache Kafka

·      Why Kafka?

·      What is Kafka?

·      Kafka architecture

·      Kafka workflow

·      Configuring Kafka cluster

·      Basic operations

·      Kafka monitoring tools

·      Integrating Apache Flume and Apache Kafka

·      Practical Exercise

Lecture-14 Spark Streaming

·      Introduction to Spark streaming

·      The architecture of Spark streaming

·      Working with the Spark streaming program

·      Processing data using Spark streaming

·      Requesting count and DStream

·      Multi-batch and sliding window operations

·      Working with advanced data sources

·      Features of Spark streaming

·      Spark Streaming workflow

·      Initializing StreamingContext

·      Discretized Streams (DStreams)

·      Input DStreams and Receivers

·      Transformations on DStreams

·      Output Operations on DStreams

·      Windowed operators and its uses

·      Important Windowed operators and Stateful operators

·      Practical Exercise

Lecture-15 Hadoop Administration – Multi-node Cluster Setup Using Amazon EC2

·      Create a 4-node Hadoop cluster setup

·      Running the MapReduce Jobs on the Hadoop cluster

·      Successfully running the MapReduce code

·      Working with the Cloudera Manager setup

·      Practical Exercise

Lecture-16 Hadoop Administration – Cluster Configuration

·      Overview of Hadoop configuration

·      The importance of Hadoop configuration file

·      The various parameters and values of configuration

·      The HDFS parameters and MapReduce parameters

·      Setting up the Hadoop environment

·      The Include and Exclude configuration files

·      The administration and maintenance of name node, data node directory structures, and files

·      What is a File system image?

·      Understanding Edit log

·      Practical Exercise

Lecture-17 Hadoop Administration

·      How to go about ensuring the MapReduce File System Recovery for different scenarios

·      JMX monitoring of the Hadoop cluster

·      How to use the logs and stack traces for monitoring and troubleshooting

·      Using the Job Scheduler for scheduling jobs in the same cluster

·      Getting the MapReduce job submission flow

·      FIFO schedule

·      Getting to know the Fair Scheduler and its configuration

·      Practical Exercise

Lecture-18 ETL Connectivity with Hadoop Ecosystem (Self-Paced)

·      How ETL tools work in Big Data industry?

·      Introduction to ETL and data warehousing

·      Working with prominent use cases of Big Data in ETL industry

·      End-to-end ETL PoC showing Big Data integration with ETL tool

·      Connecting to HDFS from ETL tool

·      Moving data from Local system to HDFS

·      Moving data from DBMS to HDFS,

·      Working with Hive with ETL Tool

·      Creating MapReduce job in ETL tool

·      Practical Exercise

Lecture-1 Data Visualization and Power of Tableau

·      What is data visualization?

·      Comparison and benefits against reading raw numbers

·      Real use cases from various business domains

·      Some quick and powerful examples using Tableau without going into the technical details of Tableau

·      Installing Tableau

·      Tableau interface

·      Connecting to DataSource

·      Tableau data types

·      Data preparation

·      Practical Exercise              

Lecture-2 Tableau Architecture

·      Installation of Tableau Desktop

·      Architecture of Tableau

·      Tableau Layout

·      Tableau Toolbars

·      Tableau Data Pane

·      Tableau Analytics Pane

·      How to start with Tableau

·      The ways to share and export the work done in Tableau

·      Practical Exercise              

Lecture-3 Tableau Metadata and Data Blending

·      Connection to Excel

·      Cubes and PDFs

·      Management of metadata and extracts

·      Data preparation

·      Joins and Union

·      Dealing with NULL values

·      Cross-database joining

·      Data extraction

·      Data blending

·      Refresh extraction

·      Incremental extraction

·      How to build extract

·      Practical Exercise              

Lecture-4 Creation of Sets and Using Filters

·      Mark

·      Highlight

·      Sort

·      Group, and use sets

·      Creating and editing sets

·      IN/OUT

·      Sets in hierarchies

·      Constant sets

·      Computed sets

·      Bins

·      Filters

·      Filtering continuous dates

·      Dimensions, and measures

·      Interactive filters

·      Marks card

·      Hierarchies

·      How to create folders in Tableau

·      Sorting in Tableau

·      Types of sorting

·      Filtering in Tableau

·      Types of filters

·      Filtering the order of operations

·      Practical Exercise              

Lecture-5 Organizing Data and Visual Analytics

·      Using Formatting Pane to work with menu, fonts, alignments, settings, and copy-paste

·      Formatting data using labels and tooltips

·      Edit axes and annotations

·      K-means cluster analysis

·      Trend and reference lines

·      Visual analytics in Tableau

·      Forecasting

·      Confidence interval

·      Reference lines

·      Bands

·      Practical Exercise              

Lecture-6 Working with Mapping, Calculations, Expressions and Parameters

·      Working on coordinate points

·      Plotting longitude and latitude

·      Editing unrecognized locations

·      Customizing geocoding, polygon maps

·      WMS: web mapping services

·      Working on the background image, including add image

·      Plotting points on images and generating coordinates from them

·      Map visualization

·      Custom territories

·      Map box

·      WMS map

·      How to create map projects in Tableau

·      Creating dual axes maps and editing locations

·      Calculation syntax and functions in Tableau

·      Various types of calculations, including Table, String, Date, Aggregate, Logic, and Number

·      LOD expressions, including concept and syntax

·      Aggregation and replication with LOD expressions

·      Nested LOD expressions

·      Fixed level

·      Lower level

·      Higher level

·      Quick table calculations

·      The creation of calculated fields

·      Predefined calculations

·      How to validate

·      Creating parameters

·      Parameters in calculations

·      Using parameters with filters

·      Column selection parameters

·      Chart selection parameters

·      How to use parameters in the filter session

·      How to use parameters in calculated fields

·      How to use parameters in the reference line

·      Practical Exercise              

Lecture-7 Introduction of Charts, Graphs, Dashboards and Stories

·      Dual axes graphs

·      Histograms

·      Single and dual axes

·      Box plot

·      Motion Charts

·      Pareto Charts

·      Funnel Charts

·      Pie Charts

·      Bar Charts

·      Line Charts

·      Bubble Charts

·      Bullet Charts

·      Scatter Charts

·      Waterfall charts

·      Tree Maps

·      Heat Maps

·      Market basket analysis (MBA)

·      Using Show me

·      Text table and highlighted table

·      Building and formatting a dashboard using size, objects, views, filters, and legends

·      Best practices for making creative as well as interactive dashboards using the actions

·      Creating stories, including the intro of story points

·      Creating as well as updating the story points

·      Adding catchy visuals in stories

·      Adding annotations with descriptions; dashboards and stories

·      What is dashboard?

·      Highlight actions, URL actions, and filter actions

·      Selecting and clearing values

·      Best practices to create dashboards

·      Dashboard examples; using Tableau workspace and Tableau interface

·      Learning about Tableau joins

·      Types of joins

·      Tableau field types

·      Saving as well as publishing data source

·      Live vs extract connection

·      Various file types

·      Practical Exercise              

Lecture-8 Tableau Prep

·      Introduction to Tableau Prep

·      How Tableau Prep helps quickly combine join, shape, and clean data for analysis

·      Creation of smart examples with Tableau Prep

·      Getting deeper insights into the data with great visual experience

·      Making data preparation simpler and accessible

·      Integrating Tableau Prep with Tableau analytical workflow

·      Understanding the seamless process from data preparation to analysis with Tableau Prep

·      Practical Exercise              

Fees

Offline Training @ Vadodara

  • Classroom Based Training
  • Practical Based Training
  • No Cost EMI Option
235000 200000

Online Training preferred

  • Live Virtual Classroom Training
  • 1:1 Doubt Resolution Sessions
  • Recorded Live Lectures*
  • Flexible Schedule
200000 170000

Corporate Training

  • Customized Learning
  • Onsite Based Corporate Training
  • Online Corporate Training
  • Certified Corporate Training

Certification

  • Upon the completion of the Classroom training, you will have an Offline exam that will help you prepare for the Professional certification exam and score top marks. The BIT Certification is awarded upon successfully completing an offline exam after reviewed by experts
  • Upon the completion of the training, you will have an online exam that will help you prepare for the Professional certification exam and score top marks. BIT Certification is awarded upon successfully completing an online exam after reviewed by experts.
  • This course is designed to clear SAS, Tableau, Microsoft, Cloudera Certifications: Spark component of Cloudera Spark and Hadoop Developer Certification (CCA175), Tableau Desktop Qualified Associate Exam. SAS Certified Base Programmer Exam, C100DEV: MongoDB Certified Developer Associate Exam, Microsoft 70-761 SQL Server Certification Exam, Microsoft 70-762 SQL Server Certification Exam