Become a Data Scientist

The Sexiest Job of the 21st Century

A data science program for everyone – Get Started with Data Science Foundations

FLAT 20% ON THE COURSE FREE

KNOW HOW

It pays to have friends!

Ask a colleague to join the course with you and GET 20% OFF - BOTH OF YOU.
Know More

45 Days

Course Duration

Introductory

For beginners

Course Delivery

Classroom or Online

Course Structure

Session 1: Basic statistical concepts : This session introduces you with how statistics is used in business with basic statistical concepts like levels of data and measures of central tendencies.

  • Measures of central tendencies
  • Measures of variability
  • Measures of shape
  • Introduction to probability

Session 2: Inferential Statistics : You will start making statistical inferences about populations from samples

  • Sampling
  • Estimating the population Mean Using Z-statistic and T- statistic
  • Hypothesis testing
  • Confidence Intervals

Session 1: Basic concepts of R programming

This session covers practical issues in statistical computing which includes programming in R, reading data into R, accessing R packages, writing R functions, debugging, profiling R code, organizing, and commenting R code.

Session 2: Continuation of R concepts and execute statistical concepts using R

  • Pre-processing techniques: Binning, Filling missing values, Standardization & Normalization, type conversions, train-test data split, ROCR1
  • Other R concept
  •  Exploratory Data Analysis

Session 3: Introduction to ML Algorithms : Preparing data as an input for machine learning algorithms

  • Case Study
  •  Assignment on R understandings

Session 4: Execute all machine algorithms in R

Session 1: Linear Regression

  • Simple Linear Regression
  • Coefficient of determination
  • Significance Tests
  • Residual Analysis
  • Confidence & Prediction intervals
  • Multiple linear regression
  • Coefficient of Determination
  • Interpretation of regression coefficients
  • Categorical variables in regression
  • Heteroscedasticity, Multi-co linearity outliers
  • R-square and goodness of fit
  • Hypothesis testing of Regression Model
  • Transformation of variables
  • Polynomial Regression

Followed by a Case Study

Session 2: Logistic Regression

Logistic regression is a method for classifying data into discrete outcomes. For example, we might use logistic regression to classify an email as spam or not spam. In this module, we introduce the notion of classification, the cost function for logistic regression, and the application of logistic regression to multi-class classification.

  • Logistic function
  • Estimation of probability using logistic regression
  • Model Evaluation
  • Confusion Matrix

Followed by a Case Study

Session 3: Time series data : The focus is on analyzing and understanding Time Series with financial markets as the case study

  • Trend Analysis
  • Cyclical & Seasonal Analysis
  • Smoothing; Moving Averages
  • Auto-Correlation
  • ARIMA; ARIMAX
  • Applications of Time Series in Financial Markets

Followed by a Case Study

  • Session 1: Clustering
  • What is Clustering   
  • Clustering examples in Business Verticals
  • Solution Strategies for Clustering
  • Finding pattern and Fixed Pattern Approach
  • Limitations of Fixed Pattern Approach
  • Machine Learning Approaches for Clustering
  • Iterative based K-Means & K-Medoid Approaches
  • Hierarchical Agglomerative Approaches
  • Density based DB-SCAN Approach       
  • Evaluation Metrics for Clustering
  • Cohesion, Coupling Metric       
  • Correlation Metric

Followed by a Case Study

Session 2: Decision Tree (In Python & R)

  • Introduction
  • Building a Decision Tree
  • Entropy, Information Gain
  • Regression using Decision Tress
  • Bias- Variance trade off
  • Limitations

Followed by a Case Study

Session 3: Support Vector machines (In Python)

  • Loss function based  interpretation
  • Linear svm
  • Non linear svm and kernel function

Followed by a Case Study

Session 4: KNN (In Python & R)

  • KNN learning
  • Limitation
  • KNN regression
  • Applying KNN and parameter  tuning

Session 5: Neural Networks (In Python)

  • Introduction
  • Perceptrons
  • Self organizing maps
  • Auto encoders
  • Back propagation and typical feed forward algorithm
  • Vanishing gradient problem

Followed by a Case Study

Session 6: Association Rules (In R)

  • Apriori Model
  • Pros and cons of the model
  • Recommemder Systems
    • User-user
    • Item-item
    • Content based

Followed by a Case Study

Session 7:Feature Engineering (In Python & R)

  • Dimensionality Reduction
  • PCA and EDA
  • Eigen values and eigen vectors

Know your Trainer

Nitin Singh is an analytics professional having around 7 years of experience in data science and machine learning. He has worked with companies like Amazon and Deloitte in their core analytics wing. Currently he is working with Prime hospitals (a US based hospital chain) in building advance healthcare solutions using machine learning and A.I. Academically, he has completed his Bachelors in Engineering from Osmania university in 2011 and was part of the founding batch of business analytics program from the Indian school of business in 2013-14. He is currently pursuing a 4-month advance course in deep learning and AI from fellowship.ai under the guidance of top data scientists across the globe.Lets connect

Have a Question?

Today the most precious thing on the world is no more oil but data and here at TechnoExcel we try to leverage it and use it at its full potential.

Connect with Us

Contact

TechnoExcel

connect@technoexcel.in

8801912200

Above Allahabad Bank,

Lal Bahadur Colony,

Trimulgherry, Hyderabad

Open chat
1
Let's Chat
Hi,
Let's Learn Excel-lent way.