Open in app
Home
Notifications
Lists
Stories

Write
Sanjay Nandakumar
Sanjay Nandakumar

Home

Published in Towards Data Science

·Mar 29

Faker library in python - An intriguing expedient for data scientists

Installation, usage, and application of the “Faker” library in python that can generate prodigious dummy datasets in a short span of time for data science experiments — “A neat and orderly laboratory is unlikely. It is, after all, so much a place of false starts and multiple attempts” Isaac Asimov Table of contents Introduction Installation of Faker library Generating basic data points (Name, Address, Job, Dates, etc.)

Python

9 min read

Faker library in python - An intriguing expedient for data scientists
Faker library in python - An intriguing expedient for data scientists

Published in Towards Data Science

·Aug 11, 2020

The enigma of Adjusted R Squared in regression analysis

The real instigation behind the trustworthiness of Adjusted R Squared over R squared among data scientists — “Truth does not consist in minute accuracy of detail; but in conveying a right impression.” Henry Alford Introduction Regression analysis is one of the most fundamental but commanding machine learning techniques which is still predominant and made a way for many advanced kinds of research in the industry. Although there are…

R Squared

10 min read

The enigma of Adjusted R Squared
The enigma of Adjusted R Squared

May 31, 2020

Time series analysis — Complete tutorial for beginners (Part 4)

The intuition behind Exponential smoothing, Holt linear, Holt winters, Various versions of ARIMA, and Performance evaluation metrics — “The future is something which everyone reaches at the rate of sixty minutes an hour, whatever he does, whoever he is” C.S. Lewis Hey learners, Welcome to the part -4 article of the time series analysis !!! Hope you had a great reading through my previous articles. In the previous…

Vector Auto Regression

9 min read

Time series analysis — Complete tutorial for beginners (Part 4)
Time series analysis — Complete tutorial for beginners (Part 4)

May 31, 2020

Time series analysis — Complete tutorial for beginners (Part 3)

The intuition behind the ARIMA algorithm for Time series forecasting — “If time travel is possible, where are the tourists from the future?” Stephen Hawking Welcome to Part-3 of my article!!! Hope you have gone through the previous 2 parts of this article so that you didn’t miss any knowledge. If you have missed those, here is the URL – Time…

Arima

5 min read

Time series analysis — Complete tutorial for beginners (Part 3)
Time series analysis — Complete tutorial for beginners (Part 3)

May 31, 2020

Time series analysis — Complete tutorial for beginners(Part 2)

The concept of Autoregression and Moving average — In my previous article, I tried to give a brief introduction to the characteristics of time series analysis with some real-world examples. As a learning phase of continuation, we will discuss the concepts behind Autoregression and Moving average. Please have a look through part 1 of this article in case…

Autoregressive

7 min read

Time series analysis — Complete tutorial for beginners(Part 2)
Time series analysis — Complete tutorial for beginners(Part 2)

May 31, 2020

Time series analysis — Complete tutorial for beginners (Part 1)

An introduction to the Stationary process and Time-series characteristics — “Scars have the strange power to remind us that our past is real ” Cormac McCarthy Introduction Time series data is a collection of observations obtained through repeated measurements over time. As the world is becoming more technology-oriented, the collection of data dependent on time is becoming very easy. The…

Time Series Analysis

8 min read

Time series analysis — Complete tutorial for beginners (Part 1)
Time series analysis — Complete tutorial for beginners (Part 1)

Published in Dev Genius

·May 25, 2020

Why does lift have a bigger role than confidence in Association rules?

An important data science interview question you will be facing (probably) — Life does not proceed by the association and addition of elements, but by dissociation and division. Henri Bergson Hey learners, If you are a person who worked on Association rules, you would have probably gone through the same question in your mind — “Why does lift have a bigger role…

Association Rule

5 min read

Why lift has bigger role than confidence in Association rules?
Why lift has bigger role than confidence in Association rules?

Apr 30, 2020

Types of variables in machine learning

A real-life demonstration of various types of machine learning variables — “In God we trust. All others must bring data” — W. Edwards Deming Introduction The understanding of types of variables is very important in the machine learning process to conduct and customize the data processing procedures efficiently. I have often seen some amount of confusion in understanding the grass-root meaning of…

Confounding Variable

7 min read

Types of variables in machine learning
Types of variables in machine learning

Published in Dev Genius

·Apr 29, 2020

The intuition behind probability distributions for machine learning beginners

Real-life examples of various probability distribution techniques — “The 50–50–90 rule: anytime you have a 50–50 chance of getting something right, there’s a 90% probability you’ll get it wrong.” ― Andy Rooney Introduction All of the parametric tests expect data to be in a certain type of data distribution to apply the statistical tests. However, the shape of the…

Probability

10 min read

Intuitions behind probability distributions
Intuitions behind probability distributions

Published in Dev Genius

·Apr 28, 2020

Skewness and Kurtosis in data science

How close is your data to the normal distribution? — “Reality is partial to symmetry and slight anachronisms” Jorge Luis Borges Introduction Data skewness is one of the important challenges that data scientists often face in real-time case studies. Apart from certain business scenarios, most real-time experiments need data in any predefined data distribution and that is very rare without undergoing…

Skewness

7 min read

Skewness and Kurtosis in data science
Skewness and Kurtosis in data science
Sanjay Nandakumar

Sanjay Nandakumar

Data scientist | ML Engineer | Statistician | https://www.quora.com/profile/Sanjay-Kumar-563 | https://www.linkedin.com/in/sanjay-nandakumar-8278229b/

Following
  • Sofien Kaabar, CFA

    Sofien Kaabar, CFA

  • TDS Editors

    TDS Editors

  • Michał Oleszak

    Michał Oleszak

  • Nathan Rosidi

    Nathan Rosidi

  • Ng Wai Foong

    Ng Wai Foong

Help

Status

Writers

Blog

Careers

Privacy

Terms

About

Knowable