This covid era has definitely changed the way people buy stuff. Top players in the e commerce industry like Amazon, Ebay, Flipkart, etc are the major benefactors.

Knowing what a customer wants is the key to any business and knowing if a customer is going to buy a product is a game changer.

And this is what we are going to see in this article.

Data Sampling


Knowing the customer churn rate is a key indicator for any business. According to a study by Bain & Company, improving the customer retention rate for existing customers by just 5 percent can improve a company’s profitability by 25 to 95 percent.

In this article, we are going to look at the following:

  • Initial Exploratory Data Analysis
  • Predicting the churn rate for a customer and classify them by learning about different classification algorithms.
  • Comparing and evaluating different algorithms based on its performance.
  • And once we have our best model, we would perform optimization.

Happy learning!

In this dataset, we have…


© Twitter

Data costs money right? Large corporate companies buy data from dedicated agencies. It can be in the form of surveys or customer feedback questionnaire. But what if you want the very same data ‘for free’ . Yes, you heard that right.

Social media Data Analysis is a very popular methodology that many organizations and independent personalities/ celebrities use for various decision making.

In this article, we will be seeing on how to setup twitter analysis and carry out a short analysis using some NLP techniques on Pepsi.

Initial Setup

To get started, you will need to apply for a twitter developer account…


© kdnuggets.com

For the next few minutes, imagine that you own a shop and you need to improve the sales. So as the manager what are you likely to do?

  • Give discounts on products
  • Encourage buyers to new offers
  • Increase customer service experience
  • Maybe some marketing

Every point is right, but what’s most important is the placement and grouping of products, be it on shelves or offers.

Remember seeing chocolates, mint, water bottles, crisps, mini snacks and other similar products while waiting in the queue to pay? Yes, those are items that you want (but don’t necessarily need) at the last minute…


For my 2nd article, I’ll be showing you on how to build a Multiple linear regression model to predict the price of cars and later comparing it with the accuracy of Random Forest along with some visuals.

Let’s get started!

Before we go on to do the coding part, we need to first understand the two questions.

What is Linear Regression?

LR is basically a model that explains the relationship between a response variable and one or more explanatory variable aka features. For example, demand and supply, higher the demand higher should the supply be.

The Equation of a LR…


Here is where I start my journey as a Data Science blogger. The purpose of getting into this is to support young Data Analysts and Data Science enthusiasts like me. Hope you enjoy!

Amazon Top 50 books (2009–2019)

Over the years e- commerce has taken over the world and big players such as Amazon, Flipkart, ebay, etc. have gained enormous amount of consumers.

I have taken this dataset from kaggle to explore and provide various insights using CRISP- DM approach.

Some of the questions that I’ll be addressing:

  1. Are there any correlations between the variables?
  2. Popularity of genres by year
  3. Top Authors of Best Selling…

Ashwath Paul

A professional data storyteller with a perfect blend of Statistics and computational Science.

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store