Handling missing data in pandas data frame python

In this post we are going to discuss how to handle missing data from a pandas data frame. Find total number of missing data in the data frame Find number of missing data in each column in a data frame Investigate patterns in the amount of missing data in each column. Percentage of missing data…

Read More

Print ROC AUC Receiver Operating Characteristic Area Under Curve

The receiver operating characteristic area under curve is a way to measure the performance of a classification model, may be created using algorithms like Logistic Regression. ROC-AUC is basically a graph where we plot true positive rate on y-axis and false positive rate on x-axis. If a model is good the AUC will be close to 1. Area…

Read More

What is Confusion Matrix in Machine Learning

Not only human beings but also the machine learning models may get confused !! After all Artificial Intelligence mimics a human brain, isn’t it?(pun intended). Imagine yourself as a machine learning engineer and suppose you trained a machine learning classification model successfully today. After the model is trained you checked the accuracy which is 93.0%. Wow…

Read More