As a machine learning engineer you may have created the Random Forest Algorithm Model, but have you ever tried to visualize it. If not , this is the post for you. In this post we will learn how to Visualize or Print Random Forest Algorithm Model in Jupyter notebook. Let us import the required library: … Continue reading Visualize or Print Random Forest Algorithm Model

# Sklearn

# Random Forest Classification in Python in 10 Lines

Random Forest algorithm is like an ensemble algorithm made of Decision Trees, which comprises more than one decision tree to create a model. It creates more than one tree like conditional control statements to create its model hence it is named as Random Forest. Random Forest machine learning algorithm can be used to solve both … Continue reading Random Forest Classification in Python in 10 Lines

# Decision Tree Classification in Python in 10 lines

Decision tree machine learning algorithm can be used to solve not only regression but also classification problems. This algorithm creates a tree like conditional control statements to create its model hence it is named as decision tree. In this post we will be implementing a simple decision tree classification model using python and sklearn. First … Continue reading Decision Tree Classification in Python in 10 lines

# Decision Tree Regression in Python in 10 lines

Decision Tree Regression Model Visualization

# Print ROC AUC Receiver Operating Characteristic Area Under Curve

The receiver operating characteristic area under curve is a way to measure the performance of a classification model, may be created using algorithms like Logistic Regression. ROC-AUC is basically a graph where we plot true positive rate on y-axis and false positive rate on x-axis. If a model is good the AUC will be close to 1. Area … Continue reading Print ROC AUC Receiver Operating Characteristic Area Under Curve

# Scaling Data Range using Min Max Scaler

Suppose you have a dataset that has float values and all values in the range 0 to 1. You want to change all values to integer with a range between 10 to 20. In this post we will learn how to do this using MinMaxScaler Data before scaling Now let us scale the data as … Continue reading Scaling Data Range using Min Max Scaler

# Creating Synthetic Data for Logistic Regression

Many a times we want to implement Logistic Regression on certain data but we do not find that kind of data online. In that case we can generate a synthetic data for our problem. In this post we will see how to generate a typical synthetic data for a simple Logistic Regression. Import the required … Continue reading Creating Synthetic Data for Logistic Regression

# Visualize and Print Confusion Matrix

In many cases you would like to print the confusion matrix in a better format and look and feel than what is provided by scikit learn by default. The default look when printing confusion matrix using scikit learn scikit-learn default confusion matrix print However in many cases you may like to print the confusion matrix … Continue reading Visualize and Print Confusion Matrix

# What is Root Mean Squared Error or RMSE

Root mean squared error or RMSE is a measure of the difference between actual values and predicted values of a machine learning model like Linear Regression. Root mean squared error is a measure of how well the machine learning model can perform. The lower the RMSE, the better the model. RMSE is always positive, and … Continue reading What is Root Mean Squared Error or RMSE

# What is R Squared for Linear Regression

For Linear Regression, R-squared is a statistical term which indicates how close the data are to the fitted regression line. R-Squared is also known as coefficient of determination. R-squared = Explained variation in data / Total variation in data R-squared = 1 - (RSS/TSS) RSS = Sum of squares of difference between predicted value and … Continue reading What is R Squared for Linear Regression