Machine Learning: Pima Indians Diabetes

Visualise the Dataset

Visualising the data is an important step of the data analysis. With a graphical visualisation of the data we have a better understanding of the various features values distribution: for example we can understand what’s the average age of the people or the average BMI etc…We could of course limit our inspection to the table visualisation, but we could miss important things that may affect our model precision.

import matplotlib.pyplot as plt
dataset.hist(bins=50, figsize=(20, 15))

Source: Machine Learning: Pima Indians Diabetes

Leave a Reply

Fill in your details below or click an icon to log in: Logo

You are commenting using your account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s

This site uses Akismet to reduce spam. Learn how your comment data is processed.