I just finished reading “Head First Data Analysis “, by Michael Milton, and it was the first book I’ve ever read from the “Head First” series.

The book was very easily approachable, with concepts introduced only when they were necessary and to make a good, valid, practical point. The whole structure of the book revolves around practice and “real life” examples (although, greatly simplified) to prove how methodical logical steps can naturally lead to a good analysis mechanism.

Correlation ≠ causation

But causation ⇒ correlation

Correlation vs causation

In my earlier post I explained how certain type of machine learning models, specifically neural networks, find the correlations between two sets of values. For predictive models, we feed correlated variables to train our models. However, sometimes, we don’t know if or how variables correlate, and part of the machine learning intelligence is to actually find that out.

