Before drawing charts or running a complex model, we need to pause and ask a powerful question: What does this data look like, and what might be worth visualizing?

In this lesson, we dive into exploratory data analysis (EDA), our first real conversation with the data. It’s how we start making sense of things. We’re not cleaning anymore; we’ve done that work. We’re here to notice what stands out, connects, and hides beneath the surface.

Think of it like meeting the data for the first time. We’re curious, observant, and open to what it might reveal. As we explore, we begin asking:

  • What variables seem to move together?

  • What patterns are worth a second look?

  • What’s the overall shape and structure of the data?

We’re not rushing to conclusions. We’re training our eyes to spot what matters, so we can tell clearer, sharper stories later. Let’s begin exploring with intent and see what the data starts to tell us.

What is EDA?

Every dataset holds a story, but that story isn’t always immediately clear. Before we build models or make predictions, we need to understand the data’s structure, quirks, and signals. That’s where exploratory data analysis (EDA) comes in.

Think of EDA like opening the first chapter of a mystery novel. We’re not solving the case yet—we’re getting familiar with the characters (our variables), checking for surprises (like missing values or strange outliers), and trying to understand the setting (how the data is shaped).

Statistician John Wilder Tukey introduced exploratory data analysis (EDA) in the 1970s. Before building models, he believed we should explore our data using simple summaries and visualizations to understand what it’s telling us.

Get hands-on with 1400+ tech skills courses.