Lesson 1 Setting Up Your Data Analysis Environment
This project is purely Python-based, unless explicitly stated otherwise.
Before we begin exploring data, you need a simple and reliable workspace so you can run code smoothly and follow every lesson without issues.
1.1 Recommended Setup
1.1.1 Option 1 — Easiest (Recommended)
Use Jupyter Notebook via the Anaconda distribution.
Why this option?
Anaconda installs Python, Jupyter Notebook, and essential data-science libraries in one streamlined setup.
Steps:
- Download Anaconda: https://www.anaconda.com/download
- Install it (default settings are fine)
- Open Anaconda Navigator
- Launch Jupyter Notebook
1.2 Installing Required Libraries
Run the following command in a Jupyter Notebook cell.
Note:
If you are using Anaconda, these libraries are usually preinstalled.
The command above ensures your environment is up to date.
1.3 Practice Dataset
We will use the Iris dataset in the early lessons of this course.
You do not need to download it manually at this stage.
- Lesson 01 will automatically generate
data/iris.csv
- This lesson focuses only on ensuring your environment is ready
1.4 Test Your Setup
You may run the test below after completing Lesson 01, once the dataset exists.
sepal_length sepal_width petal_length petal_width species
0 5.1 3.5 1.4 0.2 setosa
1 4.9 3.0 1.4 0.2 setosa
2 4.7 3.2 1.3 0.2 setosa
3 4.6 3.1 1.5 0.2 setosa
4 5.0 3.6 1.4 0.2 setosa
1.5 Troubleshooting
If something does not work as expected:
- Restart your notebook and try again
- Confirm the file path is
data/iris.csv
- Ensure the
data/folder exists
- Re-run the installation command if needed
Setup issues are common — take your time.
1.6 Exercise
- Confirm that Jupyter Notebook opens successfully
- Run the library installation command without errors
- Verify that you can run a simple Python cell
1.7 Summary
- You set up a Python-based data analysis environment
- You installed essential data science libraries
- You verified that your notebook can execute Python code
- You prepared your system for working with real datasets
Continue to Lesson 02 — Load and Explore a Dataset.