First Data Analysis Project

Bravin MuregaBravin Murega
2 min read

Started a course on data analysis from code academy as I consider data analysis to be the best field to practice and use Python. The first lessons were on using SQL for data analysis and I just finished the lessons so I decided to do a project to test my abilities in SQL.

I got a project in one of the Codecademy community forums that included a database(chinook.db) which you can find in my GitHub repository. The database contains data about songs from tv shows and other genres, information about customers who bought the tracks, employees that attended to the customers, and invoices about the purchases.

After downloading the database, I used DB browser for SQLite to analyze the database and find out its structure and the datatypes for columns for the multiple tables that were stored in the database.

In the project, I used vs code as an editor because of the jupyter notebook extension, which made it easy to visualize outputs from the SQL queries. I used a Python database engine to connect to the database and then configured MySQL into the Jupyter Notebook using the SQL magic extension, which allows executing SQL queries directly in Jupyter Notebook cells.

While working on the project I was able to understand clearly how to work with multiple tables in a database and how to combine tables with SQL, how to properly implement aggregate functions in SQL queries, and how to perform math operations in SQL while also handling different data types while performing the operations. You can find the project here

Starting the Python fundamentals for data analysis next so hopefully by next week I'll be able to work on a data analysis project while using Python.

0
Subscribe to my newsletter

Read articles from Bravin Murega directly inside your inbox. Subscribe to the newsletter, and don't miss out.

Written by

Bravin Murega
Bravin Murega