Building an NBA Data Lake project
data:image/s3,"s3://crabby-images/e1d34/e1d34534f31debeeee22027b1b20363cb18c8db9" alt="Olugbenga Ajayi"
data:image/s3,"s3://crabby-images/a29b8/a29b8045ed8c916f1b72a7f0829ea0d109b8ea14" alt=""
Introduction
This is the Day 3 of DevopsAllStarsChallenge
This project automates the creation of a data lake for NBA analytics using AWS services. A Python script integrates Amazon S3, AWS Glue, and AWS Athena, and sets up the infrastructure needed to store and query the NBA-related data.
Architectural Diagram
Prerequisites
SportsData - Create an account and get an API Key
An IAM credentials are needed for the AWS CLI and to connect to the AWS services from the terminal.
Project Setup
Get the API KEY from SportsData above
Create the file directory as shown below
Go to the GitHub account, copy the content of setup_nba_data_lake.py
The
.env
file should have the following:
SPORTS_DATA_API_KEY=your_sportsdata_api_key #YOUR API KEY
NBA_ENDPOINT=https://api.sportsdata.io/v3/nba/scores/json/Players
- We are working with VS code, hence, the AWS CLI should be configured in the terminal. Once the AWS CLI is installed, the IAM should be configured in the terminal using the
aws configure
Add the Access key and the Secret Access Key. You should have something as shown below using,
aws configure list
Create the virtual environment to run the Python program. The requirements.txt file should be in the root of the directory.
python3 -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt
- The
.gitignore
file should have these files. This will ensure that the API Key is not exposed on Git and the dependencies are not added to the GitHub repository.
.env
.venv
Run the Script
S3
Glue
Before you can query Athena, you must configure the S3 bucket with Athena.
Athena result
Delete the resources to avoid unnecessary charges from AWS. This can be done by running the delete_aws_resources.py script.
Thank you for reading. Please follow, like, comment, and share for a wider reach and more DevOps-related articles.
Subscribe to my newsletter
Read articles from Olugbenga Ajayi directly inside your inbox. Subscribe to the newsletter, and don't miss out.
Written by
data:image/s3,"s3://crabby-images/e1d34/e1d34534f31debeeee22027b1b20363cb18c8db9" alt="Olugbenga Ajayi"
Olugbenga Ajayi
Olugbenga Ajayi
I am a DevOps engineer committed to life-long learning and continuous growth.