Python Basics for Data Science: A Code-Focused Guide
Introduction
Python has earned its reputation as the preferred programming language for data science due to its simplicity and versatility. In this blog, we'll dive into the fundamentals of Python programming for data science with a hands-on approach, exploring data types, control flow, and essential functions and libraries that form the backbone of data analysis.
I. Data Types and Variables:
In Python, data types play a crucial role in defining the nature of our data. Let's explore the core data types and variables commonly used in data science:
# Integers, floats, and basic arithmetic operations
age = 25
height = 1.75
weight = 65.5
bmi = weight / (height ** 2)
# Strings and string operations
name = "John Doe"
greeting = "Hello, " + name + "!"
length_of_name = len(name)
# Booleans and logical operations
is_student = True
has_job = False
is_adult = age >= 18
# Lists, tuples, dictionaries, and sets
fruits = ['apple', 'banana', 'orange']
coordinates = (10, 20)
person = {'name': 'Alice', 'age': 30}
unique_numbers = {1, 2, 3, 4, 5}
II. Control Flow:
Control flow structures allow us to make decisions and iterate over data in Python. Let's explore if-else statements and loops:
# If-else statements
if age >= 18:
print("You are an adult.")
else:
print("You are a minor.")
# For loop
for fruit in fruits:
print("I like", fruit)
# While loop
count = 0
while count < 5:
print("Count:", count)
count += 1
III. Functions and Libraries:
Functions in Python allow us to encapsulate code for reusability. Additionally, we'll introduce essential data science libraries:
# Defining functions
def calculate_bmi(weight, height):
bmi = weight / (height ** 2)
return bmi
# Importing data science libraries
import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns
# Utilizing libraries
data = np.array([1, 2, 3, 4, 5])
mean_value = np.mean(data)
df = pd.DataFrame({'Name': ['John', 'Alice', 'Bob'], 'Age': [25, 30, 22]})
plt.plot(data)
plt.xlabel('Index')
plt.ylabel('Values')
plt.title('Sample Data Plot')
plt.show()
sns.barplot(x='Name', y='Age', data=df)
plt.title('Age Distribution')
plt.show()
Conclusion
Python's data types, control flow structures, functions, and powerful data science libraries form the foundation of data analysis. Armed with this hands-on guide, you're now equipped to manipulate data, make decisions based on conditions, and unleash the power of data science with Python. Embrace the versatility and simplicity of Python to embark on exciting data-driven journeys, uncovering insights and solving complex real-world problems with ease. Happy coding and data exploring! ๐๐ฌ
Subscribe to my newsletter
Read articles from Ayesha Irshad directly inside your inbox. Subscribe to the newsletter, and don't miss out.
Written by
Ayesha Irshad
Ayesha Irshad
I am a Developer Program Member at GitHub, where I collaborate with a global community of developers and contribute to open source projects that advance the field of Artificial Intelligence (AI). I am passionate about learning new skills and technologies, and I have completed multiple certifications in Data Science, Python, and Java from DataCamp and Udemy. I am also pursuing my Bachelor's degree in AI at National University of Computer and Emerging Sciences (FAST NUCES), where I have gained theoretical and practical knowledge of Machine Learning, Neural Networks, and Data Analysis. Additionally, I have worked as an AI Trainee at Scale AI, where I reviewed and labeled data for various AI applications. Through these experiences, I have developed competencies in Supervised Learning, Data Science, and Artificial Neural Networks. My goal is to apply my skills and knowledge to solve real-world problems and create positive impact with AI.