Understanding Cache: Key to Efficient Caching

Cache is a high-speed data storage layer located between the CPU and RAM (main memory). It stores frequently used data and instructions to enable faster access compared to fetching data directly from RAM.

Or you can say:

Cache memory is a small, fast memory placed between the CPU and main memory (RAM) to store frequently accessed data.

It’s generally built using SRAM (Static RAM).

The main purpose of cache memory is to provide data and instructions to the CPU at a much faster rate, allowing the CPU to complete its operations more efficiently.

🔎 Why is Cache Needed?

The CPU operates much faster than all other types of memory.

To function properly, the CPU constantly needs data and instructions.

In early computers, the CPU directly accessed data from the hard disk (HDD). This made systems very slow.
Then came RAM, which was much faster than HDD. This improved performance.
But as CPUs evolved into multi-core architectures (dual-core, quad-core, octa-core, etc.), even RAM couldn’t keep up with the data/instruction demands.
That’s when cache memory was introduced — to act as a faster intermediary between the CPU and RAM.

⚙️ How Does Cache Work?

When the CPU needs data, it first checks the cache.
- If the data is found → ✅ Cache Hit → Use it directly.
If not found → ❌ Cache Miss → CPU fetches data from RAM and stores it in cache for future use.
If the data isn’t in RAM either, it will then be retrieved from secondary storage (SSD/HDD).

Benefits of Using Cache

Greatly increases CPU performance
Reduces the waiting time for fetching data from slower memory

Types of Cache Memory

There are 3 main types of CPU cache, categorized by levels:

Cache Level	Speed	Location	Shared?	Size Range
L1	Fastest	Inside CPU	Not shared (per core)	32KB–128KB
L2	Slower than L1	Inside or near CPU	Usually not shared	256KB–1MB+
L3	Slower than L2	Outside CPU	Shared by all cores	2MB–50MB+

Summary types of Cache:

L1 Cache:
- Smallest and fastest
- Located inside the CPU
- Stores very frequently used data
L2 Cache:
- Larger than L1 but slower
- May be located inside or just outside the CPU
L3 Cache:
- Shared by multiple CPU cores
- Larger and slower than L2, but still faster than RAM

Process of Data Fetching

The CPU follows a specific path to fetch data, starting from the fastest and closest storage:

CPU → L1 Cache → L2 Cache → L3 Cache → RAM → SSD/HDD

It first checks in L1 cache
If not found → goes to L2, then L3
If not found in any cache → fetches from RAM
If it’s not even in RAM → finally fetches from secondary storage like SSD/HDD

✍️ Write Policies

When the CPU updates data, there are two main strategies to decide where the update should go:

Write-Through:
- Data is written to both the cache and RAM at the same time.
- Keeps data in sync.
- Slightly slower due to double writes.
Write-Back:
- Data is written only to the cache initially.
- RAM is updated later when that cache block is replaced.
- Faster write performance.
- Requires more control logic and tracking.

Cache Replacement Policies

When the cache is full, we need to decide which block to remove to make space for new data. These are common strategies:

LRU (Least Recently Used):
- Replaces the block that hasn’t been used for the longest time.
- ⏳ Smart but slightly complex to track.
FIFO (First In, First Out):
- Replaces the oldest block, regardless of usage.
- 📦 Easy to implement, but may not always be optimal.
Random:
- Replaces a random block.
- 🎲 Very simple, low overhead, but unpredictable.

Simple Real-Life Analogy

Let’s say you're a student preparing for exams:

L1 Cache = notebook on your desk (quick to access)
L2 Cache = notes in your bag (still close but slower)
L3 Cache = notes in your hostel room
Main Memory (RAM) = library shelf
Hard Disk = archive room in a different building

The closer it is to you (CPU), the faster you can get it.

Caching:

Caching is a general technique used in computing to store frequently accessed data or results in a temporary storage (called a cache) so that future requests for the same data can be served much faster.

✅ Caching = A strategy

✅ Cache = The storage (hardware or software) where data is kept

Why Use Caching? 🤔

When we request data (e.g., from a web app), that request usually goes to the backend server and then to the database.

The database processes the request, fetches the data, and sends it back to the client(browser/app).

This process is time-consuming — it involves:

API call
Database read
Response generation
Costly (in terms of API calls, DB reads, processing time, etc.)

That’s where caching helps:

✅ It stores frequently used data (like user profile info) in temporary memory (cache).

📦 So, when the same data is requested again, it can be served directly from cache, without calling the backend or database again.

Result:

⚡ Faster response
🔁 Less load on backend/database
📉 Reduced latency

Real-World Example

Let’s say your app shows a user’s profile:

GET /user/profile

First request → goes to database → result is stored in cache
Next request (same data) → comes from cache directly
⚡ Much faster than hitting the DB again!

Types of Caching

In-Memory Cache (e.g., Memcached)
- Each server has its own local memory to store frequently accessed data.
- Fast, but not shared across multiple servers (data duplication possible).
Distributed Cache (e.g., Redis)
- A shared cache system accessible by all application servers.
- Scales better for large, distributed systems.

Caching in Different Contexts

Aspect	Cache Memory	Caching
Type	Hardware-level concept	Software-level strategy
Scope	CPU and RAM	Anywhere (DB, web, OS, etc.)
Purpose	Speed up CPU data access	Reduce latency and backend load
Example	L1, L2, L3 caches	Redis, browser cache, CDN, etc.

📌 Simple Analogy

🧠 Cache memory is like keeping your books on your desk (CPU) instead of going to the library (RAM).

🌐 Caching in software is like remembering answers to frequently asked questions, so you don’t have to Google them every time.

🙏 Final Words

Thanks for reading!

I hope you now have a clear understanding of the difference between cache (the storage) and caching (the strategy), and why they play such an important role in system performance. 🚀

What is Cache? And Why It Matters Before Understanding Caching

Table of contents