What Happens When Your App Gets Famous Overnight?


When you open your laptop or phone and run an app, something is powering it in the background; a computer somewhere is doing the work. In AWS, we call that “Compute.” It’s simply the brainpower your app needs to run, stored not in your home, but in Amazon’s massive global network of data centers.
Think of Compute Like a Food Stall
Imagine you have a small food stall. You can only handle 5 customers at a time. When the lunchtime rush comes, you’re overwhelmed; customers wait longer, and some leave. That’s like your app’s server getting overloaded.
In the cloud, you can “hire” more chefs instantly when the line gets too long (adding servers) and send them home when it’s quiet (removing servers). You only pay for the time they work.
Elastic Load Balancing: The Friendly Waiter
If you have two chefs, you need someone to tell customers which chef is free so they’re served quickly. That’s what a Load Balancer does; it spreads incoming requests so no one chef is drowning in work.
Why it’s essential: Without a Load Balancer, one chef could be swamped while the other is idle. Worse, if a chef gets sick (server crash), customers would have no one to serve them. With Load Balancing, work is evenly spread, and failures are hidden from customers.
Auto Scaling: The Magic Hiring Button
Now, imagine your food stall has a magic hiring button. If the line grows too long, new chefs appear within minutes. When things calm down, extra chefs disappear to save costs.
Why it’s essential: Traffic is unpredictable; think lunch rushes, sudden social media mentions, or flash sales. Auto Scaling ensures you can handle demand spikes without paying for unused servers when it’s quiet.
Scalability vs Elasticity
Scalability: The ability of your system to handle more work by adding more resources.
- Example: Expanding your food stall from 2 chefs to 5 permanently because business is growing.
Elasticity: The ability to automatically adjust resources up or down based on demand.
- Example: Hiring extra chefs only during the lunch rush and letting them go afterward.
In AWS, Auto Scaling gives you elasticity, while the cloud’s flexible infrastructure supports scalability.
Why CPU Usage Matters
CPU is like a chef’s energy level. If they’re working at 95% capacity for too long, mistakes happen, orders get delayed, food gets burned, and customers leave.
For example, in a simple To-Do app, if thousands of people add tasks at the same time, the server has to handle all those “add” requests. If it’s too busy, you might click “Save” and wait… and wait… before it responds. Auto Scaling and Load Balancing help prevent that.
Questions That Keep Me Curious
If the head chef walks out mid-dinner rush, what’s your move?
Would you rather pay extra to have a full team standing by every day… or only bring in help when the line is out the door?
And if the entire kitchen went up in flames, how fast could you get cooking again somewhere else?
Subscribe to my newsletter
Read articles from Racheal Kuranchie directly inside your inbox. Subscribe to the newsletter, and don't miss out.
Written by
