AWS Lambda Silent Crash – A Platform Failure, Not an Application Bug

David LyonDavid Lyon
3 min read

Over a seven-week investigation, I uncovered and proved a silent, platform-level crash in AWS Lambda — affecting Node.js functions in a VPC making outbound HTTPS calls. The failure occurred mid-execution, after the function had returned a success response. No logs. No errors. No telemetry. No way to catch it.

From day one, I did what AWS claims to value in a partner.

  • I stripped the function down to minimal reproducible code.
  • I tested across runtimes, regions, and infrastructure baselines.
  • I rebuilt on EC2 and proved that the issue vanished entirely.
  • I shared logs, traces, metrics, and internal observations.

I escalated through every official channel:

  • Support dismissed it.
  • My Account Executive ignored it.
  • Formal complaints were met with silence.
  • Internal re-escalations led nowhere.
  • AWS Activate — the startup programme — refused to engage.
  • And executive outreach yielded nothing but a two-line response weeks later.

At every stage, I remained professional. I kept the tone restrained. I offered AWS every opportunity to engage constructively.

Instead, they claimed the bug was in my code — despite the function crashing after returning a 201. They claimed I had forgotten a reject() — despite the error occurring deep inside https.request(), and their own reproduction missing the handler. They suggested I move to EC2 — by then, I already had. I asked for Lambda engineering — they gave me sales. Then silence.

Even AWS Activate, whose sole purpose is to support startups like ours, refused to take part. Their response wasn’t technical — it was procedural. A polite copy-paste directing us back to the same failing support system we were already trapped in.

This wasn’t just a Lambda bug. It was a platform-level failure, misdiagnosed through a broken support process, and left to rot in plain sight.

AWS’s internal systems failed. Their support model failed. Their startup engagement model failed. And above all, their cultural commitment to ownership — the thing they claim defines them — was nowhere to be found.

So, we left. We migrated everything. Lambda was decommissioned. Critical services were refactored for Azure. And our engineering culture now lives on a platform that still understands trust has to be earned — not assumed.

If you're building on Lambda: know this. It may fail silently. And if it does — AWS may blame you, even after they reproduce the failure themselves.


This is just the beginning of the story.

For the full deep dive into the silent AWS Lambda crash, our complete diagnostic process, AWS's contradictory responses, and why we ultimately decided to migrate our entire infrastructure, please read the full article on our website:

Read the Full Post: AWS Lambda Silent Crash – A Platform Failure, Not an Application Bug


1
Subscribe to my newsletter

Read articles from David Lyon directly inside your inbox. Subscribe to the newsletter, and don't miss out.

Written by

David Lyon
David Lyon