The Long Tail of the AWS Outage

A sprawling Amazon Web Services cloud outage that began early Monday morning illustrated the fragile interdependencies of the internet as major communication, financial, health care, education, and government platforms around the world suffered disruptions. As the day wore on, AWS diagnosed and began working to correct the issue, which stemmed from the company’s critical US-EAST-1 region based in northern Virginia. But the cascade of impacts took time to fully resolve.

Researchers reflecting on the incident particularly highlighted the length of the outage, which started around 3 am ET on Monday, October 20. AWS said in status updates that by 6:01 pm ET on Monday “all AWS services returned to normal operations.” The outage directly stemmed from Amazon’s DynamoDB database application programming interfaces and, according to the company, “impacted” 141 other AWS services. Multiple network engineers and infrastructure specialists emphasized to WIRED that errors are understandable and inevitable for so-called “hyperscalers” like AWS, Microsoft Azure, and Google Cloud Platform, given their complexity and sheer size. But they noted, too, that this reality shouldn’t simply absolve cloud providers when they have prolonged downtime.

“The word hindsight is key. It’s easy to find out what went wrong after the fact, but the overall reliability of AWS shows how difficult it is to prevent every failure,” says Ira Winkler, chief information security officer of the reliability and cybersecurity firm CYE. “Ideally, this will be a lesson learned, and Amazon will implement more redundancies that would prevent a disaster like this from happening in the future—or at least prevent them staying down as long as they did.”

AWS did not respond to questions from WIRED about the long tail of the recovery for customers. An AWS spokesperson says the company plans to publish one of its “post-event summaries” about the incident.

“I don’t think this was just a ‘stuff happens’ outage. I would have expected a full remediation much faster,” says Jake Williams, vice president of research and development at Hunter Strategy. “To give them their due, cascading failures aren’t something that they get a lot of experience working with because they don’t have outages very often. So that’s to their credit. But it’s really easy to get into the mindset of giving these companies a pass, and we shouldn’t forget that they create this situation by actively trying to attract ever more customers to their infrastructure. Clients don’t control whether they are overextending themselves or what they may have going on financially.”

The incident was caused by a familiar culprit in web outages—“domain name system” resolution issues. DNS is essentially the internet’s phonebook mechanism to direct web browsers to the right servers. As a result, DNS issues are a common source of outages, because they can cause requests to fail and keep content from loading.

#Long #Tail #AWS #Outage

  • Related Posts

    ASUS TUF-BE6500 Review: A Good Dual-Band Wi-Fi 7 Router

    The TUF Gaming BE6500 Wi-Fi 7 router (model TUF-BE6500) is the fourth dual-band Wi-Fi 7 router from ASUS, and the first officially designated as a gaming router. In a way,…

    Network Switches 101: The Variety and Quick Shopping Tips

    When you look to expand your home network, especially for wired clients, you will likely need a couple of network switches. This post will explain in simple terms what they…

    Leave a Reply

    Your email address will not be published. Required fields are marked *

    You Missed

    10 Best Apps That Pay You To Walk

    • By VO Tech
    • October 21, 2025
    • 33 views
    10 Best Apps That Pay You To Walk

    Epson EcoTank ET-2950 — A small printer with a big capacity

    • By VO Tech
    • October 21, 2025
    • 25 views
    Epson EcoTank ET-2950 — A small printer with a big capacity

    Historians Don’t Think a US Civil War Is Likely—but They’re Still Nervous

    • By VO Tech
    • October 20, 2025
    • 22 views
    Historians Don’t Think a US Civil War Is Likely—but They’re Still Nervous

    Suunto Race 2 & Suunto Wing 2 — Australian Review

    • By VO Tech
    • October 14, 2025
    • 24 views
    Suunto Race 2 & Suunto Wing 2 — Australian Review

    RAN, AI, AI-RAN and Open RAN

    • By VO Tech
    • October 13, 2025
    • 23 views
    RAN, AI, AI-RAN and Open RAN

    How To Sell Replicas Legally In 2025? 8 Steps Guide

    • By VO Tech
    • October 11, 2025
    • 28 views
    How To Sell Replicas Legally In 2025? 8 Steps Guide