AWS is down

crime [she/her, any] · edit-2 3 years ago

AWS is down

TheOwlReturns [comrade/them] · 3 years ago

This is the result of critical infrastructure privatization. The people's internet services would never go down!

effervescent [they/them] · 3 years ago

99.99999% uptime ~~just like China-funded hexbear.net~~

Mother [any] · 3 years ago

:kim:

crime [she/her, any] · 3 years ago

:red-fist: :cyber-lenin:

thethirdgracchi [he/him, they/them] · 3 years ago

It's so funny to me that every time this happens it turns out Amazon hosts like 80% of its own services on us-east-1 and it's just us-east-1 that's down. Why they have not diversified their own infrastructure to even just us-east-2 is beyond me.

hexaflexagonbear [he/him] · 3 years ago

It's probably diversified, there's just a single point of failure that's difficult to get rid of, probably a network load balancer or something.

thethirdgracchi [he/him, they/them] · 3 years ago

But like, why not move that outside of us-east-1? That's literally always the one that goes down. Just move that single point of failure outside of the most used region by a large margin on AWS. Seems like a basic reliability engineering practice.

hexaflexagonbear [he/him] · edit-2 3 years ago

Each site has a network load balancer, I mean if it was an AWS wide one, it probably wouldn't be a single region that goes down. And again, this is just my guess. Having a hardware load balancer is how the place I worked at managed traffic between several server rooms, and it was usually the main culprit for downtime. It's a very handy device so your network doesn't get overwhelmed by traffic spikes, but afaik really hard to make redundant.

thethirdgracchi [he/him, they/them] · 3 years ago

Oh I see what you mean. Yeah I guess it would make sense that us-east-1 goes down often since it's the most trafficked and it does have to have physical hardware that can fail.

all2well [he/him] · 3 years ago

It’s more that AWS customers build stuff overwhelmingly in us-east-1

eduardog3000 [he/him] · edit-2 3 years ago

Why do they get a choice in a specific server? Sure they could specify us-east, but why let them pick us-east-1 specifically?

all2well [he/him] · 3 years ago

That sort of thing already exists, as availability zones. Things like EC2 instances live in us-east-1a, us-east-1b, etc, which are comprised of separate data centers. In theory that should provide resilience to even a large-scale outage, but evidently that's not foolproof.

The reason why they let you be very precise with how you provision servers is because some applications require that servers be physically close together, especially high bandwidth stuff.

hexaflexagonbear [he/him] · edit-2 3 years ago

deleted by creator

Speaker [e/em/eir] · 3 years ago

In particular, the control plane for Route53 (their DNS product) lives entirely in us-east-1, so the blast radius of a bad outage in that region is enormous.

BlueMagoo [comrade/them] · edit-2 3 years ago

deleted by creator

red_stapler [he/him] · 3 years ago

Taking AWS down in solidarity with the striking Kellogs workers is pretty lit, ngl.

crime [she/her, any] · 3 years ago

Hell yes :sicko-yes: we really need a militant software operator's union

Alex_Jones [he/him] · 3 years ago

Programmers really do hold a lot of power compared to the bosses that pay them. They just need some solidarity and they're good.

crime [she/her, any] · 3 years ago

Oh yeah, at my last job one time during a casual after work chat like a half dozen of my coworkers and I were just talking about what a hypothetical "disgruntled employee" could do to cripple the company if they wanted. Some of my favorites:

delete every IAM policy, role, and user so no one could log in to AWS and nothing internally could talk to anything else, but the company would still rack up their usual hosting bill
write a bunch of realistic junk data to the database, scramble all the foreign keys, touch every record, delete the backups
sell the company's domains

I mean, shit, I don't know a single devops engineer who hasn't thought about the myriad of ways they could literally shut all operations down in a way that would take their coworkers weeks to recover and would be impossible for anyone without working knowledge of the systems to fix.

Aryuproudomenowdaddy [comrade/them] · 3 years ago

Amazon warehouses all over NA are down, they usually just make everyone sit on their thumbs for several hours until it's resolved.

crime [she/her, any] · 3 years ago

Glad the warehouse workers are finally getting a break :sicko-wholesome:

Aryuproudomenowdaddy [comrade/them] · 3 years ago

When I worked at one it happened a couple times, was like Christmas vacation.

TillieNeuen [she/her] · 3 years ago

:deeper-sadness: For once, my role on my current project doesn't require AWS, or I'd be celebrating with you.

crime [she/her, any] · 3 years ago

Taking an extra-long scrolling break on your behalf :rosa-salute:

TillieNeuen [she/her] · 3 years ago

:sicko-crab:

LGOrcStreetSamurai [he/him] · 3 years ago

Libre/Open-source self-hosting alternatives gang RISE UP! THIS IS WHY A FEDERATED INTERNET IS COOL AND GOOD

:praise-it:

blobjim [he/him] · 3 years ago

that's not how that works but okay

LGOrcStreetSamurai [he/him] · edit-2 3 years ago

You're not wrong. I'm a street Samurai not a decker. I know very little.

blobjim [he/him] · 3 years ago

I think paid hosting is more because it's more cost-effective, simple, and most importantly allows companies to have servers around the world without having to own internet-connected property everywhere in the world.

Rem [she/her] · 3 years ago

What is that :meow-floppy:

crime [she/her, any] · 3 years ago

Amazon web services, Amazon's cash cow — they host like 1/3 of the internet lol

Rem [she/her] · 3 years ago

Lol lmao

cresspacito [he/him] · 3 years ago

Really? Is that why I've not been made to take calls today?

crime [she/her, any] · 3 years ago

Yeah, their status page has been green but a lot of their services are down so depending on how your calls are wired up it's v possible

layla · 3 years ago

It's been down for over 30 minutes but the status page is still all green

https://status.aws.amazon.com/

crime [she/her, any] · 3 years ago

Pretty sure that status page will be green well after the extinction of humanity

Grownbravy [they/them] · 3 years ago

Down? Idk what you could mean :shrug-outta-hecks:

PootrKrobuttkin [any] · edit-2 2 years ago

deleted by creator

crime [she/her, any] · 3 years ago

Glad you're getting some respite from the onslaught of angry and confused Karens :mao-clap:

emizeko [they/them] · edit-2 3 years ago

Putin is at it again!

effervescent [they/them] · 3 years ago

Smdh Russian trolls stopping my treats

marxisthayaca [he/him,they/them] · 3 years ago

fb timeline is not loading. I AM FREEEEE

HntrKllr [he/him] · 3 years ago

Muh treats!! :frothingfash:

CrimsonSage [any] · 3 years ago

I had some anarchist friends who used to work there, they had some real funny horror stories. They quit around the time the pandemic began, hope they are doing alright.

BlueMagoo [comrade/them] · edit-2 3 years ago

deleted by creator

happybadger [he/him] · 3 years ago

:fedposting: :stalin-gun-1: :lord-bezos-amused:

crime [she/her, any] · 3 years ago

Ive been on call through day-long outages in us-east-1 but I'm impressed with how much we've discovered is broken vs what they say is having issues lol. Feel like they haven't acked like half the problems they have right now

bodywithoutorgans [he/him] · edit-2 3 years ago

deleted by creator

crime [she/her, any] · 3 years ago

That checks out and also sounds extremely unhealthy and counterproductive to fixing and preventing outages. Capitalist efficiency strikes again!

I've heard that the CEO of AWS has to personally approve all red-x statuses for core services like EC2 and S3, which also sounds counterproductive and toxic as fuck lol

bodywithoutorgans [he/him] · edit-2 3 years ago

deleted by creator

eduardog3000 [he/him] · 3 years ago

When your ideology is definitely conducive to efficient operations.

BlueMagoo [comrade/them] · edit-2 3 years ago

deleted by creator