Cloudflare was down

814 points by mektrik 20 hours ago|518 comments

•

pm90 20 hours ago

This is not good. One major outage? Something exceptional. Several outages in a short time? As someone thats worked in operations, I have empathy; there are so many “temp havks” that are put in place for incidents. but the rest of the world won’t… they’re gonna suffer a massive reputation loss if this goes on as long as the last one.

•

berkes 20 hours ago

At least this warrants a good review of anyone's dependency on cloudflare.

If it turns out that this was really just random bad luck, it shouldn't affect their reputation (if humans were rational, that is...)

But if it is what many people seem to imply, that this is the outcome of internal problems/cuttings/restructuring/profit-increase etc, then I truly very much hope it affects their reputation.

But I'm afraid it won't. Just like Microsoft continues to push out software, that, compared to competitors, is unstable, insecure, frustrating to use, lacks features, etc, without it harming their reputation or even bottomlines too much. I'm afraid Cloudflare has a de-facto monopoly (technically: big moat) and can get away with offering poorer quality, for increasing pricing by now.

•

zelphirkalt 19 hours ago

Microsoft's reputation couldn't be much lower at this point, that's their trick.

The issue is the uninformed masses being led to use Windows when they buy a computer. They don't even know how much better a system could work, and so they accept whatever is shoved down their throats.

•

coffeebeqn 20 hours ago

Vibe infrastructure

The crowdstrike incident taught us that no one is going to review any dependency whatsoever.

•

ezst 17 hours ago

Yep, that's what late stage capitalism leaves you with: consolidation, abuse, helplessness and complacency/widespread incompetence as a result

•

bluerooibos 17 hours ago

I'm quite sure the reputational damage has already been done.

How do they not have better isolation of these issues, or redundancy of some sort?

Sometimes it's not worth it. Your plan is just to accept you'll be off for a day or two, while you switch to a competitor.

•

creamyhorror 15 hours ago

If there's a fitting competitor worth switching to.

Plus most people don't get blamed when AWS (or to a lesser extent Cloudflare) goes down, since everyone knows more than half the world is down, so there's not an urgent motivation to develop multi-vendor capability.

•

rvz 17 hours ago

pyuser583 20 hours ago

Lots of big sites are down

•

wooque 17 hours ago

2 days ago they had outage that affected Europe, Cloudflare seems to be going down the drain. I removed it for my personal sites.

•

karmakurtisaani 20 hours ago

Probably fired a lot of their best people in the past few years and replaced it with AI. They have a de-facto monopoly, so we'll just accept it and wait patiently until they fix the problem. You know, business as usual in the grift economy.

•

5d41402abc4b 19 hours ago

>They have a de-facto monopoly

But the nature of a CDN and most other products CF offers, is central by nature.

If you switch from CF to the next CF competitor, you've not improved this dependency.

The alternative here, is complex or even non-existing. Complex would be some system that allows you to hotswap a CDN, or to have fallback DDOS protection services, or to build you own in-house. Which, IMO, is the worst to do if your business is elsewhere. If you sell, say, petfood online, the dependency-risk that comes with a vendor like CF, quite certainly is less than the investment needed- and risk associted with- building a DDOS protection or CDN on your own; all investment that's not directed to selling more pet-food or get higher margins at doing so.

•

agnivade 19 hours ago

You can load-balance between CDN vendors as well

•

otikik 19 hours ago

Then your load balancer becomes the single point of failure.

•

roryirvine 18 hours ago

BGP Anycast will let you dynamically route traffic into multiple front-end load balancers - this is how GSLB is usually done.

mschuster91 19 hours ago

no one loves the need for CDNs other than maybe video streaming services.

the problem is, below a certain scale you can't operate anything on the internet these days without hiding behind a WAF/CDN combo... with the cut-off mark being "we can afford a 24/7 ops team". even if you run a small niche forum no one cares about, all it takes is one disgruntled donghead that you ban to ruin the fun - ddos attacks are cheap and easy to get these days.

and on top of that comes the shodan skiddie crowd. some 0day pops up, chances are high someone WILL try it out in less than 60 minutes. hell, look into any web server log, the amount of blind guessing attacks (e.g. /wp-admin/..., /system/login, /user/login) or path traversal attempts is insane.

CDN/WAFs are a natural and inevitable outcome of our governments and regulatory agencies not giving a shit about internet security and punishing bad actors.

•

koakuma-chan 20 hours ago

My Cloudflare Pages website works fine.

•

inferiorhuman 18 hours ago

  There are many alternatives

Of varying quality depending on the service. Most of the anti-bot/catpcha crap seems to be equivalently obnoxious, but the handful of sites that use PerimeterX… I've basically sworn off DigiKey as a vendor since I keep getting their bullshit "press and hold" nonsense even while logged in.

I don't like that we're trending towards a centralized internet, but that's where we are.

•

luastoned 19 hours ago

From the incident page:

A change made to how Cloudflare's Web Application Firewall parses requests caused Cloudflare's network to be unavailable for several minutes this morning. This was not an attack; the change was deployed by our team to help mitigate the industry-wide vulnerability disclosed this week in React Server Components. We will share more information as we have it today.

https://www.cloudflarestatus.com/incidents/lfrm31y6sw9q

•

reassess_blind 18 hours ago

I’m really curious what their rollout procedure is, because it seems like many of their past outages should have been uncovered if they released these configuration changes to 1% of global traffic first.

•

lima 16 hours ago

They don't appear to have a rollout procedure for some of their globally replicated application state. They had a number of major outages over the past years which all had the same root cause of "a global config change exposed a bug in our code and everything blew up".

I guess it's an organizational consequence of mitigating attacks in real time, where rollout delays can be risky as well. But if you're going to do that, it would appear that the code has to be written much more defensively than what they're doing it right now.

•

JB_Dev 15 hours ago

Yea agree.. This is the same discussion point that came up last time they had an incident.

I really don’t buy this requirement to always deploy state changes 100% globally immediately. Why can’t they just roll out to 1%, scaling to 100% over 5 minutes (configurable), with automated health checks and pauses? That will go along way towards reducing the impact of these regressions.

Then if they really think something is so critical that it goes everywhere immediately, then sure set the rollout to start at 100%.

Point is, design the rollout system to give you that flexibility. Routine/non-critical state changes should go through slower ramping rollouts.

•

franktankbank 14 hours ago

Can't get hacked when you are down.

•

nrhrjrjrjtntbt 5 hours ago

Not sure I buy it. Do 1% for 10 minutes. I mean it must have taken over half a day to code and test a patch. Why not wait another 10 minutes.

•

ethbr1 15 hours ago

For hypothetical conflicting changes (read worst case: unupgraded nodes/services can't interop with upgraded nodes/services), what's best practice for a partial rollout?

Blue/green and temporarily ossify capacity? Regional?

•

cryptonym 13 hours ago

- Push a version with the new logic but not yet enabled, still using legacy logic, able to implement both

- Push a version that enables new logic for 1% of traffic

- Continue rollout until 100%

•

nrhrjrjrjtntbt 5 hours ago

Can also do canary rollout before that. Canary means rollout to endpoints only used by CF to test. Monitor metrics and automated test results.

•

That still shouldn't be a part of post mortem, more of a performance review item.

•

tempaccount420 15 hours ago

They should be performantly removed.

•

turbobrew 13 hours ago

The aviation industry regularly requires certifications, check rides, and re-qualifications when humans mess up. I have never seen anything like that in tech.

Sometimes the solution is to not let certain people do certain things which are risky.

•

Xunjin 17 hours ago

Agree 100%, however using your example, there is no regulatory agency that investigate the issue and demand changes to avoid related future problems. Should the industry move towards this way?

•

> They have blameless post mortems, but maybe "We actually do make mistakes so this practice is not good" wasn't a lesson anybody wanted to hear.

Or they could say, "we want to continue to prioritise speed of security rollouts over stability, and despite our best efforts, we do make mistakes, so sometimes we expect things will blow up".

I guess it depends what you're optimising for... If the rollout speed of security patches is the priority then maybe increased downtime is a price worth paying (in their eyes anyway)... I don't agree with that, but at least it's an honest position to take.

That said, if this was to address the React CVE then it was hardly a speedy patch anyway... You'd think they could have afforded to stagger the rollout over a few hours at least.

•

lima 16 hours ago

It's just poor risk management at this point. Making sure that a configuration change doesn't crash the production service shouldn't take more than a few seconds in a well-engineered system even if you're not doing staged rollout.

•

> That's Next.js, not React.

React seems to think that it was React:

https://react.dev/blog/2025/12/03/critical-security-vulnerab...

•

I think the "argument" is that it's a critical vuln so they can't "go slow".

So now a vuln check for a component deployed on, being generous, 1% of servers causes an outage for 30% of the internet.

The argument is dumb.

•

spiffytech 16 hours ago

To be accurate: React developed server-side capabilities, and that's where the vulnerability exists.

javier2 20 hours ago

Yeah. I only work for a small company, but you can be certain we will not update the status page if only a small portion of customers are affected, and if we are fully down, rest assured there will be no available hands to keep the status page updated

•

s_dev 20 hours ago

>rest assured there will be no available hands to keep the status page updated

That's not how status pages if implemented correctly work. The real reason status pages aren't updated is SLAs. If you agree on a contract to have 99.99% uptime your status page better reflect that or it invalidates many contracts. This is why AWS also lies about it's uptime and status page.

These services rarely experience outages according their own figures but rather 'degraded performance' or some other language that talks around the issue rather than acknowledging it.

It's like when buying a house you need an independent surveyor not the one offered by the developer/seller to check for problems with foundations or rotting timber.

•

redm 19 hours ago

SLA’s usually just give you a small credit for the exact period of the incident, which is arymetric to the impact. We always have to negotiate for termination rights for failing to meet SLA standards but, in reality, we never exercise them.

Reality is that in an incident, everyone is focused on fixing issue, not updating status pages; automated checks fail or have false positives often too. :/

•

korm 18 hours ago

Yep, every SLA I've ever seen only offers credit. The idea that providers are incentivized to fudge uptime % due to SLAs makes no sense to me. Reputation and marketing maybe, but not SLAs.

The compensation is peanuts. $137 off a $10,000 bill for 10 hours of downtime, or 98.68% uptime in a month, is well within the profit margins.

•

laurent123456 19 hours ago

This is weird - at this level contracts are supposed to be rock solid so why wouldn't they require accurate status reporting? That's trivial to implement, and you can even require to have it on a neutral third-party like UptimeRobot and be done with it.

I'm sure there are gray areas in such contracts but something being down or not is pretty black and white.

•

franga2000 19 hours ago

> something being down or not is pretty black and white

This is so obviously not true that I'm not sure if you're even being serious.

Is the control panel being inaccessible for one region "down"? Is their DNS "down" if the edit API doesn't work, but existing records still get resolved? Is their reverse proxy service "down" if it's still proxying fine, just not caching assets?

•

Are the contracts so easy to bypass? Who signs a contract with an SLA knowing the service provider will just lie about the availability? Is the client supposed to sue the provider any time there is an SLA breach?

•

netdevphoenix 19 hours ago

Anyone who doesn't have any choice financially or gnostically. Same reason why people pay Netflix despite the low quality of most of their shows and the constant termination of tv series after 1 season. Same reason why people put up with Meta not caring about moderating or harmful content. The power dynamics resemble a monopoly

•

lucianbr 17 hours ago

Why bother to put the SLA in the contract at all, if people have no choice but to sign it?

Netflix doesn't put in the contract that they will have high-quality shows. (I guess, don't have a contract to read right now.)

•

ozim 19 hours ago

Most of services are not really critical but customers want to have 99.999% on the paper.

Most of the time people will just get by and ignore even full day of downtime as minor inconvenience. Loss of revenue for the day - well you most likely will have to eat that, because going to court and having lawyers fighting over it most likely will cost you as much as just forgetting about it.

onion2k 19 hours ago

if we are fully down, rest assured there will be no available hands to keep the status page updated

There is no quicker way for customers to lose trust in your service than it to be down and for them to not know that you're aware and trying to fix it as quickly as possible. One of the things Cloudflare gets right is the frequent public updates when there's a problem.

>Customers using the Dashboard / Cloudflare APIs are impacted as requests might fail and/or errors may be displayed.

"Might fail"

•

yapyap 20 hours ago

well it does say that now, so…

which datacenter got flooded?

•

rvnx 19 hours ago

> In progress - Scheduled maintenance is currently in progress. We will provide updates as necessary. Dec 05, 2025 - 09:00 UTC

•

headmelted 20 hours ago

It's 1AM in San Francisco right now. I don't envy the person having to call Matthew Prince and wake him up for this one. And I feel really bad for the person that forgot a closing brace in whatever config file did this.

•

Life hack: Announce bug that brings your entire network down as scheduled maintenance.

•

tommek4077 20 hours ago

Yes, it’s really ‘weird’ that they refuse to share any details. Completely unlike AWS, for example. As if being open about issues with their own product wouldn’t be in their best interest. /s

•

timvdalen 20 hours ago

Wow, just plain 500s on customer sites. That's a level of down you don't see that often.

•

ablation 20 hours ago

Yeah that's a hard 500 right? Not even Cloudflare's 500 branded page like last time. What could have caused this, I wonder.

•

mckirk 20 hours ago

"A cable!"

"How do you know?"

ransom1538 20 hours ago

So. I don't understand the 5 nines they promote. One bad day those nines are gone. So they next year you are pushing 2 nines.

•

kingstnap 19 hours ago

Its just fabricated bullshit. It's how all the companies do it. 99.999% over a year is literally 5 minutes. Or under an hour in a decade, that's wildly unrealistic.

Reddit was once down for a full day and that month they reported 99.5% uptime instead of 99.99% as they normally claimed for most months.

There is this amazing combination of nonsense going on to achieve these kinds of numbers:

1. Straight up fraudulent information on status page. Reporting incendents as more minor than any internal monitors would claim.

2. If it's working for at least a few percent of customers it's not down. Degraded is not counted.

3. If any part of anything is working then it's not down. For example with the reddit example even if the site was dead as long as the image server is still at 1% functional with some internal ping the status is good.

•

zelphirkalt 12 hours ago

Funnily enough an hour in a decade on a good hoster, with a stable service running on it, occasionally updated by version number ... it might even be possible. Maybe not quite, but close, if one tries. While it seems completely impossible with cloudflare, AWS, and whatnot, who are having outages every other week these days.

•

jondot 20 hours ago

its like someone-shut-down-the-power 500s

•

madjam002 19 hours ago

Looking forward to the post mortem on this one. We weren't affected (just using the CDN), and people are saying they weren't affected who are using Cloudflare Workers (a previous culprit which we've since moved off), so I wonder what service / API was actually affected that brought down multiple websites with a 500 but not all of them.

Wise was just down which is a pretty big one.

> Looking forward to the post mortem

This is becoming a meme.

•

meandmycode 19 hours ago

This has to be setting off some alarm bells internally, a well written postmortem on an occasional issue, great, but when your postmortem talks about learnings and improvements yet major outages keep happening, it becomes meaningless..

•

kryptn 19 hours ago

was interesting, some of our stuff failed, but some other stuff that used cloudflare indirectly didn't.

•

da_grift_shift 19 hours ago

The excuse:

>A change made to how Cloudflare's Web Application Firewall parses requests caused Cloudflare's network to be unavailable for several minutes this morning.

>The change was deployed by our team to help mitigate the industry-wide vulnerability disclosed this week in React Server Components.

>We will share more information as we have it today.

https://www.cloudflarestatus.com/incidents/lfrm31y6sw9q

•

madjam002 19 hours ago

It's quite an unfortunate coincidence that React has indirectly been the reason for two recent issues at Cloudflare haha

•

brobdingnagians 19 hours ago

halgir 20 hours ago

We have one. But according to Down Detector's Down Detector's Down Detector's Down Detector, that's also down.

•

Dilettante_ 20 hours ago

Well Down Detector's Down Detector isn't down...What we might need is a Down Detector's Down Detector Validator

•

O4epegb 20 hours ago

This is a fake detector that just has frontend logic for mocking realistic data, you can easily see it in the source code.

•

maxlin 20 hours ago

>half the internet is down >downdetector is down >downdetector down detector reports everything is fine

software was a mistake

•

aurareturn 20 hours ago

Ehh, so down detector for down detector is up but it is inaccurate.

•

> Each of these bugs took weeks of real-world usage before they were found. The programmer might have spent a couple of days reproducing the bug in the lab and fixing it. If it’s like a lot of bugs, the fix might be one line of code, or it might even be a couple of characters, but a lot of work and time went into those two characters.

> When you throw away code and start from scratch, you are throwing away all that knowledge. All those collected bug fixes. Years of programming work.

koakuma-chan 12 hours ago

Here is the Java equivalent of what happened in that Cloudflare Rust code:

  try {
    data = loadDataFile();
  } catch (Exception e) {
    LOG.error("Failed to load new data file", e);
    System.exit(1);
  }

So the "bad data load" was trapped, but the programmer decided that either it would never actually occur, or that it is unrecoverable, so it is fine to .unwrap(). It would not be any less idiomatic if, instead of crashing, the programmer decided to implement some kind of recovery mechanism. It is that programmer's fault, and has nothing to do with Rust.

Also, if you use general try-catch blocks like that, you don't know if that try-catch block actually needs to be there. Maybe it was needed in the past, but something changed, and it is no longer needed, but it will stay there, because there is no way to know unless you specifically look. Also, you don't even know the exact error types. In Rust, the error type is known in advance.

•

mike_hearn 7 hours ago

Yes, I know. But nobody writes code like that in Java. I don't think I've ever seen it outside of top level code in CLI tools. Never in servers.

> It is that programmer's fault, and has nothing to do with Rust.

It's Rust's fault. It provides a function in its standard library that's widely used and which aborts the process. There's nothing like that in the stdlibs of Java or .NET

> Also, if you use general try-catch blocks like that, you don't know if that try-catch block actually needs to be there.

domysee 20 hours ago

I'm just realizing how much we depend on Cloudflare working. Every service I use is unreachable. Even worse than last time. It's almost impossible to do any work atm.

•

makkoncept 20 hours ago

https://downdetectorsdowndetector.com/ is up :) but the status is not correct.

•

glimshe 18 hours ago

Not only they make my browsing experience a LOT worse (seconds per site for bot detection and additional "are you human" clicks even without VPNs), now they are bringing the entire Internet down. They don't deserve the position they currently have.

•

gilrain 16 hours ago

> Not only they make my browsing experience a LOT worse

No, I did (metaphorically, for the websites I control). And I did it because otherwise those sites are fully offline or unusable thanks to the modern floods of unfilterable scrapers.

Months of piecemeal mitigations, but Attack Mode is the only thing that worked. Blame the LLM gold rush and the many, many software engineers with no ethics and zero qualms about racing to find the bottom of the Internet.

•

The_President 6 hours ago

The whole “not a bot” prompt every three hours seems like it has potential to get out of the way more often.

•

wrobelda 17 hours ago

You make it sound like the DDoS and Bots are their fault.

•

glimshe 16 hours ago

They make gazillions. I'm sure they can do better than that.

Cloudfront

Probably some more I forgot now. CF is not the only option and definitely not the best option.

> Yeah, now we'll save everyone from DDoS, everything's perfect, we'll speed up your site,

The site is back up, but it feels fairly silly that a platform that has inserted itself as a single point of failure has an architecture that's got single points of failure.

xomiachuna 16 hours ago

https://www.cloudflarestatus.com/incidents/lfrm31y6sw9q

•

xomiachuna 16 hours ago

And so it seems that the cause is close to RSC vulnerability from yesterday: https://www.cloudflarestatus.com/incidents/lfrm31y6sw9q

So much for the react being just a frontend library, amirite

•

> So, what are the alternatives?

Let AI wing it instead.

•

dev0p 20 hours ago

Isn't it happening a little too often now? Did someone .unwrap in production again?

•

erikbye 20 hours ago

This is getting embarrassing.

•

arjie 19 hours ago

How interesting. As of 00:30 or so I could still access Claude but then it went down with a 500 from Cloudflare and I thought I'd nab a quick something off Slickdeals but that's down too. My own blog is on Cloudflare's `cloudflared` tunnel and it's working just fine, even the cache, so it must be something hitting some specific type of configuration or some shard hitting some region.

And they're back before I finished the comment. Such a pity, I was hoping to hog some more Claude for myself through Claude Code.

•

robotfelix 20 hours ago

Our site is fine, including files served by Cloudflare's CDN and Cloudflare Workers, but the Cloudflare dashboard is definitely down.

The Cloudflare status page says that it's the dashboard and Cloudflare APIs that are down. I wonder if the problem is focused on larger sites because they are more dependent on / integrated with Cloudflare APIs. Or perhaps it's only an Enterprise tier feature that's broken.

If it's not everything that is down, I guess things are slightly more resilient than last time?

•

SwedishPerson_A 20 hours ago

https://www.tandfonline.com/doi/full/10.1080/02673843.2023.2... https://www.perplexity.ai/ https://www.researchgate.net/

All give me

"500 Internal Server Error cloudflare.."

So I'm guessing yes.

•

piker 20 hours ago

At least the 500 error announces ownership.

Imagine how productive we'll be now!

•

jazzyjackson 20 hours ago

Is it at all achievable to be fronted by a CDN but fallback to the raw server in case the front falls off? Better to be vulnerable to DDoS than be unreachable altogether

•

koolba 20 hours ago

With CloudFlare specifically probably not. IIRC, they require DNS resolution of your domain to operate so if they’re down, I don’t see how you’d change it to route directly to the underlying site.

looks like a big one. interestingly, our site, which uses a TON of Cloudflare services[0] — yet not their front-line proxy — is doing fine: https://magicgarden.gg.

So it seems like it's just the big ol' "throw this big orange reverse proxy in front of your site for better uptime!" is what's broken...

[0] Workers, Durable Objects, KV, R2, etc

•

reassess_blind 20 hours ago

My sites that use their main proxy are seemingly up and working? Could be a regional PoP issue.

•

bpye 20 hours ago

Moving off of Cloudflare for my personal domain is on my todo list for the holidays...

•

Ueland 20 hours ago

Interestingly enough, also some MS/Azure services are down. For example https://www.office.com/ just returns:

>We are sorry, something went wrong. >Please try refreshing the page in a few minutes. If the problem persists, please visit status.cloud.microsoft for updates regarding known issues.

"Content not available in your region."

Please avoid Imgur.

•

sebzim4500 19 hours ago

Use a vpn or avoid the UK

•

testemailfordg2 13 hours ago

Looks like this post somehow is not even on the front page of HN anymore. CF pulling some strings maybe, they don't have this incident on top of their current list.

•

JeremyJaydan 19 hours ago

I moved away from Cloudflare over a month ago because I didn't understand how they don't have pricing caps for their upgraded plans, they genuinely seem like the mob but I haven't looked any further into it..

Either way it's been interesting to see the bullets I've been dodging.

•

"Scheduled maintenance is currently in progress" I image the maintenance was conducted like this: "fix detroit data center bugs, please be very careful, don't mess up like last time :)" bypass permissions on

•

scirob 19 hours ago

Wonder if supabase auth down is also related https://status.supabase.com/incidents/rgz3dl2rcmq8

•

zwnow 19 hours ago

Will it be down for 10 days again? Who knows. Would've stopped using it after the first 10 day outage anyway.

•

segev608 20 hours ago

Luckly https://downdetectorsdowndetectorsdowndetectorsdowndetector.... is up :)

•

nabla9 20 hours ago

It's configuration error or related to configuration. It always is with this big things.

ednevsky 20 hours ago

Notion is also down (haven't seen a comment on that). It's so funny how the biggest companies literally just have their sites not loading because of Cloudflare.

•

meindnoch 20 hours ago

Maybe they should stop vibe coding and vibe reviewing their PRs?

•

SwedishPerson_A 20 hours ago

https://www.researchgate.net/ https://www.tandfonline.com/ https://www.perplexity.ai/ All give me "500 Internal Server Error cloudflare"

BluSyn 19 hours ago

Perhaps related? My main fiber WAN went out few hrs ago, failing over to Starlink backup. Discovered it’s a cloudflare issue, as my multi-wan setup tests against 1.1.1.1, which suddenly stopped responding (but only from my fiber ISP). Switched to testing 8.8.8.8 to restore.

If it weren’t for recent cloudflare outages, never would have considered this was the problem.

Even until I saw this, I assumed it was an ISP issue, since Starlink still worked using 1.1.1.1. Now I’m thinking it’s a cloudflare routing problem?

•

chaidhat 19 hours ago

For those saying we have an over-reliance on software -- is there a way to use multiple CDNs for the same frontend website?

•

jonathanlydall 20 hours ago

It seems regular reverse proxying and R2 still works, as we use those and seem to be working fine still.

Can't get to the Dashboard though.

•

DirkH 10 hours ago

I feel like all the BS we were taught about architecture design principles multi-AZ, failover strategies, graceful degradation etc was gaslighting us all into thinking any of out work on it actually matters.

This isn't true, but it feels like this when the entire engineering world order seems to actually run on single-point-of-failures where one CEO just messages another when some 3rd party is down. And reputational risk here is completely safeguarded because as long as everyone is down you are fine. Use a service everyone uses and it goes down = no reputational risk. Use a more robust architecture and make some mistake = massive reputational risk and everyone asks why you don't use what everyone else uses.

Blind leading the blind and all that.

•

techguy1954 20 hours ago

I can still visit some websites that use Cloudflare, but other don't work.

Blender Artists works, but DownDetector and Quillbot dont.

•

SherryWong 20 hours ago

LinkedIn and MEdium are also down as a result

•

igleria 20 hours ago

Heads will roll at cloudfare. E-commerce customers must be furious.

Impossible not to feel bad for whoever is tasked to cleanup the mess.

•

zppln 20 hours ago

Especially around christmas. I was about to buy a pair of Birkenstocks. Nope, site is down. Went on to buy a microphone holder, nope, that site is down as well. :) Sure, I'll still get around to it eventually.

•

lousken 5 hours ago

Cloudflare 362

•

MildlySerious 20 hours ago

I can't update DNS entries for my domains with Porkbun, because it's "Powered by Cloudflare".

•

MarcelGerber 20 hours ago

Just started working for me again (in Germany), both on our own CF-hosted page and on cloudflare.com itself.

•

Towaway69 20 hours ago

for me docker is failing with:

    unknown: failed to copy: httpReadSeeker: failed open: unexpected status from GET request to https://production.cloudflare.docker.com/registry-v2/docker/registry/v2/blobs/sha256/....

so coffee time.

•

hnarturpl 19 hours ago

We use workers and dns proxy and I got flooded with pages. We were getting 503s from workers.

•

dynamite-ready 19 hours ago

Some of the sites I maintain, are fine. But I'm guessing it's just a matter of time?

•

skylurk 19 hours ago

> Monitoring - A fix has been implemented and we are monitoring the results.

Ooof, this one looks like a big one!

canva.com

chess.com

claude.com

coinbase.com

kraken.com

linkedin.com

medium.com

notion.so

npmjs.com

shopify.com (!)

and many more I won't add bc I don't want to be spammy.

Edit: Just checked all my websites hosted there (~12), they're all ok. Other people with small websites are doing well.

udarij 19 hours ago

It's ok to fail. but the most frustrating thing ever is... there's no contact point or supporting team easily and directly accessible.. this is bad..

•

sammy2255 20 hours ago

500 internal server error on most things:

One has to wonder how many times or how often proprietary cloud services have to go down before there is a general shift away from using the cloud and "infinite scaling" for everything. For many, many use cases you do not need neither Cloudflare nor Github nor nine nines for everything (which you are clearly not getting anyway). It's obviously not enough with once a year for most businesses, or perhaps once a month. Weekly outages? For how long?

>Go to <social media page> - 500 error from cloudflare >Google is <social media page> down -> click first link - literally the exact same 500 cloudflare error html from downdetector

I thought we were meant to learn something ... ?

•

lionkor 20 hours ago

eval(requestBody).unwrap()

•

elcapithanos 19 hours ago

Shortest damn outage ever

•

ojm 20 hours ago

Turnstile seems up still.

•

tippa123 20 hours ago

Curious to see which big companies were caught flat-footed during the 18 November outage compared with today. In my opinion, if a company was caught out twice, that reflects poor decision-making and urgency. As the saying goes, fool me once, shame on you, fool me twice, shame on me.

Really disappointed that down detectors down detector[1] isn't detecting that down detector[2] is down

[1] https://downdetectorsdowndetector.com/

[2] https://downdetector.com/

•

ammo1662 20 hours ago

"Given Cloudflare's importance in the Internet ecosystem any outage of any of our systems is unacceptable. "

Is this a joke?

And their blog of above statement is also down:

https://blog.cloudflare.com/18-november-2025-outage/

•

epolanski 20 hours ago

I can absolutely accomplish nothing today...can't download npm packages, cannot login to services.

I've been a Cloudflare fan for the longest time, but the more they grow the more they look like the weak link of the internet. This is the second major outage in less than few weeks. Terrible.

•

sandruso 20 hours ago

it's back on

but wow, it must be stressful to deal with this

•

grimblee 18 hours ago

Always has been

•

pprotas 20 hours ago

What a joke of a company. They have the internet in the palm of their hands, and yet let vibe coding ambitions ruin their empire.

And it's on Friday again — never change, Cloudflare.

Gentle reminder that every affected company brought it upon themselves. Very few companies care about making their system resilient to 3rd party failures. This is just another wake-up call for them.

•

someothherguyy 19 hours ago

for all of 20 minutes, the world cried.

•

mercurialsolo 20 hours ago

As is supabase

•

Hashversion 20 hours ago

cloudflare pages seems to be working!

•

LeonenTheDK 20 hours ago

Nice, just got woken up by my outage alarms, just for it to be Cloudflare again. At least it's _my_ problem!

justmarc 20 hours ago

Obligatory song https://www.youtube.com/watch?v=OC06Z6lCB_Q&t=30s

•

pech0rin 19 hours ago

Rewriting in Rust is paying dividends.

•

nromiun 20 hours ago

I wonder if it is another bug , like unwrap, in their rewritten code.

Went to ahref to check a domain, saw 500 and came here to check.

Of course, vibe coding will always find a way to make something horribly broken but pretty.

•

nromiun 20 hours ago

I have noticed LLMs tend to generate very verbose code. What an average human might do in 10 LoC, LLMs will stretch that to 50-60 lines. Sometimes with comments on every line. That can make it hard to see those bugs.

•

0xfedcafe 20 hours ago

Yep, that’s what I wrote. It wasn’t a sarcasm