Annual Cloud Outage Report
2025 Incident Analysis
59 incidents across 11 cloud providers
↑436% vs 2024(11 incidents in 2024)
Executive Summary
Total Incidents
59
Critical
1
High
12
Providers
11
Top Incidents of 2025
Metrics data ingestion delayed and monitor evaluations degraded
Datadog · Dec 9, 2025
Delayed APM metric ingestion
Datadog · Dec 12, 2025
Monitors - Delayed Evaluation of logs monitors
Datadog · Jul 7, 2025
Delayed AWS, GCP, Azure, SaaS integrations Metrics and Logs
Datadog · Oct 14, 2025
Cloud Control Panel and Multiple Services
DigitalOcean · Oct 10, 2025
Delayed Processes data
Datadog · Dec 12, 2025
Multiple products impacted with data delays
Datadog · Oct 20, 2025
Delayed AWS Metrics and Events
Datadog · May 2, 2025
[SSO] Login Errors from Google SSO
Datadog · Sep 18, 2025
Dashboards Not Loading
Datadog · Nov 17, 2025
Monthly Incident Trend
Incidents by Month
Incidents by Company
Monthly Heatmap
| Company | 01 | 02 | 03 | 05 | 06 | 07 | 08 | 09 | 10 | 11 | 12 |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Datadog | 3 | 1 | 2 | 3 | 3 | 5 | 2 | 3 | 3 | 3 | |
| Atlassian | 1 | ||||||||||
| DigitalOcean | 3 | 12 | 5 | 10 |
Severity Distribution
Top Failure Causes
Average Resolution Time
| Company | Avg Resolution |
|---|---|
| DigitalOcean | 3h 47m |
| Datadog | 3h 33m |
| Atlassian | 0m |
Reliability Rankings
| Rank | Company | Score | Incidents |
|---|---|---|---|
| #1 | Google Cloud | 100 | 0 |
| #2 | Stripe | 100 | 0 |
| #3 | Atlassian | 66.4 | 34 |
| #4 | AWS | 62.6 | 5 |
| #5 | Datadog | 39.5 | 39 |
| #6 | Vercel | 35.9 | 12 |
| #7 | OpenAI | 32.3 | 38 |
| #8 | DigitalOcean | 27.5 | 46 |
| #9 | GitHub | 18.7 | 38 |
| #10 | Anthropic | 15.3 | 32 |
| #11 | Cloudflare | 13.3 | 58 |
Generated by IncidentHub · API Access · Reliability Rankings