Previous incidents

May 2025
May 06, 2025
1 incident

EU-RO-1 Network Storage is degraded

Degraded

Resolved May 06 at 10:04am PDT

The storage cluster has been restored to nominal operating performance, and we are continuing to monitor performance.

3 previous updates

April 2025
Apr 28, 2025
1 incident

US-NC-1 Network Issue

Resolved Apr 28 at 06:50pm PDT

Our US-NC-1 data center is currently experiencing a network issue. The team is actively investigating.


The network has been restored.

Apr 21, 2025
1 incident

Error rates elevated for Serverless endpoints

Downtime

Resolved Apr 21 at 11:40am PDT

The issue has been resolved and error rates have returned to normal levels.

3 previous updates

Apr 10, 2025
1 incident

RunPod console shows Pods and Serverless endpoints unavailable

Resolved Apr 10 at 12:06pm PDT

Monitoring - all services are returning to normal operating baselines, however we are continuing to monitor overall service recovery.


On April 10, 2025, between 18:26:30 UTC and 18:53:00 UTC, a service disruption occurred due to a software release that was dependent on a database change which had not yet been applied. This caused our primary API to become temporarily non-functional. As a result, customers experienced issues including missing pods and serverless endpoints in the dashbo...

2 previous updates

Apr 07, 2025
1 incident

Billing and Audit Log pages down

Degraded

Resolved Apr 07 at 02:08pm PDT

Resolved - Users were unable to access the Billing and Audit Log pages in User Settings. We rolled out a fix and this issue is now resolved.

2 previous updates

March 2025
Mar 27, 2025
1 incident

EUR-IS-1 Network Issue

Resolved Mar 27 at 03:00pm PDT

Investigating - We are currently experiencing an issue with EUR-IS-1 Data center
We are currently investigating and will post an update as soon as we are able.


Update - This incident requires extended resolution time,
Next update scheduled for 03/27/2025 23:59 UTC


Update - This incident requires extended resolution time,
Next update scheduled for 03/28/2025 01:00 UTC


Update - This incident requires extended resolution time,
Next update scheduled for 03/28/2025 ...

Mar 11, 2025
1 incident

Urgent: Emergency Firmware Update for US-TX-4 at 21:00 UTC (March 11, 2025)

Resolved Mar 11 at 11:59am PDT

Our engineering team has identified a network disruption at our US-TX-4 datacenter, caused by a required firmware update for our router.

To resolve this, we will deploy an emergency fix at 21:00 UTC on March 11, 2025, with a maximum expected downtime of 10-15 minutes.


The update was successfully completed.

Mar 06, 2025
1 incident

US-NC-1 Network Issue

Resolved Mar 06 at 10:44am PST

Our primary ISP circuit for the US-NC-1 data center experienced an outage. The secondary router failed to take over due to a known firmware issue that was scheduled for a later patch. We’ve now upgraded the router to the latest patched version and are running on the secondary circuit.


The issue has been resolved.