Is Fly.io Down Right Now? Discover if there is an ongoing service outage.

Fly.io is currently Operational

Last checked Jul 29, 2025 23:25 UTC from Fly.io's official status page

Historical record of incidents for Fly.io

Jul 29, 2025

Report: "Fly.io APIs and dashboard inaccessible"

Last update 2025-07-29T17:39:00.930Z

investigating2025-07-29T17:39:00.926Z

Trying to log into Fly.io dashboard, most `flyctl` commands and API requests are returning HTTP 503 errors. We are investigating.

Jul 24, 2025

Report: "AMS network issues"

Last update 2025-07-24T14:45:38.477Z

investigating2025-07-24T14:45:38.473Z

We are investigating upstream network issues in AMS region. Apps may experience network issues.

Jul 17, 2025

Report: "Elevated "App Not Found" errors"

Last update 2025-07-17T18:21:50.838Z

identified2025-07-17T16:00:09.000Z

We are seeing elevated rates of "App not found" errors impacting some apps. Users may see these errors when interacting with impacted apps via the Machines API, Flyctl, or their dashboard. We have identified the cause of these errors and are working on a fix.

Report: "Elevated API issues"

Last update 2025-07-17T00:09:48.970Z

investigating2025-07-17T00:09:48.967Z

We've started a rollout to our global state store and are noticing degraded performance for our API. Users may see inconsistent data related to their machines.

investigating2025-07-17T00:02:32.077Z

We're seeing a higher number of 500s when calling the Machines API or when trying to launch a machine.

Jul 16, 2025

Report: "Network issues in SJC"

Last update 2025-07-16T16:57:49.215Z

investigating2025-07-16T16:57:49.212Z

We are observing high packet loss in SJC region. Apps may experience network issues.

Jul 15, 2025

Report: "Networking issues in SJC"

Last update 2025-07-15T08:01:33.578Z

investigating2025-07-15T08:01:33.575Z

We are currently investigating networking issues in SJC. Customers may see higher latency and elevated packet loss connecting to machines in this region.

Jul 14, 2025

Report: "Intermittent networking Issues in GRU"

Last update 2025-07-14T19:30:15.319Z

identified2025-07-14T19:30:15.316Z

The issue has been identified and a fix is currently being implemented.

Jul 11, 2025

Report: "Networking issues in IAD"

Last update 2025-07-11T18:06:52.681Z

investigating2025-07-11T18:06:52.678Z

We are currently investigating networking issues in IAD. Customers may see higher latency and elevated packet loss connecting to machines in this region.

Jul 10, 2025

Report: "Network issues in SEA and SJC"

Last update 2025-07-10T11:49:43.716Z

investigating2025-07-10T11:49:43.714Z

We are currently investigating this issue.

Jul 9, 2025

Report: "Private networking performance issues in IAD"

Last update 2025-07-09T19:51:41.368Z

identified2025-07-09T19:51:41.365Z

We have identified an issue causing degraded private networking performance in Ashburn, Virginia, US (IAD). We are currently working to resolve this issue. Public networking is unaffected.

Report: "Networking issues in MIA"

Last update 2025-07-09T13:03:04.115Z

investigating2025-07-09T13:03:04.112Z

We are currently experiencing a temporary network outage in Miami, USA and are actively working with our upstreams to resolve this issue.

Jul 8, 2025

Report: "IPv6 connectivity issues in SEA"

Last update 2025-07-08T15:11:58.206Z

monitoring2025-07-08T15:11:58.189Z

A fix has been implemented and we are monitoring the results.

investigating2025-07-08T15:06:38.833Z

We are currently investigating an issue with outbound IPv6 connectivity in our Seattle region. IPv4 is currently unaffected.

Jul 4, 2025

Report: "Network issues in OTP region"

Last update 2025-07-04T10:46:28.811Z

investigating2025-07-04T10:46:28.808Z

We are observing intermittent network issues in OTP region. Apps may experience loss of network connectivity. We are working with our upstream provider to resolve this issue.

Jul 2, 2025

Report: "Managed Postgres - Connectivity errors in IAD"

Last update 2025-07-02T09:47:53.770Z

investigating2025-07-02T09:47:53.766Z

We are currently investigating elevated connectivity errors between our control plane and a small number of Managed Postgres clusters in IAD.

Report: "Fly.io Dashboard and GraphQL issues"

Last update 2025-07-02T02:15:51.748Z

investigating2025-07-02T02:15:51.746Z

We're investigating elevated database load causing timeouts and elevated 500 error rates on the Fly.io Dashboard and the GraphQL API.

Jun 23, 2025

Report: "Elevated GraphQL API issues"

Last update 2025-06-23T19:12:08.370Z

investigating2025-06-23T19:12:08.367Z

We're seeing a higher number of 500s when calling the Machines API, running a deploy, or attempting to access the dashboard. We're currently investigating the issue.

Jun 18, 2025

Report: "fly-proxy outage"

Last update 2025-06-18T02:11:18.774Z

investigating2025-06-18T02:11:18.771Z

We are currently investigating an outage of fly-proxy in multiple regions. Connections to Fly.io apps will receive connection refused errors. We are actively investigating.

Jun 11, 2025

Report: "IPv6 Connectivity Loss in GDL"

Last update 2025-06-11T17:48:38.617Z

identified2025-06-11T17:48:21.000Z

We have experienced a temporary loss in IPv6 connectivity in Guadalajara, Mexico (GDL) and are currently working with our upstream providers to resolve the issue. IPv4 connectivity is currently unaffected.

Jun 10, 2025

Report: "Network issues in LHR"

Last update 2025-06-10T10:56:23.787Z

investigating2025-06-10T10:56:23.783Z

We are observing network issues in LHR region. Apps continue to run, but may have network issues, and deploying/updating apps may fail.

May 30, 2025

Report: "Network maintenance in GRU (São Paulo, Brazil)"

Last update 2025-05-30T12:00:00.000Z

Scheduled2025-05-30T14:00:00.000Z

An upstream provider is performing network maintenance in GRU, from 2025-05-30 at 12:00 UTC (9:00am BRT local time) to 14:00 UTC (11:00am BRT local time). You may experience a short total loss of connectivity for up to 5 minutes within the scheduled maintenance window hours.

In progress2025-05-30T12:00:00.000Z

Scheduled maintenance is currently in progress. We will provide updates as necessary.

May 29, 2025

Report: "Network maintenance in LHR"

Last update 2025-05-29T23:00:00.000Z

In progress2025-05-29T23:00:00.000Z

Scheduled maintenance is currently in progress. We will provide updates as necessary.

Scheduled2025-05-29T03:00:00.000Z

An upstream provider is performing network maintenance on a subset of our servers in LHR, from 2025-05-29 at 23:00 UTC to 2025-05-30 at 03:00 UTC. You may experience network connectivity disruptions for some time within the maintenance window.

Report: "Network maintenance in CDG (Paris, France)"

Last update 2025-05-29T22:00:00.000Z

In progress2025-05-29T22:00:00.000Z

Scheduled maintenance is currently in progress. We will provide updates as necessary.

Scheduled2025-05-29T02:00:00.000Z

An upstream provider is performing network maintenance in CDG, from 2025-05-29 at 22:00 UTC (2025-05-30 12:00am CEST local time) to 2025-05-30 at 02:00 UTC (4:00am CEST local time). You may experience a short total loss of connectivity for up to 5 minutes within the scheduled maintenance window hours.

Report: "Burst of network related alerts from some servers in LHR"

Last update 2025-05-29T14:01:14.605Z

resolved2025-05-29T14:01:14.590Z

This incident has been resolved.

monitoring2025-05-29T00:56:41.591Z

Alerts appear to be related to a network blip caused by an upstream provider's router failover, with no ongoing disruption.

investigating2025-05-29T00:52:48.824Z

We've received a flood of networking related alerts from a subset of servers running in LHR. We are not yet sure of the impact on customer workloads.

Report: "Burst of network related alerts from some servers in LHR"

Last update 2025-05-29T00:56:00.000Z

Monitoring2025-05-29T00:56:00.000Z

Alerts appear to be related to a network blip caused by an upstream provider's router failover, with no ongoing disruption.

Investigating2025-05-29T00:52:00.000Z

We've received a flood of networking related alerts from a subset of servers running in LHR. We are not yet sure of the impact on customer workloads.

May 23, 2025

Report: "WireGuard gateway issues"

Last update 2025-05-23T15:51:47.448Z

resolved2025-05-23T15:51:47.431Z

This incident has been resolved.

monitoring2025-05-23T13:08:55.040Z

A fix has been implemented and we are monitoring the results.

investigating2025-05-23T11:34:21.875Z

We are investigating issues with WireGuard over websockets (the default connection mode in flyctl). `flyctl ssh`, `flyctl proxy`, `flyctl logs` commands as well as others may fail. If you are on a network that allows UDP connections, running `fly wg websockets disable` may fix the issue as a workaround.

Report: "WireGuard gateway issues"

Last update 2025-05-23T11:34:00.000Z

Investigating2025-05-23T11:34:00.000Z

We are investigating issues with WireGuard over websockets (the default connection mode in flyctl).`flyctl ssh`, `flyctl proxy`, `flyctl logs` commands as well as others may fail.If you are on a network that allows UDP connections, running `fly wg websockets disable` may fix the issue as a workaround.

Report: "Production database is being migrated"

Last update 2025-05-23T00:50:35.138Z

resolved2025-05-23T00:50:35.122Z

This incident has been resolved.

monitoring2025-05-23T00:50:09.453Z

The issue has been resolved

monitoring2025-05-22T23:58:30.835Z

We are continuing to monitor for any further issues.

monitoring2025-05-22T22:03:57.264Z

A fix has been implemented, and we're monitoring the results. API performance should be back to normal, although app creates may still be degraded.

identified2025-05-22T21:19:09.612Z

We're continuing to work on fully restoring the Machines API. API calls are still taking longer than usual but we're no longer seeing failures.

identified2025-05-22T19:58:09.153Z

We are continuing to work on a fix for this issue.

identified2025-05-22T19:01:28.159Z

We identified an issue while migrating our production traffic, and have applied a fix to restore dashboard functionality. We're continuing to work on fully restoring the Machines API.

investigating2025-05-22T18:37:36.000Z

The Fly Dashboard is also affected and may prevent certain dashboard functionality, like the support portal. If you're on a paid support plan, please submit tickets using your support email address in the meantime.

investigating2025-05-22T18:32:52.770Z

We’re migrating production traffic over to a new production database. GraphQL queries, including flyctl commands, may be slow.

May 22, 2025

Report: "Production database is being migrated"

Last update 2025-05-22T18:32:00.000Z

Investigating2025-05-22T18:32:00.000Z

We’re migrating production traffic over to a new production database. GraphQL queries, including flyctl commands, may be slow.

May 19, 2025

Report: "Network maintenance in AMS (Amsterdam, The Netherlands)"

Last update 2025-05-19T22:30:00.000Z

In progress2025-05-19T22:30:00.000Z

Scheduled maintenance is currently in progress. We will provide updates as necessary.

Scheduled2025-05-19T04:00:00.000Z

An upstream provider is performing network maintenance in AMS, from 2025-05-19 22:30 UTC (00:30 local time) to 2025-05-20 04:00 UTC (06:00 local time). No operational impact is expected.

May 17, 2025

Report: "Network maintenance in BOG (Bogotá, Colombia)"

Last update 2025-05-17T11:00:00.000Z

Scheduled2025-05-17T15:00:00.000Z

An upstream provider is performing network maintenance in BOG on 2025-05-17, from 11:00 UTC (06:00am local time) to 15:00 UTC (10:00am local time). No operational impact is expected.

In progress2025-05-17T11:00:00.000Z

Scheduled maintenance is currently in progress. We will provide updates as necessary.

May 16, 2025

Report: "Machines API degraded performance"

Last update 2025-05-16T04:31:54.278Z

resolved2025-05-16T04:31:54.262Z

We identified the problem and deployed a fix.

investigating2025-05-16T02:51:21.870Z

We're investigating degraded performance with the Machines API metadata update endpoint.

Report: "Machines API degraded performance"

Last update 2025-05-16T02:51:00.000Z

Investigating2025-05-16T02:51:00.000Z

We're investigating degraded performance with the Machines API metadata update endpoint.

May 15, 2025

Report: "Network issues in NRT/HKG"

Last update 2025-05-15T15:57:06.169Z

resolved2025-05-15T15:57:06.150Z

This incident has been resolved.

investigating2025-05-15T15:32:49.130Z

Machines API requests (including `fly deploy` or `fly machines` commands) may occasionally fail when trying to create/update machines in NRT or HKG regions. We are investigating.

investigating2025-05-15T15:13:43.375Z

An upstream provider is investigating a network issue in NRT and HKG regions. Apps continue to run, but requests may occasionally fail.

Report: "Network issues in NRT/HKG"

Last update 2025-05-15T15:13:00.000Z

Investigating2025-05-15T15:13:00.000Z

An upstream provider is investigating a network issue in NRT and HKG regions. Apps continue to run, but requests may occasionally fail.

Report: "Network maintenance in SEA (Seattle, Washington, USA)"

Last update 2025-05-15T14:00:00.000Z

Scheduled2025-05-15T16:00:00.000Z

An upstream provider is performing critical network maintenance in SEA, from 14:00 UTC (07:00am PDT local time) to 16:00 UTC (09:00am PDT local time). You may experience a short total loss of connectivity for up to 15 minutes within the scheduled maintenance window hours.

In progress2025-05-15T14:00:00.000Z

Scheduled maintenance is currently in progress. We will provide updates as necessary.

May 12, 2025

Report: "New MPG clusters cannot be provisioned in FRA"

Last update 2025-05-12T15:31:24.252Z

resolved2025-05-12T15:31:24.235Z

This incident has been resolved, all operations in FRA are working as expected.

monitoring2025-05-12T15:22:10.936Z

A fix has been implemented and we are seeing MPG creations in FRA succeed again. The MPG tab of the fly.io dashboard is working again for users with clusters in FRA.

identified2025-05-12T15:07:28.000Z

New MPG cluster creations in Frankfurt (FRA) region are currently failing. Cluster creation in other MPG regions is working as normal. We are working to restore FRA cluster creation. Existing, running database clusters in FRA are not impacted and continue to work as normal. However the MPG page in the Fly.io dashboard may not load for users with clusters in FRA.

Report: "Errors (5xx, timeouts) in Fly.io dashboard"

Last update 2025-05-12T15:26:43.061Z

resolved2025-05-12T15:26:43.038Z

This incident is resolved, Dashboard, API and CLI operations should be working normally now.

monitoring2025-05-12T14:56:49.782Z

We continue to monitor the deployed fix. Dashboard and API/CLI operations should be functional now.

identified2025-05-12T14:11:44.699Z

We have identified the troublesome component and a fix has been rolled out. We are monitoring the results and may need to perform further updates to fully stabilize things.

investigating2025-05-12T13:38:42.570Z

Our metrics and user reports show Fly.io/dashboard and portions of the API backend are timing out or returning 5xx errors. All operations in the Fly dashboard and most operations using fly CLI will fail or timeout at this point. Currently-running machines or workloads should not be affected.

Report: "New MPG clusters cannot be provisioned in FRA"

Last update 2025-05-12T15:07:00.000Z

Identified2025-05-12T15:07:00.000Z

New MPG cluster creations in Frankfurt (FRA) region are currently failing. Cluster creation in other MPG regions is working as normal. We are working to restore FRA cluster creation.Existing, running database clusters in FRA are not impacted and continue to work as normal. However the MPG page in the Fly.io dashboard may not load for users with clusters in FRA.

Report: "Errors (5xx, timeouts) in Fly.io dashboard"

Last update 2025-05-12T13:38:00.000Z

Investigating2025-05-12T13:38:00.000Z

Our metrics and user reports show Fly.io/dashboard and portions of the API backend are timing out or returning 5xx errors. All operations in the Fly dashboard and most operations using fly CLI will fail or timeout at this point.Currently-running machines or workloads should not be affected.

May 8, 2025

Report: "Depot builders experiencing issues"

Last update 2025-05-08T22:26:56.073Z

resolved2025-05-08T18:00:00.000Z

From roughly 11:00AM Pacific to 3:00PM Pacific, Depot builders were unable to complete deploys (https://status.depot.dev/cmafni8la004z9pwuozks8vwx). During this time, deploys defaulted back to our legacy Fly builders, and users may have seen slower-than-usual deploys depending on the size of the build. This has been resolved, and deploys are now defaulting to Depot builders again.

Report: "Depot builders experiencing issues"

Last update 2025-05-08T18:00:00.000Z

Resolved2025-05-08T18:00:00.000Z

From roughly 11:00AM Pacific to 3:00PM Pacific, Depot builders were unable to complete deploys (https://status.depot.dev/cmafni8la004z9pwuozks8vwx). During this time, deploys defaulted back to our legacy Fly builders, and users may have seen slower-than-usual deploys depending on the size of the build.This has been resolved, and deploys are now defaulting to Depot builders again.

Report: "IAD Managed Postgres control plane unavailability"

Last update 2025-05-08T11:39:10.372Z

resolved2025-05-08T11:39:10.357Z

This incident has been resolved.

investigating2025-05-08T09:24:19.437Z

We are investigating intermittent unavailability of the Managed Postgres control plane in IAD region. Database clusters continue to run.

Report: "IAD Managed Postgres control plane unavailability"

Last update 2025-05-08T09:24:00.000Z

Investigating2025-05-08T09:24:00.000Z

We are investigating intermittent unavailability of the Managed Postgres control plane in IAD region. Database clusters continue to run.

Apr 30, 2025

Report: "Some or all *.fly.dev subdomains are currently returning NXDOMAIN errors in IAD"

Last update 2025-04-30T13:59:10.427Z

resolved2025-04-30T13:59:10.412Z

This incident has been resolved.

monitoring2025-04-30T03:27:21.939Z

A fix has been implemented and we are monitoring the results.

investigating2025-04-30T02:30:06.830Z

applications may be inaccessible via DNS.

Report: "WireGuard connectivity into CDG is unavailable"

Last update 2025-04-30T07:32:57.600Z

resolved2025-04-30T07:32:57.583Z

We have re-enabled the CDG gateway for flyctl.

monitoring2025-04-30T00:36:02.375Z

A fix has been implemented and we are monitoring the results.

identified2025-04-30T00:15:15.450Z

Inbound wireguard connections to our CDG gateways is currently unavailable due to an upstream networking issue. Any static peers configured in CDG will be unavailable until this is resolved.

Report: "Loss of connectivity in IAD"

Last update 2025-04-30T06:56:16.247Z

resolved2025-04-30T06:35:00.000Z

We are experienced an outage with one of our upstream transit providers in IAD for around 10 minutes. Traffic has been re-routed to alternate paths and connectivity should be back to normal.

Report: "Loss of connectivity in IAD"

Last update 2025-04-30T06:35:00.000Z

Resolved2025-04-30T06:35:00.000Z

We are experienced an outage with one of our upstream transit providers in IAD for around 10 minutes. Traffic has been re-routed to alternate paths and connectivity should be back to normal.

Report: "Some or all *.fly.dev subdomains are currently returning NXDOMAIN errors in IAD"

Last update 2025-04-30T02:30:00.000Z

Investigating2025-04-30T02:30:00.000Z

applications may be inaccessible via DNS.

Report: "WireGuard connectivity into CDG is unavailable"

Last update 2025-04-30T00:15:00.000Z

Identified2025-04-30T00:15:00.000Z

Inbound wireguard connections to our CDG gateways is currently unavailable due to an upstream networking issue. Any static peers configured in CDG will be unavailable until this is resolved.

Apr 28, 2025

Report: "Upstream network outage in MAD"

Last update 2025-04-28T19:41:22.480Z

resolved2025-04-28T19:41:22.459Z

This incident has been resolved.

monitoring2025-04-28T17:28:53.092Z

Power has been brought back online for the region. We're closely monitoring for any further complications.

identified2025-04-28T16:31:35.056Z

Our edges in Madrid, Spain are currently affected by an upstream outage caused by ongoing power issues in the region. Regional and static egress IPs may be temporarily unavailable. Access via Anycast IPs is currently unaffected. We are working with our upstream to resolve this situation.

Report: "Upstream network outage in MAD"

Last update 2025-04-28T19:41:00.000Z

Resolved2025-04-28T19:41:00.000Z

This incident has been resolved.

Monitoring2025-04-28T17:28:00.000Z

Power has been brought back online for the region. We're closely monitoring for any further complications.

Identified2025-04-28T16:31:00.000Z

Apr 24, 2025

Report: "Fly.io dashboard down"

Last update 2025-04-24T10:43:37.415Z

resolved2025-04-24T10:43:37.398Z

This incident has been resolved.

monitoring2025-04-24T09:20:29.897Z

A fix has been implemented and we are monitoring the results.

investigating2025-04-24T08:46:01.069Z

We are continuing to investigate this issue.

investigating2025-04-24T08:42:18.977Z

We are currently investigating this issue.

Report: "Fly.io dashboard down"

Last update 2025-04-24T10:43:00.000Z

Resolved2025-04-24T10:43:00.000Z

This incident has been resolved.

Monitoring2025-04-24T09:20:00.000Z

A fix has been implemented and we are monitoring the results.

Update2025-04-24T08:46:00.000Z

We are continuing to investigate this issue.

Investigating2025-04-24T08:42:00.000Z

We are currently investigating this issue.

Report: "Network performance issues in ORD"

Last update 2025-04-24T00:53:29.293Z

resolved2025-04-24T00:48:27.000Z

This incident has been resolved. The issues impacting performance on the affected routes do not seem to have been caused by issues within our network infrastructure.

investigating2025-04-24T00:04:52.369Z

We are continuing to investigate this issue.

investigating2025-04-23T23:28:17.186Z

We are continuing to investigate this issue.

investigating2025-04-23T22:42:58.233Z

Some network paths in a single region (ORD) are slightly slower than expected. You may experience lower network performance for requests in ORD.

Report: "Network performance issues in ORD"

Last update 2025-04-24T00:48:00.000Z

Resolved2025-04-24T00:48:00.000Z

This incident has been resolved. The issues impacting performance on the affected routes do not seem to have been caused by issues within our network infrastructure.

Update2025-04-24T00:04:00.000Z

We are continuing to investigate this issue.

Update2025-04-23T23:28:00.000Z

We are continuing to investigate this issue.

Investigating2025-04-23T22:42:00.000Z

Some network paths in a single region (ORD) are slightly slower than expected. You may experience lower network performance for requests in ORD.

Apr 23, 2025

Report: "Degraded performance"

Last update 2025-04-23T19:10:37.822Z

resolved2025-04-23T19:10:37.801Z

This incident has been resolved.

monitoring2025-04-23T18:57:36.197Z

A fix has been implemented and we are monitoring the results.

investigating2025-04-23T18:06:41.089Z

We are continuing to investigate this issue.

investigating2025-04-23T18:04:54.903Z

We're investigating degraded performance on our web dashboard and GraphQL API. You may notice slower responses as well as occasional 500 errors at this time.

Report: "Degraded performance"

Last update 2025-04-23T19:10:00.000Z

Resolved2025-04-23T19:10:00.000Z

This incident has been resolved.

Monitoring2025-04-23T18:57:00.000Z

A fix has been implemented and we are monitoring the results.

Update2025-04-23T18:06:00.000Z

We are continuing to investigate this issue.

Investigating2025-04-23T18:04:00.000Z

We're investigating degraded performance on our web dashboard and GraphQL API. You may notice slower responses as well as occasional 500 errors at this time.

Apr 22, 2025

Report: "Network maintenance in SCL (Santiago, Chile)"

Last update 2025-04-22T09:00:00.000Z

Completed2025-04-22T09:00:00.000Z

The scheduled maintenance has been completed.

In progress2025-04-22T07:00:00.000Z

Scheduled maintenance is currently in progress. We will provide updates as necessary.

Scheduled2025-04-17T17:02:00.000Z

An upstream provider is performing critical network maintenance in SCL, from 7:00am UTC (3:00am local time) to 9:00am UTC (5:00am local time). You may experience a short total loss of connectivity for up to 25 minutes within the scheduled maintenance window hours.

Apr 17, 2025

Report: "Scheduled Maintenance in GIG Region (Rio De Janeiro)"

Last update 2025-04-17T09:00:00.000Z

Completed2025-04-17T09:00:00.000Z

The scheduled maintenance has been completed.

In progress2025-04-17T06:00:00.000Z

Scheduled maintenance is currently in progress. We will provide updates as necessary.

Scheduled2025-04-04T13:57:00.000Z

We are performing networking upgrades at our GIG data centre from 06:00 - 09:00 UTC (03:00 - 06:00 Local Time). Users with machines in GIG may experience networking downtime of up to 40 minutes within the scheduled maintenance period. We recommend users scale up to nearby regions, such as GRU, if needed.

Apr 15, 2025

Report: "Network maintenance in QRO"

Last update 2025-04-15T10:00:00.000Z

Completed2025-04-15T10:00:00.000Z

The scheduled maintenance has been completed.

In progress2025-04-15T08:00:00.000Z

Scheduled maintenance is currently in progress. We will provide updates as necessary.

Scheduled2025-04-03T23:09:00.000Z

An upstream provider is performing critical network maintenance in QRO. You may experience a short total loss of connectivity for up to 25 minutes within the scheduled maintenance window hours.

Apr 14, 2025

Report: "Issues with API"

Last update 2025-04-14T11:21:16.314Z

resolved2025-04-14T11:21:16.299Z

A fix has been deployed and the API is back up.

investigating2025-04-14T11:04:01.000Z

We are currently investigating issues with our Graphql API. You might experience issues connecting to the dashboard and flyctl.

Apr 10, 2025

Report: "Organization invites failing on dashboard"

Last update 2025-04-10T16:10:08.192Z

resolved2025-04-10T16:10:08.175Z

This incident has been resolved.

investigating2025-04-10T15:05:25.099Z

We are investigating an issue where inviting users to organization from the web dashboard may fail. As a workaround, inviting users using the flyctl command-line (`fly orgs invite` command) is working.

Apr 3, 2025

Report: "Networking issues in HKG"

Last update 2025-04-03T23:44:47.055Z

resolved2025-04-03T23:44:47.030Z

This incident has been resolved.

investigating2025-04-03T23:44:29.067Z

We are continuing to investigate this issue.

investigating2025-04-03T22:51:29.444Z

We are investigating intermittent network issues in the HKG region. Apps running in the region may have trouble reaching apps in other regions at this time.

Report: "Network issues in GDL"

Last update 2025-04-03T10:31:01.822Z

resolved2025-04-03T10:31:01.806Z

This incident has been resolved.

investigating2025-04-03T10:18:01.537Z

We are investigating network issues in the GDL region. Apps running in the region may be unreachable at this time.

Mar 28, 2025

Report: "504 Errors from Logs API"

Last update 2025-03-28T13:45:10.041Z

resolved2025-03-28T13:45:10.026Z

Historical logs are back up.

monitoring2025-03-28T13:42:33.102Z

A fix has been implemented and we are monitoring the results.

investigating2025-03-28T13:14:34.898Z

We are currently investigating an issue with looking up historical logs. The fly logs command may fail. Streaming logs with NATS is not affected..

Mar 26, 2025

Report: "Capacity issues in FRA"

Last update 2025-03-26T17:38:56.952Z

resolved2025-03-26T17:38:56.923Z

This incident has been resolved.

monitoring2025-03-26T13:08:26.482Z

New capacity has been added in FRA, we will continue to monitor the region for capacity constraints.

identified2025-03-26T12:21:21.033Z

We are continuing to work on a fix for this issue.

identified2025-03-26T12:19:50.226Z

We are continuing to work on a fix for this issue.

identified2025-03-26T12:18:21.842Z

We are continuing to work on a fix for this issue.

identified2025-03-26T12:14:51.000Z

We are actively working to add additional capacity in the FRA region. We'll provide another update in the next 15-30 minutes.

identified2025-03-26T11:19:22.356Z

We are experiencing low capacity in FRA. You may see machine launch failures. We are working on adding new capacity to FRA as soon as possible.

Mar 25, 2025

Report: "Network issues in SJC"

Last update 2025-03-25T15:27:33.112Z

resolved2025-03-25T15:27:33.092Z

Networking in SJC is working as expected on all hosts. This incident has been resolved.

monitoring2025-03-25T15:15:58.160Z

A fix has been implemented and we are monitoring the results.

monitoring2025-03-25T15:13:49.253Z

We've identified the cause of the issue and have applied a fix. We are seeing improvements are continuing to monitor for full recovery.

investigating2025-03-25T15:01:00.544Z

A small number of hosts in SJC are continuing to experience networking issues after the earlier scheduled maintenance. We are working with our upstream provider to restore full connectivity to these hosts. Machines on impacted hosts may see reduced networking performance connecting to other machines within Fly.io and the broader internet.

investigating2025-03-25T13:39:45.010Z

We are investigating network issues resulting from the earlier scheduled maintenance in SJC.

Mar 24, 2025

Report: "Capacity issues in LHR region"

Last update 2025-03-24T18:03:45.832Z

resolved2025-03-24T18:03:45.816Z

This incident has been resolved.

monitoring2025-03-21T13:50:07.011Z

We've provisioned new host capacity in LHR region, machine/volume creates have been re-enabled and deploys should now be possible again. We are monitoring capacity and will provide updates if the situation changes.

identified2025-03-14T13:55:09.906Z

New machine/volume creates in LHR regions are currently unavailable as there is no host capacity available. Any workloads currently running will continue to run; it is also still possible to update existing machines/volumes. Increasing `fly scale count` in LHR region is not possible. Blue-green deploys are also not possible at the moment, as well as deploys with `release_command`. We expect more capacity to become available in the coming weeks. For the time being, please choose a nearby region for new workloads, such as AMS (Amsterdam, Netherlands) or ARN (Stockholm, Sweden).

Report: "Management plane for managed postgres in ORD is unavailable"

Last update 2025-03-24T10:37:58.078Z

resolved2025-03-24T10:37:58.063Z

This incident has been resolved.

monitoring2025-03-24T09:52:10.015Z

A fix has been implemented and we are monitoring the results.

investigating2025-03-24T09:28:23.195Z

We are currently investigating this issue.

Mar 18, 2025

Report: "Degraded connectivity to Fly Registry"

Last update 2025-03-18T19:21:10.785Z

resolved2025-03-18T19:21:10.769Z

We have identified that transoceanic subsea cable faults resulted in degraded connectivity to some registry instances in AMS, FRA, WAW regions. Our monitoring indicates error rates have improved after cordoning the affected instances at 16:40 UTC.

monitoring2025-03-18T17:23:35.484Z

We are continuing to monitor results after cordoning affected registry instances.

investigating2025-03-18T17:06:53.129Z

We are investigating timeouts connecting to instances of registry.fly.io in AMS, FRA, WAW regions. Customers may experience slower image pushes and pulls within Fly Machines in the affected regions.

monitoring2025-03-18T16:40:26.000Z

We have cordoned the affected registry instances in AMS, FRA, WAW and are seeing timeout errors decrease.

investigating2025-03-18T16:39:50.301Z

We are continuing to investigate the cause of increased connection timeouts to instances of our primary registry in AMS, FRA, WAW. Affected customers may be able to work around by pushing images to an alternate registry, registry2.fly.io: FLY_REGISTRY_HOST=registry2.fly.io fly deploy

investigating2025-03-18T15:52:30.770Z

We are investigating timeouts connecting to registry.fly.io. Customers may experience slower image pushes and pulls within Fly Machines.

Mar 17, 2025

Report: "Capacity issues in IAD and AMS"

Last update 2025-03-17T22:12:22.320Z

resolved2025-03-17T22:12:22.306Z

We have provisioned additional capacity in the affected regions.

monitoring2025-03-17T20:50:51.443Z

New machine/volume creates in IAD regions may fail as there is no host capacity available. Any workloads currently running will continue to run; it is also still possible to update existing machines/volumes. Increasing `fly scale count` in these regions may not work. Blue-green deploys may also be unavailable at the moment, as well as deploys with `release_command`. We are provisioning additional capacity in this region.

Mar 6, 2025

Report: "Leader Election Issues with PG Flex Clusters close to NA region"

Last update 2025-03-06T17:10:55.018Z

resolved2025-03-06T17:10:54.998Z

This incident has been resolved.

monitoring2025-03-06T16:10:59.868Z

A fix has been implemented and we are monitoring

investigating2025-03-06T16:08:35.930Z

We are investigating an issue where postgres flex clusters are unable to elect a new leader.

Feb 28, 2025

Report: "Network issues in AMS region"

Last update 2025-02-28T01:09:56.489Z

resolved2025-02-28T01:09:56.472Z

This incident has been resolved.

monitoring2025-02-27T23:56:53.734Z

We are continuing to monitor for any further issues.

monitoring2025-02-27T23:56:41.183Z

Networking on the impacted hosts has been restored. Machines and apps on those hosts will now be reachable. We're continuing to monitor to ensure everything remains stable.

identified2025-02-27T23:36:37.057Z

The hardware switchover is complete. We are continuing the process of re-connecting the downed hosts to the network.

identified2025-02-27T22:47:35.268Z

Installation of the new hardware has completed and we are starting the switchover process. A networking blip may be observed on Machines in the AMS region during this process.

identified2025-02-27T22:08:39.000Z

Installation of the replacement hardware is still ongoing.

identified2025-02-27T21:07:17.033Z

Replacement hardware is onsite and is being installed.

identified2025-02-27T19:55:09.632Z

The upstream provider has identified this issue to a broken switch, and are working to replace the switch. They expect connectivity to return in ~1 hour.

investigating2025-02-27T19:12:56.318Z

Various hosts in AMS region have lost network connectivity. We are investigating this along with our upstream provider.

Feb 25, 2025

Report: "Network issues in ARN"

Last update 2025-02-25T13:50:05.413Z

resolved2025-02-25T13:50:05.395Z

Load has subsided on the edge nodes and we are not observing any related errors at this time.

investigating2025-02-25T13:28:23.064Z

Our edge nodes in Stockholm are currently experiencing high load. Some incoming connections may fail while we work to address the issue.

Feb 16, 2025

Report: "Network outage"

Last update 2025-02-16T15:18:14.751Z

resolved2025-02-16T15:18:14.735Z

This incident has been resolved.

identified2025-02-16T14:13:49.000Z

Network connectivity in IAD has been restored. Our APIs should be working again, but might have higher response times.

identified2025-02-16T14:13:45.076Z

Network connectivity in IAD has been restored. Our APIs should be working again, but might have higher response times.

identified2025-02-16T12:50:25.959Z

We're bringing our platform up in another region and waiting for things to settle. Our upstream provider is also replacing the affected networking devices in IAD.

identified2025-02-16T12:15:44.944Z

We're continuing work to move our APIs away from affected regions/providers. Another update will be provided at 13h00 UTC or earlier.

identified2025-02-16T10:22:44.538Z

The IAD region is unavailable due to an incident at an upstream provider. Our API is hosted in this region and as such is unavailable.

investigating2025-02-16T10:11:01.000Z

We are investigating widespread reports of networking issues. Apps appear to be running correctly but requests made to the apps may fail. The API and dashboard are also unavailable at the moment.

Feb 14, 2025

Report: "Edge network issues in GRU and SCL"

Last update 2025-02-14T16:27:53.176Z

resolved2025-02-14T16:27:53.159Z

This incident has been resolved.

monitoring2025-02-14T16:18:05.747Z

A fix has been implemented and we are monitoring the results.

investigating2025-02-14T15:59:42.647Z

We are seeing network issues on our edge servers in regions GRU and SCL. Machines are running correctly, but inbound requests from clients in those regions may fail intermittently..

Feb 12, 2025

Report: "Network issues in JNB"

Last update 2025-02-12T20:39:42.833Z

resolved2025-02-12T20:39:42.815Z

This incident has been resolved.

monitoring2025-02-12T20:07:24.832Z

We have implemented a workaround for the network issue and are monitoring the situation.

investigating2025-02-12T19:47:25.143Z

There is an issue with an upstream network provider in JNB. Apps are still running but may observe network issues. New deploys for apps may fail.

Feb 11, 2025

Report: "Depot builders failing with internal error"

Last update 2025-02-11T00:29:58.437Z

resolved2025-02-11T00:29:58.419Z

This incident has been resolved.

monitoring2025-02-11T00:10:40.482Z

A fix has been implemented on Depot side. https://status.depot.dev/cm6zolsn40009f2dj5ss7lrd7

identified2025-02-10T23:52:00.127Z

The Depot service is currently degraded due to a database outage. We're continuing to monitor for recovery. Customers can also follow the Depot status page at https://status.depot.dev/ for updates. Customers that need to deploy can use legacy Fly.io hosted builders with `fly deploy --depot=false`

investigating2025-02-10T23:21:55.847Z

We are investigating failures when trying to build using the default Depot builders. The recommended workaround is to use `--depot=false` with `fly deploy`. The error from Depot builders is `Error: failed to fetch an image or build from source: error building: input:3: ensureDepotRemoteBuilder {"code"=>"internal", "message"=>"internal error"}`

Feb 10, 2025

Report: "SSH failing for newly created machines"

Last update 2025-02-10T22:53:50.509Z

resolved2025-02-10T22:53:50.490Z

This incident has been resolved.

monitoring2025-02-10T21:59:56.582Z

This issue has been fixed, newly created machines will have working SSH. Machines created during this incident will need to be updated (`fly machine update --yes <id>`) or deleted/recreated to fix SSH.

investigating2025-02-10T21:47:33.199Z

As a workaround, run the `fly ssh console` command with `--pty --command /bin/sh` flags.

investigating2025-02-10T21:39:21.669Z

We are investigating reports that connecting to newly created machines via SSH (`fly ssh console`) may fail.

Feb 5, 2025

Report: "Elevated network latency in FRA"

Last update 2025-02-05T16:11:10.678Z

resolved2025-02-05T16:11:10.663Z

Network functionality is fully restored in FRA.

monitoring2025-02-05T15:06:57.351Z

We've deployed a fix for this incident and we are monitoring while network latency and bandwidth return to normal. All user apps should start seeing improved and normal response times.

identified2025-02-05T14:44:10.582Z

We're addressing elevated network latency and saturation affecting the FRA region. Apps with machines in this region might experience longer response times and possible timeouts (502 errors).

Jan 23, 2025

Report: "Capacity Constraints in IAD"

Last update 2025-01-23T22:06:31.726Z

resolved2025-01-23T22:06:31.710Z

This incident has been resolved

monitoring2025-01-23T21:42:28.992Z

We have brought additional IAD capacity online. Customers should see machine creation, deploy, and scaling operations succeed as normal in the region. We're continuing to monitor to ensure full recovery.

identified2025-01-23T20:44:40.121Z

We are continuing the process of adding additional machine capacity in the IAD region.

investigating2025-01-23T20:23:40.444Z

Machine capacity in the IAD region is currently low. We're working to bring additional capacity online. In the meantime, you may see errors deploying new machines in IAD, or increasing the size of existing machines in the region. Customers may want to deploy machines to nearby regions, such as ewr

Report: "Deploys using Depot Builders failing"

Last update 2025-01-23T18:35:46.118Z

resolved2025-01-23T18:35:46.105Z

This issue has been resolved, deploys using Depot Builders are succeeding as expected.

monitoring2025-01-23T18:28:43.703Z

The Depot builder service is partially recovered and we are seeing deploys using Depot builders succeed again. Some customers may still experience degraded performance using Depot builders at this time. We're continuing to monitor for full recovery. Customers can still deploy using Fly.io hosted builders with `fly deploy --depot=false`

identified2025-01-23T18:21:13.771Z

The Depot service is currently degraded due to a database outage. We're continuing to monitor for recovery. Customers can also follow the Depot status page at https://status.depot.dev/ for updates. Customers can still deploy using Fly.io hosted builders with `fly deploy --depot=false`

investigating2025-01-23T18:05:41.188Z

We are investigating increased error rates when deploying apps using the default Depot Builders. Customers who experience this issue can work around it by using `fly deploy --depot=false` to deploy your image with a Fly.io hosted builder.

Jan 16, 2025

Report: "API errors"

Last update 2025-01-16T17:48:46.569Z

resolved2025-01-16T17:48:46.557Z

This incident has been resolved.

investigating2025-01-16T17:30:25.132Z

We are investigating error 503 when making requests to our GraphQL API, or running flyctl commands.

Jan 15, 2025

Report: "Bluegreen healthchecks not passing"

Last update 2025-01-15T19:22:21.095Z

resolved2025-01-15T19:22:21.083Z

This incident has been resolved.

monitoring2025-01-15T18:44:10.986Z

A fix has been implemented and bluegreen deploys are succeeding as expected. We're continuing to monitor deploys to ensure stability, but customers should see BlueGreen deploys succeed in all regions.

identified2025-01-15T18:19:28.873Z

The issue has been identified and a fix is being implemented.

investigating2025-01-15T18:11:28.331Z

We are seeing signs of recovery, with Bluegreen deployments succeeding for many customers. We are continuing to investigate the root cause of the issue. Customers who still experience a Bluegreen deployment failure can retry using the rolling strategy with `fly deploy --strategy rolling`.

investigating2025-01-15T14:41:52.996Z

A temporary workaround for new deployments is to use rolling strategy: `fly deploy --strategy rolling`.

investigating2025-01-15T14:35:29.910Z

We are still investigating the issue.

investigating2025-01-15T13:28:49.555Z

When deploying with bluegreen strategy some green machines (new app version) won't pass healthchecks. Temporary workaround: unless bluegreen is a must for your app you can temporarily deploy using a different strategy by `fly deploy --strategy NAME`.

Report: "Machine creation errors in LHR"

Last update 2025-01-15T18:43:55.003Z

resolved2025-01-15T18:43:54.994Z

We observed several periods where Machine creations in LHR resulted in authentication errors from 11 Jan to 15 Jan 2025. Customers creating new Machines in the region may have seen failures with: failed to launch VM: permission_denied: bolt token: failed to verify service token: no verified tokens; token <token>: verify: context deadline exceeded The disruptions were caused by degraded connectivity to our token creation service from three hosts. We deployed a preventative fix for the network issues on 15 Jan 2025 at 12:58 UTC. Timestamps of occurrences (UTC): 2025-01-11 03:32 to 2025-01-11 04:11 2025-01-11 17:07 to 2025-01-11 17:54 2025-01-14 11:36 to 2025-01-14 12:14 2025-01-15 07:46 to 2025-01-15 09:49

Jan 11, 2025

Report: "Network issues in SJC region"

Last update 2025-01-11T05:24:38.508Z

resolved2025-01-11T05:24:38.496Z

This incident has been resolved.

monitoring2025-01-11T04:14:18.089Z

A fix has been implemented and we are monitoring the results.

investigating2025-01-11T03:20:06.463Z

We are currently investigating inbound network connectivity issues in SJC region. Users routed to SJC may be unable to access apps, or latency may be increased.

Dec 20, 2024

Report: "Transient networking issue in FRA"

Last update 2024-12-20T15:24:07.454Z

resolved2024-12-20T15:24:07.439Z

This incident has been resolved.

monitoring2024-12-20T14:55:59.392Z

We have noticed a spike in packet loss across the FRA region at around 14:44 UTC caused by an upstream issue. This has recovered since 14:47 UTC, and we are currently monitoring the situation along with our upstream providers.

Report: "IPv6 Networking Issue in SCL"

Last update 2024-12-20T07:28:28.482Z

resolved2024-12-20T07:28:28.467Z

This incident has been resolved.

identified2024-12-20T07:19:51.636Z

We are aware of a temporary IPv6 networking issue in SCL when accessing certain IPv6 ranges / providers caused by an upstream maintenance and are working with our upstream for a fix. IPv6 request originating from your machines in SCL may see increased error rates.

Dec 12, 2024

Report: "Network Instability"

Last update 2024-12-12T19:28:53.349Z

resolved2024-12-12T19:28:53.334Z

This incident has been resolved.

monitoring2024-12-12T18:35:20.102Z

We're monitoring the platform which continues to be stable and work normally. Additionally we are in the process of deploying the Fly Proxy build that contains the fix for the bug that caused this issue.

monitoring2024-12-12T18:02:06.963Z

We have identified the cause of the network blip to be a bug in our Fly proxy and we're applying a fix.

monitoring2024-12-12T16:55:44.714Z

We have noticed a temporary blip in our upstream network(s) between 16:38-16:40 UTC that affected our platform. This has been resolving and we are monitoring for any continuing effects.

Dec 6, 2024

Report: "Machine Creates and Updates currently failing"

Last update 2024-12-06T15:31:50.181Z

resolved2024-12-06T15:31:50.165Z

All changes have been deployed and Machine Create/Update API operations are healthy.

monitoring2024-12-06T15:12:18.950Z

The validation fix has been deployed and our monitoring has resolved for the API error rate.

identified2024-12-06T15:02:12.000Z

We were alerted to elevated error rates for machine creates and updates. A deploy caused a validation error which is now being reverted.

Dec 5, 2024

Report: "Networking issues in GDL"

Last update 2024-12-05T15:24:19.925Z

resolved2024-12-05T15:24:19.912Z

This incident has been resolved.

identified2024-12-04T00:09:44.361Z

The issue has been identified and a fix is currently being implemented.

Dec 4, 2024

Report: "sjc region capacity"

Last update 2024-12-04T20:22:53.383Z

resolved2024-12-03T23:00:34.000Z

This incident has been resolved.

identified2024-12-03T21:57:15.000Z

We are currently at capacity in our SJC region. We're actively working on fixing this, however you may wish to deploy to nearby regions (lax or phx) as a workaround.

investigating2024-12-03T21:56:58.250Z

We are currently at capacity in our SJC region. We're actively working on fixing this, however you may wish to deploy to nearby regions (lax or phx) as a workaround.

Nov 26, 2024

Report: "Elevated API Latency and Timeout Errors"

Last update 2024-11-26T23:25:06.559Z

resolved2024-11-26T23:25:06.536Z

This incident has been resolved.

monitoring2024-11-26T21:13:35.390Z

A fix has been implemented and both Machines API and GraphQL API performance have returned to normal.

identified2024-11-26T20:28:26.388Z

We have identified the cause of the API latency increase and are working to mitigate

investigating2024-11-26T20:23:39.945Z

We are currently investigating elevated error rates with our Machines and Graphql APIs. Users may experience slower responses or timeouts using the Machines API and flyctl commands

Report: "Degraded Connectivity"

Last update 2024-11-26T16:11:46.681Z

resolved2024-11-26T16:11:46.662Z

We have determined that some customers' machines are being throttled due to our full rollout of CPU quotas, separate from the incident yesterday. This in turn caused apparent networking issues. We have now temporarily rolled back these changes while we work with customers to better adapt to CPU quotas.

investigating2024-11-26T14:30:41.189Z

We are aware of customer-reported issues with internal networking and are investigating.