Is CircleCI Down Right Now? Discover if there is an ongoing service outage.

CircleCI is currently Operational

Last checked Jul 29, 2025 14:39 UTC from CircleCI's official status page

Historical record of incidents for CircleCI

Jul 18, 2025

Report: "Some customers are experiencing increased Server Errors in the CircleCI UI"

Last update 2025-07-18T19:37:18.888Z

monitoring2025-07-18T19:37:16.805Z

We identified the cause of the errors and successfully applied a revert. We will continue to monitor the results.

investigating2025-07-18T19:32:02.111Z

Some customers may be unable to view pages in the CircleCI UI due to "500 Error: Server Error". We are currently investigating the issue.

Jul 15, 2025

Report: "Delayed data for Usage API"

Last update 2025-07-15T17:21:42.026Z

identified2025-07-15T17:21:42.024Z

We are continuing to work on a fix for this issue.

identified2025-07-15T17:17:59.280Z

We have identified that the Usage API is delayed for yesterday's data. We expect this data to be available in the next few hours and thank you for your patience while we work through this.

Jul 10, 2025

Report: "Windows, Android and GPU jobs experiencing increased queueing"

Last update 2025-07-10T09:01:49.091Z

identified2025-07-10T09:01:48.592Z

We're experiencing capacity issues with our compute provider. Customers may experience increased queue times for Windows, Android and GPU jobs. We're currently working to resolve this issue and appreciate your patience.

Jul 7, 2025

Report: "M2 Pro jobs will be delayed"

Last update 2025-07-07T13:12:21.533Z

identified2025-07-07T13:12:21.083Z

We are currently experiencing delays starting macOS jobs on m2pro.medium and m2pro.large resource classes. We suggest customers move to m4pro.medium or m4pro.large to mitigate queue time.

Jul 3, 2025

Report: "SSH Re-Run API reports incorrect status"

Last update 2025-07-03T20:36:50.681Z

investigating2025-07-03T20:36:50.678Z

We are currently investigating an incorrect response from the v1 Job Retry API. While it correctly retries the job, it returns a 503.

Jun 30, 2025

Report: "Increased 429 responses for API calls"

Last update 2025-06-30T15:54:38.168Z

resolved2025-06-30T15:54:38.153Z

Rates of 429 errors are back within normal ranges. This issue has now been resolved.

monitoring2025-06-30T15:43:36.601Z

We have pushed another change which further mitigates the rate limiting. Customers should be able to use the API as normal now. We will continue to monitor the rate of 429s for any changes.

identified2025-06-30T15:29:57.739Z

We have pushed a change that should reduce the frequency of these rate limits. We are continuing to investigate and will continue work to reduce this impact further.

identified2025-06-30T15:11:47.449Z

Following the scheduled maintenance, we are seeing a reduction in rate limits and an increase in 429 errors. Customers are currently able to make API requests less frequently than expected.

Report: "Delays to start macOS jobs on m2pro.large"

Last update 2025-06-30T14:18:02.218Z

identified2025-06-30T14:18:01.887Z

We are currently experiencing delays starting macOS jobs on m2pro.medium and m2pro.large resource classes. We suggest customers to move to m4pro.medium or m4pro.large to mitigate queue time.

identified2025-06-30T13:03:25.681Z

We are currently experiencing delays starting macOS jobs on m2pro.large resource class. We thank you for your patience while our engineers work through this issue.

Report: "API Maintenance"

Last update 2025-06-30T12:00:37.319Z

resolved2025-06-30T12:00:36.916Z

This incident has been resolved.

monitoring2025-06-30T11:42:48.068Z

The maintenance has completed and we are monitoring the results. Thank you for your patience.

identified2025-06-30T10:51:39.000Z

The APIs for managing personal access tokens (PATs) are still unavailable following a scheduled maintenance. Customers are able to view existing PATs in user settings but not add or delete tokens.

Jun 17, 2025

Report: "A 3rd party provider is currently having issues, this will result in some degraded performance."

Last update 2025-06-17T19:56:08.045Z

investigating2025-06-17T19:56:07.284Z

Some work may be not be triggered - you will have to push again.

Jun 4, 2025

Report: "Maintenance window for Runner"

Last update 2025-06-04T02:00:00.000Z

Scheduled2025-06-04T02:10:00.000Z

Maintenance window for Runner is scheduled for June 3rd, 2025, at 19:00 PST/22:00 EST. The maintenance window will last until 19:10 PST/22:10 EST.During this period:- Resource management will not be available- The Runner web UI for inventory and installation will not be available- Up to a 5-minute delayed start time for runner jobs.

In progress2025-06-04T02:00:00.000Z

Scheduled maintenance is currently in progress. We will provide updates as necessary.

May 31, 2025

Report: "Delays to start macOS jobs on m2pro.medium and m2pro.large"

Last update 2025-05-31T00:14:26.653Z

resolved2025-05-31T00:14:24.448Z

Thanks for the patience everyone. Everything back to normal.

monitoring2025-05-30T23:05:05.322Z

Job start times have returned to normal. We'll continue to monitor.

identified2025-05-30T21:52:15.690Z

We are experiencing delays starting macOS jobs on m2pro.medium and m2pro.large. Thanks you for your patience.

May 30, 2025

Report: "Delays to start macOS jobs on m2pro.medium and m2pro.large"

Last update 2025-05-30T21:52:00.000Z

Identified2025-05-30T21:52:00.000Z

We are experiencing delays starting macOS jobs on m2pro.medium and m2pro.large. Thanks you for your patience.

May 26, 2025

Report: "Delays in starting Mac Jobs"

Last update 2025-05-26T21:54:08.102Z

resolved2025-05-26T21:54:07.645Z

This is now resolved. Wait times have recovered.

identified2025-05-26T20:05:32.723Z

The fix is still rolling out across our fleet, all looking good so far.

identified2025-05-26T17:19:07.938Z

We are experiencing delays in starting Mac Jobs. We have identified the issue and are in the process of rolling out a fix. Thank you for your patience.

Report: "Delays in starting Mac Jobs"

Last update 2025-05-26T17:19:00.000Z

Identified2025-05-26T17:19:00.000Z

We are experiencing delays in starting Mac Jobs. We have identified the issue and are in the process of rolling out a fix. Thank you for your patience.

Report: "Dropped webhooks for GitHub pipelines"

Last update 2025-05-26T10:11:57.938Z

resolved2025-05-26T10:11:57.924Z

GitHub have updated their API status to operational, and we are no longer seeing related customer impact. Customers will need to push new commits for any lost pipelines.

monitoring2025-05-26T08:56:59.799Z

Some GitHub webhooks are being dropped due to an incident with GitHub. Customers may also experience a delay in scheduled workflows.

Report: "Dropped webhooks for GitHub pipelines"

Last update 2025-05-26T08:56:00.000Z

Monitoring2025-05-26T08:56:00.000Z

Some GitHub webhooks are being dropped due to an incident with GitHub.Customers may also experience a delay in scheduled workflows.

May 22, 2025

Report: "Delays in starting some jobs"

Last update 2025-05-22T16:39:44.816Z

postmortem2025-05-22T15:59:47.110Z

## Summary On May 1, 2025, from 22:20 UTC to May 2, 2025 02:00 UTC, CircleCI customers experienced delays in starting most jobs. Jobs affected were contained to the following resource classes: Docker large, Docker medium, Docker small and Linux large. During this time customers may have also experienced delays in obtaining status checks. ## What Happened \(all times UTC\) At approximately 22:05 on May 1, 2025, we initiated a database upgrade to the service that dispatches jobs. We used a blue/green deployment to stand up a second database running the upgraded version and use logical replication to keep the data across the two databases in sync. We had been running the blue \(old version\) and the green \(new version\) without issues for a couple days and replication was confirmed to be in sync when we triggered the cut over from blue to green. Upon completion of the cutover process, we noticed application errors for jobs, which meant the application pods failed to automatically pickup the new DNS route. A rolling restart of the pods was performed, and all pods were back online with no further application errors as of 22:17. At 22:40, teams were alerted that Docker jobs were backing up. They initially investigated if the pod restarts caused fewer processing nodes to be online, and began to manually scale up the nodes. At 23:47, it was confirmed only a small quantity of jobs were making it through to the processing pods, causing the backlog and ruling out an infrastructure issue. It was determined that jobs in the following resource classes were not executing: Docker large, Docker medium, Docker small and Linux large. At 00:40 on May 2, 2025, orphaned task records for the above mentioned resource classes were identified. An orphaned task record is an item of work with no associated jobs, as these records were picked up by the service it causes a failure preventing the next record from being picked up. The team updated the task status to “completed” and immediately saw more jobs processing and the backlog of jobs dropped. By 00:45, the backlog of jobs had completely cleared and the issue was thought to be remediated. At 00:56 UTC, an alert triggered, warning of a backlog of jobs once again. Upon investigation, it was determined that only some Docker resource classes were affected. These included large, medium and small, all other resource classes including Linux jobs were operating as expected. An investigation determined additional orphaned task records had been written to the database after 00:40. Logical replication was manually disabled and the orphaned task records were updated at 01:55. At 02:10 the backlog of jobs had once again cleared. The team continued to monitor over the following hour with no additional occurrences of orphaned tasks and declared the incident closed at 03:39. Post-incident, the team continued to investigate. The root cause was determined to be a race condition between the application and logical replication when the application pods were restarted. A task event was rerun and wrote to the green \(new\) database before the original task event status was replicated from the blue \(old\) database. This created a unique constraint error that broke replication. Because logical replication does not respect foreign key constraints, task records were replicated to the green database which were older than those already in the green database, creating the orphaned task records seen during the incident. The issue resurfaced immediately after draining the job queue as the failed replication task tried to restart. ## Future Prevention and Process Improvement The incident has exposed the need to implement further controls on database writes during the upgrade process while using logical replication. Even if replication is in sync, the milliseconds network delay incurred in transferring the data can be enough to trigger this scenario. 1. We will update the upgrade procedure to limit writes to the database for a short period of time while logical replication writes the final updates from the old database version to the new version. 2. A second data replication verification test will be added to the procedure before turning writes on for the new version. 3. Once replication is confirmed, in sync replication will be disabled to avoid any possibilities of conflicts. 4. We will be implementing a more in depth review process between the database and service owner teams to review the upgrade process and risks prior to performing the change. We sincerely apologize for the disruption this incident caused to your ability to build on our platform. We understand the critical role CircleCI plays in your development workflow and take any service disruption seriously. We're committed to learning from this experience and have already implemented several measures to prevent similar occurrences in the future. Thank you for your patience and continued trust in CircleCI.

resolved2025-05-02T00:45:56.048Z

All jobs are now running normally. Thank you for your patience whilst we resolved the issue.

monitoring2025-05-02T00:36:17.685Z

We are continuing to monitor for any further issues.

monitoring2025-05-02T00:29:51.576Z

Jobs for the following resource classes will have suffered significant delays in running, these will be processed over the next X minutes. * Docker Large, Medium and Small * Linux Large Those jobs will start within the next 15 minutes, you should not need to retry them. We thank you for your patience whilst we resolve this issue.

monitoring2025-05-01T23:43:51.788Z

We're continuing to monitor the delays with starting Docker jobs. Thank you for your patience.

monitoring2025-05-01T23:05:38.824Z

Docker jobs have not recovered as expected, and customers may continue to see delays for Docker jobs starting. We are working to increase capacity and thank you for your patience.

monitoring2025-05-01T22:49:41.945Z

This incident impacted final result delivery between 22:06 and 22:17 UTC. Customers may experience delays starting Docker Large jobs as the system recovers. We will continue to monitor recovery and thank you for your patience.

monitoring2025-05-01T22:40:20.999Z

This also impacts status checks which may not have been sent to GitHub.

May 17, 2025

Report: "Delays in insights dashboard data"

Last update 2025-05-17T00:57:57.488Z

resolved2025-05-17T00:57:57.007Z

We've verified our fix and insights data is refreshing as expected.

monitoring2025-05-17T00:40:42.251Z

We are monitoring our change to catch up on delayed insights data.

identified2025-05-17T00:38:49.996Z

We have identified an issue with delays in insights data. The cause has been identified and we are working on a solution.

Report: "Delays in insights dashboard data"

Last update 2025-05-17T00:38:00.000Z

Identified2025-05-17T00:38:00.000Z

We have identified an issue with delays in insights data.The cause has been identified and we are working on a solution.

May 9, 2025

Report: "We are currently experiencing an outage affecting v1 and v2 API documentation pages"

Last update 2025-05-09T02:10:04.332Z

resolved2025-05-09T02:10:04.009Z

This issue has been resolved. Thank you for your patience.

investigating2025-05-08T22:40:11.292Z

We are currently experiencing an outage affecting the following API documentation pages: V1 API Documentation: https://circleci.com/docs/api/v1/index.html V2 API Documentation: https://circleci.com/docs/api/v2

May 8, 2025

Report: "We are currently experiencing an outage affecting v1 and v2 API documentation pages"

Last update 2025-05-08T22:40:00.000Z

Investigating2025-05-08T22:40:00.000Z

We are currently experiencing an outage affecting the following API documentation pages:V1 API Documentation: https://circleci.com/docs/api/v1/index.htmlV2 API Documentation: https://circleci.com/docs/api/v2

May 7, 2025

Report: "Test results are delayed for test insights"

Last update 2025-05-07T00:12:45.610Z

resolved2025-05-07T00:12:45.232Z

This incident has been resolved.

monitoring2025-05-06T20:35:16.305Z

We will continue monitoring overnight. Thank you for your patience.

monitoring2025-05-06T20:18:55.571Z

A fix has been implemented and we are monitoring the results.

identified2025-05-06T19:26:36.523Z

The issue has been identified and a fix is being implemented.

investigating2025-05-06T19:06:44.454Z

Some users may notice a delay in their test insights. We are working on a fix. Thank you for your patience.

May 6, 2025

Report: "Test results are delayed for test insights"

Last update 2025-05-06T19:06:00.000Z

Investigating2025-05-06T19:06:00.000Z

Some users may notice a delay in their test insights. We are working on a fix. Thank you for your patience.

May 2, 2025

Report: "Delays in May 1, 2025 Data in Usage API"

Last update 2025-05-02T22:08:22.284Z

resolved2025-05-02T22:08:21.904Z

The issue causing delays in usage api data has now been resolved, we thank you for patience while our engineers worked to resolve the.

identified2025-05-02T13:51:20.507Z

Some customers will see a delay in Usage API Data for May 1, 2025. We've identified the problem and are working to resolve it. Thank you for your patience.

Report: "CircleCI UI Loading & build triggering issues"

Last update 2025-05-02T14:17:44.745Z

postmortem2025-05-02T14:14:47.566Z

## Summary On April 4, 2025, from 00:16 to 01:49 UTC \(approximately 1 hour and 33 minutes\), CircleCI experienced a service disruption affecting both our user interface and build capabilities. During this time, customers were unable to access the CircleCI UI or initiate new builds. The incident was caused by an inadvertently applied Web Application Firewall \(WAF\) rule that blocked legitimate traffic to CircleCI services. It was resolved when our engineering team identified and removed this rule. [The original status page can be found here.](https://status.circleci.com/incidents/zh1qd6lrntl7) ## What Happened \(all times UTC\) The WAF is a critical security component that sits in front of our services and protects them from malicious traffic, while allowing legitimate requests to pass through. All times below are in UTC * **00:16**: A WAF rule was inadvertently introduced that began blocking legitimate traffic to CircleCI services. * **00:26 - 00:52**: Our monitoring systems detected degraded performance across multiple services. This occurred just as our teams were concluding another [unrelated incident](https://status.circleci.com/incidents/31n0h4tcl02g), which initially caused some confusion about whether the issues might be connected. Customers began reporting inability to access the CircleCI UI or initiate new builds, and our teams pivoted to investigate these new symptoms. The team noted a drop in GitHub webhooks and widespread connectivity issues between our frontend and backend services, spending time to ensure these weren't aftereffects of the previous incident. * **00:52**: We established we were looking at a completely separate incident and launched our incident process with a new incident, and a dedicated response team was assembled to investigate the service disruption. * **01:15:** Initial investigation revealed broad connectivity issues between the frontend and our backing APIs, including CORS \(Cross-Origin Resource Sharing\) errors. The team explored multiple potential causes, including recent deployments and infrastructure changes, but the cause remained unclear. * **01:35**: Our automated Terraform drift detection identified a difference in configuration between our defined and current WAF settings. This discovery revealed that a WAF rule had been changed outside of our standard Terraform deployment process, and was blocking legitimate traffic to [api.circleci.com](http://api.circleci.com) and [circleci.com](http://circleci.com) CloudFront distributions. * **01:41**: The problematic WAF rule was reverted from both affected CloudFront distributions. * **01:49**: Our monitoring confirmed that error rates decreased across all affected services as traffic was properly routed again. * **01:55**: Full service restoration was confirmed across the board at this time. * **02:59**: The incident was officially closed after a period of monitoring confirmed stable operation. ## **Root Cause Analysis** While we manage all our infrastructure, including WAF, almost entirely with Terraform, we discovered during this incident a misconfiguration in IAM controls that allowed a specific role to make changes without using our infrastructure-as-code tooling. As a result, while investigating routine security monitoring, an operator manually modified WAF configuration, believing they were taking read-only actions. The resulting change blocked legitimate traffic to our services. Based on the same assumptions, those investigating the incident did not prioritize investigating WAF configuration expecting that any changes would have gone through our Terraform pipeline and there was no record of such changes. The diverse symptoms produced across our platform combined with the occurrence shortly after a separate, [unrelated incident](https://status.circleci.com/incidents/31n0h4tcl02g), led to time spent on paths of inquiry that ultimately proved fruitless. Eventually, our automated drift detection process ran and identified the issue. While this safeguard was invaluable, it was nearly 80 minutes between the initial change and the detection. Drift detection identified the exact configuration change that caused the issue despite the confusion and led directly to the resolution of the incident. ## Future Prevention and Process Improvement This incident highlighted the strength of our existing systems while identifying several areas where we can improve and make them even more robust: 1. We have implemented stricter IAM policies that prevent direct modification of infrastructure managed by our infrastructure-as-code pipeline. 2. Terraform's drift detection was instrumental in identifying the root cause of this incident. We are enhancing these capabilities to provide faster alerts when critical infra components deviate from their expected state. We are also adding technical guardrails to ensure all configuration management follows this approach, which helps prevent human error and provides better visibility into changes. 3. Specifically, we're establishing better protocols for implementing and testing WAF rules before they reach production environments. Additionally, we are adding monitoring specifically for WAF behavior and traffic patterns to detect potential issues more quickly. 4. We're investigating additional technical controls through Security Control Policies \(SCPs\) that provide organization-wide restrictions on IAM roles, reducing the risk of accidental misconfigurations. These policies create hard boundaries on what actions can be performed on critical systems like our WAFs, adding an extra layer of protection against unintended changes. We sincerely apologize for the disruption this incident caused to your ability to build on our platform. We understand the critical role CircleCI plays in your development workflow and take any service disruption seriously. We're committed to learning from this experience and have already implemented several measures to prevent similar occurrences in the future. Thank you for your patience and continued trust in CircleCI.

resolved2025-04-04T02:11:08.372Z

The incident has now been resolved. Thank you for your understanding and patience while our engineers investigated and mitigated the issue.

monitoring2025-04-04T01:51:51.437Z

A fix has been implemented, and we are currently monitoring the system to ensure everything is functioning as expected. Thank you for your patience.

investigating2025-04-04T01:34:53.671Z

We are investigating intermittent issues triggering pipelines or sending status updates.

investigating2025-04-04T01:17:09.786Z

We are investigating intermittent issues with loading the CircleCI UI.

Report: "Delays in May 1, 2025 Data in Usage API"

Last update 2025-05-02T13:51:00.000Z

Identified2025-05-02T13:51:00.000Z

Some customers will see a delay in Usage API Data for May 1, 2025. We've identified the problem and are working to resolve it. Thank you for your patience.

Report: "Delay in starting some jobs"

Last update 2025-05-02T03:06:08.860Z

resolved2025-05-02T03:06:08.525Z

This incident has been resolved.

monitoring2025-05-02T02:58:15.318Z

A fix was put in place. We are monitoring the situation.

identified2025-05-02T02:21:05.870Z

Queues should be clearing, and jobs starting normally.

investigating2025-05-02T01:51:28.488Z

We are investigating a delay in starting some jobs.

Report: "Delay in starting some jobs"

Last update 2025-05-02T01:51:00.000Z

Investigating2025-05-02T01:51:00.000Z

We are investigating a delay in starting some jobs.

May 1, 2025

Report: "Final results of some jobs may not be reported in the UI"

Last update 2025-05-01T22:40:00.000Z

Monitoring2025-05-01T22:40:00.000Z

This also impacts status checks which may not have been sent to GitHub.

Apr 21, 2025

Report: "intermittent checkout job failures"

Last update 2025-04-21T18:07:12.581Z

resolved2025-04-21T18:07:12.185Z

This incident has been resolved.

monitoring2025-04-21T18:03:43.325Z

We've rolled out the fix and are monitoring.

investigating2025-04-21T17:55:03.710Z

Some customers are experiencing checkout step failures.

Report: "intermittent checkout job failures"

Last update 2025-04-21T17:55:00.000Z

Investigating2025-04-21T17:55:00.000Z

Some customers are experiencing checkout step failures.

Apr 16, 2025

Report: "Delays in starting workflows"

Last update 2025-04-16T13:35:44.604Z

postmortem2025-04-16T13:35:01.823Z

## Summary On April 3, 2025, from 22:08 UTC to 23:45 UTC, CircleCI customers experienced increased latency and some failures with starting and canceling workflows and jobs. During this time customers may have experienced delays and difficulty viewing workflows in the UI. We appreciate your patience and understanding as we worked to resolve this incident. ## What Happened \(all times UTC\) At approximately 22:00 on April 4, we initiated an upgrade to the service responsible for workflows. We expected a short delay \(< 90 seconds\) during the database upgrade where calls to the database from the workflows service would get sent to a queue and retried over a 10 minute period. We expected to see the queues grow slightly during and immediately after the upgrade. At 22:08, when the blue/green deployment was complete, we verified queries being served. At 22:17, we identified increased latency in the workflows service, as well as some errors from jobs being dropped due to exhausting their 10 minute retries. At 22:29 additional engineers were engaged, and at 22:30 the team restarted the workflows pods to ensure they were all connecting to the correct database. At 22:35 a public incident was declared. At 22:41, it was observed that all queries on the new database were hitting disk, which indicated that the database statistics tables had not updated. The team immediately upsized the database and disabled any non business critical operations on the database. At 23:00, the workflows service was scaled down to a single pod to give the database capacity to recover while the statistics table was rebuilt. At 23:10, the team observed the workflows queue backing up due to the reduction in pods as expected but also did not see an improvement in database performance. At 23:19, the team decided to re-enable writes on the old database and reinstate its primary status to restore service to customers sooner. This work completed at 23:29. The team continued to monitor the workflows queue. At 23:45 it was determined that the workflow queue was back to normal operating levels, and no further errors were observed. Post-incident, the team continued to investigate. The root cause was determined to be that the analyze operation to rebuild the database’s statistics table, which is used for indexes, had been executed too early in the operation and was made stale by a second major version upgrade within the same deployment. ## Future Prevention and Process Improvement The blue/green database deployment procedures have been updated to run an analysis procedure after every major version change. The team has also tested running the analyze command while a database is under pressure to determine it has no further degrading effects on the database performance. This will be noted for future remediation. Before any additional migrations are run, the team will add additional automated tests and manual checkpoints throughout the process to identify and resolve issues before the blue/green cutover.

resolved2025-04-04T00:11:05.336Z

The issue impacting workflows and pipelines has now been resolved.

monitoring2025-04-03T23:52:57.922Z

Our engineers have implemented a fix for the issue impacting workflows and pipelines and are back within normal operation range. We will continue to monitor the situation. We thank you for your patience while we worked to resolve this issue.

identified2025-04-03T23:41:06.269Z

We are continuing to work on the issue impacting workflows and pipelines and are starting to see our systems recover. Thank you for your patience while our engineers are working to resolve this.

identified2025-04-03T23:03:46.148Z

We have identified the issue causing workflows and pipelines to be delayed or not start at all. Our engineers are working on a fix. We appreciate your patience and understanding as we actively work to resolve this disruption. We will keep you updated on our progress.

investigating2025-04-03T22:35:26.548Z

We are continuing to investigate this issue.

investigating2025-04-03T22:35:13.039Z

We are investigating a delays in starting workflows.

Apr 14, 2025

Report: "Our engineers investigated an issue impacting log-in"

Last update 2025-04-14T23:03:32.361Z

resolved2025-04-14T23:03:32.067Z

This incident has been resolved.

monitoring2025-04-14T22:51:59.803Z

Auth0 has applied a fix and we are seeing reduced log-in error rates as well. We will continue to monitor our systems. Thank you for your patience.

identified2025-04-14T22:49:43.137Z

We've identified an issue affecting users attempting to log in with username and password credentials. Our engineering team has determined this is related to an ongoing incident with Auth0, our authentication provider. Users can track the Auth0 incident status at https://status.auth0.com/incidents/zgyzzt12c40v. We're actively monitoring the situation and will provide updates as the issue is resolved. We apologize for any inconvenience this may cause.

Report: "Our engineers investigated an issue impacting log-in"

Last update 2025-04-14T22:49:00.000Z

Identified2025-04-14T22:49:00.000Z

Report: "Delay in starting jobs"

Last update 2025-04-14T10:36:12.052Z

resolved2025-04-14T09:00:00.000Z

Between 09:54UTC and 10:01UTC all jobs experienced a slight delay in starting. All jobs will have run, so it is not necessary rerun any of them. We apologize for the delay.

Report: "Delay in starting jobs"

Last update 2025-04-14T09:00:00.000Z

Resolved2025-04-14T09:00:00.000Z

Between 09:54UTC and 10:01UTC all jobs experienced a slight delay in starting. All jobs will have run, so it is not necessary rerun any of them. We apologize for the delay.

Apr 4, 2025

Report: "CircleCI UI Loading & build triggering issues"

Last update 2025-04-04T01:34:00.000Z

Update2025-04-04T01:34:00.000Z

We are investigating intermittent issues triggering pipelines or sending status updates.

Investigating2025-04-04T01:17:00.000Z

We are investigating intermittent issues with loading the CircleCI UI.

Report: "CircleCI UI Loading Issues"

Last update 2025-04-04T01:17:00.000Z

Investigating2025-04-04T01:17:00.000Z

We are investigating intermittent issues with loading the CircleCI UI.

Apr 3, 2025

Report: "Delays in starting workflows"

Last update 2025-04-03T22:35:00.000Z

Update2025-04-03T22:35:00.000Z

We are continuing to investigate this issue.

Investigating2025-04-03T22:35:00.000Z

We are investigating a delays in starting workflows.

Report: "Orb fetch causing pipeline failure"

Last update 2025-04-03T16:27:56.241Z

resolved2025-04-03T16:27:53.996Z

This incident is now resolved. Orbs are functioning as normal, thank you for your patience.

monitoring2025-04-03T16:04:37.949Z

We are monitoring a solution to this issue.

identified2025-04-03T15:33:25.778Z

Error message displayed is configs deemed to be invalid. A fix is identified and being implemented.

identified2025-04-03T15:32:31.278Z

The issue has been identified and a fix is being implemented.

Report: "Orb fetch causing pipeline failure"

Last update 2025-04-03T15:33:00.000Z

Update2025-04-03T15:33:00.000Z

Error message displayed is configs deemed to be invalid. A fix is identified and being implemented.

Identified2025-04-03T15:32:00.000Z

The issue has been identified and a fix is being implemented.

Mar 31, 2025

Report: "Customers may see delays with status updates on Github"

Last update 2025-03-31T17:03:17.908Z

resolved2025-03-31T17:02:36.000Z

The issue impacting status updates for Github App basic status has now been resolved. Please note that a small percentage of pipelines triggered directly from the CircleCI API did not post status to Github and must be re-run. If you have any further issues, please reach out to support for assistance. We thank you for your patience and understanding as our engineers worked towards mitigation.

monitoring2025-03-31T16:50:38.880Z

We are continuing to monitor for any further issues.

monitoring2025-03-31T16:50:26.229Z

A small percentage of pipelines triggered directly from the CircleCI API did not successfully post status to GitHub. To post status, the pipeline must be re-run.

investigating2025-03-31T16:31:57.887Z

Our engineers are investigating an issue where some customers may see delays with status updates. The impact is limited to Github App basic status and does not affect oAuth basic or Github checks. We appreciate your patience and understanding as we actively work to resolve this delay. We will keep you updated on our progress.

Report: "Customers may see delays with status updates on Github"

Last update 2025-03-31T16:31:00.000Z

Investigating2025-03-31T16:31:00.000Z

Our engineers are investigating an issue where some customers may see delays with status updates. The impact is limited to Github App basic status and does not affect oAuth basic or Github checks.We appreciate your patience and understanding as we actively work to resolve this delay. We will keep you updated on our progress.

Mar 28, 2025

Report: "Increased Job Start Latencies"

Last update 2025-03-28T11:30:00.000Z

Resolved2025-03-28T11:30:00.000Z

Between 7:15 UTC and 7:50 UTC, some customers may have been impacted by increased job start latency. Our engineering team promptly identified and resolved this issue and job start times have now returned to normal levels. We thank you for your patience and understanding.

Report: "Increased Job Start Latencies"

Last update 2025-03-28T08:33:18.562Z

resolved2025-03-28T11:30:00.000Z

Mar 27, 2025

Report: "Docker Executor Infrastructure Upgrade"

Last update 2025-03-27T21:53:00.000Z

In progress2025-03-27T21:53:00.000Z

Scheduled maintenance is currently in progress. We will provide updates as necessary.

Scheduled2025-03-27T21:51:00.000Z

For additional details, please refer to this announcement: https://discuss.circleci.com/t/docker-executor-infrastructure-upgrade/52282

In progress2025-03-27T21:51:00.000Z

Scheduled maintenance is currently in progress. We will provide updates as necessary.

Scheduled2025-03-27T21:51:00.000Z

We are updating the infrastructure supporting the arm resource class and the ip_ranges feature on the Docker Executors through April 1st. Customers may experience build failures during this time. View the announcement post for more information: link.

Mar 24, 2025

Report: "Increase wait time for M2pro medium resource class"

Last update 2025-03-24T18:51:15.967Z

resolved2025-03-24T18:51:15.650Z

The issue impacting the wait time on M2 pro medium resource class has now been resolved. We thank you for your patience while we worked through to resolve the delays caused.

monitoring2025-03-24T17:36:34.000Z

A fix has been implemented for the issue impacting our M2 pro medium resource class. The wait times are within normal range and we will continue to monitor the situation as the fix gets rolled out. We thank you for your patience while our engineers worked to resolve this delay.

identified2025-03-24T16:00:40.000Z

Our engineers have identified the issue and implemented a fix where builds using m2pro.medium resource class were delayed for up to 6 minutes. We are starting to see recovery in the wait time. We thank you for your patience as we continue to work on this issue.

investigating2025-03-24T15:30:31.897Z

Our engineers are investigating an issue where customers using M2 Pro medium resource class may experience a higher wait time (delays of up to 6 minutes). We appreciate your patience and understanding as we actively work to resolve this delay. We will keep you updated on our progress.

Report: "Increase wait time for M2pro medium resource class"

Last update 2025-03-24T18:51:00.000Z

Resolved2025-03-24T18:51:00.000Z

The issue impacting the wait time on M2 pro medium resource class has now been resolved. We thank you for your patience while we worked through to resolve the delays caused.

Monitoring2025-03-24T17:36:00.000Z

Identified2025-03-24T16:00:00.000Z

Investigating2025-03-24T15:30:00.000Z

Mar 21, 2025

Report: "Pipelines page intermittently not loading pipelines."

Last update 2025-03-21T12:14:52.498Z

resolved2025-03-21T12:14:52.140Z

This incident is now resolved.

monitoring2025-03-21T11:25:36.526Z

The pipelines page is recovering and we are observing normal behaviour. We are continuing to monitor.

identified2025-03-21T10:54:27.731Z

We have identified the cause and are continuing to work on a fix.

identified2025-03-21T10:32:44.703Z

We have identified an issue causing a timeout when loading the Pipelines page on projects. We are currently working on a potential fix. Pipelines are running as normal.

Report: "Pipelines page intermittently not loading pipelines."

Last update 2025-03-21T12:14:00.000Z

Resolved2025-03-21T12:14:00.000Z

This incident is now resolved.

Monitoring2025-03-21T11:25:00.000Z

The pipelines page is recovering and we are observing normal behaviour. We are continuing to monitor.

Update2025-03-21T10:54:00.000Z

We have identified the cause and are continuing to work on a fix.

Identified2025-03-21T10:32:00.000Z

We have identified an issue causing a timeout when loading the Pipelines page on projects. We are currently working on a potential fix. Pipelines are running as normal.

Mar 18, 2025

Report: "Increased wait times for M2 Pro Large"

Last update 2025-03-18T17:02:00.785Z

resolved2025-03-18T17:02:00.776Z

There was increased wait times for M2 Pro Large.

Report: "Increased wait times for M2 Pro Large"

Last update 2025-03-18T17:02:00.000Z

Resolved2025-03-18T17:02:00.000Z

There was increased wait times for M2 Pro Large.

Mar 17, 2025

Report: "Outbound webhooks delayed"

Last update 2025-03-17T16:36:06.414Z

resolved2025-03-17T16:36:06.092Z

The issue causing delays in outbound webhooks setup for job completed event has now been resolved. We thank you for your patience and understanding while our engineers worked to fix this issue.

monitoring2025-03-17T16:10:07.933Z

The issue causing delays in outbound webhooks has now been mitigated, the latencies in outbound webhooks are back to normal. We will continue to monitor the situation. Thank you for your understanding while we worked to investigate the issue.

investigating2025-03-17T15:15:38.226Z

Our engineers are currently investigating an issue causing delays in outbound webhooks, the impact is limited to customers that have outbound webhooks setup for job completed. We will provide updates as soon as more information is available. Thank you for your understanding.

Report: "Outbound webhooks delayed"

Last update 2025-03-17T16:36:00.000Z

Resolved2025-03-17T16:36:00.000Z

The issue causing delays in outbound webhooks setup for job completed event has now been resolved. We thank you for your patience and understanding while our engineers worked to fix this issue.

Monitoring2025-03-17T16:10:00.000Z

Investigating2025-03-17T15:15:00.000Z

Feb 25, 2025

Report: "Jobs using contexts are not running"

Last update 2025-02-25T13:14:00.504Z

resolved2025-02-25T13:14:00.181Z

We are continuing to observe normal behaviour. This incident is now resolved. If you see any affected workflows/jobs, they will need to be re-run on CircleCI or a new commit pushed.

monitoring2025-02-25T12:46:04.094Z

We have identified the cause and implemented a fix to the affected service. We are seeing recovery and are currently monitoring.

investigating2025-02-25T12:44:39.882Z

Jobs that use contexts are not running. We are currently investigating the cause.

Feb 24, 2025

Report: "Incread Queue Times for macos.m1.large.gen1"

Last update 2025-02-24T17:43:05.947Z

resolved2025-02-24T17:43:05.641Z

This incident has been resolved.

monitoring2025-02-24T17:13:49.881Z

Queue times have stabilized.

identified2025-02-24T16:23:12.210Z

Capacity on M1 Resource Class is limited. Customers can experience less queuing if you move to m2pro.medium or m2pro.large.

Feb 20, 2025

Report: "Issues loading the jobs page"

Last update 2025-02-20T20:02:45.158Z

resolved2025-02-20T20:02:44.878Z

This incident has been resolved.

monitoring2025-02-20T19:53:49.950Z

The jobs page has automatically recovered. We have identified the change that caused this issue and have reverted it so that the issue cannot reoccur. Thank you for your patience.

investigating2025-02-20T19:33:21.429Z

Our engineering team is currently investigating an issue affecting some customers with loading the jobs page. Please note that jobs continue to flow through the system without interruption; the impact is limited to the user interface only. We will provide updates as soon as more information is available. Thank you for your understanding!

Feb 19, 2025

Report: "Delays in starting M1 Large Mac Jobs"

Last update 2025-02-19T17:49:05.597Z

resolved2025-02-19T17:49:05.238Z

The issues with higher queue times for M1 Large resource class has now been resolved. If you encounter any further delays, please consider switching to the M2 resource class for improved performance. Thank you for your patience!

monitoring2025-02-19T17:35:45.609Z

We experienced higher queue times for customers requesting M1 Large resource class between 17:05 and 17:20 UTC. Queue times have now returned to normal. If you continue to experience delays with the M1 resource class, we recommend switching to the M2 resource class for optimal performance. We will continue to monitor the situation with M1 Large resource class capacity. Thank you for your understanding!

Feb 12, 2025

Report: "Login Page issues with Bitbucket user not able to login"

Last update 2025-02-12T08:10:39.871Z

resolved2025-02-12T08:10:39.512Z

The issue has been resolved. Thank you for your patience. Please refresh the login page and try logging in with Bitbucket again.

monitoring2025-02-12T08:07:24.989Z

We have implemented a fix and are currently monitoring the results. Please refresh the login page and try logging in using Bitbucket again.

identified2025-02-12T07:45:33.230Z

We have identified the issue affecting our system and are actively working on a resolution. Thank you for your patience while we resolve this.

investigating2025-02-12T07:25:49.519Z

We are currently investigating an issue where Bitbucket users are unable to log in through our login page.

Feb 11, 2025

Report: "Delays in starting M1 Large Mac Jobs"

Last update 2025-02-11T16:32:48.605Z

resolved2025-02-11T16:31:44.419Z

There was a delay in starting M1 Large Mac Jobs. In some cases the delay could reach up to 10 minutes. No work was lost.

Feb 10, 2025

Report: "Delays in starting Mac jobs using m2pro instance"

Last update 2025-02-10T17:52:14.443Z

resolved2025-02-10T17:52:14.161Z

This incident has been resolved.

monitoring2025-02-10T17:43:25.577Z

Extra capacity has been added and we are seeing wait times decrease to normal levels. Thank you for your patience, we will continue to monitor recovery.

identified2025-02-10T15:29:14.708Z

There are presently delays starting m2pro machines, we are working to resolve this issue but it will take time to resolve due to high demand. Thank you for your patience whilst we resolve this.

Jan 31, 2025

Report: "Some pipelines failed to be created"

Last update 2025-01-31T21:45:26.310Z

resolved2025-01-31T20:56:32.095Z

This incident is resolved. Our engineers detected failures in pipeline creation at 20:17 UTC, and the system automatically recovered by 20:18 UTC.

monitoring2025-01-31T20:33:16.000Z

The system seems to have recovered and we are monitoring. Customers should re-trigger impacted pipelines, either in the UI or by re-pushing the work to the code repository.

Report: "Some commit status updates were not updated"

Last update 2025-01-31T21:39:23.065Z

postmortem2025-01-31T21:38:54.613Z

## Summary On January 23, 2025, from 19:48 UTC to 20:43 UTC, customers using CircleCI GitHub OAuth and Bitbucket projects stopped receiving commit status updates. This was due to a code change deployed at 19:48 UTC that negatively impacted the service responsible for sending commit statuses to the Version Control System \(VCS\) provider. ## What Happened On January 23, 2025, at 19:48 UTC, we deployed a change in how we send events from our service that orchestrates workflows. This change inadvertently modified the value of a key field used by a downstream service responsible for setting commit statuses. At 20:03 UTC, the team responsible for the downstream service was alerted to an increase in errors when setting commit statuses. This alert auto-resolved without intervention, delaying our response time. At 20:12 UTC, our support team notified us that customers were experiencing issues with commit status updates. This prompted an investigation. By 20:40 UTC, we had identified and reverted the faulty code change, with customer impact ceasing at 20:43 UTC. ## Future Prevention and Process Improvement We will add more comprehensive testing to cover the events sent by our orchestration service. Additionally, we will implement synthetic tests to catch failures in setting proper commit status updates. We are also investigating why the alert auto-resolved to ensure similar issues are actioned sooner. While investigation and remediation started promptly after being notified of the issue, there was a delay in initializing our incident protocol, which delayed the creation of a status page update and limited the information available to provide clear timing on the published update. We are revisiting our incident declaration procedures and tool configuration to provide further clarity around incident declaration and improve response time.

resolved2025-01-23T20:30:00.000Z

At 19:48 UTC, some customers' projects may have stopped receiving commit status updates. The incident was resolved at 20:43 UTC. To ensure that the checks are reported correctly, we recommend rerunning the impacted workflows from the start.

Report: "Customers may be seeing delays with their workflows starting and may notice issues viewing their workflows through our UI"

Last update 2025-01-31T21:27:16.544Z

postmortem2025-01-31T21:26:28.589Z

## Summary From January 21, 2025 at 23:50 UTC to January 22, 2025 at 00:56 UTC, CircleCI customers experienced increased latency with starting and canceling workflows and jobs, and experienced delays and difficulty viewing workflows in the UI. We appreciate your patience and understanding as we worked to resolve this incident. ## What Happened \(all times UTC\) At approximately 23:00 on January 21, an automated alert indicated that a database instance responsible for holding archived data was almost out of free storage space. At 23:09, the team halted a blue/green deployment on the database to free a logical replication slot, thinking that may have been the cause, but that did not help the database recover. The archival service is called synchronously by the service responsible for orchestrating workflows. When the archival service’s database reached capacity, these requests started timing out, which impacted the overall performance of the workflows service. At 23:26, the workflows queue began to grow, leading to increased latency starting workflows and jobs, canceling jobs, and viewing workflows in the UI. This was not immediately attributed to the archival database issues in part because there was a separate alert at approximately the same time related to request volume, but when the queue continued to grow after that issue resolved, a separate team began to investigate workflows further and scaled up the event consumer responsible for processing the queue at 23:44. The team investigating the unhealthy database instance promoted a read replica to a standalone primary at 23:55. By 00:03, the workflows queue depth returned to normal, which resolved workflow latency and UI impacts. However, at around the same time, Linux machine jobs began to queue downstream due to errors trying to provision instances with our cloud provider, which was actively investigating increased API error rates to the provisioning endpoint in our region. Requests began to be fulfilled around 00:32, but due to the volume of requests being processed, we also experienced rate limiting that extended the length of impact. Our queues returned to normal levels at 00:56, and the incident resolved at 01:26 after confirming there was no further impact. Post-incident, the team continued to investigate. The root cause was determined to be a code change made to a function in the impacted database on January 16th, which unintentionally created an excessive number of log messages. The function has been updated to fix this behavior. ## Future Prevention and Process Improvement We have added a max duration to the workflows retry policy for archiving workflows to allow it to fail earlier than the default timeout, limiting the potential impact on the workflows service should there be a future issue with the archival service. Longer-term, we intend to shift the workflow archival process to an event-based model to decouple the services. While alerting did indicate an issue with the archival database, the team did not have much time to address the problem before it caused customer impact because the database was filling significantly more quickly than previously forecasted. We will be implementing forecast and anomaly monitoring for our databases to alert us of unusual activity before it reaches critical levels.

resolved2025-01-22T01:26:30.000Z

This incident has been resolved.

monitoring2025-01-22T01:08:00.069Z

A fix has been implemented and we are monitoring the results.

investigating2025-01-22T00:35:19.210Z

We are continuing to investigate this issue. Thank you for your patience and understanding.

monitoring2025-01-22T00:09:41.287Z

A fix has been implemented and we are monitoring the results.

investigating2025-01-22T00:07:54.000Z

A fix has been implemented and we are monitoring the results.

investigating2025-01-21T23:24:38.000Z

We are currently investigating this issue. You can still view pipelines for a specific project in the UI.

Jan 14, 2025

Report: "Delayed Status checks and outbound webhooks"

Last update 2025-01-14T17:42:34.814Z

resolved2025-01-14T17:42:34.495Z

The incident impacting status and outbound webhooks has now been resolved. We thank you for your patience while our engineers worked on the issue.

monitoring2025-01-14T17:23:11.518Z

We are seeing signs of recoveries with the issue causing delays in status checks and outbound webhooks. We will continue to monitor the situation closely.

investigating2025-01-14T16:54:37.486Z

Our engineers are currently investigating an issue that may have an impact on Status checks and outbound webhooks. We will provide further updates as more information becomes available.

Jan 13, 2025

Report: "M2-Pro Medium jobs delayed"

Last update 2025-01-13T20:18:56.177Z

resolved2025-01-13T20:18:55.846Z

The incident impacting m2pro.medium resource class has now been resolved. We thank you for your patience while our engineers worked through this issue.

monitoring2025-01-13T17:16:30.282Z

We have implemented a fix for the issue affecting the MacOS m2pro.medium resource class and are currently observing signs of improvements. The task start time has decreased and has returned to normal levels. We will continue to monitor the situation closely. Thank you for your continued patience.

identified2025-01-13T16:43:10.928Z

Our engineers have identified an issue where builds using m2pro.medium resource class are facing delays of up to 6-8 minutes. We are actively working to mitigate the issue and increase the capacity to resolve this delay. We appreciate your patience and understanding as we work to enhance our service. We will keep you updated on our progress.

Jan 10, 2025

Report: "Documentation site is down"

Last update 2025-01-10T18:35:39.358Z

resolved2025-01-10T18:35:39.061Z

This incident has been resolved.

monitoring2025-01-10T17:48:20.800Z

A fix has been implemented and we are monitoring the results.

identified2025-01-10T17:29:33.625Z

The issue has been identified and a fix is being implemented.

investigating2025-01-10T17:20:58.363Z

We are continuing to investigate this issue.

investigating2025-01-10T17:19:10.617Z

We are currently investigating this issue.

Jan 8, 2025

Report: "Unexpected Build Failures"

Last update 2025-01-08T00:08:57.016Z

resolved2025-01-07T22:00:00.000Z

Between 20:01 UTC and 21:06 UTC, some users may have experienced unexpected build failures related to the configured working_directory in their configuration. The cause of these failures has been identified and reverted, and builds should now complete successfully.

Dec 12, 2024

Report: "MacOS m2pro.large Jobs delayed"

Last update 2024-12-12T22:43:43.813Z

resolved2024-12-12T22:43:43.438Z

The issue affecting the macOS m2pro.large resource class, causing them to have a delayed job-start time, has now been fully resolved. We thank you for your patience while our engineers worked through this incident.

monitoring2024-12-12T22:32:53.000Z

We have implemented a fix for the issue affecting the MacOS m2pro.large resource class and are currently observing signs of improvements. The task start time has decreased and has returned to normal levels. We will continue to monitor the situation closely. Thank you for your continued patience

identified2024-12-12T21:43:47.000Z

Our engineers have identified an issue causing delays to macOS m2pro.large tasks. We are working to mitigate the issue and will provide further updates as more information becomes available

Dec 4, 2024

Report: "Delays sending webhooks"

Last update 2024-12-04T00:37:06.799Z

resolved2024-12-04T00:37:06.467Z

Outbound webhook processing time has recovered.

monitoring2024-12-04T00:24:10.034Z

Our mitigations are working as expected, we are monitoring the change.

investigating2024-12-04T00:23:15.653Z

Outbound webhook processing is delayed. We have identified the issue and are rolling out a fix to mitigate the issue.

Dec 3, 2024

Report: "Trigger Pipeline modal in web UI not working"

Last update 2024-12-03T15:50:22.537Z

resolved2024-12-03T15:50:22.237Z

The incident is now resolved.

monitoring2024-12-03T15:26:23.133Z

We have identified the root cause of the issue and have reverted the change. Users can now use the previous Trigger Pipeline modal as required.

investigating2024-12-03T15:01:49.081Z

We are investigating the cause of the Trigger Pipelines modal in the web UI not working as expected. Affected users can trigger pipelines via the API if required.

Nov 29, 2024

Report: "Auto-cancellation disabled for GitHub App pipelines"

Last update 2024-11-29T11:37:29.847Z

resolved2024-11-17T12:00:00.000Z

Starting on October 14th until November 17th, the auto-cancellation feature was disabled for all GitHub App pipelines. The issue did not impact any pipelines integrated through GitHub OAuth or any other VCS. The issue is now resolved, and expected behaviour has been restored.

Nov 26, 2024

Report: "Machine jobs are not starting"

Last update 2024-11-26T17:16:35.042Z

resolved2024-11-26T17:16:33.341Z

This incident has been resolved.

monitoring2024-11-26T16:53:44.336Z

We're successfully processing the backlog and continuing to monitor it.

identified2024-11-26T16:28:10.767Z

We are continuing to see a backlog for machine jobs and are working on resolving that.

identified2024-11-26T15:49:56.421Z

We are continuing to work on a fix for this issue.

identified2024-11-26T15:43:00.205Z

We have implemented a fix, however jobs are delayed as we work through the backlog of jobs which arrived during the outage. Thank you for your patience whilst we work through the backlog.

investigating2024-11-26T15:42:27.623Z

We have implemented a fix, however jobs are delayed as we work through the backlog of jobs which arrived during the outage. Thank you for your patience whilst we work through the backlog.

identified2024-11-26T15:19:30.179Z

We're currently investigating a possible issue. We'll update as soon as we know more details.

Nov 23, 2024

Report: "Some customers may experience delays with Runner builds"

Last update 2024-11-23T00:47:05.935Z

resolved2024-11-23T00:47:05.642Z

This incident has been resolved.

monitoring2024-11-23T00:34:40.791Z

A fix has been implemented and we are monitoring the results.

investigating2024-11-23T00:13:12.835Z

We are currently investigating this issue.

Nov 21, 2024

Report: "Customers may see delays receiving credits"

Last update 2024-11-21T23:31:35.976Z

resolved2024-11-21T23:31:35.691Z

This incident has been resolved.

monitoring2024-11-21T23:31:29.267Z

We are continuing to monitor for any further issues.

monitoring2024-11-21T23:25:43.377Z

We are continuing to monitor for any further issues.

monitoring2024-11-21T23:25:05.804Z

A fix has been implemented and we are monitoring the results.

identified2024-11-21T23:10:32.802Z

The issue has been identified and a fix is being implemented.

investigating2024-11-21T22:56:05.126Z

We are currently investigating this issue.

Report: "Checks and statuses were not updated for GitHub App users and GitLab users"

Last update 2024-11-21T16:06:11.717Z

resolved2024-11-21T02:40:00.000Z

Starting at 2:40AM UTC we did not update checks or statuses for GitHub App or GitLab users. This continued until 3:40PM UTC. Statuses during that time will not be sent, but any after 3:40PM UTC will update as normal in GitHub or GitLab. Thank you for your patience while we resolved this issue.

Nov 13, 2024

Report: "GitHub App branch config fetching failures"

Last update 2024-11-13T10:31:31.428Z

resolved2024-11-12T17:30:00.000Z

From 17:30 UTC to 07:20 UTC, config fetching for GitHub App customers on branches that included a / character in the branch name were failing. A fix has been implemented and we are seeing successful config fetches from these affected branches. Please rerun any failed jobs, or push a new commit.

Nov 4, 2024

Report: "Bitbucket checkout failing"

Last update 2024-11-04T17:36:39.754Z

resolved2024-11-04T16:30:00.000Z

From 16:39 UTC to 17:24 UTC, Bitbucket checkouts were failing. A fix has been implemented and we are seeing Bitbucket checkouts pass. Please rerun any failed jobs, or push a new commit.

Oct 28, 2024

Report: "Jobs failing to start or in progress fails."

Last update 2024-10-28T15:50:36.332Z

resolved2024-10-28T15:50:35.831Z

The incident has been resolved. Thanks for your patience.

monitoring2024-10-28T15:26:54.219Z

Jobs are working again. If you had any jobs showing failures you will have to re-run. We will continue monitoring.

investigating2024-10-28T14:58:18.850Z

Some jobs are failing to start, and some jobs are having infrastructure failures. We are looking into it.

Oct 24, 2024

Report: "MacOS Job Starts Delayed: M2 Pro Medium"

Last update 2024-10-24T18:30:09.380Z

resolved2024-10-24T18:30:09.073Z

This incident has been resolved.

monitoring2024-10-24T18:19:48.891Z

We are seeing recovery and will continue to monitor.

identified2024-10-24T18:04:40.491Z

Wait times continue to decrease. We are monitoring the fix.

identified2024-10-24T17:41:33.757Z

MacOS job starts delayed for M2 Pro medium resource class. We've identified the issue and we are working to resolve it. We will provide more updates as information becomes available and we appreciate your continued patience.

identified2024-10-24T17:38:22.308Z

The issue has been identified and a fix is being implemented.

Report: "Plans and Usage pages are unavailable"

Last update 2024-10-24T11:27:51.735Z

resolved2024-10-24T11:27:51.431Z

This incident has been resolved.

monitoring2024-10-24T11:06:15.220Z

The plans and usage pages are now accessible and is functioning normally.

identified2024-10-24T11:00:30.135Z

We have identified the cause of the issue and have begun remediating it. We appreciate your patience whilst we work through the issue.

investigating2024-10-24T10:49:56.051Z

We're continuing to investigate this issue. Thank you for your patience.

investigating2024-10-24T10:33:07.885Z

Users are unable to view the plans or usage pages. We're investigating this issue.

Oct 23, 2024

Report: "Some Runner jobs not starting"

Last update 2024-10-23T12:31:03.732Z

resolved2024-10-23T01:30:00.000Z

During this incident, customers could not access the Runner Inventory page and experienced infrastructure failures for Runner jobs.