Is Codefresh Down Right Now? Discover if there is an ongoing service outage.

Codefresh is currently Operational

Last checked Aug 4, 2025 12:44 UTC from Codefresh's official status page

Historical record of incidents for Codefresh

Jul 25, 2025

Report: "Quay.io Emergency Maintenance"

Last update 2025-07-25T16:44:09.233Z

identified2025-07-25T16:40:11.000Z

We are away that there is an incident with quay.io. This affect image pulling for certain steps in codefresh. https://status.redhat.com/incidents/rvtmg00ycbjh You can utilize the Public Marketplace Registry under Account Settings > Pipeline Settings to use a different registry (ex: Dockerhub). https://codefresh.io/docs/docs/pipelines/configuration/pipeline-settings/#advanced-options-for-pipelines

Jun 12, 2025

Report: "GCP Incident"

Last update 2025-06-12T20:34:22.118Z

monitoring2025-06-12T20:34:22.099Z

We are aware of the Google Cloud Platform (GCP) Incident (https://status.cloud.google.com/incidents/ow5i3PPK96RduMcb1SsW). The analytics portion of Codefresh is currently experiencing degradation. Build Logs are still operational but rely on GCP services. Builds that utilize GCP integrations may be impacted by this incident.

May 28, 2025

Report: "The issue with running promotions & git push operations from the codefresh UI"

Last update 2025-05-28T12:25:56.736Z

postmortem2025-05-28T12:25:01.877Z

**Impact:** Git commit operations via GitOps-Runtime were temporarily non-functional. **Detection:** The issue was identified by our internal development team. **Root Cause:** A change in the GitHub API altered a response relied upon by the underlying Git library, causing push actions to fail. **Resolution:** We implemented a temporary fix and provided an upgrade path. GitHub has since resolved the issue on their end, and no action is currently required from users. GitHub's Status Page - \[Retroactive\] Incident with Git Operations: [https://www.githubstatus.com/incidents/tyjjp463pg91](https://www.githubstatus.com/incidents/tyjjp463pg91)

resolved2025-05-27T14:11:23.000Z

This incident has been resolved.

monitoring2025-05-27T14:01:25.802Z

A fix has been implemented and we are monitoring the results.

identified2025-05-27T11:40:24.971Z

The issue has been identified and a fix is being implemented.

investigating2025-05-27T10:00:47.801Z

The problem occurs during the commit phase. Our team is actively working to identify the root cause and resolve the issue. We will provide updates as soon as more information is available.

Report: "The issue with running promotions & git push operations from the codefresh UI"

Last update 2025-05-28T07:25:00.000Z

Postmortem2025-05-28T07:25:00.000Z

Resolved2025-05-27T09:11:00.000Z

This incident has been resolved.

Monitoring2025-05-27T09:01:00.000Z

A fix has been implemented and we are monitoring the results.

Identified2025-05-27T06:40:00.000Z

The issue has been identified and a fix is being implemented.

Investigating2025-05-27T05:00:00.000Z

The problem occurs during the commit phase.Our team is actively working to identify the root cause and resolve the issue. We will provide updates as soon as more information is available.

May 21, 2025

Report: "Degraded performance on some UI pages and builds"

Last update 2025-05-21T14:31:48.604Z

resolved2025-05-21T14:31:48.582Z

This incident has been resolved.

monitoring2025-05-21T14:06:50.126Z

A fix has been implemented and we are monitoring the results.

investigating2025-05-21T13:58:35.131Z

We are currently investigating this issue.

Report: "Degraded performance on some UI pages and builds"

Last update 2025-05-21T09:31:00.000Z

Resolved2025-05-21T09:31:00.000Z

This incident has been resolved.

Monitoring2025-05-21T09:06:00.000Z

A fix has been implemented and we are monitoring the results.

Investigating2025-05-21T08:58:00.000Z

We are currently investigating this issue.

May 14, 2025

Report: "g.codefresh.io not available for North America Region"

Last update 2025-05-14T17:27:15.918Z

resolved2025-05-14T17:27:15.904Z

Related services should be restored as per our WAF's most recent status update. https://status.imperva.com/incidents/3ffjxyln9pjt

monitoring2025-05-14T17:18:36.831Z

Our WAF provided had a brief disruption and their services are getting back to normal across the North America Region. https://status.imperva.com/incidents/3ffjxyln9pjt

identified2025-05-14T16:55:33.110Z

We are continuing to work on a fix for this issue.

identified2025-05-14T16:47:42.096Z

We have identified the issue. There is an issue with our WAF provider for the North America. https://status.imperva.com/incidents/3ffjxyln9pjt

investigating2025-05-14T16:25:19.832Z

Some geographical location are unable to connect to g.codefresh.io.

May 12, 2025

Report: "Several hosted GitOps runtimes are unavailable or missing from the UI"

Last update 2025-05-12T14:48:27.924Z

resolved2025-05-12T14:48:27.909Z

This incident has been resolved.

monitoring2025-05-12T14:09:44.892Z

A fix has been implemented and we are monitoring the results.

identified2025-05-12T13:32:46.343Z

The issue has been identified and a fix is being implemented.

investigating2025-05-12T12:59:08.289Z

The team has discovered that some hosted GitOps runtimes are missing or no longer visible in the UI. We are currently investigating this issue and will provide an update shortly.

May 5, 2025

Report: "Codefresh marketing site is down"

Last update 2025-05-05T11:40:46.356Z

postmortem2025-05-05T11:40:26.585Z

**Impact:** The marketing website for Codefresh, `https://codefresh.io/`was unavailable globally for around 6 hours. **Root Cause:** A configuration issue led to a traffic routing problem. Because of the nature of intermediary responses, the routing loop was not immediately diagnosed. **Resolution:** The routing configuration was restored. The site is now fully accessible. **Detection:** The issue was identified internally through monitoring and also reported by our team.

resolved2024-12-03T20:02:36.705Z

This incident has been resolved.

identified2024-12-03T18:39:17.176Z

We have identified the issue with an external provider, and are working with them on a resolution.

investigating2024-12-03T17:54:36.439Z

We are continuing to investigate this issue. All Codefresh platform functionalities remain unaffected and fully functional, and the UI console is fully accessible at g.codefresh.io.

investigating2024-12-03T15:47:44.460Z

We are currently investigating the issue. Please note that the problem is not affecting the product itself, which remains fully functional. You can access the platform at g.codefresh.io

Apr 29, 2025

Report: "g.codefresh.io is down"

Last update 2025-04-29T10:42:13.799Z

postmortem2025-04-29T10:40:26.152Z

**Impact:** The platform was unavailable for approximately 15 minutes. **Root Cause:** Human error during the execution of a deployment pipeline. **Trigger:** The pipeline was executed with incorrect parameters. **Resolution:** The error was fixed, and the platform was restored **Detection:** The issue was identified internally through monitoring and also reported by our team.

resolved2025-04-08T16:44:24.324Z

This incident has been resolved.

monitoring2025-04-08T14:07:38.153Z

A fix has been implemented and we are monitoring the results.

identified2025-04-08T14:05:40.211Z

The issue has been identified and a fix is being implemented

investigating2025-04-08T14:01:15.689Z

We are currently investigating the issue.

Apr 23, 2025

Report: "Some builds fail to start on Codefresh SaaS environment"

Last update 2025-04-23T20:43:14.711Z

resolved2025-04-23T15:15:36.000Z

This incident has been resolved.

monitoring2025-04-23T15:00:58.548Z

A fix has been implemented and we are monitoring the results.

identified2025-04-23T14:56:51.656Z

The issue has been identified and a fix is being implemented.

investigating2025-04-23T14:36:17.820Z

We are currently investigating this issue.

Apr 18, 2025

Report: "Builds stuck on Validating Connection to Docker Daemon"

Last update 2025-04-18T21:51:41.138Z

resolved2025-04-18T21:51:41.121Z

This incident has been resolved.

monitoring2025-04-18T20:51:28.355Z

We have applied a fix, and builds should resume normal operation. Please allow some time for the retry mechanism to initiate the builds.

identified2025-04-18T20:40:32.233Z

Hybrid Runtimes are now working normally for builds. We are still working on resolving SaaS builds not starting.

identified2025-04-18T20:31:38.982Z

The work around of using Hybrid Runtime is no longer valid. All Builds are now stuck on "Validating Connection to Docker Daemon". We are continuing to work on resolving this issue.

identified2025-04-18T19:11:07.387Z

SaaS Builds are currently being stuck on "Validating Connection to Docker Daemon". We have identified the issue and currently working on a solution. Work around would be using the Hybrid Runtime for the builds.

Feb 27, 2025

Report: "Increasing number of delayed builds in some accounts"

Last update 2025-02-27T13:26:29.904Z

postmortem2025-02-27T13:25:57.905Z

**Impact**: The issue affected customers on the CUSTOM plan **Detection** : Internal monitoring system **Root Cause**: Following the release of a new pricing model, a miscommunication between our systems caused the payment plan for some Custom plan accounts to reset to default. **Resolution**: Changes were reverted. Corrupted data was restored from the backup.

resolved2025-02-20T14:53:32.532Z

This incident has been resolved.

monitoring2025-02-20T12:24:07.316Z

A fix has been implemented and we are monitoring the results.

identified2025-02-20T11:42:33.793Z

We are still continuing to work on a fix for this issue.

identified2025-02-20T10:46:20.880Z

We are continuing to work on a fix for this issue.

identified2025-02-20T09:40:35.546Z

The issue has been identified and a fix is being implemented.

investigating2025-02-20T08:57:42.016Z

We are currently investigating this issue.

Feb 20, 2025

Report: "Some accounts are experiencing issues with viewing Home Dashboard data"

Last update 2025-02-20T17:36:44.667Z

resolved2025-02-20T17:36:44.650Z

This incident has been resolved.

investigating2025-02-20T16:45:28.509Z

The Home Dashboard data related to CI pipelines is only inaccessible for accounts with active GitOps runtimes. CI pipeline-only accounts are not affected.

investigating2025-02-20T16:38:45.559Z

We are continuing to investigate this issue.

investigating2025-02-20T16:36:01.250Z

We are currently investigating this issue.

Dec 23, 2024

Report: "Build Start Delay"

Last update 2024-12-23T21:40:58.890Z

resolved2024-12-23T14:43:04.000Z

This incident has been resolved.

monitoring2024-12-23T10:09:09.333Z

We are continuing to monitor for any further issues.

monitoring2024-12-22T13:43:13.410Z

A fix has been implemented and we are monitoring the results.

investigating2024-12-22T12:24:04.741Z

We are currently investigating the issue

Dec 17, 2024

Report: "Pipeline Variable Loss After Search Filtering"

Last update 2024-12-17T09:42:25.308Z

resolved2024-12-17T09:42:25.289Z

This incident has been resolved.

monitoring2024-12-16T12:43:03.138Z

A fix has been implemented and we're monitoring the results.

identified2024-12-16T12:34:47.668Z

The issue has been identified and a fix is being implemented.

investigating2024-12-16T09:27:28.442Z

We are currently investigating the issue. Workaround: Do not use the search filtering option to update pipeline variables. Instead, use the scroll feature to locate and update variables.

Nov 14, 2024

Report: "Some Codefresh Classic builds failed to start or terminated incorrectly"

Last update 2024-11-14T17:07:24.425Z

resolved2024-11-14T17:07:24.408Z

This incident has been resolved.

monitoring2024-11-14T03:07:13.749Z

We have detected some issues with connections to Firebase from our classic build manager from 01:22 to 01:33 AM UTC, Nov 14th. Some builds during this time may have failed to start, or terminated incorrectly. The issue subsided after this time period, and all systems are currently operational. Our team is monitoring and investigating.

Nov 8, 2024

Report: "General UI Slowness"

Last update 2024-11-08T00:01:54.607Z

postmortem2024-11-08T00:01:32.832Z

**Impact:** Following changes made during recent scheduled maintenance, some customers experienced slower load times in the Codefresh SAAS platform UI. Builds were unaffected. **Detection:** Our monitoring systems alerted us to unusual activity, and our team quickly initiated an investigation. Customer reports of slower performance further confirmed the issue, allowing us to prioritize and address it promptly. **Root Cause:** The traffic imbalance was due to a combination of technical configurations that resulted in uneven resource allocation across our zones immediately after the maintenance. This led to one zone handling a disproportionate amount of traffic, impacting website responsiveness for some users. **Resolution:** Our team implemented several measures to redistribute traffic evenly across all zones. These adjustments restored balanced performance, with monitoring systems ensuring stability.

resolved2024-11-04T14:14:22.872Z

This incident has been resolved.

monitoring2024-11-03T15:07:58.677Z

A fix has been implemented and we are monitoring the results

identified2024-11-03T14:28:40.398Z

The issue has been identified and a fix is being implemented

investigating2024-11-03T08:11:43.999Z

We are currently investigating the issue

Oct 23, 2024

Report: "GitOps Pages Not Loading"

Last update 2024-10-23T18:17:21.842Z

resolved2024-10-23T18:17:21.826Z

This incident has been resolved.

monitoring2024-10-23T17:34:28.597Z

A fix has been implemented and we are monitoring the results.

investigating2024-10-23T17:17:39.832Z

The Classic UI is still accessible. If you need to access Pipelines and Projects, please navigate to https://g.codefresh.io/projects/ directly.

investigating2024-10-23T17:02:36.282Z

We are currently looking into the issue where https://g.codefresh.io/2.0/ is not loading and showing a white screen.

Oct 10, 2024

Report: "Some Classic builds are stuck in Pending state"

Last update 2024-10-10T14:45:30.006Z

postmortem2024-10-10T14:43:40.998Z

**Impact**: Some accounts sporadically experienced longer pending times than usual on a portion of their builds for a day. **Detection**: Issue was reported by a customer, and shortly after confirmed by Codefresh’s platform monitoring alerts. **Root Cause**: This issue was caused by a bug in MongoDB driver. The MongoDB driver was upgraded in Codefresh services as part of our efforts to improve performance, but this version contained a bug that caused Mongoose queries to hang when under heavy load without returning or throwing errors. This resulted in the Codefresh build manager randomly getting stuck when enough queries were hanging under certain conditions. **Resolution**: A temporary solution to improve build queries queue behavior was initially implemented to alleviate the issue for affected customers. The actual root cause was identified the following week, and the issue was resolved by downgrading the MongoDB driver to a version that did not contain the bug.

resolved2024-09-23T20:14:53.588Z

This incident has been resolved.

monitoring2024-09-23T14:21:33.915Z

A fix has been implemented and we are monitoring the results.

identified2024-09-23T13:39:59.831Z

The issue has been identified and a fix is being implemented.

investigating2024-09-23T13:34:10.778Z

We are currently investigating this issue.

Sep 5, 2024

Report: "Codefresh UI doesn't open for some users"

Last update 2024-09-05T20:27:17.053Z

resolved2024-09-05T20:27:17.036Z

This incident has been resolved.

monitoring2024-09-05T13:26:26.814Z

A fix has been implemented and we are monitoring the results.

identified2024-09-05T13:20:03.139Z

The issue has been identified and a fix is being implemented.

investigating2024-09-05T13:17:25.803Z

We are currently investigating this issue.

Sep 4, 2024

Report: "Codefresh GitOps performance degradation"

Last update 2024-09-04T09:20:27.006Z

resolved2024-09-04T09:20:26.992Z

This incident has been resolved

monitoring2024-09-03T15:21:19.861Z

A fix has been implemented and we are monitoring the results.

investigating2024-09-03T15:21:14.715Z

A fix has been implemented and we're monitoring the results.

investigating2024-09-03T14:50:57.000Z

Users may potentially see delays in processing events from runtimes, and increased general load time for GitOps pages We are currently investigating the issue.

Aug 14, 2024

Report: "Partial Outage: Pipeline builds are stuck in pending due to expired certificate's"

Last update 2024-08-14T13:00:06.633Z

postmortem2024-08-14T01:54:23.700Z

**Impact**: We had a 10 hybrid runners \(no more than 10\) that were unable to communicate with our API for a day, and therefore were unable to fetch and run pipelines. **Detection**: We were informed of this issue by customers. **Root Cause**: We identified an issue with our certificate rotation which failed to generate new certificates as required for this subset of runners. **Resolution**: We were able to resolve the issue by manually recreating the certificates required, which were then updated to the runners on the next build, restoring the service for all impacted customers. Further mitigation was done to ensure the issue with certificate rotation was also rectified. We are working on monitoring improvements in this area

resolved2024-08-01T19:00:00.000Z

We had a small number of hybrid runners (no more than 10) that were unable to communicate with our API for a day, and therefore were unable to fetch and run pipelines. We identified an issue with our certificate rotation which failed to generate new certificates as required for this subset of runners. We were able to resolve the issue by manually recreating the certificates required, which were then updated to the runners on the next build, restoring the service for all impacted customers.

Jun 27, 2024

Report: "Long loading times at GitOps applications dashboard with Hosted GitOps Runtimes"

Last update 2024-06-27T12:01:41.814Z

resolved2024-06-27T12:01:41.800Z

This incident has been resolved.

monitoring2024-06-27T10:17:42.339Z

We are continuing to monitor for any further issues.

monitoring2024-06-27T10:17:37.662Z

A fix has been implemented and we are monitoring the results.

investigating2024-06-27T09:24:48.792Z

We are currently investigating this issue.

Jun 26, 2024

Report: "Some accounts are experiencing issues with viewing desired/live state of GitOps platform objects"

Last update 2024-06-26T21:07:00.864Z

resolved2024-06-26T21:07:00.846Z

This incident has been resolved.

monitoring2024-06-26T18:19:08.298Z

A fix has been implemented and we are monitoring the results.

identified2024-06-26T13:46:31.116Z

The issue has been identified and a fix is being implemented.

investigating2024-06-26T13:18:40.469Z

We are currently investigating this issue.

Jun 6, 2024

Report: "Some accounts are experiencing issues with classic builds"

Last update 2024-06-06T16:27:53.808Z

resolved2024-06-06T16:27:53.794Z

This incident has been resolved.

monitoring2024-06-06T16:26:14.296Z

We are continuing to monitor for any further issues.

monitoring2024-06-06T13:09:06.378Z

A fix has been implemented and we are monitoring the results.

identified2024-06-06T13:08:43.749Z

The issue has been identified and a fix is being implemented.

Jun 5, 2024

Report: "Support Portal is unavailable for some users"

Last update 2024-06-05T16:32:41.889Z

resolved2024-06-05T16:32:41.873Z

This incident has been resolved.

monitoring2024-06-05T15:02:43.238Z

A fix has been implemented and we are monitoring the results.

investigating2024-06-05T14:54:54.432Z

We are currently investigating the issue. You still can open a ticket on this page using the 'Submit request' button: https://support.codefresh.io/hc/en-us/ or by sending an email to support@codefresh.io

Jun 3, 2024

Report: "Some accounts have an issue with access to CFCR Helm Registry"

Last update 2024-06-03T12:52:49.635Z

resolved2024-06-03T12:52:49.618Z

This incident has been resolved.

monitoring2024-06-03T08:05:17.581Z

A fix has been implemented and we are monitoring the results.

investigating2024-06-03T06:50:08.349Z

We are currently investigating this issue.

May 24, 2024

Report: "GitOps UI Degraded Performance"

Last update 2024-05-24T03:44:22.232Z

resolved2024-05-24T03:44:22.214Z

This incident has been resolved.

monitoring2024-05-23T13:48:14.488Z

A fix has been implemented and we are monitoring the results.

investigating2024-05-23T13:11:30.980Z

We are currently investigating the issue.

Apr 22, 2024

Report: "Hosted runtimes are unavailable for some customers"

Last update 2024-04-22T20:01:17.562Z

resolved2024-04-22T20:01:17.536Z

This incident has been resolved.

monitoring2024-04-22T18:44:06.748Z

We have resolved this issue. Users will need to add their Personal Git Tokens at https://g.codefresh.io/2.0/git-personal-access-token for the Hosted Runtime. Please reach out to support if you have any additional questions.

identified2024-04-22T16:59:49.594Z

We are continuing to work on a fix for this issue. We estimate this will be resolved in 2-3 hours

identified2024-04-22T13:44:09.477Z

We are continuing to work on a fix for this issue.

identified2024-04-22T10:33:14.946Z

The issue has been identified and a fix is being implemented.

Report: "Degraded performance in Codefresh Classic builds for some customers"

Last update 2024-04-22T10:31:59.023Z

resolved2024-04-22T10:31:59.009Z

This incident has been resolved.

monitoring2024-04-16T13:13:51.997Z

We are continuing to monitor for any further issues.

monitoring2024-04-16T12:23:58.672Z

A fix has been implemented and we are monitoring the results.

identified2024-04-16T12:23:37.923Z

The issue has been identified and a fix is being implemented.

investigating2024-04-16T12:09:05.162Z

We are currently investigating this issue.

Mar 31, 2024

Report: "Degraded performance for Codefresh Classic"

Last update 2024-03-31T09:39:35.264Z

resolved2024-03-31T09:39:35.249Z

This incident has been resolved.

monitoring2024-03-31T08:55:43.871Z

A fix has been implemented and we are monitoring the results.

investigating2024-03-31T08:36:26.681Z

Some users might notice degraded performance for Codefresh Classic. We're investigating the issue at the moment.

Mar 27, 2024

Report: "Slow SAAS Builds due to AWS Issue"

Last update 2024-03-27T10:44:53.443Z

resolved2024-03-27T10:44:53.424Z

This incident has been resolved.

investigating2024-03-26T21:27:56.935Z

We are continuing to investigate this issue.

investigating2024-03-26T21:27:33.213Z

We are currently seeing an impact on builds and services on our SAAS platform due to an API error currently impacting provisioning new nodes in US-EAST-1. Some builds and services are experiencing delays due to this resulting resource constraint. We will update our incident here as we see measurable changes, and the AWS incident can also be followed via the below link: https://health.aws.amazon.com/health/status.

Mar 25, 2024

Report: "Partial UI outage on GitOps pages"

Last update 2024-03-25T11:06:43.690Z

resolved2024-03-25T11:06:43.675Z

No new complaints have been identified during the monitoring period.

monitoring2024-03-25T05:16:08.399Z

A fix has been implemented and we are monitoring the results.

identified2024-03-25T05:15:45.103Z

The issue has been identified and a fix is being implemented.

investigating2024-03-25T05:11:45.565Z

We are currently investigating an issue with GitOps UI pages that rely on Runtime information. Please bear with us as we investigate this issue.

Jan 16, 2024

Report: "Classic build and pipelines pages for some customers are experiencing long load times"

Last update 2024-01-16T02:00:50.798Z

resolved2023-10-23T22:27:31.588Z

Work to improve performance in these cases has been implemented. Further improvements are planned in the next few days. Although we do not anticipate this reoccurring, if you do encounter any issues with extremely slow page load times, please contact Support.

monitoring2023-10-23T16:13:48.668Z

A fix has been implemented to address this issue for all accounts, and we are monitoring the results.

monitoring2023-10-20T00:32:27.167Z

If you are experiencing long page load times (10 sec+) and have a high number of builds pr day, please contact Support who can apply a patch to your account. Our engineers have found the cause, and we are working to resolve the root issue.

monitoring2023-10-19T15:48:56.224Z

We are continuing to monitor for any further issues.

monitoring2023-10-19T10:37:02.441Z

We are continuing to monitor for any further issues.

monitoring2023-10-19T04:03:04.546Z

We are continuing to monitor UI performance for selected customers with a high number of builds.

monitoring2023-10-18T23:38:43.626Z

A fix has been implemented for the affected accounts, and we are monitoring the results. This bug is only present in accounts with a very high number of daily builds, and support is able to apply the same fix on an account-by-account basis for any additional customers as needed.

investigating2023-10-18T21:13:54.844Z

We are currently investigating an issue for some customers where the build and pipelines pages can take an inconsistent time to load, at times experiencing a significant inconsistent delay. We have identified one issue, and currently implementing changes to resolve this issue and improve performance. This is only impacting the UI loading on some pages. Pipeline execution is unaffected.

Report: "Build pages intermittently not loading"

Last update 2024-01-16T02:00:37.530Z

resolved2023-08-07T20:52:56.139Z

This incident has been resolved.

monitoring2023-08-07T20:21:08.102Z

We are continuing to monitor for any further issues.

monitoring2023-08-07T20:16:45.048Z

A fix has been implemented and we are monitoring the results.

investigating2023-08-07T19:32:38.898Z

We are currently investigating an issue where some build pages may be having trouble loading.

Dec 31, 2023

Report: "We are experiencing intermittent issues with loading git-ops pages"

Last update 2023-12-31T16:12:23.420Z

resolved2023-12-31T16:12:23.408Z

This incident has been resolved.

monitoring2023-12-31T13:26:18.323Z

A fix has been implemented and we are monitoring the results.

investigating2023-12-31T12:47:36.092Z

We are currently investigating this issue.

Dec 25, 2023

Report: "The steps catalog is not available"

Last update 2023-12-25T10:57:58.133Z

resolved2023-12-25T10:57:58.120Z

This incident has been resolved.

monitoring2023-12-25T09:59:57.583Z

A fix has been implemented and we are monitoring the results.

identified2023-12-25T09:40:50.679Z

We are working on fixing the issue

Dec 18, 2023

Report: "Slow Response Times"

Last update 2023-12-18T20:48:17.894Z

resolved2023-12-18T20:48:17.875Z

We have identified the database-related case of the API slowdown, and have now resolved the issue.

investigating2023-12-18T19:18:14.745Z

This incident is only impacting our GitOps API. Classic Pipelines are expected to continue to work with no interruption.

investigating2023-12-18T19:09:57.470Z

We are currently investigating slow response times in some parts of our platform. If you are impacted, please subscribe to this page for updates.

Dec 5, 2023

Report: "General UI Slowness"

Last update 2023-12-05T17:41:22.684Z

resolved2023-12-05T17:41:22.667Z

This incident has been resolved.

monitoring2023-12-05T16:47:54.542Z

We have applied a fix that resolved the issue.

identified2023-12-05T15:49:33.185Z

We have identified an issue that is causing UI Slowness. We are currently working on resolving this issue

Nov 20, 2023

Report: "We are experiencing issues with viewing GitOps-related pages in UI"

Last update 2023-11-20T20:02:34.817Z

postmortem2023-11-20T20:01:58.079Z

We have completed our RCA for this incident, for which the summary is below: **Impact:** We had significant disruption to any UI page that relied on displaying runtime-related information, leading to incomplete or unavailable data for users. **Detection:** This issue was reported to us by customers. **Root Cause:** An unexpected side effect of an API change which caused the event handler to not recognize runtime events as runtimes and instead treat them as generic-entities. When the change was reverted the entries in the generic-entities collection were no longer updated, and an automatic cleaning function then resulted in some UI data queries returning incorrect data. **Resolution:** After resolving the root cause, we rebuilt the required data and reinitialized the runtime information. We have identified improvements to our E2E testing process and monitoring systems as a result of this incident that we will be implementing.

resolved2023-11-13T20:21:20.143Z

This incident has been resolved.

monitoring2023-11-13T18:51:07.235Z

We have implemented additional fixes to restore UI functionality, and we are monitoring the results.

identified2023-11-13T18:04:46.186Z

We have identified an additional issue and are working on a fix.

monitoring2023-11-13T17:47:18.256Z

A fix has been implemented and we are monitoring the results.

identified2023-11-13T15:48:11.701Z

The issue has been identified and a fix is being implemented.

investigating2023-11-13T14:23:53.223Z

We are continuing to investigate this issue.

investigating2023-11-13T13:32:18.828Z

We are currently investigating this issue.

Nov 15, 2023

Report: "Quay.io incident is impacting some image pulls"

Last update 2023-11-15T21:43:08.892Z

resolved2023-11-15T21:43:08.860Z

Quay.io is operating correctly for pushes and pulls. This issue is resolved for all Codefresh related operations.

monitoring2023-11-15T00:34:20.706Z

Quay.io has implemented fix for the issue and is operating for image pulls. Codefresh default image pulls are functional at this time. We are continuing to monitor Codefresh builds.

identified2023-11-14T23:28:52.967Z

We are seeing improved success rate at this time in image pulls from Quay for Codefresh build images. We are continuing to monitor the incident status from Quay.io.

investigating2023-11-14T21:55:34.400Z

As a partial workaround to help alleviate the issue, users can set the default marketplace registry to pull from Docker Hub. This setting will make all public typed-steps (excluding direct Freestyle steps) pull images from the specified registry. To configure this, you will first need a registry integration in your Codefresh account for Docker Hub. Then go to account settings -> Pipeline Settings -> Advanced Options -> Public Marketplace Registry, and select your Docker Hub integration in the dropdown. Note that this setting only affects public typed-step image pulls such as codefresh-run, and will not resolve Quay image pull issues for other cases. Image pulls will also be subject to any Docker Hub rate limits associated with your credentials.

investigating2023-11-14T21:03:25.369Z

An incident at Quay.io (https://status.quay.io/incidents/z7sbjqmb34p1) is impacting some pipeline builds, causing failures when the required images are unable to be obtained.

Nov 5, 2023

Report: "Quay.io is under maintenance"

Last update 2023-11-05T18:38:16.561Z

resolved2023-11-05T18:38:16.542Z

Quay.io: The scheduled maintenance has been completed

investigating2023-11-05T14:25:15.550Z

Impact: sometimes it's impossible to pull images from quay.io. More details: https://status.quay.io/incidents/10b6w5v1w7ql

Sep 13, 2023

Report: "Sporadic connection timeouts while operating with cm://h.cfcr.io"

Last update 2023-09-13T18:57:43.670Z

resolved2023-09-13T18:57:43.658Z

This incident has been resolved.

investigating2023-09-13T15:24:15.539Z

Currently some users might encounter issues while operating with default Helm registry cm://h.cfcr.io. We're investigating this issue. Some errors that might be seen in build: Error: Get https://h.cfcr.io/account/default/index.yaml: net/http: request canceled (Client.Timeout exceeded while awaiting headers) Error: looks like "cm://h.cfcr.io/account/default/" is not a valid chart repository or cannot be reached: plugin "bin/helmpush" exited with error or Error: 500: unknown error Error: looks like "cm://g.codefresh.io/api/helm/repos/account/default/" is not a valid chart repository or cannot be reached: plugin "bin/helm-cm-push" exited with error ~~ Workaround: As a temporary workaround, we recommend adding retry options to your helm-related steps as described here: https://codefresh.io/docs/docs/pipelines/what-is-the-codefresh-yaml/#retrying-a-step

Jul 5, 2023

Report: "We're experiencing an issue with Codefresh pipelines"

Last update 2023-07-05T20:42:31.017Z

postmortem2023-07-05T20:42:11.447Z

**Impact**: We had a 64 minute window where the CI pipeline view had no data in our SAAS platform. **Detection**: Our monitoring systems immediately alerted us to the issue. **Root Cause**: One of our database collections went offline. **Resolution**: We were able to sync and restore connectivity to this database collection. Some parts of this process took longer than expected and we have implemented improvements to a number of processes to avoid a similar incident in the future.

resolved2023-06-20T11:09:01.522Z

This incident has been resolved.

monitoring2023-06-20T10:48:07.604Z

A fix has been implemented and we're currently monitoring results. All systems are operational now.

identified2023-06-20T10:25:37.000Z

In order to speed up the issue resolution, Codefresh platform went into maintenance mode. Expected resolution time is 30 minutes.

investigating2023-06-20T10:25:23.005Z

In order to speed up the issue resolution, Codefresh platform went into maintenance mode. Expected resolution time is 15 minutes.

investigating2023-06-20T10:05:05.000Z

Codefresh platform is currently under maintenance, expected downtime is up to one hour

investigating2023-06-20T09:59:30.557Z

We are continuing to investigate this issue.

investigating2023-06-20T09:49:43.000Z

Pipelines are missing from the Pipelines list. We are currently investigating this issue.

Jun 20, 2023

Report: "Pending builds on SaaS and Hybrid"

Last update 2023-06-20T04:09:46.439Z

postmortem2023-06-20T04:09:45.446Z

**Impact**: We had a partial outage \(some requests could not access the platform at all\) and some builds were stuck in pending for 30 mins. **Detection**: We manually detected this issue before our automated check \(every 10 minutes\) alerted us **Root Cause**: We had a parallel issue with Firebase logging and the combination of a number of small issues as a result caused some pods to become unresponsive. **Resolution**: We reverted our last push to production to test if this was code related. Once the revert triggered services to restart, the issue was then resolved.

resolved2023-05-22T18:19:20.958Z

This has now been resolved

monitoring2023-05-22T14:28:54.461Z

A fix has been implemented and we are monitoring the results.

identified2023-05-22T14:23:02.524Z

The issue has been identified and a fix is being implemented.

investigating2023-05-22T14:07:45.524Z

We are currently investigating this issue.

Report: "Classic Pipeline Logging Delays"

Last update 2023-06-20T04:09:23.993Z

postmortem2023-06-20T04:09:21.987Z

**Impact**: Some builds had significant delays in logs appearing in the UI. The build completion state and build time was not affected. **Detection**: Customers reported the impact to us. **Root Cause**: We had saturated the incoming capacity of our Firebase instances causing an inability to write new data into it. Due to this builds of customers were not able to report all logs and in many cases had delayed logs. The increased load spiked due to an increase in logging in production from another issue that caused builds to stay in pending. This caused an additional surge and therefore extended delays in logging. **Resolution**: We have doubled our Firebase instances to better handle spikes in demand. We will also be implemented some targeted monitoring of our Firebase instances and have improved monitoring of our overall platform state.

resolved2023-05-23T14:20:05.304Z

This incident has been resolved.

monitoring2023-05-22T23:15:09.980Z

A fix has been implemented to address this issue and we are monitoring the results.

investigating2023-05-22T18:18:30.122Z

We are continuing to investigate this issue.

investigating2023-05-22T18:17:17.722Z

We are currently investigating some occurrences where Classic Pipeline steps are experiencing significant delays in the logs of the step showing in the UI. Pipelines are continuing to operate as expected.

Report: "GraphQL API outage for GitOps Platform"

Last update 2023-06-20T00:24:36.769Z

postmortem2023-06-20T00:20:47.572Z

**Impact** Our GitOps Platform’s API was unresponsive for an hour. **Detection:** We detected the issue via our automated monitoring. **Root Cause:** Some customer requests to our API triggered an error, which then caused a crashloop in some API pods. **Resolution:** We have updated our error handling to avoid this error in future.

resolved2023-06-13T03:19:14.935Z

This incident has been resolved.

monitoring2023-06-12T22:00:01.291Z

We have identified and resolved the issue with the API for the GitOps Platform. We are continuing to monitor this issue.

investigating2023-06-12T20:51:54.137Z

We are continuing to investigate this issue.

investigating2023-06-12T20:51:42.811Z

Our testing has detected an issue with one of the API's used for our GitOps platform. We are currently investigating and will update this incident as work develops. Our Classic platform is unaffected.

Report: "Potential AWS Outage"

Last update 2023-06-20T00:11:58.380Z

postmortem2023-06-20T00:10:12.981Z

This was confirmed as an AWS outage, and during our debugging and monitoring investigations we were able to see and confirm that builds had resumed without issue.

resolved2023-06-13T23:04:35.217Z

This incident has been resolved.

monitoring2023-06-13T21:15:15.511Z

We are continuing to see builds successfully resume on our SAAS clusters. We are actively monitoring as our runtimes work through the build backlog bottleneck created by this incident, and making adjustments as necessary to help expedite recovery.

identified2023-06-13T20:47:57.000Z

Volumes for customer builds are now able to be provisioned. Pending builds should be slowly returning to normal. We are continuing to monitor the behavior for this incident.

identified2023-06-13T19:58:05.543Z

We have confirmed that AWS endpoints are timing out and we cannot provision volumes used for builds. We are continuing to investigate what appears to be a widespread AWS issue with us-east-1.

investigating2023-06-13T19:29:03.000Z

We are currently investigating an issue with AWS (in particular us-east-1) which is impacting volume provisioning for some customers.

Report: "Builds are failing on some accounts"

Last update 2023-06-20T00:08:56.651Z

postmortem2023-06-20T00:07:32.708Z

**Impact**: We had 40 minutes where API calls relying on context objects were failing. **Detection**: Our internal monitoring detected this issue. **Root Cause**: We introduced a new security model to our context API, and as a result, caused a callback loop between that API and another service. **Resolution**: We reverted the code within 40 minutes of our internal monitoring raising an alert and have reimplemented the new security model in a way that doesn’t cause the callback loop.

resolved2023-06-08T17:23:12.162Z

This incident has been resolved.

monitoring2023-06-08T13:13:07.951Z

A fix has been implemented and we are monitoring the results.

identified2023-06-08T12:57:06.306Z

The issue has been identified and a fix is being implemented.

investigating2023-06-08T12:51:24.026Z

We are currently investigating this issue.

Apr 30, 2023

Report: "Codefresh API documentation doesn't work"

Last update 2023-04-30T11:55:22.511Z

resolved2023-04-30T11:55:22.497Z

This incident has been resolved. If you still see a blank screen when accessing https://g.codefresh.io/api/, please hard-refresh the page to clear the cache (Shift + Reload button in Chrome).

monitoring2023-04-30T11:43:47.910Z

A fix has been implemented and we are monitoring the results.

investigating2023-04-30T08:21:25.386Z

Codefresh API documentation (https://g.codefresh.io/api/) doesn't work. We're currently investigating this issue. 💡Only the documentation is affected, the API itself is fully functional.

Feb 24, 2023

Report: "h.cfcr.io is not reachable"

Last update 2023-02-24T19:06:33.509Z

postmortem2023-02-24T18:59:11.266Z

**Impact**: Codefresh default helm repository was unreachable from [h.cfcr.io](http://h.cfcr.io). Classic users relying on this were unable to run their builds until they switched to our backup endpoint. **Detection**: This was discovered by a user. **Root Cause**: There were issues with our DNS provider. The workaround that was provided to users during this outage can be considered a permanent fix.

resolved2023-02-21T19:30:14.996Z

At this time, all DNS issues have been resolved for cfcr.io. To reiterate, if you have implemented the workaround which uses the https://g.codefresh.io/api/helm/repos endpoint, you can safely leave this workaround in place, or revert back to https://h.cfcr.io if you wish to do so. If you are seeing any issues with cfcr.io please reach out to Codefresh Support for assistance.

monitoring2023-02-21T17:58:25.502Z

DNS issues for cfcr.io are resolved as DNS propagation has been completed. All customers (both SaaS and Hybrid) should now be able to access the Codefresh Chart Musemum using cfcr.io. Please note: If you have implemented the workaround which uses the https://g.codefresh.io/api/helm/repos endpoint, you can safely leave this workaround in place, or revert back to https://h.cfcr.io if you wish to do so. We are going to continue to monitor the situation prior to marking this incident as resolved.

identified2023-02-21T17:40:32.344Z

DNS issues for cfcr.io are now resolved for all non-hybrid customers. Hybrid customers may still experience some issues, therefore we are continuing to monitor the situation.

identified2023-02-21T16:26:29.501Z

The issue was identified with our DNS provider. The current timeline for a resolution could be up to three days. In the meantime you can change all references to the helm registry (https://h.cfcr.io) to the direct endpoint of: https://g.codefresh.io/api/helm/repos These changes should be made in the following places: 1. Codefresh helm integration (https://g.codefresh.io/account-admin/account-conf/integration/helmNew) 2. Pipelines with freestyle step 3. External systems to codefresh which are referencing the helm registry Please feel free to reach out to support with any questions.

identified2023-02-21T12:47:54.067Z

The issue has been identified and a fix is being implemented.

investigating2023-02-21T12:26:11.308Z

We are currently investigating this issue.

Report: "Quay IO APAC CDN Slow Downloads"

Last update 2023-02-24T18:58:58.270Z

postmortem2023-02-24T18:58:47.829Z

This was a 3rd party incident.

resolved2023-02-09T12:46:02.298Z

Download speeds in APAC have been restored. https://status.quay.io/incidents/h993c5nwlnj1

monitoring2023-01-22T09:18:33.373Z

Download speeds are restored. Quay.io are continuing to monitor https://status.quay.io/incidents/h993c5nwlnj1

identified2023-01-16T04:07:40.000Z

Users in the APAC region may be impacted by Quay.io's current CDN issue in the region. Immediate updates and history can be seen on Quay.io's status page here: https://status.quay.io/ This is a regional issue for APAC users only and is likely to impact the startup time of pipelines as images used internally are impacted.

Jan 11, 2023

Report: "We are experiencing issues with loading Pipelines view page"

Last update 2023-01-11T21:52:38.001Z

postmortem2023-01-11T21:52:12.223Z

**Impact**: Codefresh Classic pipelines were inaccessible for around 15 minutes. **Detection**: The issue was immediately reported to Codefresh by customers **Root Cause**: Changes to Codefresh platform components were tested and validated prior to deployment, however an inconsistency during deployment caused an issue with the platform, which resulted in a brief outage that was quickly resolved.

resolved2023-01-10T14:57:44.551Z

This incident has been resolved.

monitoring2023-01-10T14:26:09.049Z

A fix has been implemented and we are monitoring the results.

identified2023-01-10T14:26:01.267Z

The issue has been identified and a fix is being implemented.

investigating2023-01-10T14:02:06.372Z

We are currently investigating this issue.