Is Confluent Cloud Down Right Now? Discover if there is an ongoing service outage.

Confluent Cloud is currently Operational

Last checked Jul 29, 2025 17:43 UTC from Confluent Cloud's official status page

Historical record of incidents for Confluent Cloud

Jul 23, 2025

Report: "Confluent Cloud Metrics API is currently experiencing elevated error rates on certain historical queries for longer time ranges"

Last update 2025-07-23T16:34:09.632Z

monitoring2025-07-23T16:34:09.623Z

This issue started on July 23rd, 2025 02:30 UTC, and has been mitigated as of July 23rd, 2025 15:17 UTC. At this time, we have deployed a fix for the issue and are monitoring to see recovery.

Jul 15, 2025

Report: "We are experience degraded performance in AWS US east-1 due to a zonal outage."

Last update 2025-07-15T02:21:21.791Z

resolved2025-07-15T02:21:21.780Z

This incident has been resolved.

Jul 14, 2025

Report: "Customers might not be able to access their cluster metrics for some SKUs in the AWS us-west-2 region"

Last update 2025-07-14T19:36:59.403Z

monitoring2025-07-14T19:36:59.394Z

A fix has been implemented and we are monitoring the results.

Jul 11, 2025

Report: "Customers may face issues with running or newly submitted Flink statements in AWS us-west-2"

Last update 2025-07-11T16:58:19.836Z

investigating2025-07-11T16:58:19.833Z

Confluent is observing similar symptoms to the earlier issue: https://status.confluent.cloud/incidents/c4nvjkxz5vqc Investigation and mitigation efforts are underway.

Report: "Confluent Cloud Flink - Customers may face issues with running or newly submitted Flink statements in AWS US-West2"

Last update 2025-07-11T09:18:15.998Z

investigating2025-07-11T09:18:15.995Z

We are currently investigating the issue and attempting to mitigate or provide a work-around.

Jul 9, 2025

Report: "Confluent Cloud Customers accessing Freight Brokers in the AWS ap-south1 region might experience degraded service"

Last update 2025-07-09T15:16:32.143Z

investigating2025-07-09T15:16:32.139Z

We are currently investigating this issue.

Jul 8, 2025

Report: "Customers accessing Freight Brokers are currently degraded in the AWS ap-south1 region."

Last update 2025-07-08T20:25:53.564Z

investigating2025-07-08T20:21:22.802Z

We are currently investigating this issue.

Jul 7, 2025

Report: "Flink APIs to fetch regions and compute pools are impacted globally"

Last update 2025-07-07T18:58:59.974Z

monitoring2025-07-07T18:58:59.963Z

Starting Jul 7, approximately 17:39 UTC, Flink APIs to fetch regions and compute pools encountered failures. The issue was mitigated approximately Jul 7, 18:25 UTC and Confluent is actively monitoring the status.

Jul 5, 2025

Report: "Cluster produce/consume slowness observed in AWS ME-South-1"

Last update 2025-07-05T02:48:36.317Z

resolved2025-07-05T02:00:00.000Z

Between 02:00 UTC and 02:05 UTC some customers running clusters in AWS ME-SOUTH-1 may have experienced increased errors or slowness of their producer and consumers. This issue was detected and auto-mitigated.

Jul 3, 2025

Report: "Freight clusters in AWS us-east-1 experiencing elevated latencies for produce and consume requests"

Last update 2025-07-03T03:42:04.236Z

resolved2025-07-03T00:30:00.000Z

Freight clusters in AWS us-east-1 experiencing elevated latencies for produce and consume requests.

Jun 28, 2025

Report: "Confluent Cloud Flink - Customers may experience increased authorization failure rate for new statements"

Last update 2025-06-28T00:43:25.146Z

identified2025-06-28T00:43:25.143Z

We are continuing to work on a fix for this issue.

identified2025-06-28T00:43:16.173Z

Starting 6/27 7:00 AM UTC, customers may be experiencing increased authorization failure rate for new statements on the Confluent Cloud Flink service. The issue has been identified and a fix is being implemented. This incident replaces an earlier one (https://status.confluent.cloud/incidents/dkpy59xgrx11) that was incorrectly closed.

Jun 27, 2025

Report: "Confluent Cloud Flink - Customers may experience increased authorization failure rate for new statements"

Last update 2025-06-27T10:13:11.844Z

investigating2025-06-27T10:13:11.841Z

Starting 6/27 7:00 AM UTC, customers may be experiencing increased authorization failure rate for new statements on the Confluent Cloud Flink service. We are currently investigating the issue.

Jun 24, 2025

Report: "Confluent Cloud Metrics API is currently experiencing elevated error rates on certain historical queries for longer time ranges."

Last update 2025-06-24T21:21:05.131Z

investigating2025-06-24T21:21:05.129Z

We are currently investigating this issue.

Jun 12, 2025

Report: "Google Cloud Platform Outage Affecting Multiple Confluent Cloud Services"

Last update 2025-06-12T18:58:01.223Z

investigating2025-06-12T18:58:01.220Z

We are currently experiencing outages on all regions on Google Cloud Platform, affecting several Confluent Cloud services. As of now, customers may experience issues provisioning new resources, creating cluster links across privately networked clusters, connectors hosted on GCP, and reading historical data from Kafka clusters. At this time, we are investigating the scope and impact of the outage. For more information on the GCP outage, please refer to https://status.cloud.google.com/.

Jun 6, 2025

Report: "Cluster connectivity issues being observed in AWS US-east-1"

Last update 2025-06-06T16:24:42.919Z

investigating2025-06-06T16:24:42.915Z

We are observing a spike in connectivity issue in the AWS US-east-1 and are currently investigating this issue.

May 20, 2025

Report: "Customers will encounter issues when creating new Kafka clusters within existing Confluent Cloud Networks on Azure that use Privatelink as the access method"

Last update 2025-05-20T05:14:48.751Z

resolved2025-05-20T05:14:48.733Z

This incident has been resolved.

monitoring2025-05-19T22:51:50.932Z

A fix has been implemented and we are monitoring the results.

identified2025-05-19T20:13:51.264Z

The issue has been identified and a fix is being implemented.

investigating2025-05-19T19:50:15.871Z

We are continuing to investigate this issue.

investigating2025-05-19T19:15:59.385Z

We are currently investigating the issue

May 19, 2025

Report: "Customers will encounter issues when creating new Kafka clusters within existing Confluent Cloud Networks on Azure that use Privatelink as the access method"

Last update 2025-05-19T19:50:00.000Z

Update2025-05-19T19:50:00.000Z

We are continuing to investigate this issue.

Investigating2025-05-19T19:15:00.000Z

We are currently investigating the issue

Report: "Customers will experience issues creating and accessing new Kafka clusters using PrivateLink networking in Azure"

Last update 2025-05-19T19:15:00.000Z

Investigating2025-05-19T19:15:00.000Z

We are currently investigating the issue

May 18, 2025

Report: "Confluent Cloud Metrics may return incorrect or no data for a few customers"

Last update 2025-05-18T14:55:36.958Z

resolved2025-05-18T14:55:36.943Z

The incident has been resolved. Time of impact is from 12:41 UTC to 12:46 UTC.

monitoring2025-05-18T13:34:34.768Z

We are continuing to monitor for any further issues.

monitoring2025-05-18T13:15:28.526Z

We are continuing to monitor for any further issues.

monitoring2025-05-18T13:14:18.404Z

We are continuing to monitor for any further issues.

monitoring2025-05-18T13:11:09.559Z

We are continuing to monitor for any further issues.

monitoring2025-05-18T13:09:53.000Z

The issue was identified and mitigated at 12:50 UTC.

investigating2025-05-18T13:09:10.746Z

We are continuing to investigate this issue.

investigating2025-05-18T13:08:57.000Z

Starting 12:40 UTC, Confluent Cloud Metrics are delayed. Queries for metrics may return incomplete, incorrect or no data for a few customers.

Report: "Confluent Cloud Metrics may return incorrect or no data for a few customers"

Last update 2025-05-18T13:34:00.000Z

Update2025-05-18T13:34:00.000Z

We are continuing to monitor for any further issues.

Update2025-05-18T13:15:00.000Z

We are continuing to monitor for any further issues.

Update2025-05-18T13:14:00.000Z

We are continuing to monitor for any further issues.

Update2025-05-18T13:11:00.000Z

We are continuing to monitor for any further issues.

Monitoring2025-05-18T13:09:00.000Z

The issue was identified and mitigated at 12:50 UTC.

Update2025-05-18T13:09:00.000Z

We are continuing to investigate this issue.

Investigating2025-05-18T13:08:00.000Z

Starting 12:40 UTC, Confluent Cloud Metrics are delayed. Queries for metrics may return incomplete, incorrect or no data for a few customers.

Report: "Confluent Cloud Metrics returned incorrect or no data"

Last update 2025-05-18T13:09:00.000Z

Monitoring2025-05-18T13:09:00.000Z

The issue has been identified and mitigated.

Update2025-05-18T13:09:00.000Z

We are continuing to investigate this issue.

Investigating2025-05-18T13:08:00.000Z

Starting 12:40 UTC, Confluent Cloud Metrics were delayed. Queries for metrics may have returned incomplete, incorrect or no data.

May 15, 2025

Report: "Confluent Cloud Flink - Customers may experience increased rate of degraded statements"

Last update 2025-05-15T10:05:07.752Z

resolved2025-05-15T10:05:07.734Z

The service has been stable since the last update. The incident is now fully resolved.

monitoring2025-05-14T15:52:12.188Z

This issue has been mitigated and the team is monitoring the fix.

identified2025-05-14T12:51:13.993Z

The issue is identified and we are working on its mitigation.

investigating2025-05-14T09:37:13.879Z

Starting 5/14 10::21 AM UTC, customers may be experiencing increased rate of degraded statements and temporary spike in CFU consumption on Confluent Cloud Flink service. We are currently investigating the issue.

May 14, 2025

Report: "Confluent Cloud Flink - Customers may experience failures with DROP TABLE statements"

Last update 2025-05-14T23:42:28.115Z

resolved2025-05-14T23:42:28.098Z

The rollout completed and no further issues have been observed. The incident is now resolved as of 23:30 UTC on May 14th, 2025.

identified2025-05-13T18:48:28.020Z

The team is working on deploying updates for additional components. The ETA for the rollout has been adjusted to May 16 12:00 AM UTC.

identified2025-05-12T17:46:42.173Z

The team is working on deploying updates for additional components. The ETA for the rollout has been adjusted to May 15 12:00 AM UTC.

identified2025-05-10T20:40:09.218Z

Team identified additional components that need to be updated. ETA for rollout: 05/13 1 AM UTC

monitoring2025-05-10T18:56:18.278Z

Fix is rolled out to all regions

identified2025-05-09T14:43:37.599Z

Rollout is expected to finish by 05/10 1 AM UTC

identified2025-05-08T02:43:47.676Z

After a detailed investigation and given the issue's limited impact, the team decided to proceed with the regular release process for the fix instead of manual intervention.

identified2025-05-07T22:29:11.451Z

Investigation has shown that the issue is localized to a subset of regions in GCP and Azure. The team is working on mitigation.

investigating2025-05-07T21:21:02.951Z

Starting 5/7 8:39 PM UTC, customers may be experiencing errors while executing DROP TABLE statements on Confluent Cloud Flink service. We are currently investigating the issue.

Report: "Confluent Cloud Flink - Customers may experience increased rate of degraded statements"

Last update 2025-05-14T09:37:00.000Z

Investigating2025-05-14T09:37:00.000Z

May 12, 2025

Report: "Cloud UI Homepage Status is degraded"

Last update 2025-05-12T21:46:27.070Z

resolved2025-05-12T21:46:27.052Z

This incident has now been fully resolved.

monitoring2025-05-12T21:22:40.638Z

The Confluent Cloud UI homepage should now be visible, showing the summary of environments/clusters for most customers. Newer organizations created in the last three days may still not be showing correct summaries. The Confluent team expects to have the issue fully resolved for these remaining organizations by May 12 11:00 PM UTC.

monitoring2025-05-12T19:41:56.000Z

investigating2025-05-12T18:41:17.585Z

The Confluent Cloud UI homepage is in a degraded state and is currently unable to properly show the summary of environments/clusters. The underlying health of environments/clusters is unaffected. The team is working on a fix.

Report: "Cloud UI Homepage Status is degraded"

Last update 2025-05-12T18:41:00.000Z

Investigating2025-05-12T18:41:00.000Z

May 9, 2025

Report: "Degraded experience with Flink queries"

Last update 2025-05-09T14:08:06.804Z

resolved2025-05-09T14:08:06.785Z

This incident has been resolved.

investigating2025-05-09T03:33:13.525Z

We are observing issues with Flink queries in some cloud regions. Impacted queries could be stuck in Resuming or Pending state. The team is actively investigating.

Report: "Degraded experience with Flink queries"

Last update 2025-05-09T03:33:00.000Z

Investigating2025-05-09T03:33:00.000Z

We are observing issues with Flink queries in some cloud regions. Impacted queries could be stuck in Resuming or Pending state. The team is actively investigating.

May 7, 2025

Report: "Confluent Cloud Flink - Customers may experience failures with DROP TABLE statements"

Last update 2025-05-07T21:21:00.000Z

Investigating2025-05-07T21:21:00.000Z

Starting 5/7 8:39 PM UTC, customers may be experiencing errors while executing DROP TABLE statements on Confluent Cloud Flink service. We are currently investigating the issue.

May 1, 2025

Report: "Confluent Cloud Metrics API Delay"

Last update 2025-05-01T18:37:25.258Z

resolved2025-05-01T17:00:00.000Z

On May 1st, 2025 between 17:00 and 17:45 UTC, the Confluent Cloud Metrics API experienced ingestion delay for the "io.confluent.kafka.server/retained_bytes" metric for some customers. As a result, queries for this specific metric may have yielded incomplete, undercounted, or missing data.

Report: "Confluent Cloud Metrics API Delay"

Last update 2025-05-01T17:00:00.000Z

Resolved2025-05-01T17:00:00.000Z

On March 1st, 2025 between 17:00 and 17:45 UTC, the Confluent Cloud Metrics API experienced ingestion delay for the "io.confluent.kafka.server/retained_bytes" metric for some customers. As a result, queries for this specific metric may have yielded incomplete, undercounted, or missing data.

Apr 25, 2025

Report: "Flink - metrics API and billing are experiencing delays or failures"

Last update 2025-04-25T14:33:18.693Z

resolved2025-04-25T14:33:18.679Z

Since the fix was deployed, no further failures or delays have been observed. The incident is considered resolved

monitoring2025-04-25T11:55:19.217Z

A fix has been deployed, successfully mitigating the issues.

identified2025-04-25T11:47:24.578Z

The root cause has been identified, and the team is working to mitigate the impact on affected resources.

investigating2025-04-25T11:37:40.477Z

Flink metrics and billing within us-west-2 may be delayed or incomplete; triaging is ongoing to identify the nature of the issue.

Apr 21, 2025

Report: "Clusters not visible in UI, Terraform, and CLI"

Last update 2025-04-21T17:18:57.492Z

resolved2025-04-21T17:18:57.472Z

This problem has been mitigated as of 2025-04-21 15:52 UTC. No further issues have been observed.

monitoring2025-04-21T16:59:01.853Z

A fix has been implemented and we are continuing to monitor APIs for errors.

identified2025-04-21T16:29:47.604Z

We have identified a root cause and have put an initial mitigation in place. We are still investigating some additional errors in the API.

investigating2025-04-21T16:00:59.974Z

We are currently experiencing an issue where multiple customers are unable to view any clusters on the Cloud UI console, Terraform, or API, with Networking Services returning HTTP 500 errors. We are continuing to investigate this issue.

investigating2025-04-21T15:58:34.011Z

We are continuing to investigate HTTP 500 errors in production networking environments causing Clusters to not be visible in the Confluent UI.

investigating2025-04-21T15:57:33.409Z

We are continuing to investigate this issue.

investigating2025-04-21T15:53:56.235Z

We are currently investigating HTTP 500 errors in production networking environments

Report: "Clusters not visible in UI, Terraform, and CLI"

Last update 2025-04-21T16:00:00.000Z

Update2025-04-21T16:00:00.000Z

Update2025-04-21T15:58:00.000Z

We are continuing to investigate HTTP 500 errors in production networking environments causing Clusters to not be visible in the Confluent UI.

Update2025-04-21T15:57:00.000Z

We are continuing to investigate this issue.

Investigating2025-04-21T15:53:00.000Z

We are currently investigating HTTP 500 errors in production networking environments

Report: "Clusters not visible in UI"

Last update 2025-04-21T15:58:00.000Z

Update2025-04-21T15:58:00.000Z

We are continuing to investigate HTTP 500 errors in production networking environments causing Clusters to not be visible in the Confluent UI.

Update2025-04-21T15:57:00.000Z

We are continuing to investigate this issue.

Investigating2025-04-21T15:53:00.000Z

We are currently investigating HTTP 500 errors in production networking environments

Report: "Networking Service 500 Errors"

Last update 2025-04-21T15:53:00.000Z

Investigating2025-04-21T15:53:00.000Z

We are currently investigating HTTP 500 errors in production networking environments

Apr 18, 2025

Report: "Confluent Cloud incident affecting Control Plane Authorization"

Last update 2025-04-18T00:42:51.202Z

resolved2025-04-17T23:17:58.000Z

From 21:59 UTC to 22:21 UTC customers observed errors in the Confluent Cloud UI, control plane APIs, CLI, Terraform and Metrics API due to an Authorization Service outage affecting control plane services. The issue has been resolved and there was no impact on data plane services.

Apr 17, 2025

Report: "Confluent Cloud incident affecting Control Plane Authorization"

Last update 2025-04-17T23:17:00.000Z

Resolved2025-04-17T23:17:00.000Z

From 21:59 UTC to 22:21 UTC customers observed errors in the Confluent Cloud UI, CLI and Terraform when making calls to control plane API endpoints. The issue has been resolved and there was no impact on data plane services or to other regions.

Report: "Confluent Cloud incident in AWS us-west-2"

Last update 2025-04-17T23:17:00.000Z

Resolved2025-04-17T23:17:00.000Z

From 21:59 UTC to 22:21 UTC customers observed errors in the Confluent Cloud UI, CLI and Terraform when making calls to control plane endpoints. The issue has been resolved and there was no impact on data plane services or to other regions.

Apr 2, 2025

Report: "Errors connecting to Kafka in Azure/canadacentral"

Last update 2025-04-02T03:07:58.699Z

resolved2025-04-02T02:30:00.000Z

From approximately 02:23 - 02:43 UTC on 4/2/25, customers connecting to Kafka clusters in Azure/canadacentral may have experienced errors.

Report: "Errors connecting to Kafka in Azure/canadacentral"

Last update 2025-04-02T02:30:00.000Z

Resolved2025-04-02T02:30:00.000Z

From approximately 02:23 - 02:43 UTC on 4/2/25, customers connecting to Kafka clusters in Azure/canadacentral may have experienced errors.

Apr 1, 2025

Report: "Cluster Unavailability in Azure North Europe"

Last update 2025-04-01T13:53:42.006Z

resolved2025-04-01T13:53:41.990Z

This incident has been resolved.

monitoring2025-04-01T11:45:46.914Z

A fix has been implemented and we are monitoring the results.

identified2025-04-01T10:17:16.880Z

The issue has been identified as being caused by the ongoing Microsoft Azure North Europe outage. The Microsoft team is actively working on recovery.

investigating2025-04-01T09:21:25.948Z

We are currently investigating this issue.

Report: "Cluster Unavailability in Azure North Europe"

Last update 2025-04-01T09:21:00.000Z

Investigating2025-04-01T09:21:00.000Z

We are currently investigating this issue.

Mar 28, 2025

Report: "Kafka cluster and network provisioning delays"

Last update 2025-03-28T18:22:29.774Z

resolved2025-03-28T14:23:00.000Z

On March 28, 2025, between 14:23 and 16:40 UTC, the provisioning of Kafka clusters and networks were delayed. The root cause has been resolved and Kafka clusters and networks are being deployed. There is no customer action required.

Report: "Kafka cluster and network provisioning delays"

Last update 2025-03-28T14:23:00.000Z

Resolved2025-03-28T14:23:00.000Z

Mar 25, 2025

Report: "Delays in provisioning Kafka clusters in Confluent Cloud"

Last update 2025-03-25T23:37:33.359Z

resolved2025-03-25T18:30:12.000Z

This incident has been resolved.

identified2025-03-25T18:13:20.581Z

Customers may experience delays in provisioning new Kafka clusters in Confluent Cloud. Confluent engineering has identified the root cause and is working on the fix.

Report: "Delays in provisioning Kafka clusters in Confluent Cloud"

Last update 2025-03-25T18:30:00.000Z

Resolved2025-03-25T18:30:00.000Z

This incident has been resolved.

Identified2025-03-25T18:13:00.000Z

Customers may experience delays in provisioning new Kafka clusters in Confluent Cloud. Confluent engineering has identified the root cause and is working on the fix.

Mar 21, 2025

Report: "Confluent Cloud scheduled maintenance"

Last update 2025-03-21T02:54:00.000Z

Completed2025-03-21T02:54:00.000Z

The scheduled maintenance has been completed.

Update2025-03-21T01:12:00.000Z

Scheduled maintenance is still in progress. We will provide updates as necessary.

In progress2025-03-21T01:10:00.000Z

Scheduled maintenance is currently in progress. We will provide updates as necessary.

Scheduled2025-03-21T01:01:00.000Z

Confluent Cloud is currently having a scheduled maintenance operation from 1:00 A.M to 3:00 A.M. UTC on March 21, 2025. During this time, managing Stream Shares and topic Access Requests will be disabled. All other Control Plane APIs and Data Plane APIs are not affected. Regular operations will resume shortly after. Learn more here: https://support.confluent.io/hc/en-us/articles/34941970895636-March-21st-2025-1-00-3-00-AM-UTC-Scheduled-Maintenance-of-Confluent-Cloud?ajs_aid=e7adf4e6-b539-43e6-ba66-7055abf695f9&ajs_uid=1122684

Mar 19, 2025

Report: "Azure eastus Degradation"

Last update 2025-03-19T03:35:25.295Z

resolved2025-03-19T03:35:25.276Z

Azure has mitigated the networking issues and all errors on Confluent's end are recovered. See https://azure.status.microsoft/en-us/status for more details.

monitoring2025-03-19T02:15:51.776Z

Azure is working to restore the networking issues, see https://azure.status.microsoft/en-us/status for more details. We are seeing recovery on our end and actively monitoring the situation.

investigating2025-03-19T01:31:46.823Z

Azure has identified a networking issue in eastus, see https://azure.status.microsoft/en-us/status for more details. Customers may also run into issues provisioning new clusters in eastus at this time.

investigating2025-03-19T00:15:38.646Z

Confluent engineering is investigating availability degradation on Azure eastus clusters. Customers using this region may experience unavailability and elevated latencies. In addition, customers may have issues using Schema Registry in this region.

Report: "Azure eastus Degradation"

Last update 2025-03-19T03:35:00.000Z

Resolved2025-03-19T03:35:00.000Z

Azure has mitigated the networking issues and all errors on Confluent's end are recovered. See https://azure.status.microsoft/en-us/status for more details.

Monitoring2025-03-19T02:15:00.000Z

Azure is working to restore the networking issues, see https://azure.status.microsoft/en-us/status for more details. We are seeing recovery on our end and actively monitoring the situation.

Update2025-03-19T01:31:00.000Z

Investigating2025-03-19T00:15:00.000Z

Mar 17, 2025

Report: "Flink Compute Pool provisioning degraded for GCP us-east1"

Last update 2025-03-17T03:06:11.031Z

resolved2025-03-17T03:06:11.012Z

This incident has been resolved.

monitoring2025-03-17T02:06:37.067Z

Hot fix is ready to be deployed across the clusters and we are monitoring the progress.

investigating2025-03-17T00:37:41.727Z

All the affected workloads are mitigated and compute pools are provisioned.

investigating2025-03-16T20:19:13.729Z

We are experiencing degraded performance in provisioning for Flink customers in us-east1. Existing workloads should be fine. Engineers have identified the root cause and working on the fix.

Report: "Flink Compute Pool provisioning degraded for GCP us-east1"

Last update 2025-03-17T03:06:00.000Z

Resolved2025-03-17T03:06:00.000Z

This incident has been resolved.

Monitoring2025-03-17T02:06:00.000Z

Hot fix is ready to be deployed across the clusters and we are monitoring the progress.

Update2025-03-17T00:37:00.000Z

All the affected workloads are mitigated and compute pools are provisioned.

Investigating2025-03-16T20:19:00.000Z

We are experiencing degraded performance in provisioning for Flink customers in us-east1. Existing workloads should be fine. Engineers have identified the root cause and working on the fix.

Mar 14, 2025

Report: "Flink workspace management degraded"

Last update 2025-03-14T02:17:44.590Z

resolved2025-03-14T02:17:44.572Z

A fix has been rolled out and this issue is fully resolved.

monitoring2025-03-14T02:12:24.728Z

A fix has been implemented and we are monitoring the results.

investigating2025-03-14T01:41:33.022Z

Users may experience issues managing Flink workspaces, and/or running statements from within a workspace. The `Run` button on a statement in a workspace may be greyed out. Confluent engineering is investigating.

Mar 12, 2025

Report: "KSQL and Schema Registry creation/deletion degraded performance"

Last update 2025-03-12T00:50:23.003Z

resolved2025-03-12T00:50:22.990Z

This incident has been resolved.

monitoring2025-03-11T23:54:07.902Z

The issue has been mitigated and we are actively monitoring it

identified2025-03-11T23:22:03.042Z

The issue has been identified and a fix is being implemented.

investigating2025-03-11T22:27:06.842Z

Creation/Deletion of Schema Registry and KSQL, as well as API key creation/deletion for new or existing KSQL and Schema Registry clusters is currently impacted. We are currently investigating the issue

Mar 5, 2025

Report: "Azure southcentralus availability degredation"

Last update 2025-03-05T02:47:27.025Z

resolved2025-03-05T02:47:27.009Z

This incident has been resolved.

investigating2025-03-05T01:36:18.585Z

Confluent engineering is investigating availability degradation on Azure southcentralus clusters.

Feb 27, 2025

Report: "Egress IPs are not discoverable."

Last update 2025-02-27T02:31:44.960Z

resolved2025-02-27T02:31:44.944Z

This incident has been resolved.

monitoring2025-02-27T01:27:50.474Z

A fix has been implemented and we are monitoring the results.

identified2025-02-27T00:39:27.689Z

The issue has been identified and a fix is being implemented.

investigating2025-02-26T23:43:26.293Z

Egress IP addresses are not currently being displayed in the Confluent Cloud UI as described in https://docs.confluent.io/cloud/current/connectors/static-egress-ip.html. They are also not being returned by the equivalent APIs, CLI, or Terraform. We are currently investigating this issue.

Feb 25, 2025

Report: "Experiencing partial outage in AZURE in US EAST 2 region"

Last update 2025-02-25T11:19:28.627Z

resolved2025-02-25T11:19:28.610Z

This incident has been resolved.

monitoring2025-02-25T06:43:23.986Z

The issue has been mitigated, and we are actively monitoring the clusters.

monitoring2025-02-25T05:03:11.000Z

Azure cloud confirmed that the outages have resulted from Unexpected VM reboots. "Impact Statement: Starting at approximately 01:40 UTC on 25 Feb 2025, Azure customers in East US 2 may have experienced VM reboots and/or increased response latencies in the region.Current Status: We are aware of the issue and actively investigating. Initial findings indicate that a subset of VMs in East US 2 may have rebooted. The next update will be provided in 60 minutes or sooner if there are significant developments.This message was last updated at 04:53 UTC on 25 February 2025" We are continuing to monitor the system and mitigate the issue working with Azure.

monitoring2025-02-25T03:38:44.424Z

Recovery is in progress, we are continuing to monitor.

investigating2025-02-25T03:05:55.731Z

We are currently investigating the issue and attempting to mitigate.

Feb 20, 2025

Report: "Apache Flink incident in AWS us-west-2"

Last update 2025-02-20T16:36:22.504Z

resolved2025-02-20T16:36:22.488Z

This incident has been resolved.

investigating2025-02-20T15:47:53.299Z

We are experiencing availability problems with Flink in AWS us-west-2. The symptoms started at 11:30 UTC. Customers may observe availability issues. We are currently investigating, and will update as we know more.

Report: "Confluent Cloud incident affecting Kafka clusters in Azure norwayeast"

Last update 2025-02-20T12:25:00.424Z

resolved2025-02-20T12:25:00.409Z

This incident has been resolved.

investigating2025-02-20T12:06:20.013Z

We are experiencing connectivity issues with Kafka in Azure norwayeast. The problems started at 09:45 UTC. We are currently investigating, and will update as we know more.

Feb 18, 2025

Report: "Cluster Provisioning - degraded performance"

Last update 2025-02-18T16:12:24.663Z

resolved2025-02-18T16:12:24.647Z

Provisioning performance is back to normal

monitoring2025-02-18T13:12:15.635Z

We have deployed a fix for this issue and are monitoring the affected metrics.

investigating2025-02-18T09:56:31.882Z

We are continuing to investigate this issue.

investigating2025-02-18T08:47:49.627Z

We are continuing to investigate this issue.

investigating2025-02-18T08:47:29.781Z

We have noticed degraded performance when provisioning dedicated Kafka clusters and other products in a few busiest regions. We are investigating the root cause and will provide another update at 10AM UTC

Feb 14, 2025

Report: "connectivity issues in AWS eu-north-1. Customers may experience availability issues in the region."

Last update 2025-02-14T02:29:51.308Z

resolved2025-02-14T02:29:51.293Z

We have fully recovered and operational now.

monitoring2025-02-14T01:34:06.000Z

We have mostly recovered at this time except for 3 clusters that still have some partial impact. Update from the underlying cloud service provider side: We are continuing to see recovery for error rates and latencies for multiple AWS Services in the EU-NORTH-1 Region. We are monitoring the network and as we work towards full recovery, some requests may continue to timeout or be throttled. We recommend customers retry failed requests where possible. We will continue to provide additional information as we have it, or within the next 60 minutes.

investigating2025-02-14T00:51:04.233Z

We are currently experiencing partial unavailability in the AWS EU-NORTH-1 region, affecting a single Availability Zone (AZ). Some customer workloads may be impacted partially in terms of availability and performance. Some clusters remain operational and can process client workloads on certain brokers. However, external network connectivity issues have been detected. Our automation systems are demoting affected brokers in the impacted AZ to mitigate the issue. Further updates will be provided as we work towards resolution.

investigating2025-02-14T00:11:05.000Z

We have limited the impact to a single availability zone (AZ) thats partially unavailable. Clusters in this AZ are able to process client workloads on some kafka brokers.

investigating2025-02-14T00:01:36.048Z

We are continuing to investigate this issue.

investigating2025-02-13T23:43:21.569Z

We are having connectivity issues in AWS eu-north-1. Customers may experience availability issues.

Feb 12, 2025

Report: "Kafka Rest API unreachable for certain dedicated networks"

Last update 2025-02-12T21:36:02.008Z

resolved2025-02-12T21:36:01.990Z

This incident has been resolved.

monitoring2025-02-12T20:20:40.387Z

The fix has been rolled out to all affected sites, and we are currently monitoring the incident.

identified2025-02-12T19:15:55.194Z

We have identified an issue due to an incompatible software rollout affecting AWS us-east-1 customers. A fix is being rolled-out in waves and is estimated to take a few hours. We will update this incident as we know more.

investigating2025-02-12T18:39:40.889Z

We are experiencing an issue where some customers are getting errors while hitting Kafka Rest endpoints. We are currently investigating, and will update as we know more.

Feb 6, 2025

Report: "Experiencing connectors provisioning delays"

Last update 2025-02-06T12:05:11.052Z

resolved2025-02-06T12:05:11.027Z

This incident has been resolved.

monitoring2025-02-06T11:47:57.479Z

A fix has been implemented and we are monitoring the results.

investigating2025-02-06T11:27:40.661Z

We are currently investigating this issue.

Jan 16, 2025

Report: "Customers experiencing HTTP 401 errors on Schema Registry API requests"

Last update 2025-01-16T02:09:16.215Z

resolved2025-01-16T02:09:16.203Z

This incident has been resolved.

monitoring2025-01-16T02:07:45.623Z

This incident has been resolved.

monitoring2025-01-16T01:54:38.189Z

The fix has been rolled out and we are monitoring the results.

identified2025-01-16T01:15:06.565Z

The issue has been identified and a fix is being implemented.

investigating2025-01-16T00:45:34.322Z

Some customers are experiencing HTTP 401 errors on Schema Registry API requests. Engineers are engaged on the issue and are troubleshooting the cause.

Jan 12, 2025

Report: "Metrics API is Experiencing Delays"

Last update 2025-01-12T00:13:10.745Z

resolved2025-01-11T21:45:34.000Z

Metrics API is now operating normally. The root cause of the issue was identified and a permanent fix applied.

investigating2025-01-11T13:23:01.633Z

Metrics-API is experiencing higher latency and error rates starting January 11, 12:09 UTC. We are continuing to investigate this issue.

monitoring2025-01-10T22:59:49.798Z

The Metrics API system has been operating with normal behavior since around 21:40 UTC

investigating2025-01-10T20:52:42.835Z

We are continuing to investigate this issue.

investigating2025-01-10T17:36:25.436Z

We are continuing to investigate this issue.

investigating2025-01-10T15:38:43.516Z

Metrics-api is experiencing higher latency and error rates starting 15.15 UTC.

Jan 10, 2025

Report: "Some clusters seeing problems provisioning nodes in Azure East US2 deployments"

Last update 2025-01-10T20:46:09.194Z

resolved2025-01-10T20:46:09.178Z

This incident has been resolved. The affected clusters have recovered.

investigating2025-01-10T17:47:19.000Z

We are currently investigating the issue. It was reported at 16:56 UTC

Report: "Metrics API is Experiencing Delays"

Last update 2025-01-10T13:57:44.580Z

resolved2025-01-10T13:57:44.563Z

This incident has been resolved.

investigating2025-01-10T13:56:55.000Z

Incident mitigated

investigating2025-01-10T13:56:09.254Z

We are continuing to investigate this issue.

investigating2025-01-10T13:36:48.091Z

Metrics-api is experienced higher than usual latencies and error rates from 10.45 to 11.15 UTC Metrics-api is experiencing higher than usual latencies and error rates since 12.10 UTC

Report: "Experiencing network issues within Azure East US2 deployments"

Last update 2025-01-10T03:23:11.739Z

resolved2025-01-10T00:30:41.000Z

This incident has been resolved.

identified2025-01-09T16:45:08.433Z

Impact has been identified solely as Azure network outage; more can be found at https://azure.status.microsoft/en-us/status. We have taken all the actions necessary to mitigate affected customers' issues while Azure restores their service availability.

investigating2025-01-09T14:55:44.457Z

Due to an ongoing Azure network failure, customers may experience connectivity issues within the Azure East US2 region. We are investigating possible mitigation procedures for affected customers while working with the cloud provider to find a resolution.

Dec 26, 2024

Report: "Confluent Cloud - Degraded Performance - Azure-South-Central-US"

Last update 2024-12-26T23:45:22.303Z

resolved2024-12-26T23:45:22.288Z

All known issues directly caused by this incident have been mitigated. Any customers still experiencing issues should contact Confluent support. There will be no more updates for this issue after this.

identified2024-12-26T23:01:40.939Z

Customer impact is mostly mitigated at this point. We are still validating that all workloads are working correctly, and will provide the next update no later than 4 PM PST.

identified2024-12-26T21:56:55.286Z

We are starting to see partial recovery in the region. We are continuing efforts to mitigate impact, and will provide another update no later than 3 PM PST.

identified2024-12-26T21:25:23.890Z

Impact appears to be caused by an Azure outage that started at approximately the same time. Confluent engineers are closely attempting to mitigate impact before the restoration of Azure systems. We will provide an update by 2pm PST. More details on the issue can be found at: https://azure.status.microsoft/en-us/status

investigating2024-12-26T20:54:48.164Z

We are continuing to investigate the issue, and will provide another update by 2 PM PST. Impact remains limited to Azure South Central US at this time.

investigating2024-12-26T19:49:25.159Z

Some customers may experience degraded performance in the Azure cloud South-Central-US region. The problem started at 11:04 AM AM PST. We have identified the issue and working with the cloud provider for a resolution.

Dec 20, 2024

Report: "We are experiencing problems with some Flink jobs."

Last update 2024-12-20T20:55:24.274Z

resolved2024-12-20T20:55:24.256Z

The situation is resolved and we are proactively reaching out to any customers that may have been impacted.

investigating2024-12-20T19:32:35.000Z

We are experiencing problems with some Flink jobs. We are currently investigating, and will update as we know more.

Dec 19, 2024

Report: "Single sign-On (SSO) authentication and authorization flows are not working as expected"

Last update 2024-12-19T20:48:59.232Z

resolved2024-12-19T20:48:59.219Z

This incident has been resolved.

monitoring2024-12-19T20:44:39.548Z

The fix has been rolled out and we are monitoring the results.

identified2024-12-19T19:10:04.453Z

The fix is being rolled out to production. We will provide the next status update at 1 pm Pacific Time.

identified2024-12-19T18:42:25.474Z

This issue has been identified as impacting Schema Registry operations when group mappings are used as part of Single sign-on. The root cause has been identified and engineers are working on a mitigation.

investigating2024-12-19T18:00:44.511Z

Confluent is aware of SSO related authentication and authorization issues related to multiple parts of the Confluent Cloud platform. Engineers are actively investigating the issue.

Dec 17, 2024

Report: "Some KSQL clusters may experience high saturation"

Last update 2024-12-17T15:43:36.139Z

resolved2024-12-17T15:43:36.124Z

This incident has been resolved.

monitoring2024-12-17T14:36:49.198Z

The fix has been rolled out and we are monitoring the results.

identified2024-12-17T09:52:53.921Z

The fix is being rolled out to production. We will provide the next status update at 6am Pacific Time.

identified2024-12-17T04:03:51.982Z

The issue has been identified and a fix is being implemented.

investigating2024-12-17T04:02:41.150Z

We have identified the cause of the issue and in process of rolling out the solution.

investigating2024-12-17T03:59:20.949Z

Some KSQL clusters may be experiencing Node lag that may cause increased consumer lag. The impact started at Dec 13 2024 and impacts all regions. We are currently investigating and will update as we know more.

Dec 13, 2024

Report: "Confluent Cloud API requests may return HTTP 401 errors"

Last update 2024-12-13T07:22:52.333Z

resolved2024-12-13T02:39:16.845Z

The issue is mitigated.

monitoring2024-12-13T01:39:45.678Z

Confluent engineers have applied a mitigation and are monitoring systems. Mitigation was applied at 01:31 UTC.

investigating2024-12-13T01:29:28.000Z

Users of Confluent Cloud may experience API requests returning HTTP 401 errors. Engineers are engaged and are in the process of remediating the issue.

Dec 12, 2024

Report: "Confluent Cloud - Degraded Performance - AWS ap-northeast-1"

Last update 2024-12-12T20:12:31.575Z

resolved2024-12-12T20:12:31.564Z

This incident has been resolved.

identified2024-12-12T20:12:06.066Z

The issue has been mitigated.

identified2024-12-12T12:17:06.948Z

Some customers may experience degraded performance in the AWS cloud ap-northeast-1 region. The problem started at 10 AM UTC. We identified the root cause and are working on mitigating the issue. We will update once the issue is mitigated.

Dec 6, 2024

Report: "Control Plane resources unavailable using UI/CLI"

Last update 2024-12-06T01:11:29.152Z

resolved2024-12-06T01:11:29.139Z

This incident is resolved as of Dec 6th, 1:00 AM UTC.

monitoring2024-12-06T01:09:21.690Z

We have fixed the issue, and customers should see the UI and CLI functioning. We believe the problem should be mitigated by Dec 6th, 1:00 AM UTC.

investigating2024-12-05T23:06:01.720Z

We are continuing to investigate this issue.

investigating2024-12-05T23:05:37.187Z

We are currently experiencing UI and CLI issues. These issues impact workflows, such as new cluster provisioning through UI/CLI. The problems started on Dec 5th, 10:20 PM UTC. We are investigating and will update you as we learn more.

Nov 26, 2024

Report: "Cluster provisioning and expansion issues in GCP me-central-2 region."

Last update 2024-11-26T05:50:39.861Z

resolved2024-11-26T05:50:39.846Z

GCP has confirmed that all storage capacity related issues were resolved by November 22 2024.

identified2024-10-28T20:23:55.291Z

Starting on October 25 10:30pm UTC, customers may experience cluster expansion issues in the GCP me-central-2 region. Provisioning of new single zone clusters in me-central-2 region may also fail intermittently and new multi-zone clusters in the same region will have reduced availability due to the underlying cloud provider's storage capacity availability. We have identified the issue and working with the cloud provider for a resolution.

Nov 25, 2024

Report: "Provisioning of clusters and network resources is blocked in all cloud regions"

Last update 2024-11-25T14:09:50.668Z

resolved2024-11-25T14:09:50.649Z

No further issues have been observed. The incident is now resolved as of 14:00 UTC on November 25, 2024.

monitoring2024-11-25T12:31:07.981Z

We have identified the cause of the problem and have implemented mitigation steps. Provisioning was unblocked as of 12:16 UTC on November 25, 2024. We will continue to monitor for any issues and plan to resolve this incident in 1 hour.

investigating2024-11-25T12:13:54.000Z

We are experiencing issues with provisioning clusters and network resources in all cloud regions. The problem started at 11:25 UTC on November 25, 2024. We are currently investigating, and will update as we know more.

Nov 14, 2024

Report: "Confluent Cloud UI is unavailable"

Last update 2024-11-14T09:38:31.062Z

resolved2024-11-14T09:38:31.045Z

The issue has been resolved now, the UI is fully functional now.

monitoring2024-11-14T09:33:33.124Z

We are continuing to monitor for any further issues.

monitoring2024-11-14T09:27:28.497Z

The issue has now been fixed. We will continue to monitor the UI availability.

identified2024-11-14T09:13:20.000Z

The new fix is being deployed, will provide an update shortly.

identified2024-11-14T08:22:08.375Z

An additional fix is needed to resolve this issue, we are working on deploying it.

identified2024-11-14T07:18:00.761Z

The fix is being deployed, we will provide an update in around 30 minutes.

identified2024-11-14T06:45:36.894Z

The issue has been identify and a fix is being implemented.

investigating2024-11-14T06:13:16.876Z

We are investigating the issue and will provide an update shortly.

Report: "Confluent Cloud Control Plane API/UI unavailability"

Last update 2024-11-14T09:35:14.359Z

resolved2024-11-14T09:35:14.346Z

This incident has been resolved.

monitoring2024-11-13T18:12:15.382Z

5xx errors from Confluent Gateway service for 5-10 minutes. Status is back to normal, we are monitoring.

Nov 5, 2024

Report: "Confluent Cloud - Metrics API is currently experiencing elevated latency and error rate"

Last update 2024-11-05T06:55:46.395Z

resolved2024-11-05T06:55:46.375Z

This incident has been resolved.

investigating2024-11-04T23:13:17.313Z

We are currently investigating this issue.

monitoring2024-11-04T18:18:26.744Z

A fix has beenimplemented and the systems are recovering.

investigating2024-11-04T17:36:06.668Z

Confluent Cloud Metrics API is currently experiencing elevated latency and error rate.

Oct 24, 2024

Report: "Some Clusters are incorrectly showing `Provisioning` status."

Last update 2024-10-24T23:48:51.651Z

resolved2024-10-24T23:48:51.637Z

This incident has been resolved. All clusters should show their correct status.

monitoring2024-10-24T22:53:28.808Z

Some clusters in an `Up` state were incorrectly showing as in a `Provisioning` state. This issue has been mitigated and we are continuing to monitor it.

Oct 19, 2024

Report: "Cluster Creation Delayed - AWS, GCP, Azure"

Last update 2024-10-19T00:55:05.867Z

postmortem2024-10-19T00:52:30.310Z

Tracked in RCCA

resolved2024-10-18T22:13:41.096Z

This incident has been resolved.

monitoring2024-10-18T19:40:50.678Z

A fix has been implemented and we are monitoring the results.

identified2024-10-18T19:22:32.956Z

The issue has been identified and a fix is being implemented.

investigating2024-10-18T19:18:50.000Z

Some customers may experience delayed cluster provisioning.

Oct 11, 2024

Report: "docs.confluent.io is unavailable"

Last update 2024-10-11T20:13:03.938Z

resolved2024-10-11T20:13:03.921Z

This incident has been resolved.

investigating2024-10-11T19:38:29.436Z

We are currently experiencing service distrubtion on our Confluent documentation site docs.confluent.io. Affected users, might not be able to access the online Confluent documentation due to this issue.

Report: "Authentication Issues"

Last update 2024-10-11T00:08:36.154Z

resolved2024-10-11T00:08:36.141Z

This incident has been resolved.

identified2024-10-10T22:47:28.016Z

We have identified a mitigation and are applying a fix, clusters should expect to see recovery over the next 30 minutes.

investigating2024-10-10T20:56:12.000Z

Some customers may experience SSL issues when trying to connect to Kafka clusters in the following clouds and regions: Azure: westeurope, eastus2, azure, centralus AWS: us-west-2 GCP: australia-southeast1, us-west2 We are currently investigating the issue and attempting to mitigate or provide a work-around.

Sep 28, 2024

Report: "Some Confluent Cloud users might experience issues with Kafka read/write unavailability in AWS us-east-1 region"

Last update 2024-09-28T04:19:07.086Z

resolved2024-09-28T04:19:07.072Z

This incident has been resolved.

monitoring2024-09-28T00:09:13.034Z

We are continuing to monitor for any further issues.

monitoring2024-09-28T00:09:02.541Z

We have fixed the issue and are monitoring it to ensure it does not recur. As of September 28, 2024, 12:07 AM UTC, customers should see normal operations with Confluent systems.

identified2024-09-27T23:09:38.886Z

We have identified the issue and are actively working on the mitigation.

investigating2024-09-27T21:58:33.102Z

Some Confluent Cloud users might experience issues with Kafka read/write unavailability in the AWS us-east-1 region. The problem started at 20:11 UTC. We are currently investigating and will update you as we learn more.

Sep 24, 2024

Report: "CCloud UI is down"

Last update 2024-09-24T14:44:04.662Z

resolved2024-09-24T14:44:04.649Z

This incident has been resolved.

monitoring2024-09-24T14:24:21.892Z

A fix has been implemented and we are monitoring the results.

identified2024-09-24T14:19:13.149Z

The issue has been identified and a fix is being implemented.

investigating2024-09-24T14:17:04.858Z

We are currently investigating the cause and a fix to restore functionality. Other services should not be affected