Is Upstash Down Right Now? Discover if there is an ongoing service outage.

Upstash is currently Operational

Last checked Jul 29, 2025 14:42 UTC from Upstash's official status page

Historical record of incidents for Upstash

Jun 26, 2025

Report: "Login problems on Upstash Console"

Last update 2025-06-26T07:34:49.603Z

resolved2025-06-26T07:34:49.586Z

This incident has been resolved.

monitoring2025-06-26T07:12:24.960Z

A fix has been implemented and we are monitoring the results.

identified2025-06-26T06:53:13.020Z

Some users may experience login issues due to a disruption in our authentication provider. We’re actively monitoring the situation.

Jun 12, 2025

Report: "QStash: Degraded performance"

Last update 2025-06-12T14:28:12.214Z

investigating2025-06-12T14:28:12.210Z

We are currently investigating this issue.

Jun 11, 2025

Report: "Degraded Performance"

Last update 2025-06-11T06:51:33.168Z

investigating2025-06-11T06:51:33.165Z

We are currently investigating this issue.

Jun 10, 2025

Report: "Performance Degradation"

Last update 2025-06-10T07:23:41.903Z

investigating2025-06-10T07:23:41.900Z

We are currently investigating this issue.

May 22, 2025

Report: "Global Ireland (eu-west-1) Degraded Performance"

Last update 2025-05-22T17:41:21.181Z

resolved2025-05-22T17:41:21.161Z

This incident has been resolved.

monitoring2025-05-22T16:44:23.829Z

A fix has been implemented and we are monitoring the results.

Report: "Global Ireland (eu-west-1) Degraded Performance"

Last update 2025-05-22T16:44:00.000Z

Monitoring2025-05-22T16:44:00.000Z

A fix has been implemented and we are monitoring the results.

Apr 18, 2025

Report: "QStash: Degraded performance"

Last update 2025-04-18T13:23:31.352Z

postmortem2025-04-18T13:19:24.225Z

An internal cleanup task for QStash events, coinciding with disc layer compaction task has caused performance degradation to some users. ‌ Mitigation: Team has mitigated the event by pausing some of these tasks and monitored the status for a while. ‌ Fixes: Improvements are being applied to these tasks to use resources more gracefully. Disk resources are increased to be able to handle a bigger burst of load.

resolved2025-04-18T13:18:21.562Z

This incident has been resolved.

monitoring2025-04-18T12:46:20.128Z

A fix has been implemented and we are monitoring the results.

investigating2025-04-18T12:36:20.231Z

We are continuing to investigate this issue.

investigating2025-04-18T12:36:14.395Z

We are currently investigating this issue.

Report: "QStash: Degraded performance"

Last update 2025-04-18T12:36:00.000Z

Update2025-04-18T12:36:00.000Z

We are continuing to investigate this issue.

Investigating2025-04-18T12:36:00.000Z

We are currently investigating this issue.

Apr 3, 2025

Report: "QStash: Degraded performance in request processing and event logs"

Last update 2025-04-03T15:58:24.366Z

postmortem2025-04-03T15:49:18.956Z

### Product: QStash ### Incident Summary Due to high load, the volume of QStash event logs reached to a point which caused latency in the underlying data store operations. Event log creation was slowed down and lead to performance degradation in QStash request processing. In order to resolve the performance degradation in QStash requests, event logging module was turned off temporarily. After deploying a hot fix and configuration changes, we eventually turned on event logging and system went back to stable state again. ### Root Cause At 07:15 UTC we received alerts on the performance degradation and started the investigation. We discovered long running queries for synching event logs from main QStash servers to QStash event server. In order to resolve performance degradation, we turned off event logging functionality as an immediate action. This action turned the performance back to normal levels for QStash requests but left event log processing disabled. We deployed a hotfix during the day to remove some redundant calls and alleviate the impact. Around 16:20 UTC, we observed another performance degradation on QStash requests due to a load increase, and disabled event log processing again. In the following hours, we deployed a configuration change to relax the job interval durations for event log tasks and turned on event logging again. This configuration change helped to resolve the performance issues without any further issues. ### Impact During the problematic timeframes, when the slow event log processing was observed, QStash requests experienced high latency and caused timeouts for customers. No events were lost. Duplicate event deliveries were observed due to a number of restarts during the incident. ### Resolution Improvements are applied to the event logging module to prevent the same issue from happening again. Also, we have planned to upgrade underlying disks to stronger models.

resolved2025-04-02T21:12:51.042Z

QStash service and Event logs are fully functional without any remaning issues.

monitoring2025-04-02T20:20:43.000Z

Monitoring: Main QStash service is back to normal. Event logs service is back online but events will be lagging a few mins.

monitoring2025-04-02T17:33:09.167Z

Main QStash service is back to normal. Event logs are still temporarily unavailable.

investigating2025-04-02T16:33:35.186Z

We are continuing to investigate this issue.

investigating2025-04-02T16:29:46.228Z

We are continuing to investigate this issue.

investigating2025-04-02T16:21:16.112Z

We are currently investigating this issue.

monitoring2025-04-02T08:15:18.000Z

A fix has been implemented and we are monitoring the results.

investigating2025-04-02T07:15:34.000Z

We are currently investigating this issue.

Report: "QStash: Degraded performance in request processing and event logs"

Last update 2025-04-03T15:57:00.000Z

Postmortem2025-04-03T15:57:00.000Z

Resolved2025-04-02T21:12:00.000Z

QStash service and Event logs are fully functional without any remaning issues.

Update2025-04-02T20:20:00.000Z

Monitoring: Main QStash service is back to normal. Event logs service is back online but events will be lagging a few mins.

Monitoring2025-04-02T17:33:00.000Z

Main QStash service is back to normal. Event logs are still temporarily unavailable.

Update2025-04-02T16:33:00.000Z

We are continuing to investigate this issue.

Update2025-04-02T16:29:00.000Z

We are continuing to investigate this issue.

Investigating2025-04-02T16:21:00.000Z

We are currently investigating this issue.

Monitoring2025-04-02T08:15:00.000Z

A fix has been implemented and we are monitoring the results.

Investigating2025-04-02T07:15:00.000Z

We are currently investigating this issue.

Apr 2, 2025

Report: "Performance degradation on QStash"

Last update 2025-04-02T07:15:00.000Z

Investigating2025-04-02T07:15:00.000Z

We are currently investigating this issue.

Mar 12, 2025

Report: "Performance degradation on QStash"

Last update 2025-03-12T12:36:09.741Z

postmortem2025-03-12T12:26:43.561Z

On 09:43 UTC, QStash experienced degraded service when a high number of requests to a specific domain were throttled, resulting in timeouts during an unexpected phase of the TCP connection establishment. These requests and resulting retries triggered excessive consumption of network resources and negatively impacted all users. We have added more resources to QStash as a quick remediation and as for the resolution, we have improved the timeout mechanism to detect and fail-faster in such cases.

resolved2025-03-12T10:16:46.279Z

This incident has been resolved.

monitoring2025-03-12T10:08:45.128Z

A fix has been implemented and we are monitoring the results.

investigating2025-03-12T09:43:07.284Z

We are currently investigating this issue.

Feb 26, 2025

Report: "Regional AWS eu-west-1 Cluster Performance Degradation Issue"

Last update 2025-02-26T12:40:24.938Z

postmortem2025-02-26T12:29:06.734Z

### Incident Summary During a maintenance update to the regional Upstash Redis databases in AWS eu-west-1, several databases hosted in that region has unnecessarily triggered a full synchronisation between their primary and backup replicas. ### Root Cause A full synchronisation is the invalidation of the whole data in the target replica and starting a fresh re-population from the source replica. Under normal circumstances, full synchronisation is required only in cases where the data integrity is lost in one of the replicas, which was not the case here. ### Impact This incident impacted the performance of regional databases on AWS eu-west-1 only. Full synchronisation has caused a very high CPU load and caused a performance degradation on some of the databases that has a replica in this region. Moreover, our system throttles the databases that are going through this operation to allocate more CPU to the synchronisation to finish it sooner. No data or consistency has been lost. ### Resolution As a quick remediation, we have unthrottled affected databases on 15:06UTC and enabled more throughput, however high latency has still been observed until the full synchronisation is completed on 21:23UTC. A fix has been prepared to avoid this unnecessary full synchronisation on regional databases, and will be deployed shortly. This issue is not present on Upstash Global databases, which is our new generation infrastructure and is now our default offering. We will reach out to our regional users on how to migrate to Upstash Global going forward.

resolved2025-02-25T21:23:09.697Z

This incident has been resolved.

monitoring2025-02-25T20:54:12.120Z

A fix has been implemented and we are monitoring the results.

identified2025-02-25T14:03:20.000Z

Regional AWS eu-west-1 cluster is experiencing performance degradation, and we are adding more resources to the cluster.

Feb 20, 2025

Report: "We experienced a very short period of API downtime for the incoming requests to QStash due to urgent maintenance to ensure the stability and performance of our services. Our team acted quickly to address the issue, and everything is now fully operational."

Last update 2025-02-20T08:58:54.131Z

resolved2025-02-20T08:00:00.000Z

We experienced a very short period of API downtime for the incoming requests to QStash due to urgent maintenance to ensure the stability and performance of our services. Our team acted quickly to address the issue, and everything is now fully operational.

Feb 6, 2025

Report: "QStash Workflow Run Failure"

Last update 2025-02-06T12:01:06.326Z

resolved2025-02-06T10:10:00.000Z

Latest QStash release caused some Workflow runs to fail due to a bug in Workflow URL detection mechanism. Affected workflows did not start at all and moved directly to DLQ. These workflow runs, which show "detected non-workflow destination" message in the response body, can be retried from DLQ. This was a partial failure, lasted from 10:10 to 11:00 UTC, not all users' workflows were affected.

Jan 29, 2025

Report: "Disk failure in some Regional Databases"

Last update 2025-01-29T18:06:39.029Z

resolved2025-01-29T18:03:32.000Z

We observed a disk failure on some instances on 16:52 UTC and restarted affected components to bring them back online. Issue is resolved on 17:13 UTC. During the time of the issue databases reachability are affected, no data is lost.

Nov 13, 2024

Report: "Performance degradation on QStash"

Last update 2024-11-13T20:48:41.770Z

postmortem2024-11-13T20:38:27.548Z

**Product:** QStash **Impact:** Degraded performance, delayed processing of events, and duplicate event deliveries for some customers ## Incident Summary QStash experienced an incident marked by a sudden and extreme load on our servers. This caused a degradation in performance, with extremely high latency for event processing for all users. We also noticed some of the events being delivered multiple times to some of the users. To mitigate the high load, we have increased the capacity as our initial response while investigation proceeds. Eventually, fixes for the issues are confirmed with an issue reproducer and deployed to production. ## Root Cause Analysis In a certain type of usage, failure handling of [failureFunction](https://upstash.com/docs/workflow/basics/serve#failurefunction) can cause recursive calls which causes a leak in the queue of the tasks, causing a severe load on the QStash servers. This also triggered an edge case which caused some of the events to be delivered multiple times. ## Resolution Two hotfixes to the QStash processes are deployed 1. Prevent recursive calls within the failure function. 2. Eliminate duplicate deliveries while keeping "at least once delivery" guarantee. These are verified to successfully resolve the root cause, normalizing server load and restoring standard event processing operations. ## Impact on Customers High latency of event processing is observed for all users. Some users received duplicate event deliveries. No events were lost, and all were delivered as part of our "at least once delivery" guarantee. Customers do not need to take any corrective action, as workflows have returned to normal and preventive fixes are deployed.

resolved2024-11-13T17:27:29.698Z

We will be sharing a postmortem about the incident soon.

identified2024-11-13T15:38:44.419Z

The issue has been identified and a fix is being implemented.

investigating2024-11-13T14:14:40.901Z

We are currently investigating this issue.

Report: "Performance degradation on QStash"

Last update 2024-11-13T12:49:01.182Z

resolved2024-11-13T12:49:01.168Z

This incident has been resolved.

monitoring2024-11-13T12:12:04.688Z

A fix has been implemented and we are monitoring the results.

identified2024-11-13T10:28:23.426Z

We are continuing to work on a fix for this issue.

identified2024-11-13T10:28:15.185Z

The issue has been identified and a fix is being implemented.

investigating2024-11-13T09:31:00.735Z

We are currently investigating this issue.

Sep 2, 2024

Report: "Performance degradation on QStash"

Last update 2024-09-02T14:53:42.390Z

resolved2024-09-02T14:53:42.377Z

This incident has been resolved.

monitoring2024-09-02T14:19:41.903Z

We are continuing to monitor for any further issues.

monitoring2024-09-02T12:18:32.000Z

Our data processing infrastructure is running behind. No data has been lost and the system should be caught up shortly.

Aug 23, 2024

Report: "QStash API Not Reachable"

Last update 2024-08-23T06:01:38.697Z

resolved2024-08-23T06:01:38.678Z

This incident has been resolved.

identified2024-08-23T05:32:40.203Z

The issue has been identified and a fix is being implemented.

Jun 5, 2024

Report: "Partial Degraded Performance"

Last update 2024-06-05T15:56:37.761Z

resolved2024-06-05T14:02:00.000Z

Some of the databases in us-east-1 region might have experienced increased latencies.

Jun 3, 2024

Report: "Degraded availability on Vector us-east-1"

Last update 2024-06-03T09:20:42.461Z

resolved2024-06-03T09:20:42.448Z

This incident has been resolved.

monitoring2024-06-03T09:06:21.453Z

A fix has been implemented and we are monitoring the results.

identified2024-06-03T08:52:52.154Z

The issue has been identified and a fix is being implemented.

May 13, 2024

Report: "Degraded performance on us-east-1 Kafka"

Last update 2024-05-13T08:30:33.758Z

resolved2024-05-13T08:30:33.744Z

This incident has been resolved.

monitoring2024-05-10T09:42:12.814Z

We are working on a fix and monitoring the cluster performance.

investigating2024-05-09T14:39:02.955Z

We are currently investigating this issue.

Mar 13, 2024

Report: "Maintenance - Global Ap-Southeast-1"

Last update 2024-03-13T09:30:56.653Z

resolved2024-03-13T08:30:00.000Z

We have taken actions to increase the capacity of the region. During the operation, clients might have experienced higher than usual latencies for about 15 minutes.

Mar 12, 2024

Report: "Connectivity Issues - Global eu-central-1"

Last update 2024-03-12T09:49:52.239Z

resolved2024-03-12T09:49:52.221Z

This incident has been resolved.

investigating2024-03-12T09:42:43.381Z

We are currently investigating this issue.

Report: "Connectivity Issues - Global Ap-Southeast-1"

Last update 2024-03-12T08:06:36.071Z

resolved2024-03-12T08:06:36.057Z

This incident has been resolved.

monitoring2024-03-12T07:40:16.048Z

A fix has been implemented and we are monitoring the results.

investigating2024-03-12T06:59:53.030Z

We are continuing to investigate this issue.

investigating2024-03-12T06:53:36.225Z

We are currently investigating this issue.

Mar 6, 2024

Report: "Degraded Availability in Global Us-East-1"

Last update 2024-03-06T19:42:10.260Z

resolved2024-03-06T19:42:10.246Z

This incident has been resolved.

investigating2024-03-06T18:37:29.387Z

We are currently investigating this issue.

Sep 7, 2023

Report: "Degraded Availability on Redis"

Last update 2023-09-07T16:22:45.598Z

resolved2023-09-07T16:22:45.582Z

This incident has been resolved.

monitoring2023-09-07T15:04:17.792Z

A fix has been implemented and we are monitoring the results.

investigating2023-09-07T13:27:20.767Z

Some databases are not reachable. We are currently investigating issue.

Aug 28, 2023

Report: "Client connections are intermittently dropping"

Last update 2023-08-28T20:57:32.575Z

resolved2023-08-28T20:57:32.555Z

This incident has been resolved.

monitoring2023-08-28T15:46:17.863Z

We found that small number of database connections are intermittently dropping. We have applied a fix and observing now.

Aug 16, 2023

Report: "Degraded Availability on QStash"

Last update 2023-08-16T19:57:57.184Z

resolved2023-08-16T19:57:57.167Z

This incident has been resolved.

monitoring2023-08-16T19:56:21.361Z

We are continuing to monitor for any further issues.

monitoring2023-08-16T19:56:09.525Z

A fix has been implemented and we are monitoring the results.

identified2023-08-16T19:45:36.105Z

An issue in persistence layer causes degradation in QStash availability. We have identified the cause and working on a fix right now.

Jul 19, 2023

Report: "Partial Downtime for REST Clients on Kafka EU-WEST-1 Clusters"

Last update 2023-07-19T13:07:49.278Z

resolved2023-07-19T11:45:00.000Z

We regret to inform you that we experienced a partial downtime during our recent maintenance. This downtime specifically impacted our REST clients. We apologize for any inconvenience caused and assure you that our team has fixed connectivity issues.

May 23, 2023

Report: "Degraded performance on QStash"

Last update 2023-05-23T00:43:18.908Z

resolved2023-05-23T00:30:00.000Z

The performance of QStash experienced degradation, after getting alert from our monitoring system, our team intervened and restored its stability.

May 10, 2023

Report: "Elevated latency in us-east-1"

Last update 2023-05-10T21:46:41.697Z

resolved2023-05-10T21:46:41.682Z

This incident has been resolved.

monitoring2023-05-10T21:36:40.854Z

We are monitoring the status now.

identified2023-05-10T21:27:03.838Z

We have identified the issue and applied the fix.

investigating2023-05-10T20:04:58.803Z

We are currently experiencing elevated latency in us-east-1 and are investigating the issue. We will share updates as they become available.

May 9, 2023

Report: "Degraded performance on QStash"

Last update 2023-05-09T22:17:24.093Z

resolved2023-05-09T22:17:24.086Z

Surge in user activity resulted in unusually high traffic, causing temporary service disruption in QStash service.

Report: "Degraded performance on QStash"

Last update 2023-05-09T13:41:23.208Z

resolved2023-05-09T13:41:23.188Z

This incident has been resolved.

monitoring2023-05-09T13:32:58.107Z

A fix has been implemented and we are monitoring the results.

investigating2023-05-09T13:27:38.000Z

We are currently investigating this issue.

Apr 20, 2023

Report: "Degraded performance on Stash"

Last update 2023-04-20T15:05:04.921Z

resolved2023-04-20T15:05:04.904Z

This incident has been resolved.

monitoring2023-04-20T14:56:14.908Z

A fix has been implemented and we are monitoring the results.

investigating2023-04-20T14:39:46.293Z

Degregeded performance on QStash

Apr 6, 2023

Report: "Degraded performance on AWS EU-WEST-1 region"

Last update 2023-04-06T14:49:45.717Z

resolved2023-04-06T14:47:30.101Z

This incident has been resolved.

monitoring2023-04-06T12:37:55.000Z

We have identified a heavy system load on some of the database servers. We are adding new machines to the pool to prevent this happening again. Some databases may have observed high latencies or short disconnections during the event.

Mar 26, 2023

Report: "QStash Unavailable"

Last update 2023-03-26T06:35:26.873Z

resolved2023-03-26T06:00:00.000Z

QStash had a short period of unavailability. More resource has been automatically allocated to the instance during this time. We are taking steps to optimize the resource availability by allocating further resources.

Mar 12, 2023

Report: "QStash Unavailable"

Last update 2023-03-12T14:49:40.167Z

resolved2023-03-12T14:49:40.152Z

We have allocated more resources to the instance. QStash is stable.

monitoring2023-03-12T14:08:56.728Z

Mar 1, 2023

Report: "Degraded performance on Kafka eu-west-1 cluster"

Last update 2023-03-01T20:41:37.834Z

resolved2023-03-01T20:41:37.813Z

This incident has been resolved.

monitoring2023-03-01T19:19:00.736Z

A fix has been implemented and we are monitoring the cluster

identified2023-03-01T11:15:45.072Z

We have identified the problem, waiting for a resolution from our cloud provider.

investigating2023-03-01T10:44:50.053Z

We are experiencing a cloud provider related problem at this moment. Investigating

Dec 13, 2022

Report: "Database Backup/Restore Performance Degradation on AWS US-EAST-1 Cluster"

Last update 2022-12-13T08:48:43.756Z

resolved2022-12-13T08:48:43.739Z

This incident has been resolved.

investigating2022-12-10T19:40:37.643Z

We have observed degraded performance on database backup/restore operations on AWS US-EAST-1 cluster. The functionality is disabled temporarily

Aug 9, 2021

Report: "US-EAST-1 Free Tier Rest Service Outage"

Last update 2021-08-09T11:13:38.857Z

resolved2021-08-05T11:13:00.000Z

There was a partial outage in the REST service in one of the us-east free tier clusters

Report: "US-EAST-1 Free Tier Rest Service Outage"

Last update 2021-08-09T11:02:44.274Z

resolved2021-08-04T11:02:00.000Z

There was a partial outage in the REST service in one of the us-east free tier clusters

May 23, 2021

Report: "This is an example incident"

Last update 2021-05-23T20:04:52.944Z

investigating2021-03-27T21:48:34.614Z

When your product or service isn’t functioning as expected, let your customers know by creating an incident. Communicate early, even if you don’t know exactly what’s going on.

resolved2021-03-27T21:22:34.774Z

Empathize with those affected and let them know everything is operating as normal.

identified2021-03-27T21:18:34.674Z

As you continue to work through the incident, update your customers frequently.

monitoring2021-03-27T21:12:34.720Z

Let your users know once a fix is in place, and keep communication clear and precise.