Is PostHog Down Right Now? Discover if there is an ongoing service outage.

PostHog is currently Operational

Last checked Oct 1, 2025 18:25 UTC from PostHog's official status page

Incident History

Sep 29, 2025

Report: "Elevated API Errors"

Last update 2025-09-29T16:58:52.835Z

investigating2025-09-29T16:58:52.833Z

We're experiencing an elevated level of feature flag evaluation errors in our US datacenter and are currently looking into the issue.

Report: "US: elevated error rate web portal and api"

Last update 2025-09-29T13:56:54.755Z

investigating2025-09-29T13:56:54.752Z

We've spotted that something has gone wrong. We're currently investigating the issue, and will provide an update soon.

Sep 23, 2025

Report: "Event querying errors"

Last update 2025-09-23T15:20:06.219Z

investigating2025-09-23T15:20:06.216Z

We've spotted that something has gone wrong with some SQL based event queries

Report: "ClickHouse offline node is down in PostHog US"

Last update 2025-09-23T06:43:02.331Z

identified2025-09-23T06:43:02.329Z

We have identified that one of our database nodes is unresponsive. This is causing degraded performance in offline workloads, like batch exports. We are working on bringing the node back up. No data loss has been identified.

Sep 18, 2025

Report: "US Cloud Web App Down"

Last update 2025-09-18T16:50:53.333Z

resolved2025-09-18T16:50:53.316Z

This incident has been resolved.

identified2025-09-18T16:46:32.208Z

We have identified the cause and should have a fix rolled out within the next few minutes

identified2025-09-18T16:46:12.431Z

A deployment issue has caused the US Cloud web app to fail to load. Data ingestion and APIs are unaffected.

Sep 15, 2025

Report: "web app is down"

Last update 2025-09-15T23:04:26.739Z

investigating2025-09-15T23:04:26.736Z

We've spotted that something has gone wrong. We're currently investigating the issue, and will provide an update soon.

Sep 13, 2025

Report: "Funnels are not calculating"

Last update 2025-09-13T17:10:12.631Z

investigating2025-09-13T17:10:12.628Z

We've spotted that something has gone wrong with funnels. We're currently investigating the issue, and will provide an update soon.

Sep 11, 2025

Report: "Data Processing Delays - Reporting Tools Affected"

Last update 2025-09-11T15:23:49.625Z

investigating2025-09-11T15:23:49.623Z

Our data processing infrastructure is running behind which is causing inaccuracies in the reporting tools. No data has been lost and the system should be caught up shortly.

Sep 9, 2025

Report: "Experiment results are not calculating"

Last update 2025-09-09T17:55:27.070Z

resolved2025-09-09T17:55:27.052Z

We've spotted that something has gone wrong. We're currently investigating the issue, and will provide an update soon.

investigating2025-09-09T17:52:36.271Z

We're experiencing an issue where the calculating of experiment results are experiencing errors, and we are currently looking into the issue.

Report: "Elevated errors in Experiment results"

Last update 2025-09-09T17:52:43.726Z

investigating2025-09-09T17:52:43.723Z

We're experiencing an elevated level of API errors for Experiment results and are currently looking into the issue.

Report: "Posthog EU app not loading"

Last update 2025-09-09T17:32:55.142Z

investigating2025-09-09T17:32:55.139Z

We're experiencing issues on the EU web app and we are currently investigating.

Sep 4, 2025

Report: "EU ingestion lag"

Last update 2025-09-04T16:21:59.116Z

investigating2025-09-04T16:21:59.113Z

We are having some issues with our ClickHouse cluster not being able to merge parts as expected and it's leading to increased ingestion lag. We are looking into what the root cause could be.

Sep 3, 2025

Report: "Batch exports delayed in EU Cloud"

Last update 2025-09-03T15:40:08.340Z

identified2025-09-03T15:40:08.338Z

Batch exports are experiencing delays in EU cloud. This is due to issues with replication in our database. Our team is already on the case. Batch exports will recover automatically as the database is stabilized.

Sep 2, 2025

Report: "Data Processing Delays - Reporting Tools Affected"

Last update 2025-09-02T18:51:42.970Z

investigating2025-09-02T18:51:42.967Z

Minor impact resulting in a processing delay to the ingestion of some session recordings in the US region.

Sep 1, 2025

Report: "Google Pub/Sub and Google Cloud Storage authentication issues"

Last update 2025-09-01T14:32:34.369Z

investigating2025-09-01T14:32:34.366Z

We've rolled out a system improvement that broke authentication for the Google Cloud Storage and Google Pub/Sub destination. We are working to resolve the issue.

Aug 28, 2025

Report: "Posthog web app down"

Last update 2025-08-28T06:42:16.383Z

identified2025-08-28T06:42:16.380Z

The posthog web interface is currently down. We've identified the root cause as a performance issue in one of our database systems. We're deploying a fix now. No data has been lost.

Aug 27, 2025

Report: "Elevated Session Replay Capture Errors"

Last update 2025-08-27T06:55:03.448Z

investigating2025-08-27T06:55:03.445Z

We're experiencing an elevated level of session recording capture errors, which will lead to sessions not being captured. We're currently investigating the source of this issue. This is currently only impacting the US cloud

Aug 26, 2025

Report: "Unexpected 2FA requests to users"

Last update 2025-08-26T13:30:34.077Z

identified2025-08-26T13:30:34.072Z

We're experiencing elevated cases of users being asked for their 2FA, we've identified the issue and are deploying a fix.

Aug 21, 2025

Report: "Performance degraded in US"

Last update 2025-08-21T16:37:54.504Z

investigating2025-08-21T16:37:54.501Z

Our ClickHouse cluster is pretty loaded right now and queries may be pretty slow to complete or not be able to complete at all. We are investigating what could be the issue.

Report: "Failing queries"

Last update 2025-08-21T10:32:07.342Z

investigating2025-08-21T10:32:07.338Z

We're investigating reports that a small number of queries are currently failing. This is also causing some insights not to load, with a generic server error displayed.

Aug 20, 2025

Report: "US Realtime Destination Data Processing Delays"

Last update 2025-08-20T15:38:07.383Z

investigating2025-08-20T15:38:07.380Z

Our realtime event destination infrastructure is running behind which is causing delays in event deliveries. No data has been lost and we are working to resolve the delay.

Aug 12, 2025

Report: "Data Processing Delays - Reporting Tools Affected"

Last update 2025-08-12T00:12:47.228Z

investigating2025-08-12T00:12:47.225Z

Our data processing infrastructure is running behind which is causing inaccuracies in the reporting tools. No data has been lost and the system should be caught up shortly.

Aug 7, 2025

Report: "US: Web portal slow"

Last update 2025-08-07T09:39:53.810Z

investigating2025-08-07T09:39:53.807Z

We've spotted that something has gone wrong and seeing intermitting upstream errors in the web portal and API. We're currently investigating the issue, and will provide an update soon.

Report: "CDP events consumer lag"

Last update 2025-08-07T07:12:48.693Z

investigating2025-08-07T07:12:48.689Z

Our data processing infrastructure is running behind which is causing latency in delivery of realtime destinations.

Report: "Data Processing Delays"

Last update 2025-08-07T07:11:48.358Z

monitoring2025-08-07T07:11:48.347Z

Our data processing infrastructure is running behind which is causing latency in delivery of realtime destinations.

Aug 6, 2025

Report: "Data Processing Delays - Reporting Tools Affected"

Last update 2025-08-06T18:00:22.596Z

monitoring2025-08-06T18:00:22.579Z

After the database related incident we had earlier, the system has accumulated ingestion lag. Operators are monitoring as we rapidly ingest these events and get lag back to normal levels

Report: "US web inaccessible, "upstream request timeout""

Last update 2025-08-06T17:18:11.537Z

investigating2025-08-06T17:18:11.534Z

We're experiencing an elevated level of API errors and are currently looking into the issue.

Aug 2, 2025

Report: "Elevated API Errors"

Last update 2025-08-02T04:30:54.224Z

investigating2025-08-02T04:30:54.221Z

We're experiencing an elevated level of API errors and are currently looking into the issue.

Report: "Elevated API Errors"

Last update 2025-08-02T03:36:33.490Z

identified2025-08-02T03:36:33.487Z

We've spotted that something has gone wrong. We're currently investigating the issue, and will provide an update soon.

identified2025-08-02T03:36:10.435Z

We're experiencing an elevated level of API errors and have identified the issue as a connections count issue to our postgres instance. We are working to mitigate. The application (us.posthog.com) is back online and we are processing events. However, we still are experiencing issues with feature flag endpoints and are working to bring those back online.

Aug 1, 2025

Report: "High volume of requests in EU"

Last update 2025-08-01T11:16:44.715Z

investigating2025-08-01T11:16:44.713Z

We are receiving a higher volume of requests than usual and our system may not be able to respond with the expected performance. We are investigating the root cause.

Jul 31, 2025

Report: "Elevated API Errors"

Last update 2025-07-31T23:28:05.901Z

monitoring2025-07-31T23:28:05.890Z

We experienced elevated errors on our event ingestion endpoints, resulting in the loss of some events for customers in both the EU and US regions. The problem has been resolved and operators are taking action to make sure the issue doesn't recur.

Jul 29, 2025

Report: "Data Processing Delays"

Last update 2025-07-29T18:16:17.129Z

monitoring2025-07-29T18:16:17.117Z

Our data processing infrastructure is running behind which is causing latency in delivery of realtime destinations.

Jul 21, 2025

Report: "Processing Delays for Realtime Destinations"

Last update 2025-07-21T17:26:33.348Z

investigating2025-07-21T17:26:33.346Z

Our data processing infrastructure is running behind which is causing delays in realtime destination delivery. No data has been lost and the system should be caught up shortly.

Report: "Errors in feature flags"

Last update 2025-07-21T12:48:01.492Z

identified2025-07-21T12:48:01.488Z

Due to a bad change in our networking config, we observe elevated errors rates in feature flags. we deployed a fix, but probably need more steps to resolve this.

Jul 17, 2025

Report: "Elevated API Errors"

Last update 2025-07-17T11:22:30.340Z

investigating2025-07-17T11:22:30.337Z

We're experiencing an elevated level of API errors and are currently looking into the issue. Login and the usage of the app is experiencing issues.

Jul 16, 2025

Report: "US: elevated error rate on web portal"

Last update 2025-07-16T14:28:32.985Z

investigating2025-07-16T14:28:32.982Z

We've spotted elevated error rate in our app. We're currently investigating the issue, and will provide an update soon.

Jul 15, 2025

Report: "Data Processing Delays - Destinations affected"

Last update 2025-07-15T05:50:58.997Z

identified2025-07-15T05:50:58.994Z

Our Destination delivery system is delayed in the EU region. We are investigating the situation

Jul 14, 2025

Report: "Elevated 500s on Feature Flags in EU"

Last update 2025-07-14T21:45:29.167Z

investigating2025-07-14T21:45:29.164Z

We've spotted that some feature flag requests in EU are returning 500 response codes. We are currently investigating.

Report: "Delayed Cohort Processing"

Last update 2025-07-14T14:04:11.218Z

investigating2025-07-14T14:04:11.215Z

We've spotted a delay in cohort processing. We're currently investigating the issue, and will provide an update soon.

Jul 10, 2025

Report: "Data Processing Delays - Reporting Tools Affected"

Last update 2025-07-10T20:21:31.858Z

investigating2025-07-10T20:21:31.855Z

Our data processing infrastructure is running behind which is causing inaccuracies in the reporting tools. No data has been lost and the system should be caught up once we root cause the problem and find a resolution.

Jul 3, 2025

Report: "Delayed ingestion for replay and delayed processing for CDP"

Last update 2025-07-03T07:08:45.088Z

monitoring2025-07-03T07:08:45.078Z

Earlier there was an outage due to an external provider. See https://status.posthog.com/incidents/f54ty14b48wx We've identified that not all internal systems recovered from that outage. Meaning that session recordings were being ingested but were not available for playback and CDP destinations were not being triggered We've resolved that and all systems are now processing. Session replay and CDP will catch up.

Report: "Increased number of connection failures to warpstream Kafka in US"

Last update 2025-07-03T01:36:25.431Z

investigating2025-07-03T01:36:25.429Z

Looks related to the Warpstream outage in us-east-1 due to dynamoDB downtime in us-east-1

Jul 2, 2025

Report: "US: Delayed data processing due to patching event"

Last update 2025-07-02T12:23:13.740Z

monitoring2025-07-02T12:23:13.728Z

Due to a security patching event in our event processing infrastructure, data processing is delayed. We're monitoring and will update once this is over. No data will be lost during this operation.

Jun 27, 2025

Report: "Person search returning no results"

Last update 2025-06-27T11:01:20.741Z

resolved2025-06-27T11:01:20.721Z

We reverted the offending PR and persons searches are returning results again as they should. Apologies for the disruption!

investigating2025-06-27T10:35:03.363Z

We've noticed that person search is not working on the US Cloud and are currently investigating the issue.

Jun 25, 2025

Report: "EU - elevated error rate for feature flags"

Last update 2025-06-25T07:24:54.343Z

investigating2025-06-25T07:24:54.340Z

We've spotted elevated error rates in feature flags. We're currently investigating the issue, and will provide an update soon.

Jun 23, 2025

Report: "Experiment queries failing"

Last update 2025-06-23T14:39:00.983Z

investigating2025-06-23T14:39:00.980Z

We're experiencing an elevated level of API errors and are currently looking into the issue.

Jun 20, 2025

Report: "Latest PostHog JS not capturing session replays"

Last update 2025-06-20T17:16:15.969Z

investigating2025-06-20T17:16:15.966Z

We've spotted that something has gone wrong. We're currently investigating the issue, and will provide an update soon.

Jun 19, 2025

Report: "Data Processing Delays - Reporting Tools Affected"

Last update 2025-06-19T17:39:25.623Z

investigating2025-06-19T17:39:25.620Z

Our data processing infrastructure is running behind which is causing inaccuracies in the reporting tools. No data has been lost and the system should be caught up shortly.

Jun 18, 2025

Report: "Elevated /decide API errors"

Last update 2025-06-18T18:29:31.378Z

identified2025-06-18T18:29:31.374Z

We're experiencing significant downtime on our /decide API endpoint. The cause has been identified and a fix is being deployed.

Jun 14, 2025

Report: "Elevated API Errors"

Last update 2025-06-14T20:41:30.600Z

investigating2025-06-14T20:41:30.596Z

We're experiencing an elevated level of query failure from our analytics database systems. We are currently looking into the issue. No data has been lost, but queries, insights, replays, and other product features may be down or intermittently failing.

Jun 12, 2025

Report: "Errors from an upstream provider outage"

Last update 2025-06-12T18:45:46.232Z

investigating2025-06-12T18:45:46.230Z

We're experiencing a elevated errors from an upstream provider, we're monitoring the issues and will post an update soon.

Jun 10, 2025

Report: "Cohort recalculations taking longer than expected"

Last update 2025-06-10T15:41:46.855Z

monitoring2025-06-10T15:41:46.839Z

We've spotted a small number of cohorts are stuck in a recalculating state, and a larger number are taking longer than 24 hours to automatically recalculate as they should. We've identified the issue and have deployed a fix.

Jun 6, 2025

Report: "Queries are slow to run"

Last update 2025-06-06T13:44:43.236Z

investigating2025-06-06T13:44:43.233Z

We've been alerted to an increase in query times. We're currently investigating the issue, and will provide an update once we identify the root cause.

Jun 4, 2025

Report: "Elevated errors on us.posthog.com"

Last update 2025-06-04T16:02:58.995Z

investigating2025-06-04T16:02:58.992Z

We're seeing elevated errors loading the posthog interface. We're investigating and we'll update you as we know more.

Report: "Elevated errors on us.posthog.com"

Last update 2025-06-04T16:02:00.000Z

Investigating2025-06-04T16:02:00.000Z

We're seeing elevated errors loading the posthog interface. We're investigating and we'll update you as we know more.

Jun 2, 2025

Report: "Data Processing Delays - Reporting Tools Affected"

Last update 2025-06-02T18:56:48.234Z

resolved2025-06-02T18:56:48.218Z

The ingestion delay incident has been resolved

identified2025-06-02T12:50:13.234Z

Due to delays in a maintenance process, our data processing infrastructure is running behind which is causing inaccuracies in the reporting tools. No data has been lost and the system should be caught up shortly.

Report: "EU: elevated errors on web UI"

Last update 2025-06-02T14:38:01.033Z

resolved2025-06-02T14:38:01.015Z

This incident has been resolved.

investigating2025-06-02T13:38:33.694Z

Situation is back to normal. We found the root cause being in our networking stack. We're preparing a long term fix for it. Thanks for your patience!

investigating2025-06-02T12:46:15.765Z

The situation seemed to have calmed down, we're investigating the root cause.

investigating2025-06-02T12:39:29.617Z

We've spotted that something has gone wrong. We're currently investigating the issue, and will provide an update soon.

Report: "Data Processing Delays - Reporting Tools Affected"

Last update 2025-06-02T13:56:00.000Z

Resolved2025-06-02T13:56:00.000Z

The ingestion delay incident has been resolved

Identified2025-06-02T07:50:00.000Z

Report: "EU: elevated errors on web UI"

Last update 2025-06-02T09:38:00.000Z

Resolved2025-06-02T09:38:00.000Z

This incident has been resolved.

Update2025-06-02T08:38:00.000Z

Situation is back to normal. We found the root cause being in our networking stack. We're preparing a long term fix for it. Thanks for your patience!

Update2025-06-02T07:46:00.000Z

The situation seemed to have calmed down, we're investigating the root cause.

Investigating2025-06-02T07:39:00.000Z

We've spotted that something has gone wrong. We're currently investigating the issue, and will provide an update soon.

May 31, 2025

Report: "US: Delayed event ingestion"

Last update 2025-05-31T05:29:40.099Z

resolved2025-05-31T05:29:40.082Z

The backlog has been fully processed and event ingestion is back to normal. Thank you for bearing with us and apologies for the disruption.

monitoring2025-05-30T21:22:13.961Z

We are consuming the lagged backlog and still monitoring the progress.

monitoring2025-05-30T17:55:07.220Z

We have increased the consumer resources to speed up the resolution and keep monitoring the rate.

monitoring2025-05-30T15:36:29.101Z

We identified another related issue and rolled the appropriate fix. The lag should be down and we keep monitoring it.

monitoring2025-05-30T12:23:32.074Z

We identified the issue and rolled out a fix. The event lag is dropping, and we keep monitoring it.

investigating2025-05-30T11:48:04.038Z

We're currently falling behind on event ingestion. No data loss has occurred, and we're actively investigating the issue.

Report: "US: Delayed event ingestion"

Last update 2025-05-31T00:29:00.000Z

Resolved2025-05-31T00:29:00.000Z

The backlog has been fully processed and event ingestion is back to normal. Thank you for bearing with us and apologies for the disruption.

Update2025-05-30T16:22:00.000Z

We are consuming the lagged backlog and still monitoring the progress.

Update2025-05-30T12:55:00.000Z

We have increased the consumer resources to speed up the resolution and keep monitoring the rate.

Update2025-05-30T10:36:00.000Z

We identified another related issue and rolled the appropriate fix. The lag should be down and we keep monitoring it.

Monitoring2025-05-30T07:23:00.000Z

We identified the issue and rolled out a fix. The event lag is dropping, and we keep monitoring it.

Investigating2025-05-30T06:48:00.000Z

We're currently falling behind on event ingestion. No data loss has occurred, and we're actively investigating the issue.

May 22, 2025

Report: "Data Processing Delays - Reporting Tools Affected"

Last update 2025-05-22T19:59:16.099Z

resolved2025-05-22T19:59:16.081Z

This incident has been resolved.

monitoring2025-05-22T18:58:28.769Z

We identified the issue and the ingestion pipeline is catching up.

investigating2025-05-22T18:20:46.610Z

Our data processing infrastructure is running behind which is causing inaccuracies in the reporting tools. No data has been lost.

May 21, 2025

Report: "Posthog Cloud EU Database Maintenance"

Last update 2025-05-21T07:42:00.000Z

Completed2025-05-21T07:42:00.000Z

The scheduled maintenance has been completed.

In progress2025-05-21T05:45:00.000Z

Scheduled maintenance is currently in progress. We will provide updates as necessary.

Scheduled2025-05-21T05:44:00.000Z

We are performing scheduled maintenance on our EU Cloud Clickhouse database. We don't expect significant disruption but there may be some slow queries or ingestion delays.

May 17, 2025

Report: "Increased parts count impacting performance"

Last update 2025-05-17T10:32:00.043Z

resolved2025-05-17T10:32:00.026Z

Parts are back to normal and the cluster is responding normally again. We'll keep monitoring it and let you know if we find misbehavior again.

investigating2025-05-17T06:38:21.064Z

We are currently investigating increased parts counts on our datastore and we are investigating why these parts are not being merged as they should. This will cause increased query times.

May 16, 2025

Report: "US: Delayed event ingestion"

Last update 2025-05-16T19:18:18.230Z

resolved2025-05-16T19:18:18.216Z

We've caught up on our backlog of messages. Ingestion rates look optimal. Parts are being merged as they should. New nodes are fully online. Query latencies are looking great at 100ms avg. Should be smooth sailing from here on out. Enjoy your Friday!

monitoring2025-05-15T21:26:37.794Z

There were some recurring errors in our infrastructure that led us to restart clickhouse nodes. We are falling behind on events ingestion, as we are replacing some nodes in our ClickHouse cluster. This will increase lag in our ingestion pipeline. Performance may be impacted during this time too. We are still working on this and monitoring it.

May 15, 2025

Report: "Elevated API Errors"

Last update 2025-05-15T16:00:41.448Z

resolved2025-05-15T16:00:41.429Z

We've resolved the incident. This just affected querying and no data was lost We're still working on finding the root cause for this issue (our clickhouse nodes were segfaulting without warning) and will continue to monitor.

investigating2025-05-15T15:47:56.000Z

We're experiencing failures to load data across the entire app at the moment. We've identified the root cause and are working to resolve this asap.

investigating2025-05-15T15:20:39.000Z

We're experiencing failures to load data across the entire app at the moment. We've identified the root cause and are working to resolve this asap.

May 13, 2025

Report: "US degraded performance"

Last update 2025-05-13T07:03:03.488Z

resolved2025-05-13T07:03:03.467Z

Lag has recovered and the system is completely functional again. Sorry for any inconvenience caused by this incident.

monitoring2025-05-12T21:56:43.943Z

The cluster is now responsive and the data ingestion has been resumed. The app is responding better now. We are still monitoring a couple of fixes we have pushed. We identified a query that was flooding the cluster and which may have been the root cause of this.

investigating2025-05-12T21:10:28.472Z

We have recovered a good part of the cluster, but we are still working to bring it back completely. The performance may be still degraded. We think some problematic queries may have been the root cause, we are still investigating it.

investigating2025-05-12T20:05:54.495Z

We are trying to bring back the cluster. The app may be completely unresponsive, and lag is expected during this time, we'll try to provide an update as soon as possible.

investigating2025-05-12T18:41:32.461Z

We have detected a partial outage in our ClickHouse cluster and it's impacting the application response and performance getting insights. We are investigating the root cause.

May 8, 2025

Report: "Elevated API Errors"

Last update 2025-05-08T18:29:54.550Z

resolved2025-05-08T18:29:54.535Z

We've not seen a reoccurence of this issue so closing this incident now.

monitoring2025-05-08T17:29:14.489Z

We're still investigating high load for some offline functionality (ie exports) but the vast majority of the app should work fine now

investigating2025-05-08T16:55:26.000Z

Our US app instance is down and pods are unhealthy. We're figuring out why and are working on resolving. Data ingestion and feature flags are not affected.

May 7, 2025

Report: "US degraded performance"

Last update 2025-05-07T10:13:01.631Z

resolved2025-05-07T10:13:01.616Z

We rolled out a change that increased load causing queries to be slower. We rolled back that change so performance should be back up.

investigating2025-05-07T09:07:02.855Z

We have spotted that our data infrastructure is under heavy load and it's impacting the time the app takes to load insights or leading to errors when loading them. We are investigating what could be the root cause.

Apr 29, 2025

Report: "Elevated API Errors"

Last update 2025-04-29T18:35:49.485Z

resolved2025-04-29T18:35:49.467Z

The problem has been fixed.

investigating2025-04-29T17:21:38.074Z

We're experiencing an elevated level of API errors and are currently looking into the issue.

Apr 28, 2025

Report: "Intermitting API erorrs - API endpoints & feature flags"

Last update 2025-04-28T12:14:55.453Z

resolved2025-04-28T12:14:55.435Z

Looking good, resolved!

monitoring2025-04-28T10:50:55.228Z

We scaled the infrastructure components and are monitoring this. First signs indicate recovery. We will come back with an update once this is verified.

identified2025-04-28T09:56:37.379Z

We identified intermitting 500s on several API endpoints, incl. feature flags. Reason seems to be an underprovisioned infrastructure compoenent. We're working on a fix. Apologies for any inconvenience

Apr 25, 2025

Report: "Elevated API Errors"

Last update 2025-04-25T11:03:08.511Z

resolved2025-04-25T11:03:08.492Z

We identified undetected underprovisioning in one of our network components. We scaled this up now and working on a fix to mitigate this long-term. Thank you for your patience.

investigating2025-04-25T09:45:28.917Z

Performance and error rate are back to normal levels. we're still investigating the root cause for this issue.

investigating2025-04-25T09:32:02.000Z

We are continuing to investigate this issue. Notice about US: this incident never affected the US environment. The "partial outage" status was wrong for that. We will correct this later. apologies for the inconvencience

investigating2025-04-25T09:28:17.909Z

The error rate has gone down, we're still looking for the root cause.

investigating2025-04-25T08:52:12.486Z

Elevated error rates are coming up again, we're investigating

monitoring2025-04-25T08:20:37.151Z

We identified a surge in memory usage and workload eviction events. We scaled up feature flags and web app to mitigate. We're monitoring this.

investigating2025-04-25T08:12:07.019Z

Situation has calmed down after scaling up resources. We're still investigating the root cause. Notice: in an earlier message, it was reported that this was about the US region. This was wrong, this is only about the EU region. Apologies for the initial wrong reporting

investigating2025-04-25T08:03:18.225Z

We are continuing to investigate this issue.

investigating2025-04-25T08:02:16.679Z

We're experiencing an elevated level of API errors incl feature flags and are currently looking into the issue.

Apr 23, 2025

Report: "Data Processing Delays - Reporting Tools Affected"

Last update 2025-04-23T02:06:15.261Z

resolved2025-04-23T02:06:15.236Z

This incident has been resolved.

monitoring2025-04-23T00:01:39.336Z

We're monitoring the ingestion pipeline, as it processes the delayed messages. We're estimating that the system will fully recover within an hour.

investigating2025-04-22T21:53:33.706Z

We are still investigating intermittent latency spikes in the event ingestion pipeline. Events are still being processed with a delay, which should decrease over time.

investigating2025-04-22T15:26:08.003Z

We are still investigating the root cause of the issue. Events are still delayed but the delay is no longer increasing. We hope to have a resolution shortly

investigating2025-04-22T13:22:43.287Z

Our data processing infrastructure is running behind which is causing inaccuracies in the reporting tools. No data has been lost and the system should be caught up shortly.

Apr 22, 2025

Report: "API Query endpoint intermittently 500'ing"

Last update 2025-04-22T08:50:30.781Z

resolved2025-04-22T08:50:30.764Z

This incident was resolved over the weekend.

monitoring2025-04-17T22:59:50.326Z

We've shed load and haven't seen errors re-occur yet. We'll continue monitoring this over the weekend.

investigating2025-04-17T19:56:34.332Z

The API query endpoint is throwing intermittent 500 errors due to capacity limits on our end. We are working to fix this on our end and make the errors more clear. If known valid queries are failing with 500s, we recommend retrying queries with exponential backoff.

Apr 18, 2025

Report: "US Error Tracking Processing Delays"

Last update 2025-04-18T11:05:42.954Z

resolved2025-04-18T11:05:42.937Z

Bug fixed, ingestion workers scaled back up and lag recovering rapidly. No data loss should be observable.

monitoring2025-04-18T11:02:10.396Z

We've identified the root cause of the issue. We are reprocessing exception events and continuing to monitor to make sure the pipeline fully recovers.

identified2025-04-18T09:47:40.278Z

We are currently experiencing downtime in our error tracking data pipeline, while a bug is resolved. No data loss has occurred.

Apr 15, 2025

Report: "Elevated API Errors Evaluating Feature Flags"

Last update 2025-04-15T07:15:28.393Z

resolved2025-04-15T07:15:28.378Z

After adding more database capacities feature flag evaluation has recovered to normal values. We close this incident now, but keep monitoring. We're working on a long term fix. Apologies for the inconvenience.

monitoring2025-04-15T06:54:47.526Z

We saw a surge in feature flag evaluations and increased backend and database capacity. Seeing first signs of recovery.

investigating2025-04-15T06:18:02.639Z

US: We're experiencing an elevated level of feature flags API errors and are currently investigating.

Apr 12, 2025

Report: "Processing Delays"

Last update 2025-04-12T17:06:11.957Z

resolved2025-04-12T17:06:11.939Z

We've resolved the issue and ingestion has caught up to real time

investigating2025-04-12T15:15:15.118Z

We're keeping a close eye on our ingestion delay. Events might take up to 35 minutes to show up inside PostHog in our EU Cloud. No data has been lost.

investigating2025-04-12T13:18:50.110Z

Our EU data processing infrastructure is running behind, which is causing inaccuracies in the reporting tools. No data has been lost, and the system should catch up shortly. We're monitoring it closely.

Apr 10, 2025

Report: "Data pipeline delivery delays in US Cloud"

Last update 2025-04-10T23:59:51.802Z

resolved2025-04-10T23:59:51.784Z

We've identified the bottleneck and fixed it with improved alerting to avoid the issue in the future.

investigating2025-04-10T15:41:26.059Z

Pipeline destinations are currently experiencing delays in US Cloud - this means deliveries may be sent significantly later than the event that triggers it. No data has been lost and the deliveries will happen as we catch up on processing

Report: "Elevated API Errors for Feature Flag evaluation"

Last update 2025-04-10T05:59:28.109Z

resolved2025-04-10T05:59:28.095Z

The load issue has been resolved.

investigating2025-04-10T05:57:45.560Z

We're experiencing an elevated level of API errors when evaluating feature flags, due to unexpected load. We're currently investigating.

Apr 9, 2025

Report: "Elevated feature flag and local evaluation API Errors"

Last update 2025-04-09T14:33:21.274Z

resolved2025-04-09T14:33:21.257Z

Load spike identified and resolved, error rate and api latency returned to normal

investigating2025-04-09T14:09:02.525Z

We're seeing unexpected database load causing query timeouts and elevated latency on these endpoints.

Report: "Elevated API Errors - Feature Flags and Local Evaluation"

Last update 2025-04-09T05:52:45.404Z

resolved2025-04-09T05:52:45.387Z

Load has dropped and our error rate has returned to normal levels

investigating2025-04-09T05:46:58.335Z

We're experiencing an elevated level of API errors when evaluating feature flags, due to unexpected load. We're currently investigating.

Apr 6, 2025

Report: "Elevated capture errors in the US region"

Last update 2025-04-06T22:29:11.947Z

resolved2025-04-06T22:29:11.933Z

A patch in was applied and we do not have errors anymore

identified2025-04-06T19:09:32.860Z

We're experiencing elevated capture endpoint error rates, due to unanticipated kafka cluster patching. The vast, vast majority of requests are being retried successfully by our network edge routers, but some very large volume customers may see a very small number of terminally failed requests.

Apr 1, 2025

Report: "Batch exports not making progress in US Cloud"

Last update 2025-04-01T21:16:12.904Z

resolved2025-04-01T21:16:12.885Z

This incident has been resolved.

monitoring2025-04-01T18:49:46.294Z

We have narrowed down the problem to a very small set of Snowflake batch exports that we have manually cancelled. If you were affected we will be reaching out. All other batch exports are fully recovered or on the path to recovery. Performance of ongoing batch exports will soon be on pace with real time once again.

investigating2025-04-01T17:25:13.545Z

We were unable to make a full recovery, and the issue seems to persist. We are investigating new potential fixes. In the meantime, batch exports will be delayed.

monitoring2025-04-01T14:00:52.290Z

We are monitoring the backfill process for Snowflake batch exports and any pending large batch exports for other destinations. All backfills are progressing normally. On-going batch exports are operating normally, but users with pending backfills may see us lag behind real time until all the backfilling is done.

identified2025-04-01T13:22:03.899Z

We have deployed our fixes and have managed to resolve the concurrency issues with Snowflake batch exports. Most batch exports besides Snowflake should be fully recovered, with the exception of larger batch exports that will still need some time to work through the backlog. We will shortly begin backfilling any Snowflake batch exports that were cancelled due to this incident.

identified2025-04-01T12:37:20.770Z

We are continuing to work on a fix for this issue.

identified2025-04-01T10:42:26.879Z

We have reasons to believe the cause of the problem is a deadlock happening while connecting to Snowflake. We are attempting to deploy a patch that would deal with the deadlock when it happens, leaving the investigation of what is causing the deadlock for later. Assuming the patch is successful in addressing the problem we will begin back filling any Snowflake batch exports that were cancelled.

investigating2025-04-01T09:20:23.754Z

We have reason to believe that the problem is related to Snowflake batch exports. In consequence, most Snowflake batch exports are being cancelled to be retried at a later date. We are investigating how to remediate the problem with Snowflake. Users of other destinations should see batch exports recovering over time. Depending on the size of the data exported, this recovery could take less or more time.

investigating2025-04-01T07:38:06.312Z

We have been making slow progress on batch exports and a backlog has built up, particularly on larger batch exports. It is taking us some time to work through the backlog, so users may see batch exports be delayed in delivering data. No data loss has happened nor is it expected.

Mar 30, 2025

Report: "US ingestion lag"

Last update 2025-03-30T16:50:30.351Z

resolved2025-03-30T16:50:30.335Z

After monitoring we have seen that all systems are working as normally.

monitoring2025-03-30T14:22:42.581Z

We have been able to identify the root cause and pushed a fix to get event ingestion back to normal and latency is now back to normal. We'll keep monitoring the infrastructure.

investigating2025-03-30T13:20:15.305Z

Our ingestion infrastructure is processing slowly causing delays for event ingestion. We are investigating what could be the root cause.

Mar 27, 2025

Report: "EU: elevated feature flag evaluation errors"

Last update 2025-03-27T08:18:48.318Z

resolved2025-03-25T15:00:00.000Z

EU: We observed elevated error rates in feature flag evaluation that may have led to some requests timing out between 15:00 UTC and 17:00 UTC. We're apologizing for this inconvencience and started improving our alerting to catch this earlier.

Mar 25, 2025

Report: "EU: feature flags and surveys with elevated error rates"

Last update 2025-03-25T06:44:48.797Z

resolved2025-03-24T15:00:00.000Z

EU: We were observing increased error rates for feature flags and surveys. While mitigating first issues with feature flags, we were restarting some internal components, which caused other issues. Surveys showed elevated error rates between ~15:22 UTC and 15:36 UTC. Feature flags showed elevated error rates between ~15:00 UTC and 15:36 UTC. There was a large number of timeouts in our database in the EU region causing high feature flags error rate and service disruptions. Apologies for this disruption.

Mar 24, 2025

Report: "US: Increased person processing load is causing locks on the replica DB"

Last update 2025-03-24T09:50:21.289Z

resolved2025-03-24T09:50:21.272Z

This incident has been resolved.

monitoring2025-03-21T23:56:35.000Z

We scaled up the processing to ease out the spike. We are monitoring the situation.

identified2025-03-21T21:05:11.084Z

Performance on the posthog read replica database is a bit degraded due to a high load of person ingestion processing. This is occasionally affecting flag evaluation since the feature flag service depends on the read replica database

Mar 20, 2025

Report: "EU: Data Processing Delays - Reporting Tools Affected"

Last update 2025-03-20T07:16:44.101Z

resolved2025-03-20T07:16:44.070Z

Issue resolved

identified2025-03-19T08:23:02.210Z

We have identified an issue with our service that builds the list of event and properties to search for when querying data. We are deploying a fix now and hope to see recovery in the coming hours. Until then UI tools for querying your data may be missing information you would expect. No data loss has occurred and event ingestion itself is unaffected.

Mar 19, 2025

Report: "Taxonomy updates delayed in EU"

Last update 2025-03-19T08:19:52.940Z

resolved2025-03-19T08:19:52.924Z

Issue resolved

monitoring2025-03-13T17:43:17.313Z

The issue is resolved, and we are catching up on newly seen events and properties.

identified2025-03-13T10:41:54.549Z

Our taxonomy generation system (for event and property definitions you use in filters and elsewhere) is currently delayed as we fix a minor schema bug. This means new event names or properties you just sent to posthog won't be available for use in places like filters or insights. We have identified the bug, and expect to resolve it shortly.

Report: "PostHog Cloud UI in EU is down"

Last update 2025-03-19T08:11:39.593Z

resolved2025-03-19T08:11:39.565Z

Systems were stable over the last few hours, metrics are showing normal behaviour over longer time for both flag evalulation and the web UI. Thank you for you patience!

monitoring2025-03-18T23:28:17.503Z

Flag evaluation is back to normal, monitoring the overall state

investigating2025-03-18T23:24:10.469Z

Cloud UI is back up, investigating impact on flag evaluation

investigating2025-03-18T23:10:52.832Z

We've spotted that something has gone wrong. We're currently investigating the issue, and will provide an update soon.

Mar 13, 2025

Report: "PostHog app in EU is not loading"

Last update 2025-03-13T00:19:47.967Z

resolved2025-03-13T00:19:47.946Z

We've spotted that something has gone wrong. We're currently investigating the issue, and will provide an update soon.

identified2025-03-13T00:03:10.564Z

PostHog cloud is unavailable, the surveys API and local_evaluation APIs are also affected. Fix is being worked on at the moment.

investigating2025-03-12T23:43:35.902Z

We're experience issues loading the PostHog app in EU. Data ingress does not appear to be affected.

Mar 12, 2025

Report: "US app has intermittent errors"

Last update 2025-03-12T19:11:31.839Z

resolved2025-03-12T19:11:31.823Z

We'd identified a migration that had unintended impact to our database. We've cleared the lock and watching db health stabilize. All is looking back to normal at this time.

investigating2025-03-12T18:02:08.731Z

We're seeing intermittent errors with loading us.posthog.com, and we're investigating why. This isn't impacting the ingestion of data.

Mar 5, 2025

Report: "Elevated API Errors"

Last update 2025-03-05T22:01:52.911Z

resolved2025-03-05T22:01:52.897Z

We've spotted that something has gone wrong. We're currently investigating the issue, and will provide an update soon.

monitoring2025-03-05T21:46:39.277Z

We've spotted that something has gone wrong. We're currently investigating the issue, and will provide an update soon.

investigating2025-03-05T19:25:14.804Z

We're experiencing an elevated level of API errors and are currently looking into the issue.

Report: "Batch exports delayed in EU"

Last update 2025-03-05T10:31:03.108Z

resolved2025-03-05T10:31:03.094Z

All pending batch export runs have completed and new batch export runs are progressing normally. The incident is resolved.

identified2025-03-05T08:48:31.592Z

We have identified the root cause of the problem and are in the process of deploying a fix.

identified2025-03-05T07:37:54.693Z

We have noticed batch exports experiencing a delay of several hours in PostHog Cloud EU. We are investigating the problem. Batch exports in PostHog Cloud US are not affected and operating normally.

Mar 3, 2025

Report: "[US] increased errors on feature flags, ingestion and app"

Last update 2025-03-03T15:14:22.044Z

resolved2025-03-03T15:00:00.000Z

We briefly had a spike in errors on our US instance for various endpoints due to a rollout. We rolled back and errors rates dropped.

Feb 28, 2025

Report: "Data Processing Delays - Reporting Tools Affected"

Last update 2025-02-28T16:44:21.148Z

resolved2025-02-28T16:44:21.122Z

All processing is back to normal

monitoring2025-02-28T16:17:45.717Z

We've identified some bottlenecks slowing down processing. We should be back to real time shortly

investigating2025-02-28T15:34:15.780Z

Our data processing infrastructure is running behind which is causing inaccuracies in the reporting tools. No data has been lost and the system should be caught up shortly.

Feb 27, 2025

Report: "Event taxonomy processing delays"

Last update 2025-02-27T19:42:09.139Z

resolved2025-02-27T19:42:09.102Z

We've spotted that something has gone wrong. We're currently investigating the issue, and will provide an update soon.

monitoring2025-02-27T16:39:39.327Z

We've spotted that something has gone wrong. We're currently investigating the issue, and will provide an update soon. System recovered. We are continuing to monitor.

identified2025-02-27T13:36:38.139Z

We've spotted a problem in processing event taxonomy updates - property and event definitions. We're working to fix the problem, and while working on that fix, some updates to event taxonomy will be delayed. This impacts e.g. whether new events or properties are available for filtering.

Feb 19, 2025

Report: "EU: Elevated error rate on data capture"

Last update 2025-02-19T17:27:42.676Z

resolved2025-02-19T17:09:12.000Z

We resolved the issue and everything is operational again. One of our reverse-proxy instances scaled in ungracefully which caused routing errors. After manually terminating it, the services recovered. We saw elevated errors from 16.01 UTC to 16.48 UTC. A good part of it was recovered by internal retries, but we can't be certain right now to not have lost some events. We will analyze and provide a long term fix so that this won't happen again. Our apologies for this as we were not able to capture all data during this time.

identified2025-02-19T16:47:59.606Z

We found something in networking and it seems to be recovering now. Monitoring the situation.

investigating2025-02-19T16:17:42.695Z

We've spotted that something has gone wrong. We're seeing elevated error rates on capture on the web app. We're currently investigating the issue, and will provide an update soon.

Feb 17, 2025

Report: "EU Maintenance - Data Processing Delays"

Last update 2025-02-17T10:53:54.228Z

resolved2025-02-17T10:53:54.216Z

ingestion caught up. All is good as expected.

monitoring2025-02-17T10:36:38.560Z

The maintenance operations are done, we are monitoring and waiting on all ingestion and data processing delays to catch up. Again, no data has been lost during this standard procedure. Thank you for your patience!

monitoring2025-02-17T09:46:32.277Z

Due to a planned maintenance activity, we're expecting ingestion and data processing delays in EU. No data will be lost during this operation. Thank you for your patience!

Feb 16, 2025

Report: "EU ingestion lag"

Last update 2025-02-16T07:44:20.046Z

resolved2025-02-16T07:44:20.030Z

We have fixed the underlying issue and ingestion latency is back to normal. All data is up to date now.

investigating2025-02-16T06:36:07.045Z

We have identified that there is lag in the events ingestion pipeline. We are investigating what could be the root cause. No data has been lost.

Feb 15, 2025

Report: "Web app down"

Last update 2025-02-15T10:16:53.493Z

resolved2025-02-15T10:16:53.473Z

We rolled back and fixed the issue. After monitoring we can clear things up now.

monitoring2025-02-14T13:42:22.965Z

We've rolled back to a previous version and the web app is recovered. Monitoring until bug fix is merged and latest web app version deployed.

identified2025-02-14T13:32:09.601Z

The posthog web app is down in all regions, due to a bug in our HTML rendering. All data pipeline components are still fully functional, and no data will be lost.

Feb 14, 2025

Report: "Maintenance - Data Processing Delays"

Last update 2025-02-14T10:35:33.614Z

resolved2025-02-14T10:35:33.600Z

This incident has been resolved.

monitoring2025-02-14T10:13:11.767Z

The main work of the maintenance operations are done. We're monitoring the ingestion and data processing to catch up. Thanks again for your patience!

monitoring2025-02-14T09:15:13.389Z

Due to a planned maintenance activity, we're expecting ingestion and data processing delays. No data will be lost during this operation. Thank you for your patience!

Feb 11, 2025

Report: "Event processing delays on EU Cloud"

Last update 2025-02-11T15:03:13.547Z

resolved2025-02-11T15:03:13.523Z

This incident has been resolved.

investigating2025-02-11T14:12:59.663Z

We're investigating ingestion delays on EU cloud

Jan 30, 2025

Report: "Web app unavailable"

Last update 2025-01-30T12:18:00.608Z

resolved2025-01-30T12:18:00.589Z

We improved our monitoring so we can catch similar issues before they affect production.

monitoring2025-01-29T14:16:02.279Z

The app is back now... we're investigating the root cause here

investigating2025-01-29T13:55:34.706Z

We've seen the web app is unavailable and we're investigating data ingestion is not affected

Jan 29, 2025

Report: "JS static assets not loading"

Last update 2025-01-29T01:39:01.035Z

resolved2025-01-29T01:39:01.014Z

The incident was resolved an hour ago. We're blocking new deployments until we root-cause the issue

identified2025-01-28T23:25:30.939Z

The issue was triggered again, we're rolling back quickly this time.

Jan 28, 2025

Report: "JS static assets not loading"

Last update 2025-01-28T19:40:09.960Z

resolved2025-01-28T19:40:09.946Z

This incident is resolved. You may need to hard-refresh (CMD + Shift + R) in order for the page to load. For some reason, our github workflows skipped the "upload static assets to s3" step but rolled out anyway. We're investigating why this happened.

investigating2025-01-28T19:20:57.065Z

We've again spotted you can't load the PostHog app at the moment We're investigating the cause Data ingestion is not affected

Report: "Issues loading the posthog site"

Last update 2025-01-28T19:06:09.410Z

resolved2025-01-28T19:06:09.387Z

We've spotted and fixed the issue with our static asset pipeline and all environments are back online and available!

investigating2025-01-28T18:36:43.201Z

We've spotted you can't load the PostHog app at the moment We're investigating the cause Data ingestion is not affected

Jan 27, 2025

Report: "Live Stream service unavailable"

Last update 2025-01-27T18:17:48.782Z

resolved2025-01-27T18:17:48.768Z

We've spotted and addressed the root cause and the service is back up and running. Sorry for the inconvenience and enjoy those fresh free range live events streaming to your browser!

investigating2025-01-27T17:28:23.687Z

Something has gone wrong with our livestream service which is responsible for reporting live events to the activity page. We are investigating now and will report back once we have found the root cause!