Is PlayFab Down Right Now? Discover if there is an ongoing service outage.

PlayFab is currently Operational

Last checked Jul 29, 2025 17:53 UTC from PlayFab's official status page

Historical record of incidents for PlayFab

Jul 25, 2025

Report: "PlayStream Processing Delay"

Last update 2025-07-25T15:39:07.094Z

resolved2025-07-25T15:00:00.000Z

On July 25, there was a delay in PlayStream processing between 07:40 - 08:10 PDT (14:40 - 15:10 UTC). We have identified the cause and the service has now recovered.

Jul 17, 2025

Report: "Scheduled tasks failed to start"

Last update 2025-07-17T01:12:43.422Z

resolved2025-07-15T00:00:00.000Z

No scheduled tasks were started on 7/15/2025 00:00 UTC likely due to a product deployment. Our engineers are still investigating the root causes of the incident in order to strengthen our processes and increase the resilience of our services to prevent reoccurrence.

Jun 18, 2025

Report: "API Server Delays"

Last update 2025-06-18T00:38:10.796Z

investigating2025-06-18T00:38:10.793Z

We are continuing to investigate this issue.

investigating2025-06-18T00:00:47.000Z

We're experiencing delayed responses from our API servers and are currently investigating.

Report: "PlayStream and Telemetry Event Delivery Delay"

Last update 2025-06-18T00:35:12.440Z

investigating2025-06-18T00:35:12.436Z

We are currently experiencing an issue in our data delivery pipeline. Customers may notice a delay in events appearing via Data Explorer or in Data Connections. Real-time processing of PlayStream events (e.g. for Automation Rules or WebHooks) is not impacted. No data has been lost and engineers are working to resolve this as soon as possible.

Jun 5, 2025

Report: "Audit Logs are not being collected"

Last update 2025-06-05T01:43:45.929Z

investigating2025-06-05T01:43:45.926Z

We are investigating an issue in Audit log persistance.

May 24, 2025

Report: "PlayStream Processing Delay"

Last update 2025-05-24T04:07:45.405Z

resolved2025-05-24T04:07:45.390Z

This incident has been resolved.

monitoring2025-05-24T03:29:49.184Z

A fix has been implemented and we are monitoring the results.

investigating2025-05-24T03:29:30.386Z

We are continuing to investigate this issue.

investigating2025-05-24T01:45:52.000Z

We are currently experiencing delayed processing of events for a subset of players. No data has been lost and engineers are working to resolve this as soon as possible.

May 23, 2025

Report: "Increased latency in user api's"

Last update 2025-05-23T23:09:19.666Z

resolved2025-05-23T23:09:19.650Z

This incident has been resolved.

monitoring2025-05-23T22:41:19.814Z

Api latency has returned to normal, we'll continue monitor while investigations continue.

investigating2025-05-23T22:17:37.964Z

We have observed increased API latency and error rates starting at approximately 11 am pacific today for the following APIs: UpdateUserData, GetUserData, GetPlayerCombinedInfo and are investigating.

Report: "PlayStream Processing Delay"

Last update 2025-05-23T23:07:00.000Z

Resolved2025-05-23T23:07:00.000Z

This incident has been resolved.

Monitoring2025-05-23T22:29:00.000Z

A fix has been implemented and we are monitoring the results.

Update2025-05-23T22:29:00.000Z

We are continuing to investigate this issue.

Investigating2025-05-23T20:45:00.000Z

We are currently experiencing delayed processing of events for a subset of players. No data has been lost and engineers are working to resolve this as soon as possible.

Report: "Increased latency in user api's"

Last update 2025-05-23T18:09:00.000Z

Resolved2025-05-23T18:09:00.000Z

This incident has been resolved.

Monitoring2025-05-23T17:41:00.000Z

Api latency has returned to normal, we'll continue monitor while investigations continue.

Investigating2025-05-23T17:17:00.000Z

We have observed increased API latency and error rates starting at approximately 11 am pacific today for the following APIs: UpdateUserData, GetUserData, GetPlayerCombinedInfo and are investigating.

May 22, 2025

Report: "Reports are delayed for May 21st"

Last update 2025-05-22T22:23:24.896Z

resolved2025-05-22T22:23:24.874Z

This incident has been resolved.

investigating2025-05-22T18:33:44.030Z

We are currently investigating an issue with processing analytics reports and daily title report emails for May 21st.

Report: "Reports are delayed for May 21st"

Last update 2025-05-22T17:23:00.000Z

Resolved2025-05-22T17:23:00.000Z

This incident has been resolved.

Investigating2025-05-22T13:33:00.000Z

We are currently investigating an issue with processing analytics reports and daily title report emails for May 21st.

May 21, 2025

Report: "Partial outage for Economy V2 affecting Catalog APIs and Inventory APIs"

Last update 2025-05-21T23:43:28.851Z

resolved2025-05-21T23:43:13.000Z

Migration is completed and incident mitigation is complete

monitoring2025-05-21T23:28:55.311Z

Issue has been mitigated, we're monitoring for complete recovery

identified2025-05-21T23:17:32.000Z

The issue has been identified, it was due to migrations for service improvements, we are currently working on a fix

Report: "Partial outage for Economy V2 affecting Catalog APIs and Inventory APIs"

Last update 2025-05-21T18:43:00.000Z

Resolved2025-05-21T18:43:00.000Z

Migration is completed and incident mitigation is complete

Monitoring2025-05-21T18:28:00.000Z

Issue has been mitigated, we're monitoring for complete recovery

Identified2025-05-21T18:17:00.000Z

The issue has been identified, it was due to migrations for service improvements, we are currently working on a fix

May 20, 2025

Report: "Scheduled Task Failures"

Last update 2025-05-20T22:36:15.134Z

resolved2025-05-20T07:00:00.000Z

We have identified an issue that impacted a subset of scheduled tasks. From 5/19 10:30AM - 5/20 12:00AM PDT, there were failures for scheduled tasks that run actions on each player in a segment. This issue has been resolved.

Report: "Scheduled Task Failures"

Last update 2025-05-20T02:00:00.000Z

Resolved2025-05-20T02:00:00.000Z

May 14, 2025

Report: "Issues related to saving Title Data on Game Manager"

Last update 2025-05-14T20:17:14.082Z

resolved2025-05-14T18:00:00.000Z

We are currently investigating this issue.

May 13, 2025

Report: "Economy V2 APIs availability issue"

Last update 2025-05-13T18:24:18.587Z

resolved2025-05-13T18:24:18.567Z

This incident has been resolved.

monitoring2025-05-09T21:57:00.825Z

We have identified an issue that affected a subset of the Economy V2 APIs. The issue is fixed and we are monitoring the status of the affected APIs

May 7, 2025

Report: "Service Degradation across APIs"

Last update 2025-05-07T17:38:14.411Z

resolved2025-05-07T17:38:14.392Z

This incident has been resolved.

investigating2025-05-07T17:23:53.000Z

We are continuing to investigate this issue.

investigating2025-05-07T17:18:16.000Z

We are observing recovery with error rates.

investigating2025-05-07T17:15:33.414Z

We are continuing to investigate this issue.

investigating2025-05-07T17:00:00.000Z

We are currently investigating increased API errors across APIs.

Report: "May 6th Data Connections and Playstream Actions are delayed"

Last update 2025-05-07T06:15:44.462Z

resolved2025-05-07T06:15:44.446Z

This incident has been resolved.

investigating2025-05-07T05:41:30.141Z

We are currently investigating this issue.

May 6, 2025

Report: "Economy v1 Game Manager pages not loading"

Last update 2025-05-06T22:20:08.287Z

resolved2025-05-06T22:20:08.270Z

This incident has been resolved.

identified2025-05-06T20:27:14.587Z

We're aware of the problem and are working on a fix

Report: "PlayFab api limits exceeded"

Last update 2025-05-06T18:19:41.011Z

resolved2025-05-06T18:19:40.995Z

This incident has been resolved.

monitoring2025-05-06T16:37:44.210Z

A fix has been implemented and we're monitoring the results.

identified2025-05-06T15:20:55.829Z

We've identified the cause of unexpected limits exceptions, and are attempting to mitigate.

investigating2025-05-06T14:53:46.301Z

We're investigating an issue where some titles are reporting unexpected errors related to limits being exceeded

Apr 25, 2025

Report: "Character API instability"

Last update 2025-04-25T06:18:44.731Z

resolved2025-04-25T01:00:00.000Z

Delay in database index updates has resulted in inconsistent behaviors related to characters. For example, after a Character was successfully created, it cannot be found. While all the operations eventually succeeded, the delay introduced affected multiple games. The issue has now been corrected.

Apr 14, 2025

Report: "Service Degradation on Login APIs"

Last update 2025-04-14T21:12:07.778Z

resolved2025-04-14T21:12:07.759Z

This incident has been resolved.

monitoring2025-04-14T20:42:40.792Z

A fix has been implemented and we are monitoring the results.

investigating2025-04-14T20:05:46.000Z

We are currently investigating increased errors with our Login* APIs.

Apr 3, 2025

Report: "Reports have been delayed since March 28th."

Last update 2025-04-03T21:37:34.673Z

resolved2025-04-03T21:37:34.652Z

This incident has been resolved.

identified2025-04-02T16:10:19.500Z

The issue has been identified and a fix is being implemented.

monitoring2025-04-02T16:07:25.623Z

A fix has been implemented and we are monitoring the results.

investigating2025-03-31T17:51:04.000Z

We are currently investigating an issue with processing analytics reports and daily title report emails from March 28th.

Apr 2, 2025

Report: "MPS Build and Allocation Failures"

Last update 2025-04-02T20:03:04.323Z

postmortem2025-04-02T18:02:24.559Z

On March 24th, 2025, between 9:00 PM and 12:00 AM PST, customers intermittently encountered failures with PlayFab Multiplayer Server \(MPS\) APIs, such as build or allocation calls. The incident was caused by the unhealthy state of a cluster, which was triggered by an experimental feature enabled by a high load customer, resulting in pod restarts due to high CPU usage. We resolved the issue by deploying a hotfix to address the bug. ### Impact During the incident, customers experienced intermittent failures when using MPS APIs. The issue was isolated to titles leased to a specific cluster. Titles on other clusters were not affected. ### Root Cause Analysis The root cause of the incident was pod restarts triggered by unnecessary recurring calls from a new experimental feature enabled by a high load customer. The feature caused grains to initialize with leases in stamps not associated with the title, leading to delays in processing heartbeat requests. This filled the message queue on the grain, resulting in excessive CPU usage and pod restart events. ### Action Items To prevent similar incidents from occurring in the future, we have implemented the following actions: * Verified the functionality of the experimental feature. * Checked flags in Cosmos DB and API usage for predictive standby. * Initiated a re-evaluation of the design and test coverage for the feature.

resolved2025-03-25T03:58:00.000Z

One of our clusters experienced issues processing calls between 4 AM to 7 AM UTC on March 25th, causing some calls to fail. The issue was resolved after deploying a fix, but customers may have intermittently encountered failures with MPS APIs, such as build or allocation calls, during that time.

Mar 25, 2025

Report: "Experimentation Service Degradation"

Last update 2025-03-25T21:21:08.555Z

postmortem2025-03-25T21:20:33.510Z

Between 2025-03-18 19:00 UTC and 2025-03-19 17:00 UTC, some customers saw InternalServerErrors being returned from the Experimentation/GetExperiments API, or a blank page when loading the Experiments page in Game Manager. The incident was caused by the deployment of a bad configuration used by the experimentation service. We resolved the issue by fixing the configuration and redeploying the impacted service. ### Impact Any title that attempted to retrieve their experiment information via Game Manager or API saw InternalServerErrors or a blank Experiments page for the duration of the incident. There was no impact to any other Experimentation APIs or to the operation of the experiments themselves. ### Root Cause Analysis The root cause of the incident was the rollout of an incorrect configuration used by the Experimentation service. ### Action Items To prevent similar incidents from happening again, we have taken the following actions: · Enhanced monitoring and alerting systems to detect and report errors in service-to-service communication.

resolved2025-03-19T17:31:49.474Z

This incident has been resolved.

monitoring2025-03-19T17:09:54.856Z

A fix has been implemented and we are monitoring the results.

identified2025-03-19T16:51:46.553Z

The issue has been identified and a fix is being implemented.

investigating2025-03-19T16:30:45.000Z

We are currently experiencing an issue with the experimentation page not loading in correctly. Our team is actively investigating and working to resolve the problem as quickly as possible.

Mar 18, 2025

Report: "5XX error rate and higher latency across many PlayFab APIs"

Last update 2025-03-18T21:16:50.692Z

resolved2025-03-18T21:16:50.669Z

This incident has been resolved.

monitoring2025-03-18T18:35:01.486Z

We experienced an increased amount of 5XX errors, we have identified the issue we mitigated the cause. We're currently monitoring the recovery.

Mar 11, 2025

Report: "Game Manager Login Issues"

Last update 2025-03-11T23:28:00.478Z

resolved2025-02-22T03:00:29.000Z

This incident has been resolved.

monitoring2025-02-22T02:35:08.000Z

We are monitoring Game Manager account issues. Customers who are experiencing login issues should be assured that their data is unaffected and will be available once their account access has been restored. If your account has been locked, please use the Contact Us form located at https://playfab.com/contact to have your account unlocked. For now, we recommend that customers seeing the PlayFab Account to Microsoft Account migration flow select "Migrate Later".

identified2025-02-22T00:26:59.665Z

We are continuing to troubleshoot Game Manager account issues. Customers who are experiencing login issues should be assured that their data is unaffected and will be available once their account access has been restored. If your account has been locked, please use the Contact Us form located at https://playfab.com/contact to have your account unlocked. For now, we recommend that customers seeing the PlayFab Account to Microsoft Account migration flow select "Migrate Later".

identified2025-02-21T07:46:44.643Z

We have resolved the Game Manager login and logout issues. We are investigating an issue with PlayFab account to Microsoft Account (MSA) migration. For now, we recommend that customers seeing the account migration flow select "Migrate Later".

identified2025-02-21T06:01:49.992Z

We have identified the issue and are working on deploying a mitigation.

investigating2025-02-21T01:56:09.565Z

Some customers are currently experiencing issues logging into or out of Game Manager. We are currently investigating the issue. In the interim, we recommend that customers migrating to MSA select "Migrate Later" in the MSA migration flow.

Feb 27, 2025

Report: "Delay in event delivery to Data Connections and Data Explorer"

Last update 2025-02-27T04:20:28.570Z

resolved2025-02-27T03:00:00.000Z

PlayFab engineers identified an issue with event delivery that resulted in a maximum delay of 50 minutes across all titles. This issue has been fixed, and all delayed data has been processed.

Feb 24, 2025

Report: "Errors when creating players in new titles in Game Manager"

Last update 2025-02-24T23:28:17.092Z

resolved2025-02-24T23:28:17.076Z

We have reprocessed the titles whose namespaces were not properly initialized. All titles are expected to be fully functional.

investigating2025-02-20T20:06:26.946Z

A fix has been deployed. New titles created now are able to create players. We are now looking for titles that have been created since the issue was introduced.

investigating2025-02-20T19:09:24.136Z

When a new title is created in a new namespace, player creation fails. We are working on a fix.

Feb 20, 2025

Report: "Authentication with Microsoft account failing for some users"

Last update 2025-02-20T22:04:34.595Z

resolved2025-02-20T22:04:34.573Z

The incident has been resolved

monitoring2025-02-20T21:42:20.601Z

A fix has been deployed, we're monitoring the service status and rate of errors.

investigating2025-02-20T20:34:12.532Z

Authentication for some users authenticating with Microsoft accounts is failing. We're currently investigating.

Report: "Increased Error Rates for PSN APIs"

Last update 2025-02-20T00:10:06.094Z

postmortem2025-02-20T00:06:19.881Z

On February 7th, 2025, between 3:15 PM PST and 3:00 PM PST on February 8th, 2025, some customers experienced increased latency and intermittent failures when making PlayStation network-related calls \(e.g., LoginWithPSN, RedeemPlayStationStoreInventoryItems\) on PlayFab's service. The incident was caused by a PlayStation Network outage leading to a high rate of InternalServerErrors for API calls dependent on the PlayStation Network. We monitored the situation and the issue was resolved once the PlayStation Network recovered. ### Impact All PlayStation Network-related API calls experienced increased latency and an intermittent failure rate over the course of 24 hours. This incident impacted the overall availability SLA for PlayStation-related services during this period. ### Root Cause Analysis The root cause of this incident was an external issue with the PlayStation Network. The PlayStation Network outage was outside of our direct control and required intervention from PlayStation to resolve. ### Action Items • Improve our communication protocols with partners to receive timely updates on outages and recovery status. • Enhance monitoring and alerting systems to detect and report anomalies in a more granular manner for external dependencies.

resolved2025-02-08T23:00:55.000Z

This incident has been resolved.

identified2025-02-08T00:23:53.000Z

We are observing increased errors for PSN APIs because of a third party outage and are currently monitoring.

investigating2025-02-07T23:00:25.000Z

We are observing increased errors for the LoginWithPSN APIs because of a third party outage and are currently monitoring.

Feb 18, 2025

Report: "PlayStream Processing Delay"

Last update 2025-02-18T22:20:30.433Z

postmortem2025-02-18T22:20:14.736Z

On February 12th, 2025, between 11:09 AM and 12:14 PM PST, some customers experienced delays in the updating of Leadership dashboards due to an issue with the PlayStream processor. The incident was caused by a failed authentication error from a network configuration change which was not correctly assigned to the managed identity. We resolved the issue by deleting the stats processor pods in the partially created cluster and ensuring the monitor reported healthy status. ### Impact The delay in updating Leadership dashboards lasted 1 hour and 4 minutes, affecting the PlayStream processor's ability to update its processing status. ### Root Cause Analysis The root cause of this incident was a human error in configuration. A new rollout was initiated earlier in the day, but the cluster was not fully created and the deployment should have been for an earlier version. This incomplete status led to missing role assignments and managed identities, resulting in authentication errors in the stats processor. ### Action Items To prevent similar incidents from happening again, we have taken the following actions: · We have improved our testing and validation procedures for network configuration changes to catch such bugs before they reach production. · We have enhanced our monitoring and alerting systems to detect and report any anomalies in the load balancer's behavior and performance. · We investigated and fixed the health probe for the PlayStream processors to ensure proper assignment of managed identities.

resolved2025-02-12T19:00:00.000Z

There was a playstream incident between 10:50 AM and noon, which caused a delay in updating the leaderboard dashboards. The issue was resolved at noon, and processing has returned to normal.

Feb 11, 2025

Report: "Reduced API availability"

Last update 2025-02-11T22:24:09.758Z

postmortem2025-02-11T22:23:55.675Z

On January 22, 2025, between 10:44 AM and 11:15 AM PST, some customers experienced increased latency in PlayFab's API. The incident was caused by a network configuration issue during the migration to new Redis instances, which resulted in ports being blocked. We resolved the issue by rolling back to the previous Redis cluster and restarting the pods. ### Impact The APIs experienced increased latency; however, the availability remained above the Service Level Objective \(SLO\). ### Root Cause Analysis The issue was caused by the migration to new Redis instances, which resulted in the use of ports that were not included in the exclusion list. The issue was not detected sooner because the alert was set as severity 4 and was not noticed immediately. Availability numbers were not impacted by the change. ### Action Items To prevent similar incidents from happening again, we have taken the following actions: · Exclude the full range of Redis ports. · Improved our testing and validation procedures for network configuration changes to catch such issues before they reach production. · Improved deployment process of infrastructure changes by rolling out updates to a subset of users

resolved2025-01-23T16:00:11.939Z

This incident has been resolved yesterday afternoon. Apologies for the late update.

monitoring2025-01-22T21:14:16.233Z

A fix has been implemented and we are monitoring the results.

identified2025-01-22T21:14:08.073Z

The issue has been identified and a fix is being implemented.

investigating2025-01-22T20:40:41.000Z

Customers are experiencing reduction in API availability and increased latency since an infrastructure upgrade. We are investigating and preparing to roll back the infrastructure change.

Report: "Reduced availability of Group Apis"

Last update 2025-02-11T17:25:21.822Z

resolved2025-02-11T17:25:21.814Z

There was a dip in availability of the following apis from 8.46am PST to 9 am PST. - Group/ListMembershipOpportunities - Group/ListGroupInvitations

Report: "Legacy Multiplayer APIs unavailable"

Last update 2025-02-11T00:23:37.857Z

resolved2025-02-10T23:51:56.000Z

This incident was resolved at 23:51 UTC.

investigating2025-02-10T23:41:10.000Z

Affected APIs: - Matchmaker/RegisterGame - GameServer/SetGameServerInstanceTags - GameServer/RefreshGameServerInstanceHeartbeat - GameAcquisition/Matchmake

Feb 6, 2025

Report: "API execution times degraded"

Last update 2025-02-06T21:47:18.286Z

resolved2025-02-06T21:47:18.268Z

This incident has been resolved, API response times have returned to normal.

monitoring2025-02-06T21:01:56.343Z

A fix has been deployed, we're monitoring the service.

investigating2025-02-06T20:48:03.028Z

We are aware of and investigating an issue that is causing API execution times to be slowed.

Feb 4, 2025

Report: "Increased error rates for multple Economy V2 APIs, increased delayed for PlayStream events, and increased delay for maketplace APIs"

Last update 2025-02-04T21:09:53.256Z

postmortem2025-02-04T21:09:37.636Z

On January 8th, 2025, between 5:30 PM and 9:02 PM PST, some customers experienced delayed playstream processing and increased error rates for economy requests when accessing PlayFab's API. The incident was caused by network connectivity issues from backend services. We resolved the issue by migrating traffic to a health backend. ### Impact The incident resulted in delayed playstream processing and increased error rates for economy requests. Additionally, the Economy V2 SLA dipped to 99% reliability during the impact period. ### Root Cause Analysis The root cause of the incident was identified as network connectivity errors from a specific backend. The connectivity issues were not caused by any recent changes. ### Action Items To prevent similar incidents from happening again, we have taken the following actions: · Enhanced our monitoring and alerting systems to detect and report any anomalies in the load balancer's behavior and performance. · Automatic failure when backend services are impacted

resolved2025-01-09T05:02:53.827Z

This incident has been resolved.

monitoring2025-01-09T03:47:19.023Z

A fix has been implemented and we are monitoring the results.

investigating2025-01-09T02:43:11.124Z

Issue is under investigation. Multiple Economy APIs have degraded performance, including Inventory APIs, redemption APIs and PlayStream for Economy

Jan 31, 2025

Report: "Increased ConcurrentEditError Response Rate When Accessing User Title Data"

Last update 2025-01-31T22:17:52.530Z

resolved2025-01-31T22:17:52.514Z

This incident has been resolved.

monitoring2025-01-31T14:56:25.543Z

A fix has been implemented and we are monitoring the results.

identified2025-01-23T21:03:45.650Z

Some titles are experiencing an increased ConcurrentEditError response rate when accessing user title data since 1/13. We have identified the cause for this issue, and are in the process of resolve it.

Jan 27, 2025

Report: "Errors when creating or editing Data Connections"

Last update 2025-01-27T17:03:17.337Z

resolved2025-01-27T17:03:17.323Z

This incident has been resolved.

monitoring2025-01-25T02:41:27.368Z

A fix has been implemented and we are monitoring the results.

investigating2025-01-25T01:13:18.385Z

We are investigating an issue where customers may receive an error when using the PlayFab portal to create or edit a Data Connection for their title. Existing Data Connections are not impacted.

Jan 17, 2025

Report: "Player data unavailable for a small subset of players"

Last update 2025-01-17T17:13:57.008Z

resolved2025-01-10T18:00:00.000Z

For five days between January 10, 2025 and January 15, 2025, accessing data for player profiles with custom data attachments that were "null" and set to public permissions would fail with an "InternalServerError". These now return successfully.

Jan 15, 2025

Report: "January 13 reports are delayed"

Last update 2025-01-15T23:14:18.802Z

resolved2025-01-15T23:14:18.789Z

This incident has been resolved. We identified the issue and reprocessed the impacted reports.

investigating2025-01-14T18:48:43.311Z

We are currently investigating an issue with processing analytics reports for January 13.

Jan 12, 2025

Report: "Increased Economy V2 Search Latency"

Last update 2025-01-12T01:42:01.211Z

resolved2025-01-12T01:42:01.193Z

This incident has been resolved.

monitoring2025-01-11T20:32:45.103Z

A fix has been implemented and we are monitoring the results.

investigating2025-01-11T16:46:00.495Z

We are currently investigating this issue.

Dec 18, 2024

Report: "Processing delays impacting scheduled tasks."

Last update 2024-12-18T18:45:39.231Z

postmortem2024-12-18T18:45:29.371Z

On December 11th, 2024, between 9:20 AM and 11:20 AM PST, some customers experienced processing delays with PlayFab's scheduled tasks. The incident was caused by a bad configuration change. The issue was resolved by reverting the configuration change. ### Impact During the incident, the scheduled task processor failed to process any messages for approximately 2 hours. Scheduled tasks queued to run during this time were delayed or did not trigger, but no customer data was lost. ### Action Items To prevent similar incidents from happening again, we have taken the following actions: * Created a repair item to update our mock unit tests to catch regressions related to this configuration change. * Investigated why end-to-end tests and integration deployment did not catch this issue. * Added a new production alert for no messages processed within a specified time frame. * Adjusted existing production alerts to trigger faster when no scheduled tasks/messages are processed.

resolved2024-12-11T19:28:14.997Z

We observed delays in processing scheduled tasks between 9:30 AM - 11:20 AM PST. During this time, scheduled tasks that were scheduled to execute may have not executed. This incident is now resolved.

investigating2024-12-11T17:30:25.000Z

We are investigating an issue impacting message processing related to scheduled tasks.

Report: "Event Export jobs to AWS S3 are paused."

Last update 2024-12-18T18:44:14.989Z

postmortem2024-12-18T18:43:52.576Z

On December 9th, 2024, between 3:00 PM and 11:00 AM PT the next day, some customers experienced issues with corrupted export files when accessing PlayFab's event data export to S3 buckets. The incident was caused by a bug introduced during an update to legacy code. We resolved the issue by deploying a hotfix and reprocessing the corrupted data. ### Impact There were 58 titles affected by this incident, specifically those configured for event exports to S3 buckets. The exports contained invalid characters, causing downstream parsing and decompression issues. The affected data was backfilled successfully by December 11th, 2024, at 7:00 PM PT. During the mitigation, exports to S3 were paused to prevent further impact. ### Root Cause Analysis The bug in the export process was introduced during an update to legacy code, which led to additional padding bytes being included in the export data. The codebase had not been actively maintained and lacked end-to-end tests, leaving the bug undetected during manual testing. ### Action Items To prevent similar incidents from happening again, we have taken the following actions: * Enhanced our monitoring and alerting systems to detect anomalies in export data. * Refactored the code for downloading and uploading blobs to S3. * Added end-to-end tests for exports to blob and S3. * Created tools for backfilling corrupted data.

resolved2024-12-12T18:20:39.057Z

Reprocessing of S3 Event Exports for the period between Dec 9th, 1:00 PM PST, and Dec 10th, 6:00 PM PST has been completed. Customers are advised to check their S3 buckets for the updated data.

identified2024-12-11T01:48:51.273Z

A fix has been deployed and we have resumed processing S3 Event Exports. The engineering team is working on going back to reprocess exports that may have had missing or corrupted data between Dec 9th, 1pm PST and Dec 10th, 6pm PST. We will post additional updates when reprocessing is completed.

identified2024-12-10T23:27:15.088Z

The issue has been identified and a fix is being implemented.

investigating2024-12-10T19:14:41.799Z

We have identified an issue with Event Export jobs to S3 where some uploads contain invalid characters that may cause issues with parsing or decompressing the contents. S3 Event Export jobs are being paused and data is queued while we investigate and deploy a fix, at which time jobs will resume.

Dec 10, 2024

Report: "Economy V2 increase in Service Unavailable errors"

Last update 2024-12-10T23:10:30.466Z

postmortem2024-12-10T23:10:02.902Z

On December 5th, 2024, between 5:48 AM and 8:30 AM PST, some customers experienced intermittent errors and delays when accessing PlayFab's Catalog API. The incident was caused by an isolated issue which resulted in connection failures to the Catalog APIs. We resolved the issue by restarting the problematic instance and applying an ad-hoc fix. ### Impact During the incident, customers faced reduced Catalog API availability. ### Action Items To prevent similar incidents from happening again, we have taken the following actions: * Enhanced our monitoring and alerting systems to detect and report any network anomalies.

resolved2024-12-05T10:00:00.000Z

Economy v2 Catalog APIs experiencing an increase in 503 Service Unavailable errors

Dec 7, 2024

Report: "Economy V2 Service Availability"

Last update 2024-12-07T19:34:06.950Z

resolved2024-12-07T19:34:06.935Z

Issue has been resolved.

monitoring2024-12-07T17:44:36.462Z

A fix has been implemented and we are monitoring the results.

investigating2024-12-07T16:27:03.335Z

We are currently investigating this issue.

Dec 5, 2024

Report: "5XX error rate and higher latency across many PF APIs"

Last update 2024-12-05T18:02:02.538Z

resolved2024-12-05T17:00:00.000Z

High number of 5XX errors and higher latency across multiple PlayFab APIs

Dec 2, 2024

Report: "Economy V2 APIs timing out"

Last update 2024-12-02T02:57:53.898Z

resolved2024-12-02T02:57:53.882Z

This incident has been resolved.

monitoring2024-11-30T21:57:37.713Z

A fix has been implemented and we are monitoring the results.

investigating2024-11-30T21:18:48.478Z

We are currently investigating this issue.

Nov 27, 2024

Report: "Data Explorer advanced not working for some titles"

Last update 2024-11-27T01:16:04.817Z

postmortem2024-11-27T01:15:55.009Z

On November 14th, 2024, between 13:07 PST and November 15th, 2024 at 02:48 PST, some customers experienced issues accessing player data from the Data Explorer pages in PlayFab's Game Manager. The incident was caused by a misconfigured internal URL in the service powering the Data Explorer page. The issue was resolved by a configuration update that corrected the URL. ### Root Cause Analysis The incident was triggered by a recent service update that changed how Game Manager sends requests to the PlayFab Insights databases. A bug in the code caused the incorrect domain to be used for some titles, resulting in errors when users tried to query past events. ### Action Items To prevent similar incidents from happening again, we have taken the following actions: * Fixed the bug in the code to ensure the correct domain configuration

resolved2024-11-15T08:00:00.000Z

Between 2024-11-13 06:40 UTC and 2024-11-15 00:45 UTC, new titles and titles that reset their query connections may have been unable to query data using Data Explorer advanced. Data Explorer basic was not impacted by this issue. There was no impact to the underlying data or data ingestion during this time.

Nov 21, 2024

Report: "Developer authentication to PlayFab UI"

Last update 2024-11-21T23:37:05.979Z

resolved2024-11-21T23:37:05.964Z

The incident has been resolved.

monitoring2024-11-21T23:20:23.704Z

A fix has been implemented, we're monitoring now.

identified2024-11-21T23:17:32.823Z

We're aware of an issue with authentication to the PlayFab web interface, we believe the source of the issue is understood and are attempting to mitigate.

Nov 19, 2024

Report: "Increased API Error Response Rate for Redemption"

Last update 2024-11-19T23:18:23.400Z

postmortem2024-11-19T23:11:52.897Z

On November 11th, 2024, between 4:30 PM and 6:30 PM PST, some customers experienced a high rate of internal server errors when accessing PlayFab's Inventory/Redeem APIs. The incident was caused by a new deployment that introduced a code defect, resulting in increased 500 error responses. We resolved the issue by rolling back the deployment. ### Root Cause Analysis The root cause of the incident was a human error in the code, where an incorrect config was deployed leading to new Redeem API errors. ### Action Items · To prevent similar incidents from happening again, we improved our testing and validation procedures

resolved2024-11-12T02:57:46.409Z

This incident has been resolved.

monitoring2024-11-12T02:03:04.042Z

A fix has been implemented and we are monitoring the results.

identified2024-11-12T00:28:59.000Z

Most Redeem APIs for Economy V2 have been affected due a recent deployment and present degraded performance. We have identified the issue and are currently rolling out a fix.

Nov 15, 2024

Report: "Errors in Data Explorer Search"

Last update 2024-11-15T11:18:37.492Z

resolved2024-11-15T11:18:37.471Z

This incident has been resolved.

identified2024-11-15T11:06:42.265Z

The issue has been identified and a fix is being implemented.

investigating2024-11-15T10:57:21.010Z

We are continuing to investigate this issue.

investigating2024-11-15T10:56:59.443Z

We are experiencing errors in Data Explorer Search. We are investigating.

Nov 14, 2024

Report: "Delay in PlayStream Entity Events"

Last update 2024-11-14T01:50:24.819Z

resolved2024-11-14T01:50:24.805Z

The issue has been resolved. PlayStream services should be operating normally.

monitoring2024-11-14T01:22:51.690Z

We have deployed the fix for the identified issue. Engineers are continuing to monitor to ensure there are no further issues.

identified2024-11-13T23:32:41.053Z

The issue has been identified and a fix is being implemented.

investigating2024-11-13T23:31:48.000Z

We are currently investigating an issue that is causing delays in PlayStream entity events since 2024-11-12 23:40 PST (2024-11-13 07:40 UTC)

Nov 13, 2024

Report: "Delayed transaction history for Economy V2"

Last update 2024-11-13T19:03:49.707Z

resolved2024-11-13T19:03:49.695Z

This incident has been resolved.

investigating2024-11-13T16:23:46.611Z

We are currently investigating this issue.

Nov 12, 2024

Report: "Reports and Trends reporting incorrect metrics for 11/2/2024"

Last update 2024-11-12T22:59:03.821Z

resolved2024-11-12T22:59:03.808Z

The issue has been resolved and all services should be operating normally.

investigating2024-11-05T18:23:22.996Z

We are currently investigating this issue.

Nov 5, 2024

Report: "Data from Insights S3 Exports is delayed"

Last update 2024-11-05T21:46:00.494Z

postmortem2024-11-05T21:45:31.577Z

Starting on 2024-10-29 at 19:15 UTC until 2024-10-30 18:15 UTC, some customers experienced delays in data exports to S3. This incident was caused by a misconfiguration in API routing that impacted the export service. The issue was resolved by correcting the configuration used for API routing. ### Impact All Insights data exports to S3 were delayed due to errors in the export service. There was no data loss, the export service continued to retry until success. Data caught up in less than an hour after the configuration was fixed. ### Root Cause Analysis The root cause of the incident was a human error in configuration. ### Action Items To prevent similar incidents from happening again, we have taken the following actions: · Improved our monitoring and alerting systems to detect and report anomalies in the export service's API requests.

resolved2024-10-30T18:16:51.111Z

This incident has been resolved.

monitoring2024-10-30T17:29:06.454Z

A fix has been implemented and we are monitoring the results.

identified2024-10-30T16:34:56.416Z

The issue has been identified and a fix is being implemented.

investigating2024-10-30T16:32:23.548Z

We are currently investigating delays exporting Insights data to S3. Impact started around 2024-10-29 19:00 UTC.

Oct 24, 2024

Report: "Performance Issues for Economy V1 Catalog Game Manager Page"

Last update 2024-10-24T21:30:14.491Z

resolved2024-10-24T21:30:14.465Z

Slow performance on the Economy v1 catalog page has been resolved.

monitoring2024-10-24T21:23:03.126Z

A fix has been implemented and we're monitoring the results.

identified2024-10-23T17:36:28.539Z

We have identified a performance issue on the Economy V1 catalog GameManager page, where catalogs with more than 300 items have difficulty loading.

Report: "Multiplayer servers are experiencing issues - [Region: South Central US]"

Last update 2024-10-24T17:41:46.325Z

resolved2024-10-24T17:41:46.307Z

The issue has been resolved and all services should be operating normally.

identified2024-10-24T17:40:30.000Z

The issue is now resolved

identified2024-09-04T17:12:28.296Z

Quick update that this issue still persists and that we recommend that customers to use other regions (such as East US) instead of South Central US at this time.

identified2024-08-29T22:29:09.891Z

This issue continues to persist. We are recommending that customers to use other regions (such as East US) instead South Central US at this time.

identified2024-08-28T21:19:46.400Z

We are continuing to work on a fix for this issue.

identified2024-08-28T21:04:36.471Z

We are currently experiencing service issues due to limited compute capacity availability in the ‘South Central US’ Azure region. The team is actively working to resolve this issue and restore full service as quickly as possible. We apologize for any inconvenience this may cause and appreciate your patience.