Is Gainsight Down Right Now? Discover if there is an ongoing service outage.

Gainsight is currently Operational

Last checked Jul 29, 2025 14:40 UTC from Gainsight's official status page

Historical record of incidents for Gainsight

Jun 1, 2025

Report: "CS - US1: Investigating Elevated Error Rates"

Last update 2025-06-01T16:15:20.260Z

postmortem2025-06-01T16:11:32.414Z

**Incident**: 27th May, 2025: Some customers may have experienced elevated levels of latency or issues logging into our CS-US1 application. **Root Cause**: A previously undetected cache issue was discovered. This issue caused increased latency for some services.‌ **Recovery Action:** As alerts were received, we blocked queues to investigate. Once the source was identified, a fix was applied which restored performance and availability. Queues were unblocked once stability was confirmed. **Preventive Measures:** The mentioned fix has resolved this issue. We will continue to review and learn from issues like this in effort to provide the best possible customer experience. Please contact [support@gainsight.com](mailto:support@gainsight.com) if there are any questions.

resolved2025-05-28T01:09:05.136Z

This incident has been resolved.

monitoring2025-05-27T22:12:33.528Z

A fix has been implemented and we are monitoring the results.

investigating2025-05-27T21:25:44.626Z

We are investigating a sudden increase in error rates which may lead to degraded performance or service interruption.

Report: "CS - US1: Investigating Elevated Error Rates"

Last update 2025-06-01T11:15:00.000Z

Postmortem2025-06-01T11:15:00.000Z

Resolved2025-05-27T20:09:00.000Z

This incident has been resolved.

Monitoring2025-05-27T17:12:00.000Z

A fix has been implemented and we are monitoring the results.

Investigating2025-05-27T16:25:00.000Z

We are investigating a sudden increase in error rates which may lead to degraded performance or service interruption.

May 8, 2025

Report: "CS - EU: Investigating Elevated Error Rates"

Last update 2025-05-08T18:05:46.684Z

resolved2025-05-08T18:05:46.666Z

This incident has been resolved. Self healing measures were triggered to resolve this issue and we monitored for an extended time to ensure stability. We are making threshold adjustments to prevent similar issues moving forward. Please contact support@gainsight.com with any questions.

monitoring2025-05-08T16:01:00.271Z

A fix has been implemented and we are monitoring the results.

investigating2025-05-08T15:34:01.014Z

We are investigating a sudden increase in error rates which may lead to degraded performance or service interruption.

Jan 23, 2025

Report: "Gainsight CS - US1 - Latency"

Last update 2025-01-23T10:58:14.878Z

resolved2025-01-23T10:58:14.864Z

Queues are back to normal levels.

monitoring2025-01-23T09:37:13.886Z

We have identified the issue causing delays and taken recovery actions. Queues are now processing as expected and backlog will clear in next couple of hours. Monitoring closely.

investigating2025-01-23T09:20:49.044Z

We are investigating reports of latency in processing queues of CS-US1 environment. We will post further updates as we have more details.

Nov 22, 2024

Report: "Restrospective: Issues with C360"

Last update 2024-11-22T16:38:19.251Z

resolved2024-11-22T16:38:05.817Z

Customers may have have experienced issues accessing tabs in C360 today. We have identified and resolved the issue.

Nov 18, 2024

Report: "Identified rule failures"

Last update 2024-11-18T01:49:10.921Z

resolved2024-11-18T01:49:10.906Z

The Salesforce issue has been resolved. Gainsight Administrators would have received details regarding any failed rules and can re-run where needed.

monitoring2024-11-16T16:34:06.591Z

We are still tracking the ongoing Salesforce incident and will update our status once resolved.

identified2024-11-15T21:55:35.114Z

Salesforce status now reports "Feature Degradation" which could still impact the experience for some Gainsight users. We will continue to monitor Salesforce and will update our status once resolved.

identified2024-11-15T14:13:06.000Z

We will continue to monitor Salesforce and will update status once resolved.

identified2024-11-15T12:26:16.185Z

We have identified that rule failures are occurring due to an ongoing Salesforce (SFDC) incident. Our team is actively monitoring the situation and will provide updates as more information becomes available.

Oct 3, 2024

Report: "Delays in Journey Orchestrator"

Last update 2024-10-03T20:22:34.341Z

resolved2024-10-03T20:22:34.325Z

Queues have returned to expected levels.

monitoring2024-10-03T20:00:58.618Z

We have identified an issue with the Journey Orchestrator service. A fix is applied to resolve the issue and we are monitoring it closely. We anticipate a delay in the processing, which should catch up in a few hours.

Sep 4, 2024

Report: "Gainsight CS - US1 - Degraded Performance"

Last update 2024-09-04T18:34:03.307Z

resolved2024-09-04T18:34:03.288Z

The issue is resolved. We will share RCA details as they become available.

monitoring2024-09-04T17:13:55.429Z

A fix has been implemented and we are monitoring the results.

investigating2024-09-04T16:16:04.505Z

We are observing degraded performance in CS-US1.

Report: "Gainsight CS - US1 - Latency"

Last update 2024-09-04T05:11:59.604Z

postmortem2024-09-04T04:54:16.459Z

**Incident:** 26th August, 2024: Some customers may have experienced elevated levels of latency while using our CS-US1 application. **Root Cause:** * There was an increased frequency of database calls tied to a recent hotfix. The increased traffic was intermittent but led to slower API response times and thus impacted customer experience from browser. * Customers in CS-US1 may have experienced the latency while browsing. Impact was relatively less for our CS-US2 and CS-EU regions. * Sanity testing in lower environments did not initially produce this issue which was only observed with production traffic in US1. ‌ **Recovery Action:** Once the source was identified, a subsequent fix was applied which restored performance to expected levels. **Preventive Measures:** The mentioned fix has resolved this issue. We will continue to review and learn from issues like this in effort to provide the best possible customer experience. Please contact [support@gainsight.com](mailto:support@gainsight.com) if there are any questions.

resolved2024-08-27T13:31:28.539Z

The issue is resolved. Systems are back to their normal performance level. We will share the RCA as soon as it is available.

monitoring2024-08-27T04:36:32.504Z

We are continuing to monitor for any further issues.

monitoring2024-08-26T19:33:32.772Z

We are keeping the incident open at this time to monitor for further issues.

monitoring2024-08-26T18:28:09.015Z

A fix has been implemented and we are monitoring the results.

identified2024-08-26T17:19:28.870Z

The issue has been identified and a fix is being implemented.

investigating2024-08-26T17:06:54.039Z

We are investigating reports of latency with loading pages in CS-US1. We will post further updates as we have more details.

Aug 19, 2024

Report: "Retrospective: CS - US1 Export Failures"

Last update 2024-08-19T19:25:43.944Z

resolved2024-08-19T13:00:00.000Z

CS-US1: Some customers may have experienced issues while exporting from Dashboard and Success Snapshot from 13:00 to 18:00 UTC. We restarted related services which resolved this issue. Please email support@gainsight.com with any questions.

Jun 3, 2024

Report: "Gainsight CS - EU - Elevated errors in NXT Authentication"

Last update 2024-06-03T07:14:49.642Z

postmortem2024-06-03T07:14:00.704Z

**Incident Summary for issue on 28 May 2024 \(External\)** **Gainsight CS - EU - Elevated errors in NXT Authentication** On **2024-05-28** between **07:31 and 08:45 UTC**, users of the Gainsight Application in the CS EU Cloud experienced intermittent application availability issues. The Gainsight UI was inaccessible for approximately 75 minutes during this window. **Root Cause :** Investigations have identified the following cause of the incident: * An infrastructure component, specifically the backend worker service \(Kubernetes Karpenter\), was upgraded to a newer version to patch critical security and other updates. * This change had already been successfully executed in the STAGE and other PROD environments. * During the EU environment upgrade, all metadata configurations were transferred except for one critical rule. * The missing rule allowed for UDP communication to DNS Servers. * Due to the absence of this rule, DNS requests could not be resolved, causing microservices on newly provisioned worker nodes to fail. Microservices on older worker nodes were unaffected. * These failures resulted in a significant number of stale threads/connections in a short time frame, rendering the API Gateway unresponsive. * Updating the missing rule in the Network Security Group and reprovisioning the worker nodes resolved the issue. * Pending rule jobs were either skipped or resubmitted as necessary. ‌ **Recovery Action :** 1. Updated the missing UDP rule in the Network Security Group. 2. Restarted all affected services. ‌ **Preventive Measures:** 1. Ensure network rules consistency before and after any upgrade – this process has been initiated. 2. Schedule critical security updates and even low-risk infrastructure changes during non-peak hours, despite previous successes in other environments, to minimize impact.

resolved2024-05-28T09:10:09.919Z

This incident has been resolved.

monitoring2024-05-28T08:45:40.552Z

We are continuing to monitor for any further issues.

monitoring2024-05-28T08:45:30.199Z

Fix is implemented and all services are back to normal. The queues are also released and the jobs will catchup in next couple of hours. We are monitoring closely

identified2024-05-28T08:08:48.209Z

The issue has been identified and fix is being implemented.

investigating2024-05-28T07:31:21.205Z

Gainsight NXT application is still down. We are working with the upstream service provider. We will post updates as soon as they are available.

investigating2024-05-28T06:46:46.424Z

We are investigating errors while logging into the Gainsight NXT application. We will post updates as soon as they are available.

May 10, 2024

Report: "CS - US1, US2, and EU: We are investigating intermittent errors tied to Cockpit / CTA360."

Last update 2024-05-10T21:07:44.503Z

resolved2024-05-10T21:07:44.489Z

The issue is now resolved and all systems are back to normal.

monitoring2024-05-10T21:00:39.899Z

A fix has been implemented and we are monitoring the results.

identified2024-05-10T20:07:18.693Z

The issue has been identified and a fix is being implemented.

investigating2024-05-10T19:20:22.000Z

We are investigating an increase in error rates for Cockpit / CTA360, which may lead to degraded performance or service interruption. This may impact performance for services tied to CTA including Timeline.

Report: "Gainsight CS - US1, US2 and EU - Investigating Elevated Error Rates in Rules and Data Designer"

Last update 2024-05-10T11:45:25.694Z

resolved2024-05-10T11:45:25.679Z

The issue is now resolved and all systems are back to normal.

monitoring2024-05-10T10:26:17.466Z

A fix has been implemented and we are closely monitoring the results.

identified2024-05-10T09:25:31.519Z

We have identified the issue causing the failures and are working on a fix.

investigating2024-05-10T08:57:58.610Z

We are investigating a sudden increase in error rates of Rules and Data Designer, which may lead to degraded performance or service interruption.

May 3, 2024

Report: "Gainsight CS Slowness"

Last update 2024-05-03T11:01:04.224Z

resolved2024-05-03T11:01:04.210Z

This incident is now resolved.

monitoring2024-05-03T10:39:52.852Z

We have identified the issue and implemented a fix. The pages are now loading normally. We are monitoring closely.

investigating2024-05-03T09:59:14.702Z

We are investigating reports of Gainsight pages taking a long time to load. We will update this page with more details as soon as we have further information.

Apr 23, 2024

Report: "CS - US1: Investigating Email Queue Delays"

Last update 2024-04-23T20:52:49.728Z

resolved2024-04-23T20:52:49.698Z

This incident has been resolved.

monitoring2024-04-23T20:05:59.233Z

A fix has been implemented and we are monitoring the results.

identified2024-04-23T19:56:57.604Z

The issue has been identified and a fix is being implemented.

investigating2024-04-23T19:15:26.682Z

We are continuing to investigate this issue.

investigating2024-04-23T18:15:26.747Z

We are investigating email service delays in CS - US1.

Mar 28, 2024

Report: "CS - US1: Investigating Rules Queue Delays"

Last update 2024-03-28T10:10:57.262Z

resolved2024-03-28T10:10:57.250Z

Queues have returned to expected levels.

monitoring2024-03-28T03:34:23.298Z

A fix has been implemented and we are monitoring the results.

identified2024-03-28T02:47:25.269Z

The issue has been identified and a fix is being implemented.

investigating2024-03-28T02:04:27.225Z

We are investigating delays related to our Rules Queue and will provide an update as we have more information.

Mar 23, 2024

Report: "CS - US1 Region - Rules failure for a few customers"

Last update 2024-03-23T16:26:13.111Z

resolved2024-03-23T16:26:13.095Z

Issue is resolved.

monitoring2024-03-23T16:00:55.066Z

We have identified and fixed an issue that has caused a few 'Rules with scorecards action' failures for a small subset of customers. Impacted customers may review their Rules and rerun the failed ones. Please reach out to support@gainsight.com if you have any questions or need help in rerunning any failures in your org.

Feb 28, 2024

Report: "CS - US1 - Delays with Gainsight Assist messages in Timeline"

Last update 2024-02-28T06:14:04.757Z

resolved2024-02-28T06:14:04.743Z

Backlog processing is now complete and all systems are back to normal.

monitoring2024-02-27T21:47:43.033Z

A fix has been implemented and we are monitoring the results as we work through the backlog. New messages are still processing and logging as expected. A small number of customers may experience delays for messages queued earlier in the day.

identified2024-02-27T21:03:48.176Z

New messages are now processing and logging as expected. Please expect delays with existing messages as we work on the fix.

investigating2024-02-27T20:02:50.000Z

We are investigating delays with Gainsight Assist messages displaying in Email to Timeline (E2T). This includes Chrome and Outlook plugin activities.

Feb 23, 2024

Report: "Gainsight CS - EU - Rules and Data Designer failure for a few customers"

Last update 2024-02-23T05:47:31.714Z

postmortem2024-02-23T05:45:44.597Z

**Incident:** An isolated number of customers experienced degraded performance in CS-EU Rules and Data Designer on the 9th of January, 2024. **Root Cause:** It was determined that a configuration issue in an underlying service caused the impact for Rules and Data Designer services only. **Recovery Action:** Paused the affected services and corrected the configurations. Once unpaused, services were processing as expected. ‌ **Preventive Measures:** We have made adjustments to configuration and testing controls to prevent issues like this moving forward. Please email [support@gainsight.com](mailto:support@gainsight.com) with any questions.

resolved2024-01-09T10:14:41.153Z

This incident has been resolved.

monitoring2024-01-09T09:19:00.267Z

We have identified and fixed an issue that has caused a few Rules and Data Designer failures for a small subset of customers. Impacted customers would have received failure notification and can rerun the failed rules and workflows. Please reach out to support@gainsight.com if you have any questions or need help in rerunning any failures in your org.

Report: "CS - EU: Investigating Elevated Error Rates"

Last update 2024-02-23T05:44:41.194Z

postmortem2024-02-23T05:43:19.030Z

**Incident:** An isolated number of customers experienced degraded performance in CS-EU Rules on the 3rd of January, 2024. This could have also intermittently impacted the ability to log into the application. **Root Cause:** This incident was result of an elevated number of API requests coming from a single microservice. The unexpected increase led to a build-up of connections, impacting performance on a subset of API servers. Rate limiting functionality was not configured as expected in this case. **Recovery Action:** Once the affected systems and related traffic were identified, Isolating and restarting effected API services resolved the issue immediately. ‌ **Preventive Measures:** We have corrected the rate limiter functionality for the microservice that caused this issue.

resolved2024-01-03T15:14:45.830Z

This incident has been resolved.

monitoring2024-01-03T14:50:09.360Z

A fix has been implemented and we are monitoring the results.

investigating2024-01-03T14:34:58.173Z

We are investigating a sudden increase in error rates which may lead to degraded performance or service interruption.

Report: "Gainsight CS - US2 - Rules Delays for a subset of customers"

Last update 2024-02-23T00:27:23.832Z

resolved2024-02-02T04:44:39.000Z

Rules processing is back to its normal state.

monitoring2024-02-02T03:25:03.260Z

We observed delays in Gainsight CS US2 Rules processing for a subset of customers. The source of this delay has been identified and addressed. However it can take some time for the queues to catch up. The issue is now resolved and we will continue to monitor closely. Please reach out to support@gainsight.com as needed with any questions.

Feb 16, 2024

Report: "Gainsight CS - US1 - Degraded Performance"

Last update 2024-02-16T14:45:24.779Z

postmortem2024-02-16T14:42:18.426Z

**Incident:** 12th February, 2024: Between the times of 6:30 PM and 7:00 PM UTC, some CS-US1 customers may have experienced intermittent delays in application load times. **Root Cause:** The issue was result of query enhancements introduced in the latest release. This caused unexpected API latency which affected multiple areas of CS-US1. Application resiliency measures were effective in maintaining availability but there were still periods of poor performance, especially between 6:48 PM and 7:00 PM UTC. **Recovery Action:** Once the source was identified, the related functionality was rolled back which restored performance to expected levels. A hotfix was applied on the 14th of February for final resolution. **Preventive Measures:** The mentioned fix has resolved this issue. We will continue to review and learn from issues like this in effort to provide the best possible customer experience. Please contact [support@gainsight.com](mailto:support@gainsight.com) if there are any questions.

resolved2024-02-12T19:39:45.778Z

This incident has been resolved. We will post RCA details as they become available.

monitoring2024-02-12T19:05:41.879Z

A fix has been implemented and we are monitoring the results.

identified2024-02-12T19:00:29.086Z

The issue has been identified and a fix is being implemented.

investigating2024-02-12T18:48:19.768Z

We are continuing to investigate this issue.

investigating2024-02-12T18:44:55.857Z

We are investigating reports of degraded performance in CS-US1.

Report: "Gainsight CS - US1 - Degraded Performance"

Last update 2024-02-16T14:41:17.960Z

postmortem2024-02-16T14:37:53.740Z

**Incident:** 12th February, 2024: Between the times of 2:30 PM and 3:30 PM UTC, some CS-US1 customers may have experienced intermittent delays in application load times. **Root Cause:** The issue was result of query enhancements introduced in the latest release. This caused unexpected API latency which affected multiple areas of the CS-US1 application. **Recovery Action:** Once the source was identified, the related functionality was rolled back which restored performance to expected levels. ‌ **Preventive Measures:** This issue was previously undetected as it only occurred with very high concurrency. We are modifying testing methods and monitoring solutions to detect issues like this moving forward. We will continue to review and learn from issues like this in effort to provide the best possible customer experience. Please contact [support@gainsight.com](mailto:support@gainsight.com) if there are any questions.

resolved2024-02-12T14:30:00.000Z

Between the times of 2:30 and 3:30 PM UTC, some customers may have experienced intermittent delays in application load times. We have identified the affected nodes and have resolved the issue.

Nov 27, 2023

Report: "CS-US1: Delays in Rules Queue"

Last update 2023-11-27T20:10:04.106Z

resolved2023-11-27T20:10:04.090Z

Queues have returned to expected levels.

monitoring2023-11-27T17:57:00.400Z

We are continuing to monitor for any further issues.

monitoring2023-11-27T17:56:14.649Z

We have observed delays in Rules Queues and we are monitoring closely.

Nov 16, 2023

Report: "CS - US1, US2, and EU: Investigating issues with GS Assist Chrome Plugin"

Last update 2023-11-16T09:59:47.299Z

resolved2023-11-16T09:59:47.285Z

The enhanced GS Assist chrome plugin(version 3.2.3 ) was published and approved by Chrome web store. Please install latest version for better experience. Reach out to support@gainsight.com if you have any questions. Thank you for your patience.

identified2023-11-16T09:58:21.584Z

We are continuing to work on a fix for this issue.

identified2023-11-15T17:41:13.223Z

We are still working with Chrome Web Store Support to publish the enhanced Gainsight Assist Extension and will share details once they approve.

identified2023-11-13T14:22:28.006Z

The Gainsight team has submitted an enhancement for our Chrome plugin to the Chrome Web Store. However, the review and publication process may take up to 3 business days. We will update the status once the new version is published.

identified2023-11-09T21:31:26.974Z

The issue has been identified and a fix is being implemented.

investigating2023-11-09T17:22:41.193Z

We have received reports where the GS Assist Chrome Plugin for Gmail is failing to load for some customers, possibly due to interface changes in Gmail. We are investigating and will update when we have more information.

Nov 6, 2023

Report: "CS - EU - Investigating Delays"

Last update 2023-11-06T23:08:13.377Z

resolved2023-06-08T17:00:00.000Z

Incident: Some EU customers experienced delays in Rules and Journey Orchestrator on the 8th of June, 2023. Root Cause: After noticing the delays, Engineers temporarily paused related queues to investigate. While troubleshooting, we discovered network connectivity issues tied to underlying hardware. Recovery Action : While still in a paused state, action was taken to assign affected resources to new hardware. Once back online with new hardware, jobs were processing as expected. ‌ Preventive Measures: Redundant architecture design and quick action helped prevent major impact. Still, we are inspecting monitoring, alerting, and automation to ensure optimal design. Additionally, we are confirming self-healing configurations with upstream service providers.

Report: "CS - EU - Delays observed in Connector and People Dataload jobs"

Last update 2023-11-06T23:03:13.135Z

resolved2023-07-22T15:00:00.000Z

We have observed delay in Connector jobs and People Dataload jobs for a subset of customers. Our engineers have identified and fixed the issue. A few customers with Dataload Rule failures would have received failure notification and they can rerun the failed rules and workflows.

Nov 2, 2023

Report: "CS - EU: Investigating Elevated Error Rates"

Last update 2023-11-02T06:31:05.546Z

resolved2023-09-05T16:39:00.000Z

We have identified the issue with an upstream service provider. New logins may experience slowness or errors, where existing sessions are not affected. We are worked with the service provider for a fix.

Oct 30, 2023

Report: "CS - US1, US2, and EU: Investigating Elevated Error Rates with Authentication"

Last update 2023-10-30T21:30:12.488Z

resolved2023-10-30T21:30:12.473Z

This incident has been resolved.

monitoring2023-10-30T20:43:25.680Z

We have identified an issue with an upstream authentication service provider which may have caused errors for some users while accessing Gainsight CS applications. US1 and US2 applications are working as expected at this time but we are monitoring closely.

investigating2023-10-30T20:23:31.651Z

We are investigating intermittent errors tied to Gainsight CS authentication. Some users may experience issues with logging in. Users who are already logged into NXT tenants or using the SFDC version of Gainsight CS are not affected. Others may experience slowness while logging in. We will post updates as they are available.

Oct 25, 2023

Report: "CS - US1: Cloud Storage Service Interruption"

Last update 2023-10-25T15:17:55.233Z

resolved2023-10-25T15:17:55.218Z

This incident has been resolved. Any failed rules related to this issue would have triggered notifications.

monitoring2023-10-25T13:57:45.471Z

Within the last 90 minutes, upstream cloud storage functionality was unavailable intermittently for DataOperations or Dataload jobs, which would have also impacted Rule report downloads for some customers. All functionality is back to normal.

Sep 17, 2023

Report: "Investigating Elevated Error Rates with Authentication in US1 and US2"

Last update 2023-09-17T02:24:14.077Z

resolved2023-09-17T02:24:14.065Z

This incident has been resolved and authentication services are performing normally.

identified2023-09-17T01:42:28.429Z

We have identified an issue with our authentication service provider which may cause errors for some users while accessing the Gainsight CS application. Users who are already logged into NXT tenants or using the SFDC version of Gainsight CS are not affected. Others may experience slowness while logging in. We are closely monitoring and will post updates as they are available.

Aug 11, 2023

Report: "CS-US1 - Degraded performance in Timeline"

Last update 2023-08-11T14:33:09.014Z

postmortem2023-08-11T04:23:57.668Z

**Incident:** Some customers may have experienced degraded performance in CS-US1 Timeline services between 15:44 and 15:48 UTC on the 28th of July. **Root Cause:** A memory issue on multiple web servers led to Timeline becoming unresponsive intermittently, while other servers in the pool continued to serve traffic. **Recovery Action:** Monitoring systems alerted engineers of a memory issue on Timeline web servers. Engineers quickly made adjustments and redeployed Timeline web servers as a temporary fix. ‌ **Preventive Measures:** A permanent hotfix will be deployed in our next release window.

resolved2023-07-28T16:26:26.726Z

Some customers may have experienced slowness or errors while trying to load Timeline in CS-US1. Engineers responded quickly to restore services. We will post root cause details as they become available.

monitoring2023-07-28T15:53:33.250Z

A fix has been implemented and we are monitoring the results.

investigating2023-07-28T15:49:31.772Z

We are investigating performance issues related to Timeline and will update as we have more information.

Report: "CS-US1 - Investigating - Elevated error rates"

Last update 2023-08-11T04:23:25.502Z

postmortem2023-08-11T04:22:30.166Z

**Incident:** Some customers may have experienced degraded performance while trying to load CS-US1 services between 18:25 and 18:38 UTC on the 18th of July. **Root Cause:** API Gateway instability due to network contention at the data layer was found to be the root cause. **Recovery Action:** Engineers performed a rolling restart of API services once this issue was detected. ‌ **Preventive Measures:** System configuration adjustments have been made to prevent these issues moving forward.

resolved2023-07-18T19:28:03.055Z

Some customers may have experienced slowness or errors while trying to load CS-US1 services between 18:25 and 18:38 UTC. Engineers responded quickly to restore impacted services. We will post root cause details as they become available.

monitoring2023-07-18T19:00:30.808Z

A fix has been implemented and we are monitoring the results.

investigating2023-07-18T18:41:16.002Z

We are continuing to investigate this issue.

investigating2023-07-18T18:36:40.881Z

We are investigating elevated error rates which may result in slowness or availability issues for some customers. More details to follow.

Report: "Investigating Elevated Error Rates with Authentication"

Last update 2023-08-11T04:21:28.014Z

postmortem2023-08-11T04:16:45.616Z

**Incident:** Between 16:14 and 17:16 UTC on the 13th of March, CS-US1 customers may have experienced errors while trying to log into the application. Users who were already logged into NXT tenants or using the SFDC version of Gainsight were not affected. Others may have experienced slowness while logging in. **Root Cause:** This was caused by a widespread network connectivity issue with an upstream Cloud Provider tied to Authentication Services used by our applications. **Recovery Action:** While the Service Provider evaluated manual failover, the network issue at Cloud Provider level was resolved before further action was required. ‌ **Preventive Measures:** The upstream Service Provider is reviewing their detection logic and has enhanced their monitoring mechanisms to identify issues like this moving forward.

resolved2023-03-13T17:16:22.642Z

This incident has been resolved. We will relay RCA details as they become available.

monitoring2023-03-13T16:37:25.619Z

A fix has been implemented and we are monitoring the results.

investigating2023-03-13T16:14:14.695Z

We have identified an issue with our authentication service provider which may cause errors for some users while accessing the Gainsight NXT application. Users who are already logged into NXT tenants or using the SFDC version of Gainsight are not affected. Others may experience slowness while logging in. We are closely monitoring and will post updates as they are available.

Report: "US1 - Connector Delays"

Last update 2023-08-11T04:20:32.688Z

postmortem2023-08-11T04:19:19.452Z

**Incident:** Beginning around 12:00 UTC on the 13th of June, Engineers were alerted of elevated queue levels for Connector services in CS-US1. **Root Cause:** A leader node was found to have higher than usual disk activity which prevented optimal job execution for Connector services. **Recovery Action:** Engineers scaled the number of Connector instances to correct the issue temporarily. Additionally, Engineers skipped long-running and duplicate jobs to help recover. **Preventive Measures:** System configuration adjustments have been made to prevent these issues moving forward.

resolved2023-06-14T00:09:16.784Z

This incident has been resolved. A subset of customers faced connector queue delays during this incident window. We will add RCA details as they become available.

monitoring2023-06-13T20:04:40.270Z

A fix has been implemented and we are monitoring. The Connectors queue was blocked for analysis and troubleshooting during this incident. We have since unblocked, and any duplicate sync jobs were aborted with no data impact. Please expect delays while the queue clears.

identified2023-06-13T18:43:02.948Z

The issue has been identified and a fix is being implemented.

investigating2023-06-13T17:29:23.374Z

Beginning around 12:00 PM UTC today, we detected a delay in Connectors traffic and adjusted accordingly. As we still have queue delays, we are investigating further and will update as more information becomes available.

Jul 28, 2023

Report: "CS-US1 - Data Designer Jobs delayed"

Last update 2023-07-28T20:53:34.354Z

resolved2023-07-05T10:20:39.953Z

We have identified and resolved the issue. We expect the pending jobs to complete in next 2 hours.

investigating2023-07-05T10:02:09.326Z

We are investigating reports of delays in Data Designer Jobs. We will post further updates as more information becomes available.

Report: "CS-US1 - Delays observed in Connector and People Dataload jobs"

Last update 2023-07-28T20:52:46.863Z

resolved2023-07-17T16:09:14.430Z

All queue backlogs have cleared. A small subset of customers with Rules failures would have received failure notification and can rerun the failed rules and workflows.

monitoring2023-07-17T15:35:24.131Z

identified2023-07-17T13:01:33.984Z

We have observed delay in Connector jobs and People Dataload jobs for a subset of customers. Our engineers have identified the issue with an underlying infrastructure component. We are working actively to resolve the same. Please reach out to support@gainsight.com with any questions.

Report: "CS-US1 - Retrospective: Isolated Rule Delays"

Last update 2023-07-28T20:52:16.596Z

resolved2023-07-27T06:00:00.000Z

We have received reports of delays in rules executions for a subset of customers. Initial impact started around 6:00 AM UTC on the 27th of July. The source of this delay has been identified and addressed. However it can take some time for the queues to catch up. The issue is now resolved and we will continue to monitor closely. Please reach out to support@gainsight.com as needed with any questions.

Jun 7, 2023

Report: "US1 Region - Queues blocked for investigation"

Last update 2023-06-07T00:07:53.708Z

resolved2023-06-07T00:07:53.690Z

This incident has been resolved. Our Network Operations Center Team detected errors from backend systems and immediately blocked queues to prevent major impact. After careful analysis, we were able to resolve and unblock queues. Please reach out to support@gainsight.com with any questions.

monitoring2023-06-06T23:36:47.338Z

A fix has been implemented and we are monitoring the results.

identified2023-06-06T22:01:11.549Z

The issue has been identified and a fix is being implemented.

investigating2023-06-06T21:15:18.756Z

We are investigating an increase in backend exceptions and have temporarily blocked queues to prevent impact.

May 31, 2023

Report: "US1 Region - Rules and Data Designer failure for a few customers"

Last update 2023-05-31T08:44:02.968Z

resolved2023-05-31T08:44:02.958Z

May 24, 2023

Report: "US1 Region - Rules Workflow failure for few customers"

Last update 2023-05-24T12:27:23.368Z

resolved2023-05-24T12:27:23.359Z

We have identified and fixed an issue that has caused a few 'Rule Workflows' failures for a small subset of customers. Impacted customers may review their Rules and and rerun the failed workflows. Please reach out to support@gainsight.com if you have any questions or need help in rerunning any failures in your org.

Apr 15, 2023

Report: "Retrospective: Sandbox Creation Failures"

Last update 2023-04-15T05:06:00.100Z

postmortem2023-04-15T05:04:35.077Z

**Incident:** Attempts to clone or create new tenants were failing. This included customer sandboxes which would be cloned from production instances. **Root Cause :** We discovered that creation would fail at the instance configuration stage. After review with an upstream service provider, we learned there were unannounced changes made with this provider which affected API behavior and ultimately led to the failures. **How this was missed:** We were not made aware of the mentioned API changes until reviewing with the vendor. **Recovery Action :** Once the source of the issue was determined, we made adjustments to API calls which fixed the issue. ‌ **Preventive Measures:** We're working to improve communication with service providers to ensure awareness of pending changes before they happen.

resolved2023-03-30T14:41:50.466Z

This incident has been resolved. The sandbox creation issues were tied to API call failures with an upstream service provider. We will release more details as they become available.

monitoring2023-03-30T10:06:16.989Z

A fix has been implemented and we are monitoring the results.

investigating2023-03-29T21:53:15.940Z

We are continuing to investigate this issue.

investigating2023-03-29T16:20:48.029Z

Thank you for your patience as we continue to investigate.

investigating2023-03-29T14:33:11.814Z

Customers may experience issues while creating sandboxes from Sandbox Management. Sandbox requests are currently being queued while we investigate.

Mar 16, 2023

Report: "Retrospective: Isolated Rule Failures"

Last update 2023-03-16T15:58:57.764Z

resolved2023-03-16T15:58:57.744Z

This incident has been resolved.

monitoring2023-03-16T12:34:37.748Z

We have received reports of Rules failing for a subset of customers. The status of Rules execution was spuriously marked as failed in-spite of a successful execution. The source of this anomaly has been identified and addressed. The issue is now resolved and we will continue to monitor closely. Please reach out to support@gainsight.com as needed with any questions.

Mar 11, 2023

Report: "US1 Region - Rules Engine and Data Processing Delays for few customers"

Last update 2023-03-11T20:48:00.020Z

resolved2023-03-11T20:48:00.001Z

Queues have returned to expected levels. If you have questions, please open a ticket with us by sending a note to support@gainsight.com.

monitoring2023-03-11T14:06:54.292Z

We have identified delays in the Rules Engine and Data Processor Queue for some Customers. We have taken recovery action to help clear the backlog and are closely monitoring as the Queues continue to drain. We anticipate a delay of about 4 hours in the processing. If you have questions, please open a ticket with us by sending a note to support@gainsight.com.

Jan 27, 2023

Report: "Partial Impact of Gainsight Home"

Last update 2023-01-27T21:36:03.648Z

postmortem2023-01-27T21:31:46.276Z

**Incident:** "My Portfolio" widgets failed to load for some users with specific filter configurations under Gainsight Home. Users may have experienced the issue between January 09, 2023 - 08:45 UTC and January 10, 2023 - 00:50 UTC. **Root Cause :** A production change was performed on the 9th of January to fix an existing issue. As a result, My Portfolio under Gainsight Home failed to load if new global filters were added. **How this was missed:** As the mentioned fix was applied, multiple rounds of testing were performed. Unfortunately, we did not have test automation set up for the impacted use case. **Recovery Action :** Once the source of the issue was determined, we rolled back the affected module to resolve. Functionality was restored immediately after rollback. We will be rolling out a new, permanent fix in the coming release while taking the affected filters into consideration. ‌ **Preventive Measures:** Review and fix the edge-case scenario. Review of Synthetic Monitoring measures.

resolved2023-01-20T01:00:18.138Z

We have deployed a fix to resolve this issue. Please contact support@gainsight.com if you have any questions.

investigating2023-01-19T21:15:48.474Z

Thanks for your patience as we continue to investigate this issue.

investigating2023-01-19T19:59:42.640Z

Some customers may be experiencing an error when on Gainsight Home. We are aware of the issue and acting to correct it. More information will be available here soon.

Dec 20, 2022

Report: "Intermittent failures in loading of Reports, Cockpit, Timeline and Dashboards"

Last update 2022-12-20T15:09:06.840Z

resolved2022-12-20T13:00:00.000Z

Today 20 Dec 2022 after 12:50 UTC, a subset of customers would have experienced intermittent failures while loading of Reports, Cockpit, Timeline and Dashboards. Our team identified this issue and resolved it immediately. The issue was caused due to one single node going faulty during a routine scaleup process. We take availability and performance seriously and are constantly evolving our failure detection workflows to ensure that we mitigate such issues in the future. We apologize for the inconvenience this issue may have caused. Please reach out to support@gainsight.com if you have questions.

Dec 9, 2022

Report: "Investigating issues with loading reports"

Last update 2022-12-09T15:31:40.215Z

resolved2022-12-09T15:31:40.200Z

This incident has been resolved. We will follow up with RCA details as they become available.

investigating2022-12-09T15:12:39.135Z

We are investigating intermittent issues with Reports impacting a subset of customers.

Dec 6, 2022

Report: "Rules Engine Processing Delays for few customers"

Last update 2022-12-06T16:14:37.723Z

resolved2022-12-06T16:14:37.710Z

Queues are back to normal operation. If you have questions, please open a ticket with us by sending a note to support@gainsight.com.

monitoring2022-12-06T12:48:43.690Z

We have identified an issue causing processing delays in the Rules Engine Queue for some Customers. We have taken recovery action to help clear the backlog and are closely monitoring as the Queues continue to drain. We anticipate a delay of about 4 hours in the processing of rules. If you have questions, please open a ticket with us by sending a note to support@gainsight.com.

Oct 13, 2022

Report: "Rules Engine Processing Delays for few customers"

Last update 2022-10-13T16:02:13.224Z

resolved2022-10-13T16:02:13.207Z

Queues are back to normal operation. If you have questions, please open a ticket with us by sending a note to support@gainsight.com.

monitoring2022-10-13T15:18:02.892Z

Our rule processing infrastructure is running behind which is causing delays in the processing rules. No data has been lost and we are closely monitoring.

Oct 6, 2022

Report: "Retrospective : Issues While Loading Dashboards"

Last update 2022-10-06T17:12:00.944Z

resolved2022-09-14T11:30:00.000Z

On 2022-09-14 between 11:28 & 12:01 UTC, loading of Dashboards failed Intermittently for about 28 minutes for a subset of customers. Root Cause: A critical security patch was rolled out which caused an unexpected race condition. While our monitor health checks were successful, not all components were in a healthy state. This caused intermittent connectivity for users connecting to the impacted host. Recovery Action: The affected host was removed from the server fleet. We then reviewed logs and confirmed status of the microservice. Preventive Measures: As our upstream service provider deploys patches on start-up, we have disabled this feature so we can analyze and implement an approach that fits our environment. We have also adjusted monitor health checks to catch issues like this moving forward.

Sep 20, 2022

Report: "Rule Processing Delays"

Last update 2022-09-20T15:10:00.225Z

resolved2022-09-20T15:10:00.208Z

The fix is working as expected. Please expect rule delays over the next hour as we recover.

monitoring2022-09-20T14:16:38.347Z

A fix has been implemented and we are monitoring the results.

identified2022-09-20T14:11:44.231Z

The issue has been identified and a fix is being implemented.

investigating2022-09-20T13:20:42.202Z

Our rule processing infrastructure is running behind which is causing delays in the processing rules. No data has been lost and we are investigating.