Is Xplenty Down Right Now? Discover if there is an ongoing service outage.

Xplenty is currently Operational

Last checked Jul 29, 2025 17:51 UTC from Xplenty's official status page

Historical record of incidents for Xplenty

Jul 4, 2025

Report: "Clusters Stuck on Creating"

Last update 2025-07-04T22:32:28.417Z

investigating2025-07-04T22:32:28.414Z

We are currently investigating the issue.

Jun 19, 2025

Report: "Reverse SSH connectivity issue"

Last update 2025-06-19T20:44:04.893Z

investigating2025-06-19T20:41:31.000Z

Hi All, We are experiencing an issue with our Reverse SSH connections, and the jobs are failing. Action: Investigating the issue

May 28, 2025

Report: "Intermittent Dashboard Errors"

Last update 2025-05-28T19:50:07.524Z

resolved2025-05-28T19:50:07.508Z

The incident has been resolved. We apologize for the inconvenience caused.

monitoring2025-05-28T19:26:28.521Z

A fix has been implemented and we are monitoring the result. We apologize for the inconvenience caused here.

investigating2025-05-28T18:03:31.572Z

We are currently investigating the issue which is causing the dashboard to return intermittent errors.

Report: "Intermittent Dashboard Errors"

Last update 2025-05-28T14:50:00.000Z

Resolved2025-05-28T14:50:00.000Z

The incident has been resolved. We apologize for the inconvenience caused.

Monitoring2025-05-28T14:26:00.000Z

A fix has been implemented and we are monitoring the result. We apologize for the inconvenience caused here.

Investigating2025-05-28T13:03:00.000Z

We are currently investigating the issue which is causing the dashboard to return intermittent errors.

May 5, 2025

Report: "Jobs Stuck on Pending"

Last update 2025-05-05T01:34:21.644Z

resolved2025-05-05T01:34:21.628Z

The incident has been resolved.

monitoring2025-05-05T01:31:16.815Z

A fix has been implemented and we are monitoring the results.

investigating2025-05-05T01:26:00.606Z

We are currently investigating an issue which caused jobs to be stuck on pending.

Apr 26, 2025

Report: "Issues retrieving data from Bing Ads"

Last update 2025-04-26T11:06:04.339Z

resolved2025-04-26T11:06:04.317Z

The incident has been resolved.

monitoring2025-04-25T16:58:36.410Z

A fix has been implemented. Please reconnect your Bing Ads connection and rerun the jobs on a newly provisioned cluster.

investigating2025-04-22T12:41:39.341Z

Customers may experience issues with jobs that read data from Bing Ads. Our engineers are investigating the issues.

Apr 14, 2025

Report: "Partial Job Executions Failure"

Last update 2025-04-14T05:04:39.071Z

resolved2025-04-11T15:30:00.000Z

Jobs are partially failed to be executed on the clusters due to an issue from our upstream providers. The issue was resolved at April 13, 12:30 UTC.

Mar 18, 2025

Report: "Some Clusters Stuck on Creating"

Last update 2025-03-18T07:26:54.041Z

resolved2025-03-18T07:26:54.020Z

The incident has been resolved.

monitoring2025-03-18T06:21:49.560Z

Clusters are now being created. We are continuing to monitor the issue.

identified2025-03-18T03:46:06.950Z

We are seeing some clusters stuck on Creating on the Virginia region and have identified that it's due to an issue with our upstream provider. Will provide updates as soon as we have it.

Feb 24, 2025

Report: "Job Notification Issue Causing Job Failures"

Last update 2025-02-24T20:13:21.880Z

resolved2025-02-24T20:13:21.856Z

From Feb 24th, 14:30 UTC to 19:41 UTC, an issue with our job notification service caused jobs to skip the Running state and display a blank runtime. While Dataflow packages may have run normally (pending further confirmation), Workflows were affected, leading to job failures. The issue has now been resolved. If needed, please rerun the jobs or use "Run One-Off" on the schedule page. A post-mortem will be released soon. We sincerely apologize for the inconvenience.

Jan 15, 2025

Report: "Job Failures on File Storage Destination"

Last update 2025-01-15T13:07:32.749Z

resolved2025-01-15T13:07:32.739Z

From 10:25 UTC to 13:00 UTC, job failures occurred in packages using the file storage destination component due to a faulty deployment. The issue has been resolved. We apologize for any inconvenience this may have caused.

Dec 21, 2024

Report: "Dashboard Loading Issue"

Last update 2024-12-21T00:45:30.199Z

resolved2024-12-21T00:45:30.183Z

This incident has been resolved.

investigating2024-12-20T19:40:58.631Z

We are aware of an intermittent issue affecting the dashboard. If you encounter this problem, please try refreshing the page. Our development team is actively investigating the root cause and will implement a fix as soon as possible. We apologize for the inconvenience and appreciate your patience.

Dec 20, 2024

Report: "API down due to an emergency maintenance of our upstream provider"

Last update 2024-12-20T09:32:35.750Z

resolved2024-12-20T09:32:35.392Z

This incident has been resolved.

investigating2024-12-20T09:20:02.332Z

Our API is currently down due to an emergency maintenance of our upstream provider. We will update shortly

Dec 12, 2024

Report: "Oregon SSH server down and experiencing job failures"

Last update 2024-12-12T21:47:25.079Z

resolved2024-12-12T21:25:00.000Z

There was an issue with our SSH server in the Oregon region, and the dev team applied the fix as of 21:40 UTC

Nov 25, 2024

Report: "Dashboard and API Issues"

Last update 2024-11-25T14:48:25.260Z

resolved2024-11-25T14:48:25.242Z

The issue was due to our upstream provider. The incident has been resolved.

investigating2024-11-25T14:12:25.090Z

We are currently investigating this issue.

Nov 12, 2024

Report: "Clusters Stuck on Creating State"

Last update 2024-11-12T16:05:32.260Z

resolved2024-11-12T16:05:32.245Z

On Nov 12 at 12:30 UTC until 14:00 UTC, we have detected an issue on our cluster provisioning which caused clusters to be stuck on creating due to a bad deployment. This should not affect any scheduled jobs but may have caused delayed start time of the said jobs. This issue has now been resolved and we apologize for the inconvenience caused.

Oct 20, 2024

Report: "Dashboard and API issues"

Last update 2024-10-20T00:22:01.945Z

resolved2024-10-20T00:22:01.928Z

The incident has been resolved.

monitoring2024-10-19T18:30:41.757Z

Dashboard is now up and we are currently monitoring.

investigating2024-10-19T14:22:04.663Z

We are currently having Dashboard and API not loading because of an issue with our upstream provider

Oct 1, 2024

Report: "Intermittent Jobs and Clusters Stuck on Pending"

Last update 2024-10-01T08:45:06.316Z

postmortem2024-10-01T08:32:00.586Z

### **Root Cause** During our recent database maintenance on September 16th 6 AM UTC, we encountered resource limitations from our upstream provider, which resulted in some worker tasks being missed. ### **Resolution and Mitigation** * **Immediate Actions Taken:** We immediately stabilized the environment by restarting affected services and applications to minimize disruption. * **Long-Term Measures:** To prevent this issue from happening again: * Implemented automatic termination of long-idle connections to free up resources. * Enhanced our monitoring for pending jobs, ensuring that any long-running tasks are promptly identified and addressed. ### **Preventive Actions** * **Monitoring Improvements:** We have implemented monitoring for jobs stuck in a pending state, enabling us to remain proactive in addressing long-running tasks and responding before they impact operations. * **Additional Measures:** We have increased the resources allocated to our database and are working closely with our upstream provider to ensure resource availability. ### **Next Steps** We will continue to monitor the situation closely and make adjustments to workflows or settings as needed. Our team is committed to preventing future incidents of this nature, and we sincerely apologize for any inconvenience caused by this issue.

resolved2024-10-01T08:31:54.842Z

During our recent database maintenance, we encountered intermittent resource limitations from our upstream provider, which resulted in some worker tasks being missed. We have implemented measures in place and this issue is now resolved.

Sep 17, 2024

Report: "Intermittent Dashboard and API Issues"

Last update 2024-09-17T05:16:21.947Z

resolved2024-09-17T05:16:21.926Z

The updates seem to have resolved the issue and the incident has been resolved.

monitoring2024-09-16T17:52:03.018Z

A measure has been implemented and we are monitoring the results. The issue may have been caused by a database resource issue and we have yet to confirm. Thank you for your patience and we apologize for the inconvenience caused.

investigating2024-09-16T15:13:06.382Z

We are continuing to investigate the issue with our upstream provider.

investigating2024-09-16T11:22:47.000Z

We are currently investigating this issue which may have been potentially caused by the database migration maintenance from earlier.

Aug 14, 2024

Report: "Increased Cluster Creation Times"

Last update 2024-08-14T09:59:32.061Z

resolved2024-08-14T09:59:32.039Z

The incident has been resolved. We apologize for the inconvenience.

monitoring2024-08-14T09:17:18.407Z

We have identified the issue and put a fix in place and cluster provisioning times should be back to normal now. We are monitoring the issue.

investigating2024-08-14T09:09:04.417Z

We are currently investigating an issue with increased cluster creation time.

Jun 19, 2024

Report: "Dashboard and API issues"

Last update 2024-06-19T05:10:34.140Z

resolved2024-06-19T05:10:34.125Z

The cause of the downtime was due to a bad deployment. We have rolled back the deployment for the meantime. We apologize for the inconvenience caused.

investigating2024-06-19T04:54:24.932Z

We are currently investigating on this issue.

Jun 18, 2024

Report: "Clusters and Jobs Stuck on Pending, Long Running Package Validation"

Last update 2024-06-18T10:16:53.478Z

resolved2024-06-18T10:16:53.152Z

This incident has been resolved. Upon investigation, we have identified that the root cause of this issue is coming from our upstream provider.

monitoring2024-06-18T10:11:33.080Z

A fix has been implemented and we are monitoring the results.

investigating2024-06-18T09:40:29.596Z

We are continuing to investigate this issue.

investigating2024-06-18T09:17:21.828Z

We are currently investigating this issue.

Jun 4, 2024

Report: "Dashboard and API issues"

Last update 2024-06-04T09:00:02.158Z

resolved2024-06-04T09:00:01.594Z

This incident has been resolved.

monitoring2024-06-04T05:35:31.003Z

We are continuing to monitor for any further issues.

monitoring2024-06-04T05:28:13.419Z

A fix has been implemented and we are monitoring the results.

identified2024-06-04T05:10:00.795Z

The dashboard and the API are currently not accessible due to an issue with our upstream provider. We are currently in touch with them to expedite the issue.

May 13, 2024

Report: "Dashboard and API Issues"

Last update 2024-05-13T08:00:21.254Z

resolved2024-05-13T08:00:21.236Z

The incident has been resolved.

monitoring2024-05-13T07:40:44.739Z

Applications are now starting to go online and jobs should now run fine. We are monitoring further.

identified2024-05-13T06:40:45.579Z

The dashboard and the API are currently not accessible due to an issue with our upstream provider. We are currently in touch with them to expedite the issue.

Jan 13, 2024

Report: "Jobs with Redshift connections are failing"

Last update 2024-01-13T14:43:49.238Z

resolved2024-01-13T14:43:48.581Z

This incident has been resolved.

investigating2024-01-13T14:41:50.477Z

The issue has been fixed and Redshift Source is fetching the schema properly.

investigating2024-01-12T17:30:42.761Z

We noticed the jobs with Redshift connection are not able to fetch the schema and we are working on it.

Nov 30, 2023

Report: "Failed Jobs due to Salesforce Destination Issue - 'No SLF4J providers were found.'"

Last update 2023-11-30T20:42:29.335Z

resolved2023-11-30T20:42:28.540Z

This incident has been resolved. Thank you for your patience.

investigating2023-11-30T17:55:22.273Z

We are aware of failed jobs with our Salesforce destination and are currently investigating the issue. This issue started at around 12:00PM UTC and we are working on getting this resolved as soon as possible.

Nov 15, 2023

Report: "Failed Jobs On Newly Created Cluster"

Last update 2023-11-15T09:29:34.048Z

resolved2023-11-15T09:29:34.039Z

From 7:00 AM UTC to 7:51 UTC, an issue has been detected which caused jobs ran on a newly provisioned cluster to fail. A fix has been rolled out and and job runs should now be back and running. We apologize for the inconvenience caused.

Sep 26, 2023

Report: "Scheduled Jobs Were Not Triggered"

Last update 2023-09-26T06:47:44.088Z

resolved2023-09-26T06:47:44.080Z

From 3:25 UTC until 6:31 UTC, an issue has been detected which caused scheduled jobs to not run. We've issued a fix and scheduled jobs should now be back and running.

Sep 9, 2023

Report: "Xplenty dashboard not accessible"

Last update 2023-09-09T13:07:23.213Z

resolved2023-09-09T13:07:23.197Z

A fix has been rolled out and and the dashboard should now be back and running.

investigating2023-09-09T12:55:28.069Z

Currently, the Xplenty dashboard at https://app.xplenty.com/ is not accessible. We are looking into the issue.

Mar 2, 2023

Report: "Issue with Facebook API Connections"

Last update 2023-03-02T11:11:20.004Z

resolved2023-03-02T11:11:19.985Z

Meta has finally approved our OAuth application back up and this issue has been resolved. Please reconnect your existing Facebook Ads Insights connections and jobs should run fine moving forward.

identified2023-02-24T02:21:11.870Z

Unfortunately we still have not received an update from Meta regarding the review of our app. They took it offline as they said they needed to review it and we’ve submitted their requests to them 3 times in the last 10 days but we’re still not getting any response except automated emails each time we submit an appeal. We’re doing all we can to get speaking with someone at Meta to get an update and resolution. Apologies again for the impact caused here, it’s incredibly frustrating for us too. We hope to be able to share a more positive update by end of week.

identified2023-02-14T16:51:58.682Z

We are currently resolving this issue.

Dec 27, 2022

Report: "Jobs are in pending status"

Last update 2022-12-27T18:54:49.117Z

resolved2022-12-27T18:54:48.480Z

This incident has been resolved.

investigating2022-12-27T17:51:09.491Z

We are currently investigating on this issue.

Oct 12, 2022

Report: "Intermittent connection issue to REST API and Database connections"

Last update 2022-10-12T18:41:35.346Z

resolved2022-10-12T18:41:34.694Z

This incident has been resolved.

identified2022-10-12T18:32:58.481Z

We are continuing to work on a fix for this issue.

identified2022-10-12T18:29:57.235Z

Our Engineering has identified the connection issues are only impacting our Virginia Region. We will have more updates as they become available.

investigating2022-10-12T17:53:30.144Z

We are continuing to investigate this issue.

investigating2022-10-12T17:48:53.000Z

Reports of intermittent connection issues to REST API and Databases received. Our engineering team is investigating and will provide an update as soon as we have more information.

Oct 7, 2022

Report: "Dashboard and API Downtime"

Last update 2022-10-07T09:25:45.870Z

resolved2022-10-07T09:25:45.862Z

From 8:46 AM UTC to 9:09 UTC, an issue has been detected which caused API and dashboard to be down. A fix has been rolled out and and components should now be back and running. This should not affect scheduled jobs.

Sep 5, 2022

Report: "Proxy Issue due to upstream provider on Sydney region"

Last update 2022-09-05T00:05:39.591Z

resolved2022-09-05T00:05:39.584Z

From 11:54 PM UTC to 12:01 AM UTC, an issue has been detected on our proxy server on Sydney region which caused jobs with databases to fail. The root cause was a hardware issue with our upstream provider. A fix has been implemented and jobs should now work accordingly.

Sep 1, 2022

Report: "Intermittent Job Failures and Clusters Stuck on Pending"

Last update 2022-09-01T05:05:16.720Z

postmortem2022-09-01T04:58:10.861Z

### **Issue Summary** From 7:11 AM UTC to 8:58 UTC, there’s an intermittent number of jobs and clusters stuck on pending and errors. ### **Root Cause** The root cause of this outage was due to our Redis component reaching 100% memory which caused the intermittent issues. Redis is used as a caching mechanism of our application. ### **Resolution and recovery** Here are the steps we are taking to ensure that the incident does not happen again moving forward. * Vertically scaled up Redis for more memory. * Improve monitoring so we can quickly detect Redis-related memory issues We appreciate your patience and again apologize for the impact to you, your users, and your organization. We thank you for your business and continued support. Sincerely, [Integrate.io](http://integrate.io/) Engineering

resolved2022-09-01T04:57:55.722Z

Beginning at approximately 12:40 AM until 2:30 AM UTC, there was an issue in one of our infrastructure components used for caching which affected clusters and jobs provisioned on the said time period. The issue has now been fixed by our engineers.

Aug 24, 2022

Report: "Issues with upstream DNS provider"

Last update 2022-08-24T00:55:44.626Z

resolved2022-08-24T00:55:44.020Z

This incident has been resolved.

identified2022-08-24T00:11:11.166Z

We are experiencing issues with an upstream DNS provider impacting login access. Please contact our support team for immediate assistance.

Jun 20, 2022

Report: "Failing jobs on Salesforce destination"

Last update 2022-06-20T09:39:06.466Z

resolved2022-06-20T04:00:00.000Z

Time of the incident between 7:11 AM UTC - 8:58 AM UTC. This has already been resolved. Please rerun the package with newly created cluster to overcome errors with Salesforce destination.

May 10, 2022

Report: "Jobs failing due to JDBC connection issues."

Last update 2022-05-10T10:33:19.838Z

postmortem2022-05-10T09:55:36.186Z

### **Issue Summary** From 15:01 UTC to 17:19 UTC, our Virginia proxy server became inaccessible. Due to this, all jobs with database and SFTP connections on this proxy failed. The issue was caused by our upstream provider. Customers in our other regions were unaffected. ### **Root Cause** The root cause of this outage was due to an issue with the upstream provider on the particular instance. ### **Resolution and recovery** Here are the steps we are taking to ensure that the incident does not happen again moving forward. * Improve fault-tolerance with automated proxy server failover so that there will be minimal downtime when a proxy hardware issue reoccurs. We appreciate your patience and again apologize for the impact to you, your users, and your organization. We thank you for your business and continued support. Sincerely, [Integrate.io](http://Integrate.io) Engineering

resolved2022-05-05T17:19:45.687Z

This incident is now resolved. Jobs are running fine.

investigating2022-05-05T15:01:07.380Z

We are currently investigating this issue.

Apr 26, 2022

Report: "Jobs are in pending status"

Last update 2022-04-26T21:13:29.301Z

resolved2022-04-26T21:13:28.689Z

Clusters are up and running. We have fixed the issue.

investigating2022-04-26T20:49:12.160Z

We are currently investigating the issue.

Mar 31, 2022

Report: "Jobs are failing and connections not working."

Last update 2022-03-31T15:29:28.401Z

resolved2022-03-31T15:29:27.037Z

This incident has been resolved.

investigating2022-03-31T15:20:13.482Z

We are currently investigating this issue.

Feb 24, 2022

Report: "Dashboard and API Offline"

Last update 2022-02-24T20:28:42.635Z

resolved2022-02-24T20:27:58.000Z

The dashboard and API's are working fine now. RCA : Our dashboard and API were currently offline due to an issue with our upstream provider, All looks good now!

monitoring2022-02-24T17:53:03.385Z

The Dashboard and API are working fine.

monitoring2022-02-24T17:32:48.327Z

Our upstream hosting provider (Heroku) seems to be having issues still. We are monitoring further.

monitoring2022-02-24T17:20:16.000Z

Dashboard and API seems to be recovering. We are monitoring further.

identified2022-02-24T16:55:57.148Z

Upstream provider is currently investigating the said issue.

identified2022-02-24T16:51:00.891Z

Our dashboard and API are currently offline due to an issue with our upstream provider. We are currently waiting for updates from the said provider.

Dec 15, 2021

Report: "Apache Log4j2 issue"

Last update 2021-12-15T18:27:32.741Z

resolved2021-12-15T18:27:32.724Z

This incident has been resolved.

monitoring2021-12-15T17:49:48.355Z

Xplenty is aware of the recently disclosed security issue affecting the open-source Apache "Log4j" utility (CVE-2021-44228). At this time, we can confirm that Xplenty is NOT impacted by this CVE. We strongly encourage customers who manage environments containing Log4j to update to the latest version.

Sep 24, 2021

Report: "Job failures in the Ireland region"

Last update 2021-09-24T11:07:13.607Z

resolved2021-09-24T11:07:12.871Z

This incident has been resolved.

investigating2021-09-24T11:02:42.848Z

Incident has been resolved. Job are running normally in the Ireland region.

investigating2021-09-24T11:01:31.059Z

We are continuing to investigate this issue.

investigating2021-09-24T10:35:55.357Z

We are investigating an increase in job failures in the Ireland region.

Jun 29, 2021

Report: "Clusters Stuck on Creation"

Last update 2021-06-29T13:56:10.178Z

resolved2021-06-26T23:30:00.000Z

On June 26th 11:30 PM UTC continuing until 07:14 AM UTC, an issue has been identified which caused clusters to be stuck on creating state. Clusters became available at around 07:14 AM after a fix has been put in place. The root cause was due to a rare faulty message which was stuck on our message queue mechanism. We have already found the reason and have remediated a fix moving forward to ensure this does not happen again. We have also added a mechanism to address the faulty message scenario in case something similar happens again in the future (as we have found out based on this incident that this is a single point of failure). We apologize for the inconvenience caused and please do reach out if there's anything we can do to help.

May 13, 2021

Report: "Dashboard Issues"

Last update 2021-05-13T05:13:05.627Z

resolved2021-05-13T05:13:05.613Z

This incident has been resolved.

monitoring2021-05-13T04:24:20.264Z

A fix has been implemented and we are monitoring the results.

investigating2021-05-13T03:44:42.704Z

We are currently investigating dashboard issues not loading.

Jan 21, 2021

Report: "Job Failures on MySQL Destination"

Last update 2021-01-21T15:30:59.375Z

resolved2021-01-21T15:30:59.361Z

This incident has been resolved.

monitoring2021-01-21T11:34:47.497Z

From 4:40 UTC, jobs with MySQL destination started to fail due to a bad deployment. We have rolled out a fix to remediate the said issue and we are currently monitoring. We apologize for the inconvenience caused.

Nov 7, 2020

Report: "Clusters Stuck on Creation"

Last update 2020-11-07T11:16:06.184Z

resolved2020-11-07T11:16:06.173Z

From 7:55 AM UTC to 11:02 AM UTC, an issue has been identified which caused clusters to be stuck on creating state. A fix has now been implemented and jobs that were waiting for the cluster should now run and succeed. Apologies for the inconvenience caused.

Oct 8, 2020

Report: "Connectivity and related Job Failures"

Last update 2020-10-08T08:59:48.145Z

postmortem2020-10-08T07:47:12.733Z

### **Issue Summary** From 17:13 UTC to 19:15 UTC, a Virginia proxy server became overloaded. Due to this, all jobs with database and SFTP connections on this proxy failed. The proxy server had an out of memory issue caused by an increased spike in connections. Customers in our other regions were unaffected. ### **Root Cause** The root cause of this outage was due to a spike in connections causing the proxy server to run out of memory. ### **Resolution and recovery** Here are the steps we are taking to ensure that the incident does not happen again moving forward. * Doubling the proxy server’s memory * Adjusted the proxy server’s memory swap settings We appreciate your patience and again apologize for the impact to you, your users, and your organization. We thank you for your business and continued support. Sincerely, Xplenty Engineering

resolved2020-10-08T01:16:56.123Z

This incident has been resolved.

monitoring2020-10-07T21:06:14.793Z

A fix has been implemented and we are monitoring the results.

identified2020-10-07T19:24:00.494Z

The issue has been identified and a fix is being implemented.

investigating2020-10-07T19:14:42.933Z

We have noted connectivity issues and job failures that show connection refused error. We are currently investigating this issue

Oct 6, 2020

Report: "Connectivity and related Job Failures"

Last update 2020-10-06T06:25:39.431Z

postmortem2020-10-06T05:44:14.381Z

Yesterday we experienced an issue on Virginia region causing jobs with database and SFTP connections to fail. Today we are providing an incident report that details the nature of the outage and our response. We understand this service issue has impacted our valued customers, and we apologize to everyone who was affected. ### **Issue Summary** From 18:17 UTC to 23:00 UTC, Xplenty proxy server on Virginia went down. Due to this, all jobs with database and SFTP connections failed. The Virginia proxy server had a memory issue caused by an increased spike in connections. Customers on other regions and their jobs were unaffected. ### **Root Cause** The root cause of this outage was due to an increased spike in connections making our proxy server overloaded in terms of memory usage. ### **Resolution and recovery** Here are the steps we are taking to ensure that the incident does not happen again moving forward. * Soft limits on connection tunnels per account to avoid proxy server congestion. * Improved monitoring which shows active connections with better granularity. Xplenty is committed to continually improving our technology and operational processes to prevent outages. We appreciate your patience and again apologize for the impact to you, your users, and your organization. We thank you for your business and continued support. Sincerely, Xplenty Engineering

resolved2020-10-06T01:00:47.765Z

The incident has been resolved.

monitoring2020-10-05T23:06:14.000Z

We are continuing to monitor the system. No further issues so far.

monitoring2020-10-05T23:00:24.184Z

A fix has been implemented and jobs should continue working. We are currently monitoring the system.

investigating2020-10-05T21:56:44.052Z

Our team has identified the issue and is working to roll out a fix

investigating2020-10-05T19:34:10.965Z

We are continuing to investigate this issue.

investigating2020-10-05T19:33:37.000Z

We have noted database connectivity issues and job failures that show connection refused error. We are currently investigating this issue.

Sep 2, 2020

Report: "Pending Jobs on Oregon Region"

Last update 2020-09-02T09:32:31.979Z

postmortem2020-09-02T09:18:27.135Z

Last Friday we experienced an issue on Oregon region causing jobs to be stuck on pending though eventually ran. Today we are providing an incident report that details the nature of the outage and our response. The following is an incident report for the Pending Jobs on Oregon Region that occurred on Friday August 28th 2020. We understand this service issue has impacted our valued customers, and we apologize to everyone who was affected. ### Issue Summary From 11:30 UTC to 16:06 UTC, Xplenty job monitoring service on Oregon region went down. Due to this, no new customer jobs were able to be started on Xplenty’s infrastructure and they were stuck on pending state. The deployment in charge of the job couldn’t scale up due to network interface limits relative to our instance count. ### Timeline \(all times UTC\) * 11:30 UTC: Jobs stuck on pending on Oregon region, downtime begins. * 11:30 UTC: PagerDuty alerts the team and the investigation begins * 12:00 UTC: Xplenty contacts our cloud provider to check if there’s any issues * 12:30 UTC: Job & cluster processing engines are operational * 13:45 UTC: Xplenty tweaked autoscaler configuration * 13:50 UTC: 100% of service is restored and operational ### Root Cause The root cause of this outage was our deployment couldn’t scale up due to an autoscaler misconfiguration. ### Resolution and recovery Xplenty development team has tweaked the configuration to ensure that the incident does not happen moving forward. Xplenty is committed to continually improving our technology and operational processes to prevent outages. We appreciate your patience and again apologize for the impact to you, your users, and your organization. We thank you for your business and continued support. Sincerely, Xplenty Engineering

resolved2020-08-28T16:06:56.432Z

The incident has been resolved.

monitoring2020-08-28T13:50:58.078Z

Jobs should now continue running fine. We are currently assessing the root cause and will be providing an update.

investigating2020-08-28T11:30:42.960Z

We are currently having jobs pending on Oregon region. We are currently investigating the issue.

Jul 14, 2020

Report: "Jobs Failure on Packages With File Storage Component"

Last update 2020-07-14T09:17:48.831Z

resolved2020-07-14T09:17:48.823Z

Between 6:37 AM UTC until 8:32 AM, the system had issues running jobs with file storage components on it which caused jobs to fail without any logs. The root cause was due to a bad deployment and a fix has been put in place. We apologize for the inconvenience caused.

Jul 8, 2020

Report: "Xplenty API is slow and jobs are pending"

Last update 2020-07-08T16:53:09.293Z

resolved2020-07-08T16:53:08.814Z

The issue has been resolved.

monitoring2020-07-08T15:55:24.989Z

A fix has been implemented and monitoring the results.

investigating2020-07-08T15:31:36.181Z

Xplenty API is slow and jobs are pending. We are investigating on this.

Jun 29, 2020

Report: "Connectors Issue"

Last update 2020-06-29T19:03:16.159Z

resolved2020-06-29T19:03:15.686Z

All the connections are working fine.

monitoring2020-06-29T17:01:43.526Z

We applied the fix and currently monitoring the issue.

identified2020-06-29T16:49:20.000Z

While we are still working on fixing this. Recommended to "reconnect" the connections and run the packages.

investigating2020-06-29T15:42:33.365Z

We are currently having issues with Salesforce, Google, and BingAds connections. We are investigating on this.

Report: "Jobs Update Issue"

Last update 2020-06-29T15:23:03.538Z

resolved2020-06-29T15:23:03.522Z

This incident has been resolved.

monitoring2020-06-29T15:22:48.850Z

We are continuing to monitor for any further issues.

monitoring2020-06-29T14:07:15.998Z

We are continuing to monitor for any further issues.

monitoring2020-06-29T14:03:08.776Z

The issue has been identified and a fix has been implemented. We are monitoring further for issues.

investigating2020-06-29T11:59:20.217Z

We are having issues with our jobs update handler due to the API server maintenance few hours ago. We have put the system on maintenance mode while we are investigating the issue.