Is Simplero Down Right Now? Discover if there is an ongoing service outage.

Simplero is currently Operational

Last checked Jul 29, 2025 14:40 UTC from Simplero's official status page

Historical record of incidents for Simplero

Jun 19, 2025

Report: "Microsoft has decided to mark some of our domains as "suspicious""

Last update 2025-06-19T13:35:23.697Z

identified2025-06-19T13:30:30.000Z

There's no good reason for this, as far as we're aware. It's happened once before, many moons ago, and it was a fluke. It's something they do from time to time, and there's little we can do. We've reached out to them and we hope they'll get it fixed soon. In the meantime, please: * mark the site as 'safe' * tell your clients to do the same * tell your clients to ignore the warning or use another browser Telling Microsoft the site is safe will hopefully help change their minds. We're super sorry about this, but sometimes these giant corporations do what they do and we can't stop them.

Jun 17, 2025

Report: "Broken dashboard and event pages on sites"

Last update 2025-06-17T19:06:01.121Z

identified2025-06-17T19:06:00.789Z

We are continuing to work on a fix for this issue.

identified2025-06-17T19:05:51.850Z

We have identified the release that broke those pages and are working to resolve the issue ASAP.

May 7, 2025

Report: "AI Chat Bot on sites not working"

Last update 2025-05-07T13:51:10.889Z

resolved2025-05-07T13:51:10.575Z

AI Chat Bot on sites should be working again.

identified2025-05-07T12:56:18.114Z

Actually, it was our own boo-boo. We deleted a team-member who no longer works here from our OpenAI account, but our servers were using credentials associated to their user. We are changing the credentials right now. Expect everything to be working in 20-25minutes.

identified2025-05-07T12:41:46.586Z

The service we use for chat bot (OpenAI) is currently down. As a result the AI chat bot on sites is currently not working.

Apr 24, 2025

Report: "Issues with email deliveries"

Last update 2025-04-24T06:05:03.200Z

resolved2025-04-24T06:05:02.890Z

This incident has been resolved.

identified2025-04-23T22:36:32.370Z

Our email provider Sendgrid is dealing with an incident which may delay email deliveries.

Mar 18, 2025

Report: "All pages listing course lessons are currently broken -- a fix is being deployed and should be out in 8 minutes..."

Last update 2025-03-18T13:00:02.591Z

resolved2025-03-18T13:00:02.285Z

All is working in the land of Simplero again.

investigating2025-03-18T12:43:40.392Z

We are currently investigating this issue.

Dec 26, 2024

Report: "Email Sending is Down"

Last update 2024-12-26T02:15:36.456Z

resolved2024-12-26T02:01:31.000Z

Our emails have been unsuspended and they should be up and running again. Emails sent during the suspension have now been delivered.

monitoring2024-12-26T01:49:19.552Z

Our emails have been unsuspended and they should be up and running again. We are working to confirm if emails sent during the suspension will still be sent or if they will need to be resent.

identified2024-12-25T21:00:23.378Z

We're in touch with several people at Twilio, but no one is able to actually do anything because it's Christmas here in the US. It's pretty remarkable that a $17Bn market cap ~10,000 person company cannot find a single person who's able to flip a simple switch to rectify an obvious mistake. But that's where we're at. We've also switched over transactional email (login information, receipts/invoices, forgotten password, etc.) to use the channel that does let emails go through. More details in the community: https://simplero.community/forum/posts/193635-email-down

identified2024-12-25T14:01:31.000Z

We now know the reason as for the suspension (a phishing e-mail sent to one of our members which was forwarded by our systems to the same member as a notification email). We are still waiting on our email delivery system to restore our account.

investigating2024-12-25T11:28:01.000Z

Emails are not being delivered. Our email delivery system suddenly suspended our email sending without a clear reason. We have asked for urgent support from them and are waiting for a response.

Nov 17, 2024

Report: "Simplero is down"

Last update 2024-11-17T16:07:51.100Z

resolved2024-11-17T16:07:50.791Z

One of our webservers (out of 10) went down for ~45 minutes. We've restarted it so the problem should be fixed. Weirdly enough, our automatic alerts didn't catch this downtime. We'll continue to monitor and figure out a way to setup automatic alerts for this case so we're alerted early on.

investigating2024-11-17T15:40:46.439Z

Some people are unable to access Simplero. We are investigating the issue.

Nov 14, 2024

Report: "Simplero is down"

Last update 2024-11-14T10:13:05.260Z

resolved2024-11-14T10:13:04.846Z

We've resolved the issue and everything should be back to normal.

identified2024-11-14T07:53:03.125Z

Our engineers are working on a spam traffic attack that's bringing us down.

Nov 9, 2024

Report: "Simplero is down"

Last update 2024-11-09T00:32:43.811Z

resolved2024-11-09T00:32:43.509Z

This incident has been resolved.

monitoring2024-11-09T00:32:28.459Z

We are continuing to monitor for any further issues.

monitoring2024-11-09T00:16:22.708Z

A fix has been implemented and we are monitoring the results.

investigating2024-11-09T00:11:28.517Z

We are currently investigating this issue.

Oct 23, 2024

Report: "Background processing and API is down"

Last update 2024-10-23T11:02:57.179Z

resolved2024-10-23T11:02:55.850Z

The email stats and other stuff is still catching up and will be updated very soon.

monitoring2024-10-23T04:52:20.961Z

A fix has been implemented and we are monitoring the results.

investigating2024-10-23T02:20:27.923Z

We are continuing to investigate this issue.

investigating2024-10-23T01:32:05.415Z

We are investigating the issue

Oct 18, 2024

Report: "Email Delivery Delays"

Last update 2024-10-18T12:28:27.785Z

resolved2024-10-18T12:28:27.480Z

This incident has been resolved.

identified2024-10-18T09:58:13.768Z

We are currently experiencing an issue impacting our email delivery system. Users may notice delays in receiving emails sent through our platform. Current Status: Our engineering team has identified the root cause as an unexpected surge in email load, leading to a bottleneck in our processing queues. We are actively working fixing it.

Oct 17, 2024

Report: "Simplero is down"

Last update 2024-10-17T08:30:13.985Z

resolved2024-10-17T08:30:13.662Z

This incident has been resolved.

investigating2024-10-17T08:09:47.856Z

We are currently investigating the issue.

Jul 28, 2024

Report: "Errors on Site admin pages"

Last update 2024-07-28T14:49:51.324Z

resolved2024-07-28T14:49:50.979Z

All done. So sorry about this.

identified2024-07-28T14:20:16.873Z

All admin pages for sites not on the new experience are throwing errors right now. A fix is going out. Should be all done within 15-20 minutes. (That's how long it takes to deploy and update.)

Jul 12, 2024

Report: "Course overview pages are currently broken"

Last update 2024-07-12T19:42:27.539Z

resolved2024-07-12T19:42:27.195Z

All fixed. So sorry about that.

monitoring2024-07-12T19:24:53.073Z

A fix is going out right now. The courses themselves are fine, but the overview page is throwing a 500 server error.

Jul 9, 2024

Report: "Search Functionality Disruption"

Last update 2024-07-09T12:32:16.582Z

resolved2024-07-09T12:32:16.221Z

This incident has been resolved.

monitoring2024-07-09T11:43:22.008Z

Search should be working as expected, we are monitoring for any issues.

identified2024-07-09T06:21:49.000Z

We are currently experiencing an issue with our search functionality. Our team is aware of the problem and is working diligently to resolve it as soon as possible. We apologize for any inconvenience this may cause and appreciate your patience.

Jun 13, 2024

Report: "Simplero is down"

Last update 2024-06-13T09:33:28.641Z

resolved2024-06-13T09:33:28.629Z

We have restarted our database which got us back!

investigating2024-06-13T08:24:57.000Z

Seems to be affecting our database which is causing all Simplero admin and user facing pages to be down. The Engineering team is investigating.

Apr 25, 2024

Report: "FontAwesome is Down 👎"

Last update 2024-04-25T21:53:23.439Z

resolved2024-04-25T21:53:23.052Z

FontAwesome is back as well as all the fabulous icons and texts 💃💪

investigating2024-04-25T19:39:58.136Z

Fontawesome is down 😞 This is affecting fonts and icons used in Simplero. G o here to see their status updates: https://status.fortawesome.com/ We'll do our best to update as we get more information 👷

Dec 1, 2023

Report: "Simplero is down right now.."

Last update 2023-12-01T00:04:59.705Z

resolved2023-12-01T00:04:59.247Z

This incident has been resolved.

monitoring2023-12-01T00:01:39.396Z

We are continuing to monitor for any further issues.

monitoring2023-12-01T00:01:04.604Z

A fix has been implemented and we are monitoring the results.

investigating2023-11-30T23:50:01.711Z

We are currently investigating this issue.

Nov 4, 2023

Report: "Email Delivery Delay"

Last update 2023-11-04T08:43:38.981Z

resolved2023-11-04T08:43:38.962Z

This incident has been resolved.

identified2023-11-04T06:31:54.477Z

We are currently experiencing issues with our email provider, which has resulted in delays in email delivery. Outgoing emails may be affected. Our technical team is actively working on resolving this issue and is in communication with the email provider.

Oct 26, 2023

Report: "Investigating issues accessing the platform"

Last update 2023-10-26T17:39:31.936Z

resolved2023-10-25T12:48:28.000Z

This incident has been resolved. What happened? We created a new API endpoint and this was used at a much higher rate that we were anticipating. This created a logjam amongst our backend processing which spilled over to page loads. We are so sorry for that! We have now added rate-limiting to this endpoint and are modifying it in a way that prevents this from happening again.

identified2023-10-25T12:22:03.757Z

We've identified an issue that may be the cause of the down time. We are deploying a fix and will continue to monitor.

investigating2023-10-25T11:31:05.058Z

We are currently investigating this issue.

May 9, 2023

Report: "We have a problem, we’re working on it, it seems to be affecting checkout pages and video assets"

Last update 2023-05-09T16:46:20.463Z

resolved2023-05-09T16:46:20.038Z

This incident has been resolved.

monitoring2023-05-09T16:36:54.839Z

A fix has been implemented and we are monitoring the results.

investigating2023-05-09T16:09:52.071Z

Our "pusher" is extremely busy at the moment, which handles the "purchase processing" screen, video encoding, and transcripts generation. The following are expected to be affected: 1. The "purchase processing" screen will not automatically move on: the user purchasing from your site will need to click the link to force the redirect. 2. Video encoding status will not automatically update in your dashboard but the encoding will still process: you'll just need to refresh the page to see it updated. Video transcription status will not automatically update in your dashboard but it will be generated on the background: you'll just need to refresh the page to see it updated.

Mar 17, 2023

Report: "Instagram feeds are down"

Last update 2023-03-17T15:23:19.475Z

resolved2023-03-17T15:23:19.461Z

Instagram feeds were working again as of Feb 16th. Did we remember to update this? No. No we did not.

monitoring2023-02-02T21:40:44.919Z

Our integration with Instagram is currently being reviewed by Meta. We’ve submitted the information we need to submit and the Instagram feed section should start working again within 2-3 days. Please hide your Instagram sections for now.

Feb 5, 2023

Report: "Attachments (image/file uploads/mentions) on comments/forum posts uploaded 2 days ago not being displayed"

Last update 2023-02-05T18:31:34.406Z

resolved2023-02-05T18:31:33.857Z

We've fixed attachments and mentions posted between February 3 and 5. All attachments and mentions should be functional again.

identified2023-02-05T17:33:25.371Z

We have fixed the issue for attachments and mentions posted before February 3 and all those posted going forward. We are working on a fix for those posted between February 3 and 5.

identified2023-02-05T17:04:58.752Z

The issue has been identified and a fix is being implemented.

Aug 26, 2022

Report: "Website degraded performance"

Last update 2022-08-26T21:15:46.209Z

resolved2022-08-26T19:30:00.000Z

Website performance was degraded for about 30 minutes. It has gone back to normal. We subsequently found the root cause and fixed it.

Aug 10, 2022

Report: "All sites showing error code"

Last update 2022-08-10T23:22:29.370Z

resolved2022-08-10T20:40:14.000Z

Fixed.

identified2022-08-10T20:28:20.394Z

Will show a message like "ERROR: undefined method `google?' for nil:NilClass" or show the site without any styles at all. A fix is currently being deployed. So sorry about this.

Jul 27, 2022

Report: "Database upgrade has stalled Broadcast and Email sendings"

Last update 2022-07-27T09:27:17.893Z

resolved2022-07-27T09:27:17.485Z

This incident has been resolved.

monitoring2022-07-27T09:18:12.017Z

A fix has been implemented and we are monitoring the results.

identified2022-07-27T08:58:55.046Z

The issue has been identified and a fix is being implemented.

investigating2022-07-27T07:29:08.849Z

We are currently investigating why Broadcasts and Emails are not sending after our Database upgrade. We will update you as soon as possible.

Dec 22, 2021

Report: "Looks like AWS is down"

Last update 2021-12-22T15:57:37.061Z

resolved2021-12-22T15:57:35.890Z

AWS should be back online

monitoring2021-12-22T13:13:08.641Z

We have disabled a part of our logging service that depends on the affected AWS region. Everything on Simplero should be working again. We are continuously monitoring for other issues that may come up - none so far.

investigating2021-12-22T12:39:50.335Z

We are continuing to investigate this issue.

investigating2021-12-22T12:39:15.052Z

Looks like Amazon Web Services is having issues causing outages on Simplero. We are investigating further...

Oct 13, 2021

Report: "Emails may not be sending"

Last update 2021-10-13T14:25:44.671Z

resolved2021-10-13T14:25:31.000Z

This issue appears to have been resolved. A small number of emails may not have been sent between 2:28 PM and 3:34 PM EST on October 12. If you sent messages around that time, please check to see if the broadcasts are marked as 'not delivered'.

monitoring2021-10-12T21:24:00.399Z

This issue appears to have been resolved, but we are monitoring to make sure no further issues occur.

investigating2021-10-12T19:09:50.666Z

We are experiencing an issue were some emails may not be delivering.

Jul 25, 2021

Report: "Instagram is having some problems with their API"

Last update 2021-07-25T02:18:14.799Z

resolved2021-07-25T01:52:07.000Z

The virtual hugs worked! Facebook/Instagram have announced that their API is back up. If your account's feed disconnected during this outage you will now be able to reconnect it in Settings > Integrations. Please check yours! Some disconnected and some didn't...

monitoring2021-07-24T22:16:33.330Z

We are monitoring to see when Instagram resolves their issue. Let's all send them virtual hugs...

investigating2021-07-24T14:57:16.000Z

At around 1AM ET on July 24, we started experiencing problems with our Instagram integration. After much digging on our end (go Owais!), it turns out to be an issue with the Instagram API itself, and not with Simplero. As a result, your Instagram integrations might not work as expected until Instagram resolves these issues. We will monitor and you can also follow along here: Facebook's status page: https://status.fb.com/graph-api

Jun 28, 2021

Report: "Auto-response and Automation email delivery stats are not complete"

Last update 2021-06-28T19:51:04.951Z

resolved2021-06-28T19:51:04.567Z

This issue is now resolved and emails sent during this issue should now show correct statistics.

monitoring2021-06-28T19:09:16.000Z

We have implemented a fix and new stats should now be recorded correctly. We are monitoring this fix and exploring methods to update the data on affected email during the issue.

investigating2021-06-28T16:56:57.000Z

All emails are still being sent, don't worry! But we are currently investigating an issue where the email stats for these messages are zero or incomplete. The number of 'Delivered' emails is not correct, and thus the percentages of other things (like 'Opens') are also wonky. As far as we can tell, other number-based metrics like 'Opens' are accurate - only percentages are affected. The problem may lie with SendGrid, but we haven't fully identified the issue yet. We're on it, though! Our apologies for any inconvenience.

Feb 6, 2021

Report: "Saved cards not working for new purchases"

Last update 2021-02-06T20:29:34.902Z

resolved2021-02-06T20:29:34.361Z

We believe we have this all straightened out, and saved cards are again available for new purchases.

identified2021-02-06T14:56:41.843Z

Previously-saved cards are temporarily not working for making new purchases. Complete fix expected within a few hours. In the meantime, saved cards do not show as a payment option, so the checkout process for repeat customers is somewhat worse than normal but fully functional. (Only cards processed via Stripe are affected, no other processors.)

Jan 8, 2021

Report: "Something is Amiss"

Last update 2021-01-08T10:27:36.249Z

resolved2021-01-08T10:27:35.841Z

We had a backlog of running jobs. Jobs are running again and we are seeing emails and media files uploading again.

investigating2021-01-08T09:31:31.459Z

We are currently investigating an issue with emails not being sent and video encoding. We'll update as soon as we have the issue resolved.

Nov 18, 2020

Report: "Links in email briefly broken"

Last update 2020-11-18T22:17:08.507Z

resolved2020-11-18T21:00:00.000Z

For five minutes or so, anyone clicking a link in an email got an error page. If they tried again in a few minutes, it worked correctly. No change too small for going through the proper steps. Our head of engineering is having a stern talk about expectations and SOPs with...himself. Mea culpa. -Joshua

Nov 9, 2020

Report: "Certain email deliveries delayed"

Last update 2020-11-09T21:16:27.859Z

resolved2020-11-09T02:30:00.000Z

Some deliveries failed last night between 9:20 PM and 2:20 AM Eastern. We corrected the problem (one of our mail-sending servers rapidly ran out of disk space due to an unrelated series of unfortunate events) and re-sent all the failed emails—except where we could tell that the account owner had already re-sent them. This really was quite the freak combination of problems, but we're taking steps to make sure similar processes can't use up the disk again.

Nov 3, 2020

Report: "The case of the unreported deliveries"

Last update 2020-11-03T19:42:28.926Z

resolved2020-11-02T19:00:00.000Z

You may have noticed unusually low % delivered for mailings sent in the last day or so. For about 24 hours starting at 1:45 PM Eastern (18:45 UTC) on November 2, Simplero did not record deliveries or bounces for email addresses with capital letters in them. The mail still got delivered as it always does! Unfortunately we can't get those delivery events back, and affected mailings are going to have somewhat weird-looking reports. Opens and clicks were still tracked correctly.

Oct 26, 2020

Report: "Database upgrade"

Last update 2020-10-26T21:00:55.282Z

postmortem2020-10-26T20:41:53.216Z

Thursday we set in motion some infrastructure upgrades—very carefully, behind the scenes. But it turns out Maria DB has a bug that caused it to “leak” memory when using a certain kind of data compression, and over the course of several hours, it consumed all available memory, slowed down, and rebooted itself. That caused the few minutes of downtime on Thursday evening. We’ve never had a problem like that before, but there’s a first time for everything. Now we have alarms so we’ll be notified of any memory issues with the database long before they cause a problem. We also decided to upgrade Maria DB to a version that fixes the memory leak bug. It’s a so-called minor version upgrade, and Amazon even offers to do them for you automatically during a short regularly-scheduled maintenance window, so we expected a few minutes of downtime. Instead, as you know, there was over an hour Saturday night when the database $and hence the entire application$ was inaccessible. And once the process started there was no stopping it: we were at the mercy of Amazon Web Services. Going forward, we’ll announce ahead of time on [status.simplero.com](http://status.simplero.com) and in our Facebook group any time we plan even a few minutes of downtime. And we’re implementing a plan to be able to upgrade the database with—for real—no more than a few minutes of downtime.

resolved2020-10-25T02:10:35.982Z

And, we're back! Sorry that took a bit longer than expected. All is safe and sound and operational.

identified2020-10-25T01:32:19.033Z

We're currently doing a database upgrade. We expect to be back online in a few minutes. Sorry for the wait.

Oct 25, 2020

Report: "More database upgrade"

Last update 2020-10-25T11:22:42.505Z

resolved2020-10-25T11:22:42.488Z

This incident has been resolved.

monitoring2020-10-25T03:59:51.000Z

Upgrade process is completely finished. No data was harmed in the upgrading of this database.

monitoring2020-10-25T03:01:44.268Z

We've been back up for a while now, and we're fairly sure we're out of the woods. But given that we thought we were done 30 minutes ago and then we weren't, we'll leave this Status as Monitoring. To be on the safe side.

investigating2020-10-25T03:00:09.908Z

We are continuing to investigate this issue.

investigating2020-10-25T02:42:54.517Z

Apparently our database wasn't quite done updating. This is still expected downtime, it's just taking longer then we'd expected. We're very sorry this is taking so long.

Oct 23, 2020

Report: "Something is amiss"

Last update 2020-10-23T00:59:36.192Z

resolved2020-10-23T00:59:35.760Z

And we're back in business! We'll post more details here about what happened after we do a full post-mortem.

investigating2020-10-23T00:47:05.781Z

As you've noticed, something is amiss in Simplero-land. We're on it and will get it fixed ASAP.

Sep 17, 2020

Report: "Site is down ... working on it"

Last update 2020-09-17T16:04:31.126Z

resolved2020-09-17T16:04:30.503Z

All good now. Thanks for your patience.

identified2020-09-17T15:28:38.000Z

Most stuff is back online now. Looks like it's just our own website (simplero.com) that's still borked. Your sites and services are working fine, and you can login to your account by going to youraccount.simplero.com/admin.

investigating2020-09-17T15:23:51.000Z

We made a boo-boo. We're working hard on restoring service. So sorry, guys. We know we screwed up.

Sep 9, 2020

Report: "Brief interruptions caused by maintenance"

Last update 2020-09-09T15:53:27.333Z

resolved2020-09-09T15:53:26.947Z

It's all cleaned up now. We apologize for the inconvenience. A few times the site was offline and everything got paused for a minute, but it's all back to normal, and there should be no lasting effects.

identified2020-09-09T13:44:00.684Z

We're experiencing a few brief interruptions in service this morning due to some unexpected problems during system maintenance. We're working on getting it all cleaned up.

Jul 28, 2020

Report: "Mail delayed by SendGrid outage"

Last update 2020-07-28T20:17:19.568Z

resolved2020-07-28T20:17:18.384Z

SendGrid is reporting that systems are back online. I still wouldn't be surprised if some inbound and outbound messages are delayed.

identified2020-07-28T15:28:09.580Z

According to https://status.sendgrid.com, SendGrid is having an outage across all capabilities. Mail sending will be delayed. Our architecture is designed so that mail will get delivered automatically as soon as Sendgrid is back online.

Report: "Temporary network error caused downtime for sites"

Last update 2020-07-28T19:33:22.399Z

postmortem2020-07-28T19:27:11.281Z

A temporary error with domain name resolution happened to coincide with our process that checks to see that domain names are still configured to point to Simplero, which caused our system to see many domains as no longer pointing to Simplero, which caused that process to mark them inactive. This kind of problem has never happened before in all the years we have supported custom domains, but the system design was still a mistake on our part: DNS systems _can_ fail, so we shouldn’t have had a system that deactivated sites based on a single check. We have improved the system so that an active domain must fail multiple checks over a couple days before it’s deactivated.

resolved2020-07-28T10:00:00.000Z

Our systems suffered temporary, partial errors with domain name resolution this morning, which resulted in a number of customer websites temporarily failing to display. We're still investigating to determine exactly what went wrong and what sequence of events may have caused sites to be offline any longer than necessary.

May 16, 2020

Report: "Mail sending down"

Last update 2020-05-16T18:57:28.734Z

postmortem2020-05-16T18:51:49.374Z

Our mail sending partner decided to change all login credentials at 20 minutes past midnight US Eastern Time on a Saturday, without notice, in a way that broke all of our email sending completely. Emails just stopped going out. This is terrible on their part. We're going to reevaluate our business relationship with them, we're going to obviously do everything we can to make sure this won't happen again in the future, and we will create a system to catch a situation like this automatically, and immediately, going forward. I'm so sorry. This is absolutely horrific. Nothing like this has ever happened before in our 11\+ year history, and I've never experienced a supplier behaving this irresponsibly before. We've definitely learned from this. With sincere apologies, –Calvin

resolved2020-05-16T18:50:06.569Z

Backlogged messaged have been sent.

monitoring2020-05-16T14:49:27.176Z

Email sending is working again, and we are delivering all mail that should have been sent earlier today. We're monitoring to make sure everything gets sent.

identified2020-05-16T13:54:33.485Z

All mail sending from Simplero is currently failing. We have identified the problem and are working to correct it.

Feb 4, 2020

Report: "Brief downtime"

Last update 2020-02-04T15:54:31.733Z

resolved2020-02-04T05:30:00.000Z

We had nine minutes of downtime from 12:25 AM to 12:34 AM US Eastern time. To support a new feature, a developer made a configuration change of a kind we rarely need to make, and it didn't go well. We're improving our internal documentation, and this won't happen again.

Jun 11, 2019

Report: "Notification emails delayed"

Last update 2019-06-11T14:14:27.717Z

resolved2019-06-11T14:14:27.268Z

Notification emails and other one-at-a-time emails across Simplero were stalled from 11:43 PM US Eastern time last night until 8:52 AM this morning. All such emails were delivered starting at 8:52 AM. The problem was caused by a configuration error which is now fixed. Broadcasts and newsletters delivered normally and were not affected.

May 24, 2019

Report: "Some purchases failed during an hour due to network issues"

Last update 2019-05-24T20:27:13.858Z

resolved2019-05-24T20:27:13.428Z

From 1:22 PM to 2:32 PM US Eastern today, some of our servers were unable to make connections to outside services, including payment gateways. Some payments attempted during this window failed. Full connectivity has been restored. We sincerely apologize for the outage.

May 15, 2019

Report: "Intermittent connectivity"

Last update 2019-05-15T17:50:39.118Z

postmortem2019-05-15T16:42:10.177Z

This morning we had about 30 minutes of intermittent failures affecting the Simplero software and customer websites—including [simplero.com](http://simplero.com): we use our own stuff! That was followed by a few minutes of all services being down completely. We’re so sorry about that! Here’s what happened. We deploy a new version of Simplero every time we fix or improve something, typically several times a day. We keep a few previous versions around, and the oldest version gets cleaned up as a new version gets deployed. One of our deploys this morning failed, and the old version kept running. That’s as it should be, but the failure today was silent: we didn’t realize anything had gone wrong. A few more deploys later, the old version was still running, but it was old enough that the application files got cleaned up right out from under the running application on one of our servers. $All your media, images, text, customer data, and any other files you’ve added to your Simplero were just fine. Only the application itself was affected.$ Another deploy trying to fix the problem meant the old, still-running version got cleaned up on every server, and we went from intermittently down to completely down. Finally, we realized the root cause and undid the changes that were causing new deploys to fail. Going forward, we’re changing our deploy process to make a silent failure like this visible so we can roll it back immediately. Sorry we let you down: we’ve learned from this error and we’ll make sure this kind of failure can’t happen again. Thank you for your patience and for your trust in Simplero.

resolved2019-05-15T15:24:30.734Z

We're back online.

investigating2019-05-15T15:23:39.172Z

We're completely down now. Deploying a fix we believe will solve it completely. Fingers crossed.

investigating2019-05-15T15:12:41.583Z

We've received report of some sites experiencing intermittent connectivity issues. We're currently investigating the issue.

Mar 14, 2019

Report: "System Wide Outage"

Last update 2019-03-14T16:51:40.400Z

postmortem2019-03-14T16:39:22.250Z

Here’s what happened yesterday with our longest downtime in 5 years. First, our background jobs got stuck, and we got a notification about it. It was strange, because there hadn’t been a recent deploy or any other recent event that would correlate to that. Then, in an attempt to get them unstuck, a team member made a quick decision to run a full deploy. That turned out to be a mistake, because that ended up taking down EVERYTHING, including our web servers, so now the site was completely down. To be fair, though, given what turned out to be the cause, the web servers would probably have stopped responding fairly soon after, anyway. As soon as the site was down, it was all hands on deck. We spent the majority of the time just trying to figure out what the heck was going on. There was nothing in the logs, no indications of what could be causing this. We tried the logical route: It started with background jobs, it spread to the web servers when they were redeployed. We also, of course, tried the good old “turn it off and back on” method, but, predictably, it didn’t do anything to fix it. Finally we got a clue. Some requests did go through, and they threw an error from our PostgreSQL database saying the connection was bad. That pointed us in the direction of the logging server running PostgreSQL. As soon as we validated that, it was an easy fix to turn off logging to PostgreSQL, which is safe to do since we only use it for internal debugging purposes. Then the site was back up. But what had gone wrong with our PostgreSQL database? We keep stuff there for a limited period of time, and then delete it. It looks like the way we deleted things weren’t very efficient, and we also never VACUUM’d our database. It’s been many years since I last used PostgreSQL, and that was something I’d forgotten you should do every so often. One thing that threw us was that our system is designed such that if logging to PostgreSQL fails for some reason, the application should be able to keep serving requests. Clearly something about that wasn’t working quite right. We’ve now changed our process for how we delete old rows, and implemented a system to VACUUM the database more regularly, as well as split this process out from some other processes it was lumped in with. Again, I’m super sorry about this. The big factor was just how long it took us to figure out what was going on here. It was completely mystifying for the longest time, until we finally got a clue that put us on the right track. Thank you for being here with us. We’re grateful every day.

resolved2019-03-13T20:17:41.042Z

Everything's operational. Our specialized logging system is still offline, but that doesn't affect operations. We're doing some maintenance and cleanup on it, before putting it back in commission. This was the longest-running downtime in five years, and I'm terribly terribly sorry we let you down like this. We are, of course, fixing all of the issues that led to this downtime.

identified2019-03-13T19:01:29.129Z

Yup, that was it. We're back. Now on to figuring out what happened to our PostgreSQL installation. It seems like something's really screwed there.

investigating2019-03-13T19:00:53.147Z

We think we figured out what's going on. It's related to our logging infrastructure.

investigating2019-03-13T18:45:14.173Z

This is the strangest thing I've seen in my almost 40 years in software development. It's certainly the worst downtime we've had in over five years. We've got all hands on deck trying to figure this thing out, but at this point, we don't even know what's causing the processes to not respond correctly. I'm so so sorry. We take this stuff supremely seriously, and we're working as HARD as we can to bring everything back up.

investigating2019-03-13T18:16:20.245Z

We are currently experiencing a system-wide outage. We are looking into it and will update with details as soon as we can. Thanks for your patience!

Feb 4, 2019

Report: "Video Encoding issues at AWS"

Last update 2019-02-04T20:48:38.187Z

resolved2019-02-04T20:48:37.269Z

Hallelujah! Media files are encoding effectively now. Join me in raising a glass to our team of coders who figured out several challenging problems today. Thank you all for your patience.

monitoring2019-02-04T20:44:15.148Z

We are continuing to monitor for any further issues.

monitoring2019-02-04T18:53:47.969Z

There's a new twist in today's media file encoding challenge. Dev team is investigating as fast as their fingers can take them. Thanks for your continued patience.

monitoring2019-02-04T16:01:30.278Z

We've figured out a solution and things are catching up. This will be a permanent improvement going forward. That's the good news. Thank you so much for your patience!

identified2019-02-04T12:36:40.454Z

Network issues at AWS are affecting video encoding.

Report: "Site is down"

Last update 2019-02-04T02:23:19.546Z

resolved2019-02-04T02:23:18.651Z

We're back up and systems are operational again. Thanks all.

investigating2019-02-04T02:19:08.739Z

We are continuing to investigate this issue.

investigating2019-02-04T02:14:18.550Z

We implemented a change that resulted in an outage. We are working to resolve the situation and expect to be back up soon. Thank you for your patience.

Jan 31, 2019

Report: "Video encoding is backlogged at the moment"

Last update 2019-01-31T19:35:43.475Z

resolved2019-01-31T19:35:42.764Z

Everything's humming along nicely now. Thank you for your patience.

monitoring2019-01-31T18:19:41.103Z

Looks like AWS is behaving again. We still have a little bit of a backlog, but everything is moving forward as it should.

investigating2019-01-31T17:07:39.064Z

Things are progressing, but the network issues are making it slow. We'll get through it, but we need your patience here.

investigating2019-01-31T16:51:39.600Z

It looks like a network issue with Amazon's web services that makes connections between our encoding servers and S3 where the video files are stored, makes download/upload very slow and unreliable.

investigating2019-01-31T16:45:00.785Z

Processing is stuck for a number of videos. We're working on getting it all cleared up as soon as we can.

Jan 18, 2019

Report: "Switching over Content Distribution Network"

Last update 2019-01-18T16:39:22.452Z

resolved2019-01-18T16:39:22.433Z

Almost everything's switched over now, and things seem to be working well.

investigating2019-01-18T14:33:55.240Z

We're switching over our Content Distribution Network. There may be breakage in the app while we do this, but we're monitoring closely. If you notice something, please let us know, but most likely we're already on it.