loader.io

Is loader.io Down Right Now? Check if there is a current outage ongoing.

loader.io is currently Operational

Last checked from loader.io's official status page

Historical record of incidents for loader.io

Report: "Web app and test running service unavailable"

Last update
resolved

On Jan 2, 2023 the loader.io website, API, and test running services became unavailable due to an expired TLS certificate in a backend service that manages service credentials. The certificate had been renewed, but was not distributed to all servers that needed it. When the certificate expired, Loader's orchestration system was unable to read credentials for internal connections to databases and other services, and several services failed as a result, including the web interface and the test running & scheduling jobs. Our team was not notified due to a separate failure of alerting systems, and the team was not in the office because of the observance of the New Years Day holiday on the Monday after new years day. As soon as a team member noticed the outage, service was restored by distributing the renewed certificate to the credential service, and restarting the other failed services. - Test settings, results, and other account information in the loader.io web interface was inaccessible - Tests that had been scheduled for Jan 2, 2023 were not run during their scheduled time, and instead would have run after service was restored, early on Jan 3 2023 - Some scheduled tests may have been scheduled twice in error when service was restored, due to retries and delays processing the backlog of tests We are reviewing our automation and monitoring systems to ensure that critical systems are better automated, and that our team receives alerts promptly!

Report: "Main database unreachable"

Last update
resolved

This incident has been resolved.

monitoring

A replica has been promoted after Loader's primary database failed. Systems are functional and we continue to monitor closely

investigating

We are investigating an issue connecting to Loader's primary database

Report: "loader.io website down"

Last update
resolved

This incident has been resolved.

monitoring

Systems are getting back to normal; we continue to monitor closely

investigating

We are investigating a problem with our website load balancers.

Report: "load test data collection errors"

Last update
resolved

This incident has been resolved.

monitoring

We implemented a temporary fix last night, so scheduled tests should have run as expected. A permanent fix is now in place and all systems are operational. We will continue to monitor all systems closely.

identified

We are investigating errors from our load test data collection service; systems are in maintenance mode while we address the underlying problem.

Report: "Bad Gateway errors"

Last update
resolved

All web traffic is stable, so we are marking this resolved.

monitoring

A recent configuration change caused some of our web servers to stop responding. You may have seen HTTP 502 "Bad Gateway" errors from our web interface and API. The problem has been fixed and we will continue to monitor closely.

Report: "test queue delays"

Last update
resolved

This incident has been resolved.

monitoring

The cause of test delays has been identified and fixed. We are continuing to monitor as the test queues catch up.

investigating

We are looking into an issue where some tests are not running

Report: "Test delays"

Last update
resolved

Tests should be running normally now

investigating

Some tests are being queued for longer than usual, we are investigating the cause of the slow-down

Report: "some load tests not running"

Last update
resolved

Tests are running normally now

monitoring

One of our load generation machines stopped responding this morning and has caused a few tests scheduled on it not to run. It has been removed from our fleet of load generators and affected tests should start running. We will be monitoring closely to make sure the issue is resolved.

Report: "tests are being delayed"

Last update
resolved

Tests are now running normally.

investigating

Delayed tests are starting to run, and new tests should run as expected. All systems operational :)

investigating

We are currently investigating this issue.

Report: "https tests not sending correct number of requests"

Last update
resolved

We rolled back a recent deploy, https tests should be behaving normally again.

identified

Some tests against https endpoints are not sending the correct number of requests. We are currently working to resolve this issue.

Report: "test results not live-updating"

Last update
resolved

test results are now coming through and updating live as the test runs

investigating

We are investigating an issue where test results do not update as the test runs. Tests are running and results do appear on page refresh.

Report: "Service down"

Last update
resolved

We've restored service.

investigating

We are currently experiencing an unexpected service outage. We are working on resolving the issue.

Report: "Networking issue"

Last update
resolved

One one our nodes inside of EC2, DNS was resolving EC2 hostnames to public IP addresses instead of internal ones, which prevented some of our internal systems from communicating properly. DNS is resolving properly again.

investigating

We are investigating a networking issue that is preventing some tests from running correctly.

Report: "Database server reboot"

Last update
resolved

This incident has been resolved.

monitoring

We're back up, but our queues are a little backed up, so tests may take a little longer to start for a bit.

monitoring

The server reboot is going to take longer than anticipated. We should be back around 8:15 AM EDT.

Report: "load generator issues"

Last update
resolved

This incident has been resolved.

monitoring

load generation is performing normally, but we will keep monitoring to make sure the issue is resolved

investigating

We are investigating an issue with our load generators causing a few tests to lose some results

Report: "unplanned maintenance"

Last update
resolved

And we're back. Tests should be verifying and running as usual now.

identified

Our web and API are down right now because of a deploy gone wrong. We're working on getting operational again as soon as possible.

Report: "stalled tests"

Last update
resolved

Some network issues around 10:20AM EST caused a few tests to stall. Those tests have been aborted and systems operational now.

investigating

Investigating some stalled tests from the past hour

Report: "isolated stalled tests"

Last update
resolved

A small number of users may have experienced the "preparing screen of death" intermittently over the last 12 hours, where a test shows a preparing message and even the "abort test" button couldn't get you out of it. This was caused by EC2 capacity issues at Amazon, combined with a bug in our handling of that error. A fix for the bug in our code has been deployed, and if you had a test stuck at the preparing screen, we have aborted it for you - instead of the preparing screen, you should now see a message indicating that your test has been aborted. You can run the test again from there.