Is Airtame Down Right Now? Discover if there is an ongoing service outage.

Airtame is currently Operational

Last checked Jul 29, 2025 17:50 UTC from Airtame's official status page

Historical record of incidents for Airtame

Sep 29, 2023

Report: "Service disruption"

Last update 2023-09-29T08:19:30.622Z

resolved2023-09-29T08:19:30.604Z

We have identified the issue in our systems. The system is back online.

investigating2023-09-29T07:43:48.322Z

We are currently experiencing a service disruption. Our engineering team is working to identify the root cause and implement a solution.

Oct 18, 2022

Report: "Problems connecting to Teams calls"

Last update 2022-10-18T18:07:50.767Z

resolved2022-10-18T18:07:50.753Z

This incident has been confirmed resolved.

monitoring2022-10-18T13:48:35.916Z

We have identified the issue and deployed a fix. We are now monitoring the situation.

investigating2022-10-18T13:37:59.653Z

We are currently investigating an issue where some Airtame Hub devices fail to connect to a Teams call.

Oct 17, 2022

Report: "Service disruption"

Last update 2022-10-17T14:26:40.632Z

resolved2022-10-17T14:26:40.618Z

The incident has been resolved.

investigating2022-10-17T12:05:04.720Z

We are currently experiencing a service disruption in our email provider. Our Cloud team is working to identify the root cause and implement a solution. Any expected email from Cloud will not be received until the issue is solved.

Nov 12, 2021

Report: "Google Platform experiencing degradation in the service"

Last update 2021-11-12T12:01:30.850Z

resolved2021-11-12T12:01:30.833Z

Our monitoring system is no longer reporting errors on the communication with Google services. We consider this issue as resolved.

monitoring2021-11-12T10:04:48.524Z

We are experiencing an increased error rate on some airtame cloud applications. So far it has been restricted to some Google Applications and current diagnosis points to issues at the Google Platform.

Jun 18, 2021

Report: "Backend API unstable"

Last update 2021-06-18T09:21:44.324Z

resolved2021-06-18T09:21:44.307Z

The issue has been fixed and the Backend API is now behaving as expected.

investigating2021-06-18T08:03:31.391Z

We are currently experiencing degraded performance in the Backend API handling the requests from our Frontend.

Apr 20, 2021

Report: "Elevated number of 5xx errors"

Last update 2021-04-20T10:06:07.858Z

resolved2021-04-20T10:06:07.845Z

This incident has been resolved.

monitoring2021-04-20T10:00:32.074Z

A fix has been implemented and we are monitoring the results.

identified2021-04-20T09:47:55.843Z

The issue has been identified and a fix is being implemented.

investigating2021-04-20T09:43:31.737Z

We are currently investigating an elevated number of 5xx errors. We will provide updates as necessary.

Feb 22, 2021

Report: "DNS resolution issue"

Last update 2021-02-22T08:09:29.674Z

resolved2021-02-22T08:09:29.641Z

This incident has been resolved.

monitoring2021-02-21T22:00:04.000Z

Due to an issue with domain renewal, airtame.cloud was unreachable. The issue with our domain provider has since been fixed and services are recovering. We're continuing to monitor the situation and will provide updates as necessary.

Feb 13, 2020

Report: "Service disruption"

Last update 2020-02-13T09:06:06.490Z

postmortem2020-02-12T11:41:49.236Z

# **Introduction** On Saturday, 08.02.2020, Airtame Cloud suffered a service disruption from approximately 03:10 to 16:50 UTC, during which most users were unable to use Airtame Cloud. We apologise for the service disruption. With this postmortem we would like to explain how this service disruption was handled, and what we will do to minimise the risk of future service disruptions. # **Timeline** * **03:10** - We receive alerts of high CPU usage on our database instance. * **12:14** - Engineering starts investigating the issue. * **13:46** - A potential issue has been identified with a performance test and the performance test is stopped. Service is briefly restored. This was not due to the stopped performance test, but because we stopped backend services to prepare a failover of our database. This was not clear then, as metrics shipment turned out to be delayed. * **14:03** - The issue starts to occur again with the stopped performance test. Investigation of the root cause continues. * **16:17** - The root cause has been identified with Airtame device firmware 3.8.0-b3 and above. The issue is regarding the new logic of the Cloud component on the device working with the backend in the Cloud. * **16:20** - A hotfix is being deployed on the backend to stop these firmware versions and above to connect to our Cloud. * **16:47** - The issue is mitigated and the service restored for devices running firmware 3.8.0-b2 and below. ‌ On Monday, 10.02.2020, a backend fix is developed to also allow firmware versions 3.8.0-b3 and above to connect to our Cloud again. This fix is deployed by 15:00 UTC. # **Explanation** In the Cloud component of the affected firmware versions, a device UUID handler was introduced. This UUID handler would do a full table scan of our database, leading to high CPU usage on our database. This table scan would occur each time the device connects to our backend. On Friday, we saw a 25% increase of users with devices running firmware versions 3.8.0-b3 and above. While the absolute number of added devices was small \(~200\), this was enough to cause a cascading failure due to a combination of circumstances: * The affected devices would connect to our backend. * Each connected device would cause the backend to do a full table scan, causing high CPU usage on our database. * This would result in an increase in query latencies, which in turn would result in WebSocket disconnections. * The devices would try to reconnect the WebSockets, leading to an even higher database load, and thus latencies. The number of connections piling up then led to memory issues on our backends. * Finally, our backends ran out of memory, causing all devices to disconnect from the Cloud entirely. After a random timeout, they would attempt to reconnect, meaning our backends were unable to recover with the database continuously locked up. * Once the affected versions were blocked from connecting to the Cloud, our database and backends were able to recover and service was restored. # **Learnings** We have recently added performance tests, and will continue to add further checks to these. Even though our monitoring system detected the database CPU usage increase, it didn't report the increase in error rates on the WebSocket endpoints. Since the incident, we already implemented new checks that monitor the error levels on our public load balancers. This is currently being validated.

resolved2020-02-08T16:37:07.186Z

This incident has been resolved.

monitoring2020-02-08T16:20:00.950Z

A fix has been implemented and we are monitoring the results.

identified2020-02-08T16:17:42.884Z

The issue has been identified and a fix is being implemented.

investigating2020-02-08T14:03:37.645Z

We continue to investigate the root cause of the issues.

identified2020-02-08T13:46:27.314Z

The issue has been identified and a fix is being implemented.

investigating2020-02-08T13:25:46.797Z

We are continuing to investigate this issue.

investigating2020-02-08T12:14:03.487Z

We are currently experiencing a service disruption and are investigating the issue.

Nov 11, 2019

Report: "Service disruption"

Last update 2019-11-11T13:00:46.437Z

resolved2019-11-11T11:30:45.614Z

This incident has been resolved.

monitoring2019-11-11T10:55:20.571Z

A fix has been implemented and we are monitoring the results.

identified2019-11-11T10:00:58.917Z

The issue has been identified and a fix is being implemented.

investigating2019-11-11T09:55:55.453Z

We are currently experiencing a service disruption. Our Infrastructure team is working to identify the root cause and implement a solution. Some users may be affected with devices being offline.

Oct 10, 2019

Report: "RDS storage issue"

Last update 2019-10-10T11:58:12.210Z

resolved2019-09-17T07:27:30.086Z

This incident has been resolved.

investigating2019-09-16T11:25:25.559Z

We are currently investigating elevated error codes.