Is SAP LeanIX Down Right Now? Discover if there is an ongoing service outage.

SAP LeanIX is currently Operational

Last checked Jul 29, 2025 14:31 UTC from SAP LeanIX's official status page

Historical record of incidents for SAP LeanIX

Jul 29, 2025

Report: "Service Disruption in US"

Last update 2025-07-29T07:49:59.294Z

resolved2025-07-24T10:30:00.000Z

Incident Description On July 24, 2025, the Pathfinder instance us-prod-1 was unavailable for approximately 6 minutes, from 12:30:50 UTC to 12:36:40 UTC. All customer requests targeting this instance failed. At least seven customers have been affected. Incident Resolution The issue was mitigated by redeploying the affected instance. Root Cause Analysis During a routine update, one of our gateway servers reloaded its configuration. Due to connectivity issues, the server was unable to reach any DNS servers, resulting in its failure to resolve the IP address of the upstream server. Consequently, the web server was not able to respond to requests to the respective upstream service. Preventative Measures We will utilize Azure’s internal DNS servers to reduce latency and dependency on internet connectivity. Furthermore, we will consider hot reloads in scenarios like this to improve recovery times.

Jun 24, 2025

Report: "Unable to link individual fact sheets to reference catalogs"

Last update 2025-06-24T21:44:52.733Z

investigating2025-06-24T21:44:52.730Z

Users may be unable to link individual fact sheets to the reference catalog from the fact sheet page. Our team is working to identify the root cause and implement a solution. Workaround: Fact sheets can still be linked to the reference catalog using the bulk linking features. - https://docs-eam.leanix.net/docs/saas-catalog#bulk-linking-application-fact-sheets-from-the-inventory - https://docs-eam.leanix.net/docs/lifecycle-catalog#bulk-linking-it-component-fact-sheets-from-the-inventory We will send an additional update at 8 am UTC.

Report: "Degraded performance in Bookmarks"

Last update 2025-06-24T09:32:31.222Z

investigating2025-06-24T09:32:31.217Z

Users may experience degraded performance in Bookmarks. Our team is working to identify the root cause and implement a solution. We will send an additional update in 60 minutes.

Jun 23, 2025

Report: "Service Disruption in Surveys"

Last update 2025-06-23T08:28:19.346Z

investigating2025-06-23T08:28:19.343Z

We are currently experiencing a service disruption in Surveys for all workspaces. Our team is working to identify the root cause and implement a solution. We will send an additional update in 30 minutes.

Jun 18, 2025

Report: "Service Disruption in OData Integration for all regions"

Last update 2025-06-18T15:17:34.333Z

resolved2025-06-18T07:30:00.000Z

We are currently experiencing issues with our OData integration in all regions. The root cause has been identified and mitigated.

Jun 17, 2025

Report: "Service Disruption In EU region"

Last update 2025-06-17T09:42:51.138Z

investigating2025-06-17T09:42:51.135Z

We are currently experiencing a service disruption in EU region. Our team is working to identify the root cause and implement a solution. We will send an additional update in 60 minutes.

Jun 16, 2025

Report: "Service Disruption in EU"

Last update 2025-06-16T11:50:23.574Z

postmortem2025-06-16T11:50:02.354Z

## Incident Description Between 8 Jun 2025 09:51 UTC and 8 Jun 2025 10:13 UTC, SAP LeanIX users might have experienced failed login attempts and errors when searching for users. ## Incident Resolution Our engineering team resolved a lock on the database, which fully restored the functionality. ## Root Cause Analysis The incident was caused by a Object-Relational Mapping \(ORM\) system, which created a new constraint on the database, leading to a lock. This lock was released at 8 Jun 2025 10:13 UTC. ## Preventative Measures To prevent similar incidents, we are disabling the database change functionality of the ORM. This ensures that database schema changes are only done via the database migration system.

resolved2025-06-08T09:51:00.000Z

This incident has been resolved. We appreciate your patience and understanding.

Report: "Service disruption affecting a limited number of customers in DE region"

Last update 2025-06-16T07:47:41.344Z

resolved2025-06-13T13:46:00.000Z

Between 15:46 and 15:57 UTC, a limited number of customer instances in the DE region were temporarily unavailable. Affected customers may have experienced issues accessing their workspaces during this time. The issue was automatically detected and resolved. There was no data loss.

Jun 11, 2025

Report: "Transformations Relation Sync Outage"

Last update 2025-06-11T15:59:33.108Z

postmortem2025-06-11T15:59:28.579Z

## Incident Description On `2025-06-06 12:40 UTC`, SAP LeanIX users started to experience issues with the automatic creation of relations during the setup of transformations on the Fact Sheet details page. These expected relations were not created. ## Incident Resolution An investigation into the issue revealed that it originated from a database problem that hindered the request from being processed. A fix was implemented on `2025-06-06 17:06 UTC`, allowing the job to resume its normal operations and successfully process all the missing relations. ## Root Cause Analysis The incident was caused by our system reaching a built-in database limit in the `westeurope` region, due to the growing number of customer workspaces. This led to failures in one of our services, temporarily interrupting the automatic relation creation. The issue was quickly identified and resolved, and no customer data was lost. ## Preventative Measures To prevent this kind of issue from happening again, we have updated our systems to process data in smaller batches. This change will help us avoid database limits as our customer base grows. We are also reviewing similar processes in other parts of our system to ensure continued reliability for our customers.

resolved2025-06-06T12:30:00.000Z

There were issues with the automatic creation of relations during the setup of transformations on the Fact Sheet details page.

Jun 4, 2025

Report: "Custom user roles having too restricted permissions on some workspaces"

Last update 2025-06-04T12:46:18.038Z

postmortem2025-06-04T12:45:21.690Z

## Incident Description At `2025-06-03 09:45 UTC`, SAP LeanIX users started experiencing issues accessing parts of the application with custom _customer roles_. Dashboard related error messages were displayed in the Inventory, and users appeared to have fewer permissions than anticipated. ## Incident Resolution We conducted an investigation into the issue and traced it back to a recent release that targeted broken or invalid _customer roles_. The change that caused the problem was rolled back on `2025-06-03 15:11 UTC`, successfully restoring the expected behaviour. ## Root Cause Analysis The team launched an update on `2025-06-03 09:45 UTC` aimed at addressing ongoing issues with broken and invalid _customer roles_. This modification impacted users who employ custom _customer roles_ in general, resulting in them being assigned fewer permissions than intended. The lack of necessary permissions caused several components of the application to malfunction, leading to unclear error messages being displayed to users in the **Inventory**. The change has been completely reverted and will be reassessed by our engineering teams. ## Preventative Measures We are taking steps to improve how we test changes before they reach our live environment. This includes upgrading our developer tools so we can better simulate real-life scenarios and catch issues earlier. We’re also continuously looking into ways to release updates to a small group of users first, making it easier to identify and fix potential problems. These improvements will help us prevent similar outages in the future and ensure a smoother experience for our customers.

resolved2025-06-03T15:11:35.000Z

This incident has been resolved. We appreciate your patience and understanding.

monitoring2025-06-03T15:10:00.000Z

We are continuing to monitor for any further issues.

monitoring2025-06-03T09:45:19.000Z

We experienced a service degradation where users with custom user roles were seeing less data than they should in the inventory due to too strict permissions. Our team fixed the root cause and is currently monitoring the service.

Jun 3, 2025

Report: "Service Disruption in EU"

Last update 2025-06-03T16:47:47.439Z

resolved2025-06-03T16:47:47.421Z

This incident has been resolved. We appreciate your patience and understanding.

identified2025-06-03T16:44:32.917Z

The issue has been identified and a fix is being implemented.

investigating2025-06-03T16:44:19.414Z

We are currently experiencing a service disruption in EU for several workspaces. Our team is working to identify the root cause and implement a solution. We will send an additional update in 15 minutes.

Report: "Service Disruption in EU"

Last update 2025-06-03T11:47:00.000Z

Resolved2025-06-03T11:47:00.000Z

This incident has been resolved. We appreciate your patience and understanding.

Identified2025-06-03T11:44:00.000Z

The issue has been identified and a fix is being implemented.

Investigating2025-06-03T11:44:00.000Z

Report: "Custom user roles having too restricted permissions on some workspaces"

Last update 2025-06-03T10:59:00.000Z

Resolved2025-06-03T10:59:00.000Z

This incident has been resolved. We appreciate your patience and understanding.

Update2025-06-03T10:13:00.000Z

We are continuing to monitor for any further issues.

Monitoring2025-06-03T09:07:00.000Z

We experienced a service disruption where users with custom user roles were seeing less data than they should in the inventory due to too strict permissions. Our team fixed the root cause and is currently monitoring the service.

Report: "Standard maintenance in JP"

Last update 2025-06-03T09:00:00.000Z

Completed2025-06-03T09:00:00.000Z

The scheduled maintenance has been completed.

In progress2025-06-03T08:00:00.000Z

Scheduled maintenance is currently in progress. We will provide updates as necessary.

Scheduled2025-06-02T04:45:00.000Z

We will upgrade parts of our infrastructure. During this time, some functionality or workspaces will not be available. We appreciate your patience and understanding.

Jun 2, 2025

Report: "[EAM] Dashboards show error after login"

Last update 2025-06-02T09:23:41.508Z

postmortem2025-06-02T09:22:45.307Z

## Incident Description Between 21 May 2025 13:25 UTC and 22 May 2025 6:46 UTC a subset of users experienced an issue when trying to open the following dashboard: Application Portfolio Management Onboarding Dashboard for Enterprise Architects. The result was an error popup that interrupted the loading of the dashboard. ## Incident Resolution The engineering team identified the problem in the code and deployed a fix on 22 May 2025`16:46 UTC` resolving the problem and restoring full functionality. ## Root Cause Analysis The root cause was a flaw in recently deployed code that tried to incorrectly access fact sheet fields in the application's data model. This resulted in an error when we tried to calculate the metrics in the dashboard component. ## Preventative Measures We are improving the error handling and the automated testing to prevent this from happening again in the future. We are also improving the development process to make sure we follow best practices that safeguard against similar situations.

resolved2025-05-22T17:14:16.747Z

This incident has been resolved.

monitoring2025-05-22T16:50:57.587Z

A fix has been implemented and we are monitoring the results.

identified2025-05-22T15:36:43.186Z

Some customers are experiencing issues in the dashboards after logging in. Issue has been identified and mitigation is ongoing.

Report: "[EAM] Dashboards show error after login"

Last update 2025-06-02T04:23:00.000Z

Postmortem2025-06-02T04:23:00.000Z

Resolved2025-05-22T12:14:00.000Z

This incident has been resolved.

Monitoring2025-05-22T11:50:00.000Z

A fix has been implemented and we are monitoring the results.

Identified2025-05-22T10:36:00.000Z

Some customers are experiencing issues in the dashboards after logging in. Issue has been identified and mitigation is ongoing.

May 31, 2025

Report: "Major upgrade in EU"

Last update 2025-05-31T06:57:00.000Z

Completed2025-05-31T06:57:00.000Z

The scheduled maintenance has been completed.

In progress2025-05-31T02:00:00.000Z

Scheduled maintenance is currently in progress. We will provide updates as necessary.

Scheduled2025-05-23T03:11:00.000Z

We will upgrade major parts of our infrastructure. During this time, some workspaces will not be available. We appreciate your patience and understanding.

May 30, 2025

Report: "Signavio Integration UI outage"

Last update 2025-05-30T13:51:06.672Z

resolved2025-05-30T13:51:06.655Z

This incident has been resolved.

monitoring2025-05-30T12:43:18.614Z

A fix has been implemented and we are monitoring the results.

investigating2025-05-30T11:29:18.512Z

Customers are currently experiencing issues with accessing the configuration UI of the Signavio Integration.

Report: "Fact Sheet details page outage"

Last update 2025-05-30T12:39:05.899Z

postmortem2025-05-30T12:33:59.462Z

## Incident Description Starting on `2025-05-28 14:56 UTC`, LeanIX users experienced errors when attempting to access Fact Sheet details pages within the Inventory. The issue affected all non-admin users across all workspaces and regions. ## Incident Resolution Our engineering team identified the root cause at `2025-05-28 16:13 UTC` and ran a rollback of a recent frontend release. Full functionality was restored at `2025-05-28 16:16 UTC`. No customer data was affected during this incident. ## Root Cause Analysis The incident was caused by a frontend release that introduced a new internal API call. This call resulted in an authorization error for all non-admin users, leading to the inability to load the Fact Sheet details page. Since the new API call was receiving a `HTTP 401` status code response for all non-admin users, the application was starting the re-login procedure. This procedure is not treated as an error & thus - does not trigger alerting mechanisms, which lead to a prolonged detection time. ## Preventative Measures To prevent similar incidents, we are enhancing our pre-release testing protocols to include testing across various user roles and permission levels. We are also improving our frontend application's handling of authorization errors to increase resilience and trigger alerting mechanisms early.

resolved2025-05-28T17:02:29.875Z

This incident has been resolved.

monitoring2025-05-28T16:16:55.764Z

We confirmed that the issue was related to a recent frontend release. The change was reverted and we're currently monitoring the application.

investigating2025-05-28T16:13:54.210Z

The issue seems to be related to a recent frontend release. No actual customer data is affected/lost.

investigating2025-05-28T16:03:21.236Z

Non-Admin users currently have issues viewing Fact Sheets.

Report: "Signavio Integration UI outage"

Last update 2025-05-30T08:51:00.000Z

Resolved2025-05-30T08:51:00.000Z

This incident has been resolved.

Monitoring2025-05-30T07:43:00.000Z

A fix has been implemented and we are monitoring the results.

Investigating2025-05-30T06:29:00.000Z

Customers are currently experiencing issues with accessing the configuration UI of the Signavio Integration.

Report: "Fact Sheet details page outage"

Last update 2025-05-30T07:34:00.000Z

Postmortem2025-05-30T07:34:00.000Z

Resolved2025-05-28T12:02:00.000Z

This incident has been resolved.

Monitoring2025-05-28T11:16:00.000Z

We confirmed that the issue was related to a recent frontend release. The change was reverted and we're currently monitoring the application.

Update2025-05-28T11:13:00.000Z

The issue seems to be related to a recent frontend release. No actual customer data is affected/lost.

Investigating2025-05-28T11:03:00.000Z

Non-Admin users currently have issues viewing Fact Sheets.

May 28, 2025

Report: "Issues during creation of Fact Sheet relations"

Last update 2025-05-28T14:50:39.024Z

postmortem2025-05-28T14:50:35.570Z

## **Incident Description:** On `2025-05-16 12:51 UTC`, a subset of LeanIX users encountered an issue preventing them from adding certain relationships within Fact Sheets. Users received an error message in the application's user interface. ## **Incident Resolution:** Our engineering team investigated the problem and identified a bug in the application's code. A fix was deployed at `2025-05-16 19:29 UTC`, restoring full functionality for all affected users. ## **Root Cause Analysis:** The root cause was a flaw in recently deployed code that incorrectly handled missing data fields in the application's data model. This resulted in an error when users attempted to create certain relationships. ## **Preventative Measures:** We are implementing improvements to our development process to prevent similar issues in the future. These include enhanced code review practices and more robust error handling within our applications. We are also working on improving our monitoring and alerting systems to detect similar issues earlier.

resolved2025-05-16T11:00:00.000Z

Some user experienced issues creating new relations between Fact Sheet types.

May 27, 2025

Report: "Standard maintenance in EU"

Last update 2025-05-27T16:00:00.000Z

Completed2025-05-27T16:00:00.000Z

The scheduled maintenance has been completed.

In progress2025-05-27T15:00:00.000Z

Scheduled maintenance is currently in progress. We will provide updates as necessary.

Scheduled2025-05-12T02:15:00.000Z

We will upgrade parts of our infrastructure. During this time, some functionality or workspaces will not be available. We appreciate your patience and understanding.

May 26, 2025

Report: "Login issues"

Last update 2025-05-26T15:12:47.513Z

postmortem2025-05-26T15:12:29.819Z

## Incident Description From May 17th, 04:50 UTC, we experienced a service disruption affecting multiple platform capabilities for a limited subset of our users in the EU region. The issue persisted until May 19, 2025, at 07:55 UTC, when full service was restored. During this period, affected users were unable to sign in and access various platform functionalities. ## Incident Resolution After receiving user reports, our engineering team identified network connectivity issues affecting a portion of our infrastructure. We resolved the incident by provisioning new virtual infrastructure to replace the affected components, which successfully restored all services and functionalities. ## Root Cause Analysis The incident was caused by network connectivity problems at the infrastructure level for a single server, originating from our cloud service provider. These connectivity issues prevented multiple services from functioning properly, including user authentication. The intermittent nature of the network problems allowed the system to pass health checks, which prevented our monitoring systems from detecting the issue earlier. ## Preventative Measures We are implementing enhanced infrastructure monitoring to detect similar issues earlier. Additionally, we're reviewing our alerting thresholds to ensure faster detection of service availability issues across our platform.

resolved2025-05-19T08:34:01.530Z

This incident has been resolved.

monitoring2025-05-19T08:05:32.213Z

A fix has been implemented and we are monitoring the results.

investigating2025-05-19T07:55:03.990Z

We are currently experiencing network issues from our cloud provider and are actively working to resolve the problem.

investigating2025-05-19T07:09:47.919Z

Some users may experience problems while logging in to our application. Our team is working to identify the root cause and implement a solution.

Report: "[EAM] Diagrams are not loading"

Last update 2025-05-26T05:57:17.674Z

postmortem2025-05-26T05:56:40.454Z

## Incident Description Multiple regions except UAE were not able to use Diagrams for a few minutes. Service degradation lasted from 20 May 2025 14:48 UTC to 20 May 2025 15:12 UTC \(24 min\). There was no data loss. ## Incident Resolution We rolled back to the previous version and reverted the code that was causing the issue. ## Root Cause Analysis A significant change has been implemented in Diagrams. Unfortunately, the section of the application affected by this change was unable to manage it effectively, as the update was rolled out prematurely. ## Preventative Measures We will improve our release strategy to handle significant changes before they are released.

resolved2025-05-20T14:32:53.604Z

This incident has been resolved.

monitoring2025-05-20T13:35:06.935Z

A fix has been implemented and we are monitoring the results.

identified2025-05-20T13:17:31.158Z

Currently diagrams are not loading. We have identified the issue and are working on a fix.

May 24, 2025

Report: "Major upgrade in US"

Last update 2025-05-24T06:56:00.000Z

Completed2025-05-24T06:56:00.000Z

The scheduled maintenance has been completed.

In progress2025-05-24T02:01:00.000Z

Scheduled maintenance is currently in progress. We will provide updates as necessary.

Scheduled2025-05-16T06:28:00.000Z

We will upgrade major parts of our infrastructure. During this time, some workspaces will not be available. We appreciate your patience and understanding.

Report: "Major upgrade in AE, AU, BR, CA, CH, and UK"

Last update 2025-05-24T06:56:00.000Z

Completed2025-05-24T06:56:00.000Z

The scheduled maintenance has been completed.

In progress2025-05-24T02:00:00.000Z

Scheduled maintenance is currently in progress. We will provide updates as necessary.

Scheduled2025-05-16T06:27:00.000Z

We will upgrade major parts of our infrastructure. During this time, some workspaces will not be available. We appreciate your patience and understanding.

May 21, 2025

Report: "[EAM] Accessing workspaces results in errors"

Last update 2025-05-21T08:12:27.137Z

postmortem2025-05-21T08:11:48.732Z

## Incident Description On May 19 between 14:56 UTC and 15:34 UTC, many parts of our application’s user interface were not loading, preventing users from accessing the functionality. API’s weren’t affected. No data was lost. ## Incident Resolution Upon receiving error alerts from our monitoring systems, we identified the software change causing the issue and reverted the change. The fix was fully deployed at 15:34 UTC, restoring full functionality for all affected users. ## Root Cause Analysis The change we released relied on a feature of our frontend platform that was not available in all parts of our application. Our automated tests and the code review were not able to catch this issue prior to delivery. ## Preventative Measures To prevent similar issues in the future, we will extend our frontend platform to ensure the missing feature is available in all parts of our application.

resolved2025-05-19T15:52:40.328Z

This incident has been resolved.

monitoring2025-05-19T15:39:18.478Z

We have implemented a fix and have seen full recovery of functionality. We will continue to monitor the situation.

identified2025-05-19T15:32:40.813Z

We are continuing to work on a fix for this issue.

identified2025-05-19T15:25:43.026Z

Users may experience errors in accessing their workspaces. The root cause has been identified. We are implementing a solution.

May 20, 2025

Report: "Service Disruption in AE"

Last update 2025-05-20T11:49:14.087Z

postmortem2025-05-06T06:17:54.045Z

## Incident Description SAP LeanIX was not available in the AE region on the 2025-04-28 from 13:37 until 13:55. Our hyperscaler infrastructure experienced a disruption. ## Incident Resolution Our hyperscaler restored network connectivity in our AE region on April 28th, 2025, at 13:55 UTC. ## Root Cause Analysis Our hyperscaler faced a network outage in the AE region.

resolved2025-04-29T07:46:45.932Z

This incident has been resolved. We appreciate your patience and understanding.

monitoring2025-04-28T14:03:45.946Z

We have implemented a fix and have seen full recovery of functionality. We will continue to monitor the situation.

investigating2025-04-28T13:56:36.510Z

We are currently experiencing a service disruption in AE. Our team is working to identify the root cause and implement a solution. We will send an additional update in 15 minutes.

May 14, 2025

Report: "Login issues"

Last update 2025-05-14T14:12:39.476Z

postmortem2025-05-14T14:12:33.023Z

## Incident Description Between 2025-05-07 13:45 and 2025-05-08 08:00 UTC, some users in the EU region were not able to login. Some login requests failed, due to rate limiting on SAP LeanIX. ## Incident Resolution The incident was resolved by changing the logic that determines when the rate limiting should block requests from being processed. Once this change was deployed to the production environment, login requests were no longer rate limited. ## Root Cause Analysis We identified a faulty configuration of the rate limiting for logins. This configuration applied the same rate limit to all users. Our analysis shows that a single user exceeded the permitted number of calls, which affected other users who were well within the allowed number of calls per minute. ## Preventative Measures We improved the coverage of our automated tests to include the rate limiting configuration. Additionally, we are improving our monitoring to alert earlier on blocked login requests due to rate limiting.

resolved2025-05-07T16:00:00.000Z

Some users may experience problems while logging in to our application. Our team is working to identify the root cause and implement a solution.

Report: "Faulty support for DE, ES, FR, PT languages"

Last update 2025-05-14T11:53:50.407Z

postmortem2025-05-14T11:52:28.913Z

## Incident Description Between April 30 and May 9, customers using non-English display languages experienced unexpected changes in the translation of key product terminology. This affected the user experience by presenting unfamiliar translations for commonly used terms. ## Incident Resolution Upon receiving customer feedback about problematic translations, we immediately reverted all the translation changes. By May 9, all affected translations were restored to their original state. ## Root Cause Analysis The incident occurred during the transition to a new translation management system and external professional translators. Up to this point, all translations were done internally by native speakers within SAP LeanIX. Due to limited capacity, the number of untranslated strings grew significantly over time and the introduction of new UI languages was not possible. During this transition, existing terminology was modified despite the context of established terms that had been provided. This resulted in technically correct but contextually disruptive translations. ## Preventative Measures To prevent similar issues in the future, we will: * Improve our translation review process before releasing updates * Create clearer guidelines about preserving established product terms * Strengthen communication between our team and translation service providers

resolved2025-04-30T10:00:00.000Z

On April 30th we extended our support for German, Spanish, French and Portuguese languages in SAP LeanIX. We identified some faulty translations and reverted those languages to the previous state. We apologize for any inconvenience this may have caused.

Report: "Problem when navigating to Inventory"

Last update 2025-05-14T06:37:24.761Z

postmortem2025-05-14T06:36:44.774Z

## Incident Description On 2025-04-29, between 15:57 and 16:30 UTC, trying to go to Inventory within SAP LeanIX was causing an error, preventing users from accessing fact sheets. This was caused by a software update that prevented users from accessing their fact sheets. ## Incident Resolution The incident was resolved by reverting back to the previous software version. ## Root Cause Analysis By releasing a change to consider customer roles when using workspace views, a bug was introduced that caused our application not load properly for customers without customer roles. This bug was caused by unexpected interaction between different parts or our code base. ## Preventative Measures We will analyze how to identify such problems upfront and will act accordingly to prevent such incidents in the future.

resolved2025-04-29T16:35:49.000Z

The Problem is resolved and EAM is fully functional.

investigating2025-04-29T16:21:56.000Z

We are continuing to investigate this issue.

investigating2025-04-29T16:10:24.000Z

Users may experience problem when trying to access the Inventory tab within EAM. Our team has identified the issue and working on resolving the problem.

May 12, 2025

Report: "invitation of new users to a workspace not possible"

Last update 2025-05-12T13:11:11.301Z

postmortem2025-05-12T13:09:56.681Z

## Incident Description Between 2025-05-07 13:55 and 2025-05-08 06:07 UTC, users of SAP LeanIX were not able to invite other users to a workspace. ## Incident Resolution The incident was resolved by a rollback, reverting the code change causing the issue. ## Root Cause Analysis The incident was traced back to a change in the frontend code, which switched the user search functionality to a new API endpoint. This switch was incompatible with the parameters used by user invitation functionality. ## Preventative Measures To avoid encountering similar issues in the future, we will enhance the scope of our automated tests to include the parameters used in API calls by the user invitation functionality.

resolved2025-05-08T06:30:54.751Z

This incident has been resolved, and the invitation of users to workspaces is working again.

investigating2025-05-07T20:48:23.570Z

It is currently not possible to invite new users into a workspace of our application. Our team is working to identify the root cause and implement a solution.

May 9, 2025

Report: "Degraded performance with non SSO logins."

Last update 2025-05-09T13:46:40.753Z

postmortem2025-05-09T13:46:14.121Z

## Incident Description On May 5th, 2025, at 11:58 AM UTC, users encountered login issues while trying to access the SAP LeanIX. The issue only affected customers who authenticated using their LeanIX credentials. Customers who logged in via single sign-on \(SSO\) were not affected. ## Incident Resolution By 12:30 PM UTC on the same day, the problematic code was reverted, successfully restoring functionality. ## Root Cause Analysis The incident was traced to a bug within a newly released logging extension, which was intended to enhance monitoring capabilities in our authentication stack. This bug failed to validate certain properties during the login attempt, resulting in exceptions in the subsequent code. ## Preventative Measures The automated test suite will be extended to cover logging extension runtime issues on the application framework level.

resolved2025-05-05T12:00:00.000Z

On May 5th, 2025, at 11:58 AM UTC, users encountered login issues while trying to access the SAP LeanIX. The issue only affected customers who authenticated using their LeanIX credentials. Customers who logged in via single sign-on (SSO) were not affected.

May 8, 2025

Report: "Service Disruption in EU region"

Last update 2025-05-08T12:26:21.151Z

postmortem2025-05-08T12:25:53.110Z

## Incident Description On April 30, 2025, from 07:01 to 08:25 UTC, users in the EU region experienced errors when trying to access the Inventory section in SAP LeanIX. This issue prevented access to fact sheets. The problem occurred because a faulty software component was automatically deployed during scheduled maintenance. ## Incident Resolution We resolved the issue by rolling back to the previous version of the software component. ## Root Cause Analysis The release process selected a faulty software component for deployment. The component had been manually excluded from the release but was still included due to a process oversight. ## Preventative Measures By reviewing our internal processes, we identified and started working on the following preventative measures: * Improving the deployment rollback process to prevent faulty software components from being selected for future deployments.

resolved2025-04-30T08:25:54.550Z

This incident has been resolved. We appreciate your patience and understanding.

monitoring2025-04-30T07:31:09.283Z

We have implemented a fix and have seen full recovery of functionality. We will continue to monitor the situation.

investigating2025-04-30T07:01:41.000Z

We are currently experiencing a service disruption in for several customers in EU region. Our team is working to identify the root cause and implement a solution. We will send an additional update in 30 minutes.

Report: "Service Disruption In EU region"

Last update 2025-05-08T12:24:38.180Z

postmortem2025-05-08T12:20:20.639Z

## Incident Description On 2025-04-28, between 11:45 and 12:00 UTC, some users of SAP LeanIX in EU region were not able to load dashboards nor search for fact sheets. ## Incident Resolution The incident was resolved by restarting the storage infrastructure within EU region. ## Root Cause Analysis The partial outage was caused by an increased usage of SAP LeanIX storage infrastructure. ## Preventative Measures To prevent similar incidents in the future, we aim to improve in the the following areas: * We increased the allocated resources dedicated for the product storage infrastructure. * We are investigating the usage patterns and are working on optimizing queries and usage patterns of our storage infrastructure.

resolved2025-04-28T12:00:00.000Z

Some services of EAM had a temporary disruption in EU region between 11:45 AM and 12:00 PM UTC. All services are fully functional now.

Report: "Editing fact sheets through table view is not working"

Last update 2025-05-08T07:23:29.366Z

postmortem2025-05-08T07:23:19.910Z

## Incident Description On 2025-04-28, between 12:40 and 14:14 UTC, editing fact sheets through the table view was not possible within SAP LeanIX. When users tried to save the changes made on multiple fact sheets through Inventory’s table view, they received an error. ## Incident Resolution Our team identified and mitigated the issue. The degraded service experience was resolved at 14:14 UTC. ## Root Cause Analysis A missing check for empty data caused an error when the users tried to save the changes made through the fact sheets table view, leading to an error message being shown to the users. ## Preventative Measures After the incident, we took steps to understand why we were unable to catch the error earlier. We have identified areas for improvement and are currently working on: * Improving processes for faster time to recovery. * Improve test coverage on our frontend components.

resolved2025-04-28T12:30:00.000Z

Editing fact sheets through the table view was not possible within SAP LeanIX. When users tried to save the changes made on multiple fact sheets through Inventory’s table view, they received an error.

May 7, 2025

Report: "Login issues"

Last update 2025-05-07T11:31:48.185Z

postmortem2025-05-07T11:30:18.387Z

## Incident Description On April 30, 2025, at 11:30 AM, users encountered login issues while trying to access the LeanIX platform. The Issue only affected non SSO customer. ## Incident Resolution By 11:53 AM on the same day, the problematic code was rolled back, successfully restoring functionality. The issue was subsequently resolved and redeployed without any further complications. ## Root Cause Analysis The incident was traced back to a bug in the user validation process that was introduced in preparation for the LeanIX sign-in rollout and its enhanced capabilities. This bug caused our User Management system \(MTM\) to reject logins from the LeanIX sign-in process. ## Preventative Measures The bug originated from an authentication workflow that involved multiple services. We will ensure the tests in the involved services are matching the expected interface contract.

resolved2025-04-30T12:38:14.479Z

This incident has been resolved. We appreciate your patience and understanding.

monitoring2025-04-30T11:55:47.054Z

We have implemented a fix and have seen full recovery of functionality. We will continue to monitor the situation.

investigating2025-04-30T11:46:34.445Z

We are continuing to investigate this issue.

investigating2025-04-30T11:46:04.363Z

Some users may experience problems while logging in to our application. Our team is working to identify the root cause and implement a solution. We will send an additional update in 15 minutes.

Apr 28, 2025

Report: "Service Disruption in Teams Chat Bot"

Last update 2025-04-28T14:47:33.042Z

postmortem2025-04-28T14:47:23.963Z

## Incident Description From 2025-04-16 13:10 UTC to 2025-04-17 11:36 UTC, users of the SAP LeanIX MS Teams Chatbot were unable to log in due to a missing configuration update on the Azure Bot resource. This issue affected all users of the Microsoft Teams application. During the incident, users were unable to authenticate, switch workspaces to enable notifications, or use search functionalities for fact sheets within the chatbot. ## Incident Resolution We applied a configuration fix to the Azure Bot resource to restore proper authentication. Once the fix was implemented, the incident was resolved, and users regained full access to chatbot functionality. ## Root Cause Analysis The root cause was a misalignment between the configuration of our internal system and the Azure resource, resulting from a missed synchronization step. ## Preventative Measures We have enhanced internal documentation and configured automatic trigger updates to the MS Teams Chatbot Azure resource when infrastructure changes occur. This reduces the risk of future missed updates.

resolved2025-04-16T13:10:00.000Z

This incident has been resolved.

Apr 23, 2025

Report: "Service Disruption in the Integrations page"

Last update 2025-04-23T11:35:10.087Z

postmortem2025-04-23T11:34:46.871Z

## Incident Description On April 04, 2025, between 12:33 PM and 13:18 PM UTC, the list of integrations did not load. ## Incident Resolution We identified a failing service belonging to an integration that was under development and placed it behind a feature flag to prevent it from being called.. ## Root Cause Analysis We discovered that fetching one of the integrations, which is still under development, failed because the backend service was down. This issue prevented the display of other integrations as well. ## Preventative Measures We improved the error handling for the integrations list so that if any specific request fails, it won't break the entire page; it will only prevent that particular integration from being displayed. This approach also gives us the ability to track failures for each integration, thereby enhancing observability and reducing the time to take action.

resolved2025-04-04T13:49:08.190Z

This incident has been resolved. We appreciate your patience and understanding.

monitoring2025-04-04T13:22:26.261Z

We have implemented a fix and have seen full recovery of functionality. We will continue to monitor the situation.

identified2025-04-04T13:11:21.208Z

We are currently experiencing a service disruption in the Integrations page. Our team is working to identify the root cause and implement a solution. We will send an additional update in 60 minutes.

Apr 17, 2025

Report: "Default relations are ignored in Excel and OData export"

Last update 2025-04-17T08:00:49.043Z

postmortem2025-04-17T07:58:45.175Z

## Incident Description On April 14, 2025, between 11:51 UTC and 16:09 UTC, customers experienced issues with Excel exports in which default relations were missing from the exported data. This issue was introduced by a change made to label placeholder handling in the `import-export` service, as a follow-up to a previous fix related to the OData integration. ## Incident Resolution The issue was identified shortly after a customer report and was confirmed to be linked to a recent code change. The change was promptly reverted, and a fix was deployed across all regions by 16:09 UTC. The export functionality was restored, and the incident was marked as resolved after a brief monitoring period. ## Root Cause Analysis The placeholder update introduced in the `import-export` service unintentionally affected the export of default relations. The change was protected by a feature flag, but the behaviour with default relations didn't have enough test coverage. This allowed the issue to pass unnoticed until reported by a customer. ## Preventative Measures To prevent similar issues in the future, the following actions will be taken: * Improve test coverage to include default relations in export scenarios. * Conduct more thorough manual testing for critical features before deployment.

resolved2025-04-14T17:29:20.911Z

This incident has been resolved.

monitoring2025-04-14T16:11:59.592Z

A fix has been implemented and we are monitoring the results.

identified2025-04-14T15:48:26.483Z

The issue has been identified and a fix is being implemented.

investigating2025-04-14T15:32:24.485Z

We are currently investing an issue in the Excel and OData export functionality. At the moment, certain relations are being ignored in the export result (e.g. Parents, Successors). We will send an update on the issue in 30 minutes.

Report: "Access of Menu items shows errors"

Last update 2025-04-17T07:58:53.835Z

postmortem2025-04-17T07:58:43.320Z

## Incident Description Between April 8, 10:35 AM UTC and 11:48 AM UTC, there was a degraded experience of our user interface for Surveys, Diagrams, Presentations, Reports and Transformations. ## Incident Resolution Our team identified the offending release and rolled it back. Service degraded experience was resolved at 11:48 UTC. ## Root Cause Analysis A broken user image component release in the morning caused a flaw in the internal dependency linking, leading to the error messages being shown to our users and user images not being rendered on the page. ## Preventative Measures After the incident, we took steps to understand why we haven't caught the error earlier in our deployment pipeline. We have identified areas for improvement and are currently working on: * Improving our testing strategy to catch such dependency related failures * Improving processes for faster time until recovery

resolved2025-04-08T11:51:58.754Z

This incident has been resolved.

monitoring2025-04-08T11:41:01.327Z

A fix has been implemented and we are monitoring the results.

identified2025-04-08T11:23:55.859Z

The issue has been identified and a fix is being implemented.

investigating2025-04-08T11:17:50.451Z

We are continuing to investigate this issue.

investigating2025-04-08T11:14:12.905Z

Users may encounter errors when accessing Reports, Diagrams, Surveys, and other menu items. Our team is working to identify the root cause and implement a solution.

Report: "Degraded performance in Signavio Integration Configuration UI in EU"

Last update 2025-04-17T07:56:47.255Z

postmortem2025-04-17T07:56:23.784Z

## Incident Description On April 07, 2025, between 01:40 PM and 03:10 PM UTC, users were unable to set up or update a Signavio Integration in the EU region. ## Incident Resolution Redeploying the service successfully restored the availability of the functionality. ## Root Cause Analysis Parts of the Signavio integration became unavailable due to an unexpected surge in load, which exhausted available connections and rendered the service unresponsive. ## Preventative Measures We will scale up the resources allocated to this service to ensure it can reliably handle similar load levels in the future. We will continue to enhance our observability to detect and respond to connection issues more proactively and minimize impact.

resolved2025-04-07T15:27:41.953Z

This incident has been resolved.

monitoring2025-04-07T15:24:03.003Z

A fix has been implemented and we are monitoring the results.

identified2025-04-07T15:19:52.751Z

Users may not be able to access the Signavio Integration Configuration UI. The root cause has been identified. Our team is working on a solution.

Apr 14, 2025

Report: "Delay in delivering webhooks"

Last update 2025-04-14T15:11:08.445Z

postmortem2025-04-14T14:57:59.187Z

## Incident Description On Wednesday, April 9. In the period of 12:56 - 19:15 UTC Webhooks could not deliver events. After 19:15 UTC all events were processed and delivered without data loss. ## Incident Resolution We identified the broken release and rolled it back. Webhooks continued to deliver events again starting from 19:15 UTC. ## Root Cause Analysis The broken release changed the order of initialisation for components in Webhooks. This caused events to be stuck without being processed. ## Preventative Measures To prevent similar incidents in the future, we aim to improve in the the following areas: * We will improve our observability to react earlier when the event delivery is not working properly. * We will enhance our existing tests to detect broken event delivery before it reaches production.

resolved2025-04-09T20:13:27.120Z

The incident has been resolved. All webhooks events were processed and are being delivered.

identified2025-04-09T19:33:44.514Z

We have identified delays in webhooks deliveries and are now processing through the backlog of events. We will send an additional update in 30 minutes.

Apr 11, 2025

Report: "Search for tags unavailable in all regions"

Last update 2025-04-11T12:17:48.410Z

postmortem2025-03-25T07:41:01.741Z

## Incident Description On March 9th, 2025, from 12:20 UTC to 13:15 UTC, the functionality of searching for all tags was disrupted across all regions. This issue primarily affected users' ability to list all available tags in the dropdown menu in the Factsheet Details view while trying to add a tag to a Fact Sheet. ## Incident Resolution The issue was discovered early after the deployment, and the change causing the problem was reverted. ## Root Cause Analysis An update to an underlying service for the tag search introduced unintended side effects. The issue was not caught earlier because our automated tests did not account for this specific scenario. ## Preventative Measures * Test coverage was improved by adding more integrations tests based on this regression, and other Front End inspired scenarios. * For such critical changes, we will widen the use of Silent Releases to compare new implementations with current ones. This will more accurately prevent introducing failures, before finally switching to the new implementations.

resolved2025-03-19T12:20:00.000Z

From 12:20 UTC to 13:15 UTC, the functionality of searching for tags was disrupted across all regions. This issue primarily affected users' ability to add tags to Fact Sheets on the Fact Sheet details page. The responsible team swiftly identified the root cause and implemented a fix, restoring normal functionality by 13:15 UTC.

Apr 4, 2025

Report: "Service Disruption in Teams Chat Bot"

Last update 2025-04-04T09:39:08.665Z

postmortem2025-04-04T09:38:39.440Z

## Incident Description On 2025-04-03, between 7:30 and 12:17 UTC, the SAP LeanIX MS Teams Chatbot experienced an outage due to a misconfiguration in the deployment. This impacted all users of the Microsoft Teams application, and during this period, users were unable to receive responses to queries submitted to the SAP LeanIX MS Teams Chatbot. ## Incident Resolution We deployed a fix with the correct configuration, bringing the service back online. The SAP LeanIX MS Teams Chatbot is now functioning as expected. ## Root Cause Analysis The outage occurred after we deployed changes to production that included a misconfiguration in the deployment configuration, which led to the service disruption and SAP LeanIX MS Teams Chatbot downtime. ## Preventative Measures To prevent similar incidents in the future, we aim to improve in the the following areas: * We will enhance the tests to detect the failures in the deployment configuration files * We will invest more in monitoring and alerting to catch deployment configuration failures * We will improve the mitigation process to ensure a quicker response time

resolved2025-04-03T12:21:36.513Z

This incident has been resolved.

investigating2025-04-03T07:30:39.000Z

We are currently experiencing a service disruption in **Teams Chat Bot** Our team is working to identify the root cause and implement a solution. We will send an additional update in **60** minutes.

Report: "Service Disruption for OData Integration in All Regions"

Last update 2025-04-04T07:47:05.038Z

postmortem2025-04-04T07:46:27.905Z

## Incident Description From March 24, 11:29 UTC, a change in the English translation model of Pathfinder introduced a new placeholder naming convention and modifications in the source language used for translations. These changes impacted: * The OData API, where labels with specific placeholders were not resolved, and field and relation names disrupted customer integrations. * Fact Sheet update notifications, where placeholders meant to render relation or field names were not properly resolved. While notifications were still sent, some contained unexpected values. This led to issues for customers whose integrations relied on exact values. The issue was mitigated by implementing a transformation layer in our OData API to ensure compatibility with previous naming conventions and re-translating specific labels. ## Incident Resolution * A fix was deployed on March 25 to address placeholder issues. * Affected customers were identified and contacted. * The initial change was not reverted; instead, a solution was implemented to maintain compatibility without requiring customer action. * On March 26, at 13:40 UTC, a fix was applied to the notification system at 08:47 UTC, followed by a fix for the OData API to ensure proper resolution of placeholders. ## Root Cause Analysis The translation model change introduced non-backward-compatible placeholders, which were not accounted for in our OData API. Additionally, customers were unaware of the changes affecting their integrations. There was no monitoring in place for translation model changes impacting downstream services. ## Preventative Measures To prevent similar incidents, the following improvements will be implemented: * **Testing & Monitoring:** Ensure translation model changes are tested against all integrations and establish alerts for changes affecting OData and Notifications. * **Incident Management:** Improve coordination between teams for faster response, treating functionality breaking changes as incidents with clear internal and external communication. * **Automation & Prevention:** Automate testing for translation model updates and strengthen the review process for transformation related changes. * **Detection:** Expand logging and alerting mechanisms to detect placeholder resolution issues earlier and proactively identify affected customers.

resolved2025-03-26T07:41:53.000Z

We have mitigated the problems with the OData integration in all regions.

identified2025-03-25T08:31:56.577Z

We are currently experiencing issues with our OData integration in all regions. The root cause has been identified, and we are in the process of mitigating the problem. Further updates regarding this issue will be communicated shortly.

Mar 26, 2025

Report: "Service Disruption in US"

Last update 2025-03-26T16:07:37.431Z

resolved2025-03-26T16:07:37.413Z

This incident has been resolved.

monitoring2025-03-26T13:07:43.422Z

A fix has been implemented and we are monitoring the results.

investigating2025-03-26T13:00:43.978Z

We are currently experiencing a service disruption in our US hosting region. Some customers might not be able to access their workspaces. Our team is working to identify the root cause and implement a solution. We will send an additional update in 60 minutes.

Report: "Degraded performance in "Creating a Fact Sheet in the Inventory" functionality"

Last update 2025-03-26T08:39:58.843Z

postmortem2025-03-26T08:39:17.270Z

## Incident Description On March 20, 2025, between 9:25 UTC and 10:35 UTC, a limited number of SAP LeanIX customers across seven regions were unable to manually create fact sheets. An error message appeared, indicating that recommendation details could not be loaded, blocking the manual fact sheet creation process. ## Incident Resolution The issue was successfully resolved by reverting the code changes that caused the problem. ## Root Cause Analysis The recommendation system interacted with a downstream service utilizing features not yet launched. ## Preventative Measures We will enhance our automated checks to detect the usage of features that have not yet been released before deploying to production. Additionally, we are refining our alerting system to detect the degradation of the recommendation system more swiftly.

resolved2025-03-20T09:30:00.000Z

On March 20, 2025, between 9:25 UTC and 10:35 UTC, a limited number of SAP LeanIX customers across seven regions were unable to manually create fact sheets. An error message appeared, indicating that recommendation details could not be loaded, blocking the manual fact sheet creation process.

Mar 25, 2025

Report: "Temporary disruption on the Fact Sheet details page"

Last update 2025-03-25T15:53:34.218Z

postmortem2025-03-25T15:53:25.903Z

## Incident Description On March 14, 2025, between 12:41 UTC and 13:31 UTC, SAP LeanIX customers across all regions experienced an error message when opening the fact sheet details page. The message left the users with the assumption that the page was broken. Even after acknowledging the error, the page was operational. The error message was only shown to users who accessed the page with a VIEWER-only role. ## Incident Resolution The issue was resolved successfully by providing a fix for the broken sidebar component that caused the error to be shown. ## Root Cause Analysis The issue was introduced when changing a sidebar component that did not initialize properly for users in the VIEWER role under certain conditions. ## Preventative Measures We will enhance test coverage for users with least-privilege access and expand our efforts to identify improperly initialized components using static code analysis.

resolved2025-03-14T00:30:00.000Z

We had a temporary disruption on the Fact Sheet details page of all workspaces, between 13:33 MEZ & 14:41 MEZ. This has been mitigated and resolved. We will investigate thoroughly how to avoid this affecting our customers in the future. Our apologies for the inconvenience.

Report: "Service Disruption in Dashboards that contain KPIs"

Last update 2025-03-25T13:55:15.979Z

postmortem2025-03-25T13:49:06.272Z

## Incident Description On March 18, 2025 between 10:18 and 13:36 UTC, SAP LeanIX customers who have KPIs widget on their dashboards, were unable to see KPIs widget content. ## Incident Resolution The incident was resolved by reverting the code changes and deploying a previous version of the application. This allowed users to access their KPIs widget content. ## Root Cause Analysis The regression happened due to an unexpected side effect of Angular dependency injection mechanism. ## Preventative Measures We are enhancing the automatic test suite to identify dependency injection issues before the deployment to the production environment.

resolved2025-03-18T12:40:06.754Z

This incident has been resolved.

monitoring2025-03-18T12:36:35.121Z

We have implemented a fix and have seen full recovery of functionality. We will continue to monitor the situation.

identified2025-03-18T12:34:23.601Z

We are currently experiencing a service disruption in Dashboards that contain KPI panels. Our team is working to identify the root cause and implement a solution. We will send an additional update in 15 minutes.

Mar 18, 2025

Report: "Survey shows empty results"

Last update 2025-03-18T10:18:43.809Z

postmortem2025-03-18T10:17:10.397Z

## Incident description Multiple customers in the EU, US, DE, UK, CA, AU, and CH regions noticed duplicated survey runs due to a bug in the fact sheet scope change logic. This logic keeps all relevant fact sheets and subscribers attached to a dynamic survey run. The bug was deployed for several days, from Feb 7, 2025 07:38 UTC to Feb 18, 2025 08:18 UTC, before reverting to the previous version. There was no data loss, but email notifications about a changed survey scope were sent out as a result. ## Incident resolution ### Service We reverted the changes that caused the issue. It was resolved at Feb 18, 2025 08:18 UTC. ### Data Preservation Since duplicated database records were introduced, we started addressing them. The mitigation plan was rolled out in several steps from Feb 18, 2025 08:18 UTC to Mar 7, 2025 10:59 UTC. ## Root Cause Analysis There were two issues at play: * Moving to the new fact sheet scope change * Prematurely applying performance improvements ### New fact sheet scope change logic We switched from the original implementation of the fact sheet scope change to the new implementation to solve performance issues. Once we switched, we failed to notice the irregular behavior of one edge case in the business logic. As a result, duplicate fact sheets were added to the survey scope for numerous poll runs. ### Performance improvements With the removal of the survey scope change bug, our service experienced a high load, which could potentially impact other services. We decided to quickly apply small adjustments to the same logic to improve performance. We did not consider the potential side effects of the change, and as a result, we had a duplicate fact sheet added to a survey run. ## Preventive measure Four things we will take out of this: * We will invest more in monitoring and alerting to catch anomalies that go against business logic, like allowing duplicated fact sheets within a survey run * We will improve the mitigation process to ensure a quicker response time * We will invest more into tests to cover edge cases * We will improve the assessment of rollouts to identify how impactful a change is and where potential issues can occur

resolved2025-03-05T10:12:37.967Z

This incident has been resolved. We appreciate your patience and understanding.

identified2025-02-28T08:51:08.682Z

Most regions are fully operational again now. We're continuously working on restoring functionality in the remaining regions.

identified2025-02-27T11:48:04.852Z

At the moment, changes in survey scope are not detected automatically and notification emails regarding such changes are not sent out. While we're working on restoring this functionality, please use the functionality to manually "Check for Changes" as described in the documentation: https://docs-eam.leanix.net/docs/managing-surveys-and-viewing-results#viewing-survey-results.

identified2025-02-27T08:30:11.354Z

Duplicate survey runs have been cleaned up in most workspaces. The team is working on finalizing the cleanup in the remaining workspaces and monitoring the overall situation.

identified2025-02-26T09:01:39.013Z

The root cause of the issue has been identified and fixed. The team is still working on cleaning up the remaining duplicate survey results that have been created erroneously.

identified2025-02-24T18:48:58.322Z

Customers may see empty survey runs shown as the current survey result, which show zero completion and no progress. These empty runs are duplicates and the actual survey results are still accessible through the survey history. No data was lost. The team has identified the root cause of the problem and is working to address the duplicate survey runs.

Mar 17, 2025

Report: "Service Disruption in the Diagrams Lucidchart integration"

Last update 2025-03-17T15:07:00.674Z

resolved2025-03-17T15:07:00.657Z

This incident has been resolved.

monitoring2025-03-13T17:32:24.879Z

We have implemented a fix and have seen full recovery of functionality. We will continue to monitor the situation.

investigating2025-03-13T08:25:47.232Z

We are currently experiencing a service disruption in the Diagrams Lucidchart integration. Our team is in contact with the service provider to mitigate the issue.

Mar 7, 2025

Report: "Login issues"

Last update 2025-03-07T12:51:22.775Z

postmortem2025-03-07T12:51:03.617Z

## Incident Description On March 05, 2025, between 12:30 and 15:45pm UTC, several workspaces hosted on the [http://us-9.leanix.net](http://us-9.leanix.net/) instance were not usable. Users were stuck on an unskippable error and couldn’t browse their workspace. ## Incident Resolution The incident was resolved by upgrading our software to the latest version on the [http://us-9.leanix.net](http://us-9.leanix.net/) instance. ## Root Cause Analysis A misconfiguration on the [http://us-9.leanix.net](http://us-9.leanix.net/) instance prevented it from getting automatically updated to the latest version. This version mismatch, followed by a new deployment of one of our service, caused the unskippable error in the workspaces. ## Preventative Measures Introducing new internal alerting to ensure all of our instances are always running on the expected version of our services.

resolved2025-03-04T01:30:00.000Z

Some users in US may experience problems while logging in to our application. Our team is working to identify the root cause and implement a solution.

Report: "Service Disruption in CA region"

Last update 2025-03-07T10:30:29.502Z

postmortem2025-03-07T10:30:09.883Z

## Incident Description On March 05, 2025 between 20:14 and 20:39 UTC, workspaces hosted on the [ca.leanix.net](http://ca.leanix.net/) instance were inaccessible via the UI or APIs. ## Incident Resolution The incident was resolved by stopping two long-running background jobs. ## Root Cause Analysis Two long-running background jobs were conflicting and blocking each other, leading to an exhaustion of server resources that caused the downtime. ## Preventative Measures We introduced logic to prevent such conflicting jobs to run at the same time. Additionally, we have adjusted our alerting configuration to be notified early of resource shortage situations.

resolved2025-03-05T20:30:00.000Z

We are currently experiencing a service disruption in CA region. Our team is working to identify the root cause and implement a solution.

Mar 6, 2025

Report: "Duplicated fact sheets in surveys displayed in EU, AU, CA, US and DE"

Last update 2025-03-06T12:13:47.021Z

resolved2025-03-06T12:13:47.004Z

This incident has been resolved. We appreciate your patience and understanding.

identified2025-03-05T10:11:36.088Z

Users may see duplicated fact sheets in a Survey run. Our team is working on resolving the duplicates, preserving the survey answers and make the data consistent.

Feb 26, 2025

Report: "Unable to use the diagram editor"

Last update 2025-02-26T16:31:06.380Z

resolved2025-02-26T08:30:00.000Z

Customers were unable to use the diagram editor. This was caused by a faulty version of our diagrams application that was deployed on our production instances. The team identified the faulty version and reverted back to the previous version immediately. There is no data loss caused by this. The issue was resolved at 11:00 GMT+1.

Feb 25, 2025

Report: "Degraded performance in DE region"

Last update 2025-02-25T08:55:36.629Z

postmortem2025-02-25T08:51:54.391Z

## Incident Description On February 6th, the [Reference Catalog](https://docs-eam.leanix.net/docs/reference-catalog) in region DE had inconsistent catalog items between `02:30` and `23:10 UTC`. Those items remained visible in the catalog with incorrect names. As a result, those items appeared in recommendations when creating new fact sheets, and also when linking fact sheets to the catalog. The incident impacted Reference Catalog items for Applications, IT Components and Providers in the the following views: * SAP & SaaS Discovery Inbox * The Reference Catalog linking views * The recommendations within the new fact sheet creation form ## Incident Resolution The Reference Catalog of the affected region was appropriately fixed, removing the entries with incorrect names and restoring the Reference Catalog with the correct items. ## Root Cause Analysis The indexing of the data in the Reference Catalog backend was not up-to-date, leading to incorrect synchronization steps that resulted in the creation of items with incorrect names. ## Preventative Measures A safety mechanism has been introduced to avoid mass creation of catalog items.

resolved2025-02-06T23:15:01.926Z

This incident has been resolved.

identified2025-02-06T16:02:07.164Z

Mitigation is progressing. However, inconsistent names can still show up as we are working towards the final resolution.

identified2025-02-06T13:07:10.805Z

We are experiencing inconsistent names provided by our reference catalog for Applications, IT Components, and Providers. The impact is affecting the following views: SAP & SaaS Discovery Inbox, reference catalog pages, and new fact sheet creation form (recommendations). Workspaces that have name synchronization configured, will get inconsistent names on fact sheets synchronized as well, nevertheless, this will be automatically corrected. This issue affects only the DE region. Our team is actively working to resolve this issue as quickly as possible. We apologize for any inconvenience this may cause and appreciate your patience. We will send an additional update in 120 minutes.

identified2025-02-06T12:35:58.540Z

We are still working on to resolve the issue.

identified2025-02-06T10:23:23.423Z

We are currently experiencing inconsistent synchronization of the reference catalog for Applications, IT Components, and Providers. This issue affects only the DE region. Our team is actively working to resolve this issue as quickly as possible. We apologize for any inconvenience this may cause and appreciate your patience. We will send an additional update in 120 minutes.

Feb 21, 2025

Report: "Login issues in EU"

Last update 2025-02-21T14:35:36.812Z

postmortem2025-02-21T14:34:46.391Z

## Incident Description On February 18, 2025 between 10:15 and 10:30 UTC users were not able to log in certain workspaces in the EU region. ## Incident Resolution The incident was automatically recovered once the underneath service was able to handle traffic again. ## Root Cause Analysis Due to an internal index refresh of our search service, a high load episode was introduced in our infrastructure. This led to the situation that the service could not handle any traffic until the load decreased and the systems automatically recovered. ## Preventative Measures We’ve increased the resources available to that service so that it’s able to handle such load in the future. We’ll explore additional alerting options to prevent this situation from happening again.

resolved2025-02-18T13:55:41.616Z

This incident has been resolved.

monitoring2025-02-18T11:13:50.091Z

The issue has been resolved now and the team is actively monitoring the situation.

investigating2025-02-18T10:48:58.409Z

Some users may experience problems while logging in to our application. Our team is working to identify the root cause and implement a solution. We will send an additional update in 30 minutes.

Feb 20, 2025

Report: "Tag filters not working in Inventory, Reports & Diagrams"

Last update 2025-02-20T14:14:57.940Z

postmortem2025-02-20T14:12:37.944Z

## Incident Description On February 17, 2025 between 10:30 and 13:00 UTC tag group filters were not visible within the inventory filters. As a result and during that period, users were not able to use any tag group filters when filtering fact sheets on the inventory view. ## Incident Resolution The incident was caused by a software release which was reverted and the previous version deployed. ## Root Cause Analysis The root cause of the problem was an unexpected side affect of a change meant to prevent errors while creating tag group filters in the inventory. ## Preventative Measures We’ve extended our test coverage to prevent introducing such regressions when working with tag group filters.

resolved2025-02-17T14:53:14.865Z

This incident has been resolved.

monitoring2025-02-17T13:39:15.555Z

A fix has been implemented and we are monitoring the results.

investigating2025-02-17T12:59:42.585Z

Currently, tag filters are not shown in the Inventory, Reports & Diagrams. The team is actively investigating the issue. We will send an additional update in 30 minutes.

Feb 10, 2025

Report: "Degraded performance in Collections"

Last update 2025-02-10T15:35:45.335Z

postmortem2025-02-10T15:34:15.408Z

## Incident Description Multiple customers in EU, US, DE, UK and CH regions were not able use [Collections](https://docs-eam.leanix.net/docs/collections) across Diagrams, Reports and Dashboards. Service degradation lasted from Jan 17, 2025 09:25 UTC to Jan 17, 2025 09:52 UTC \(27 mins\). There was no data loss. ## Incident Resolution We reverted changes that caused the issue. It was resolved at Jan 17, 2025 09:52 UTC. ## Root Cause Analysis We introduced a new feature to navigation item search API endpoint. Even though we had tests in place, one of the cases was not covered and caused an issue when a specific parameter was included in the API request. ## Preventative Measures We will extend our testing strategy by including additional test types and canary releases to prevent similar regressions in the future.

resolved2025-01-17T08:30:00.000Z

Multiple customers in EU, US, DE, UK and CH regions were not able use Collections across Diagrams, Reports and Dashboards. Service degradation lasted from Jan 17, 2025 09:25 UTC to Jan 17, 2025 09:52 UTC (27 mins). There was no data loss.

Report: "[EAM] Degraded performance in Automations in EU"

Last update 2025-02-10T14:24:59.629Z

resolved2025-02-10T14:24:59.614Z

This incident has been resolved.

monitoring2025-02-10T11:34:58.000Z

Users in EU may have experiencing degraded performance in the Automations Service, with a processing delay of more than a day. The root cause has been identified, and a solution has been implemented. The processing delay is decreasing. We are monitoring the situation.

Feb 3, 2025

Report: "[EAM]Degraded performance in Survey on EU Region"

Last update 2025-02-03T16:13:52.397Z

resolved2025-02-03T16:13:52.381Z

This incident has been resolved.

monitoring2025-02-03T16:11:11.904Z

We are continuing to monitor for any further issues.

monitoring2025-02-03T16:11:01.404Z

A fix has been implemented and we are monitoring the results.

identified2025-02-03T15:18:52.138Z

Users may experience degraded performance in Survey. Our team is working to identify the root cause and implement a solution.

Report: "Service Disruption for eu-7 instance in EU"

Last update 2025-02-03T14:01:07.691Z

resolved2025-01-21T16:30:00.000Z

Between 16:39 to 17:21 UTC, all Pathfinder API endpoints (see https://app.leanix.net/openapi-explorer/) were unreachable in the eu-7 instance due to a depletion of server resources.

Jan 24, 2025

Report: "Degraded performance in EAM"

Last update 2025-01-24T14:03:04.870Z

postmortem2025-01-24T13:57:41.116Z

## Incident Description On January 21st, between `16:06` and `17:19 UTC`, one of our database management systems \(DBMS\) in `westeurope` experienced multiple failovers due to high load. The load was caused by an event replay of our event-carried state transfer system. The repeated failovers led to a brief downtime of the DBMS. Several services simultaneously executed the replay process, inadvertently placing excessive pressure on the DBMS. The incident caused degraded performance and temporary service disruptions for our customers for the following business capabilities: * `diagrams` * `storage` * `todos` * `transformations` * `automations` ## Incident Resolution To address the issue, we increased the SKU of the affected DBMS to a higher capacity tier, providing additional resources to handle the increased load during event replay scenarios. This adjustment immediately stabilized the system and prevented further failovers. ## Root Cause Analysis An event replay reprocesses historical events from an event log to rebuild the current state of a system. It is commonly used for synchronization in asynchronous systems. In this particular case, the event replay was necessary to enable new features in our product. However, we did not anticipate the capacity requirements ahead of time, as previous replay runs for partitions in the same region had not caused similar levels of load. The replay introduced unexpected pressure due to differences in partition size, leading to the incident at hand. ## Preventative Measures To prevent similar incidents in the future, we aim to improve in the following areas: 1. **Capacity Management**: Better forecasting to ensure the DBMS can handle increased loads. 2. **Visibility**: Enhanced monitoring to detect potential issues earlier. 3. **Service Distribution**: More even distribution of load across DBMS instances. 4. **Replay Orchestration**: Smarter scheduling to avoid concurrent high-load events. These actions will help us build a more resilient system and ensure reliable performance as we continue to scale.

resolved2025-01-21T19:05:10.278Z

This incident has been resolved.

monitoring2025-01-21T17:21:00.972Z

Users may experience degraded performance in EAM. Our team is working to identify the root cause and implement a solution. We will send an additional update in 60 minutes.

Jan 22, 2025

Report: "Completion score on Business Capabilities temporarily wrong"

Last update 2025-01-22T11:30:54.541Z

postmortem2025-01-22T11:27:04.884Z

### Incident Description Between 2025-01-14 15:03 UTC and 2025-01-15 05:55 UTC users experienced that completion scores on Business Capability fact sheets showed a lower value than expected. ### Incident Resolution We deployed a fix that changed the completion score shown in the fact sheet details page, in reports, dashboards and Excel exports back to its expected value. However, for some workspaces it could take multiple hours to propagate the updated completion score to the secondary data store that feeds the search and the table view in the inventory. ### Root Cause Analysis We rolled out a modification to the meta model of the Business Capability fact sheet to introduce [new fields that we announced before](https://updates.leanix.net/announcements/introduction-of-fields-to-business-capabilities). However, we missed detecting in our code review that a new field had a default [completion weight](https://docs-eam.leanix.net/docs/fact-sheet-completeness#modifying-the-completion-weights) of 1, which reduced the overall completion score of all Business Capability fact sheets in all workspaces. ### Preventative Measures We plan to be more diligent when we introduce new fields to fact sheets. For meta model modifications governed by SAP LeanIX the default completion weight will be 0 instead of 1.

resolved2025-01-15T05:55:46.976Z

All completion scores on business capabilities have been restored.

monitoring2025-01-14T20:42:43.740Z

75% of all workspaces have been reverted to the correct completion score in the inventory list and table view. We expect all workspaces to be corrected at 7 am UTC tomorrow.

identified2025-01-14T18:21:43.341Z

We fixed the completion scores on the fact sheet details page and in reports. We are currently rolling out the same fix for the inventory list and table view. We will send an update in approx. 2 hours.

identified2025-01-14T17:02:46.260Z

Users experience an unintended change of the completion score on Business Capability fact sheets. We are in the process of reverting this change. We will send an additional update in 2 hours.

Dec 23, 2024

Report: "Degraded performance in various product capabilities"

Last update 2024-12-23T14:11:07.641Z

postmortem2024-12-23T14:10:07.102Z

## Incident Description On December 20th, Survey and Navigation service faced a partial disruption in some regions for approximately 84 minutes `[09:33 UTC] - [10:57 UTC]`. Some customers experienced degraded performance, where requests to the database were either failing or taking longer to execute, resulting in certain actions, such as navigation and surveys, not being completed as expected. ## Incident Resolution A fix was implemented on December 20, 2024, at 10:27 UTC, restoring the services to full operational status. No data was lost during the disruption. ## Root Cause Analysis We implemented an enhancement in Surveys aimed at improving service monitoring and observability. However, this change significantly utilized infrastructure resources, leading to performance degradation in some other services. ## Preventative Measures * Enhanced our observability best practices to manage such cases better.

resolved2024-12-20T10:58:54.308Z

The incident has been resolved.

monitoring2024-12-20T10:53:39.755Z

The root cause has been identified, a fix has been deployed, and we are monitoring the result.

monitoring2024-12-20T10:50:39.293Z

The root cause is found and we are monitoring the result.

identified2024-12-20T10:42:02.138Z

Users may experience degraded performance in various product capabilities. Our team is working to identify the root cause and implement a solution. We will send an additional update in 60 minutes.

Dec 18, 2024

Report: "Service disruption in Webhooks"

Last update 2024-12-18T09:45:28.400Z

postmortem2024-12-18T09:42:45.149Z

## Incident Description On December 13th, the Webhooks service faced a disruption in all regions. **Duration:** `[10:08 UTC] - [10:24 UTC]` ~ 16 minutes Customers were not able to see existing webhooks, nor modify them. Event deliveries were also delayed during the disruption. The incident was related to the Webhooks disruption on December 12th. ## Incident Resolution We rolled back the changes we made on both the days. No data was lost during the disruption. ## Root Cause Analysis We introduced an improvement in the Webhooks service, which was related to authentication against endpoints. This change broke the authentication with the service, and hence all inbound calls were failing. ## Preventative Measures We have already improved testing strategy for this case and will continue to improve in the future.

resolved2024-12-13T10:00:00.000Z

Webhooks service faced a disruption from 10:08 UTC to 10:24 UTC in all regions. Impact: Subscription management was not accessible and events delivery was delayed. No data loss happened.

Report: "Service disruption in Webhooks"

Last update 2024-12-18T09:42:28.333Z

postmortem2024-12-18T09:38:03.841Z

## Incident Description On December 12th, the Webhooks service faced a disruption in all regions. **Duration:** `[15:01 UTC] - [15:25 UTC]` ~ 25 minutes Customers were not able to see existing webhooks, nor modify them. Event deliveries were also delayed during the disruption. ## Incident Resolution We rolled back the changes we made on both the days. No data was lost during the disruption. ## Root Cause Analysis We introduced an improvement in the Webhooks service, which was related to authentication against endpoints. This change broke the authentication with the service, and hence all inbound calls were failing. ## Preventative Measures We have already improved testing strategy for this case and will continue to improve in the future.

resolved2024-12-12T15:00:00.000Z

Webhooks service faced a disruption from 15:01 UTC to 15:25 UTC in all regions. Impact: Subscription management was not accessible and events delivery was delayed. No data loss happened.

Report: "Service Disruption in US"

Report: "Unable to link individual fact sheets to reference catalogs"

Report: "Degraded performance in Bookmarks"

Report: "Service Disruption in Surveys"

Report: "Service Disruption in OData Integration for all regions"

Report: "Service Disruption In EU region"

Report: "Service Disruption in EU"

Report: "Service disruption affecting a limited number of customers in DE region"

Report: "Transformations Relation Sync Outage"

Report: "Custom user roles having too restricted permissions on some workspaces"

Report: "Service Disruption in EU"

Report: "Service Disruption in EU"

Report: "Custom user roles having too restricted permissions on some workspaces"

Report: "Standard maintenance in JP"

Report: "[EAM] Dashboards show error after login"

Report: "[EAM] Dashboards show error after login"

Report: "Major upgrade in EU"

Report: "Signavio Integration UI outage"

Report: "Fact Sheet details page outage"

Report: "Signavio Integration UI outage"

Report: "Fact Sheet details page outage"

Report: "Issues during creation of Fact Sheet relations"

Report: "Standard maintenance in EU"

Report: "Login issues"

Report: "[EAM] Diagrams are not loading"

Report: "Major upgrade in US"

Report: "Major upgrade in AE, AU, BR, CA, CH, and UK"

Report: "[EAM] Accessing workspaces results in errors"

Report: "Service Disruption in AE"

Report: "Login issues"

Report: "Faulty support for DE, ES, FR, PT languages"

Report: "Problem when navigating to Inventory"

Report: "invitation of new users to a workspace not possible"

Report: "Degraded performance with non SSO logins."

Report: "Service Disruption in EU region"

Report: "Service Disruption In EU region"

Report: "Editing fact sheets through table view is not working"

Report: "Login issues"

Report: "Service Disruption in **Teams Chat Bot**"

Report: "Service Disruption in the Integrations page"

Report: "Default relations are ignored in Excel and OData export"

Report: "Access of Menu items shows errors"

Report: "Degraded performance in Signavio Integration Configuration UI in EU"

Report: "Delay in delivering webhooks"

Report: "Search for tags unavailable in all regions"

Report: "Service Disruption in **Teams Chat Bot**"

Report: "Service Disruption for OData Integration in All Regions"

Report: "Service Disruption in US"

Report: "Degraded performance in "Creating a Fact Sheet in the Inventory" functionality"

Report: "Temporary disruption on the Fact Sheet details page"

Report: "Service Disruption in Dashboards that contain KPIs"

Report: "Survey shows empty results"

Report: "Service Disruption in the Diagrams Lucidchart integration"

Report: "Login issues"

Report: "Service Disruption in CA region"

Report: "Duplicated fact sheets in surveys displayed in EU, AU, CA, US and DE"

Report: "Unable to use the diagram editor"

Report: "Degraded performance in DE region"

Report: "Login issues in EU"

Report: "Tag filters not working in Inventory, Reports & Diagrams"

Report: "Degraded performance in Collections"

Report: "[EAM] Degraded performance in Automations in EU"

Report: "[EAM]Degraded performance in Survey on EU Region"

Report: "Service Disruption for eu-7 instance in EU"

Report: "Degraded performance in EAM"

Report: "Completion score on Business Capabilities temporarily wrong"

Report: "Degraded performance in various product capabilities"

Report: "Service disruption in Webhooks"

Report: "Service disruption in Webhooks"

Report: "Service Disruption in Teams Chat Bot"

Report: "Service Disruption in Teams Chat Bot"