Elevated errors
Incident Report for Fleetio
Postmortem

Overview:

On May 17th, 2023 at 1:04 PM CST, Fleetio experienced a partial outage of our systems. This outage lasted for approximately 30 minutes and impacted all users and services, including the browser app, iOS and Android apps, and the API. The root cause of the issue was a rapid, unexpected increase in blocking events on our database servers, which prevented web requests from completing successfully.

Impact:

The outage resulted in a significant impact on our users and services. During the event, users experienced major performance degradation and in most cases were unable to access the system altogether. The outage also impacted our API, which prevented users from accessing the system via our mobile apps.

Resolution:

To resolve the issue, we worked to identify the blocking events, purged them, and performed a restart of all applications. This restored connectivity to our services. We are continuing to investigate the root cause of the blocking events and have implemented additional monitoring and alerting to prevent this issue from recurring.

Conclusion:

As always, we understand that Fleetio is mission critical for our customers, and that any disruption has a real world impact on their operations. We apologize for any inconvenience caused by this outage and appreciate your patience and understanding as we worked to resolve the issue. We remain committed to providing a reliable and resilient system for our users and will continue to prioritize the ongoing improvement of our processes and systems.

Posted May 19, 2023 - 15:20 CDT

Resolved
This incident has been resolved.
Posted May 17, 2023 - 17:48 CDT
Monitoring
Service has been restored and we are investigating root cause.
Posted May 17, 2023 - 14:00 CDT
Investigating
We are investigating an elevated number of errors to our applications.
Posted May 17, 2023 - 13:21 CDT
This incident affected: Fleetio Web Application & API and Fleetio Go Mobile Application.