US1 - intermittent API errors

Incident Report for voucherify

Resolved

This incident has been resolved.
Posted Mar 10, 2025 - 17:00 CET

Monitoring

A fix has been implemented and we are monitoring the results.
Posted Mar 10, 2025 - 16:16 CET

Identified

We've been experiencing an elevated rate of API errors over the course of Saturday and Sunday. The issue would crash one of our virtual machines in the US1 cluster once every couple of hours, resulting in a burst of HTTP 500 errors until it would self-heal and stabilize just minutes later, affecting some percent of the incoming traffic. The root cause of the issue is now identified, we already have a temporary fix in place on production, and the ultimate fix is being implemented and expected to be deployed within the next hours.
Posted Mar 10, 2025 - 10:51 CET
This incident affected: Cluster US1 (N. Virginia) (US1 - API).