On September 9, 2024 8:11 PM (UTC) a change in the job orchestration service was deployed to production leading to failures in job runs that did not use private keys for cloning Git repositories.
Our monitoring system quickly identified a surge in errors, and the issue was mitigated by reverting the change, which restored system functionality.
A code change in the job orchestration service introduced a regression which lead to exceptions when processing job runs that do not use private keys for cloning Git repositories.
This regression was not caught before the deployment due to missing test coverage for this specific code path.
The issue had a broad impact because the service was deployed using a new pipeline that was not configured for a gradual rollout.
We sincerely apologize for this outage and to every customer that was impacted. We understand that you rely on the dbt Cloud application, especially job execution, as a key tool. We are confident that these measures will effectively prevent similar incidents in the future and ensure a more stable experience for all our users.