On November 22, 2024, from 4:13 AM to 6:48 AM UTC, job runs relying on PrivateLink connections failed across multiple tenants. This was caused by a code change that prevented the job execution service from properly using PrivateLink for warehouse connections. The issue was detected via a monitoring alert, and a rollback resolved the problem.
A code change to environment variables caused incorrect PrivateLink configuration to be passed to the job execution service. This resulted in jobs runs not being able to connect to PrivateLink endpoints and ultimately failing with data warehouse connection errors.
The issue was detected by our monitoring system. Once the faulty code change was identified, the issue was resolved via a rollback.
Total resolution time was longer than usual due to the incident occurring during the holiday deployment freeze, which required additional processes to lift the freeze and initiate the rollback.
Completed
Longer-term (by end of Jan 2025):
We sincerely apologise for this outage and to every customer that was impacted. We understand that you rely on the dbt Cloud application, especially job execution, as a key tool. We are confident that these measures will effectively prevent similar incidents in the future and ensure a more stable experience for all our users.