Netflix Jan 24, 2022

Fixing Performance Regressions Before They Happen

Article Summary

Angus Croll from Netflix reveals how his team slashed false performance alerts by 90% while catching more real regressions. The secret? They stopped using static thresholds entirely.

Netflix's TVUI team runs performance tests on 1,700+ device types serving 222 million members. Their old approach with static memory thresholds created constant false alarms and missed subtle regressions. They needed a smarter way to detect performance issues before code shipped to production.

Key Takeaways

Critical Insight

By replacing static thresholds with statistical anomaly and changepoint detection, Netflix now catches genuine performance regressions earlier with 90% fewer false alerts.

The team is now decoupling their detection logic to release as an open-source library that works for any sequential quantitative data, not just performance metrics.

Recent from Netflix

Related Articles