Lyft JP Simard Sep 13, 2022

Productionizing Envoy Mobile at Lyft

Article Summary

JP Simard from Lyft reveals how they replaced URLSession and OkHttp across all their mobile apps with a single networking library—and the results weren't what anyone expected.

Lyft spent years migrating their iOS and Android apps to Envoy Mobile, an open-source networking library based on Envoy Proxy. After months of rigorous A/B testing and gradual rollouts starting in December 2021, they now handle billions of daily requests through this unified solution.

Key Takeaways

Critical Insight

Lyft successfully replaced platform-native networking libraries with Envoy Mobile across all apps, matching or exceeding previous performance while gaining unprecedented observability and cross-platform consistency.

The team discovered a hidden Android networking behavior that took months to uncover and explains why certain carriers had mysteriously lower connectivity rates.

About This Article

Problem

Lyft's mobile apps didn't have good visibility into network performance. The team had to use hooks into platform libraries with very low sampling rates, which meant they couldn't detect incidents quickly and missed production issues affecting billions of daily requests.

Solution

JP Simard's team used Envoy Mobile's stats system, which comes from Envoy Proxy. It sends comprehensive metrics directly to gRPC or statsd endpoints, giving them near real-time visibility across mobile operations.

Impact

Envoy Mobile's stats caught at least 3 production incidents in recent months that Lyft's existing observability solutions missed. Those older solutions relied on costly analytics events, so the faster detection enabled quicker incident response.