Uber May 10, 2023

DragonCrawl: Generative AI for High-Quality Mobile Testing

Article Summary

Uber's mobile testing was broken. Engineers spent 30-40% of their time maintaining test scripts that broke with every UI change. So they built an AI that tests apps like a human would.

Uber's Developer Platform team created DragonCrawl, a system using large language models to execute mobile tests across 3,000+ simultaneous experiments and 50+ languages. Instead of brittle scripts, it adapts to UI changes independently by understanding screen context and test goals through natural language.

Key Takeaways

Critical Insight

DragonCrawl made testing Uber's core trip flow across 85 cities and 50+ languages possible without manual maintenance, something previously humanly impossible at their scale.

The team's approach to preventing AI hallucinations and handling loops reveals clever guardrails that any team building LLM-powered automation should know about.

Recent from Uber

Related Articles