DragonCrawl: Generative AI for High-Quality Mobile Testing Summary & Key Takeaways

Article Summary

Uber's mobile testing was broken. Engineers spent 30-40% of their time maintaining test scripts that broke with every UI change. So they built an AI that tests apps like a human would.

Uber's Developer Platform team created DragonCrawl, a system using large language models to execute mobile tests across 3,000+ simultaneous experiments and 50+ languages. Instead of brittle scripts, it adapts to UI changes independently by understanding screen context and test goals through natural language.

Key Takeaways

Blocked 10 high-priority bugs in 3 months while saving thousands of developer hours
99%+ stability with zero maintenance across 85 cities and multiple device types
Uses compact 110M parameter MPNet model, 1000x smaller than GPT-3.5
Handles adversarial cases: restarted app when payments failed, retried going online for 5 minutes
Precision@1 of 97.23% choosing correct UI actions from screen context

Critical Insight

DragonCrawl made testing Uber's core trip flow across 85 cities and 50+ languages possible without manual maintenance, something previously humanly impossible at their scale.

The team's approach to preventing AI hallucinations and handling loops reveals clever guardrails that any team building LLM-powered automation should know about.

DragonCrawl: Generative AI for High-Quality Mobile Testing

Article Summary

Key Takeaways

Recent from Uber

Related Articles

Related Articles

Client Tracing: Understanding Mobile and Desktop Application Performance at Scale

Slack traces performance across mobile and desktop to catch every snag.

Slack • Mar 13, 2024

Performance monitoring in Mercari mobile apps

Mercari tracks mobile app performance live to stay quick and steady.

Mercari • Nov 29, 2023

Distributed Load Testing Using Locust

Glance tests their app’s limits with Locust for rock-solid reliability.

Glance • Jun 15, 2023

Why Mobile Application Performance Testing Is Key to App Success

Amazon shows why testing app speed is make-or-break for success.

Amazon • May 31, 2023

DragonCrawl: Generative AI for High-Quality Mobile Testing

Article Summary

Key Takeaways

Recent from Uber

How Uber Standardized Mobile Analytics for Cross-Platform Insights

How Uber Standardized Mobile Analytics for Cross-Platform Insights

Real-Time Analytics for Mobile App Crashes using Apache Pinot

Measuring Performance for iOS Apps | App Startup

Related Articles

Client Tracing: Understanding Mobile and Desktop Application Performance at Scale

Performance monitoring in Mercari mobile apps

Distributed Load Testing Using Locust

Why Mobile Application Performance Testing Is Key to App Success