Better Android Testing at Airbnb (Part 4)

Article Summary

Airbnb's Android tests were failing randomly. The culprit? Everything from cached drawables to delayed runnables creating unpredictable test behavior.

In Part 6 of their testing series, Airbnb's Eli Hart reveals the hidden sources of test flakiness that plague screenshot and interaction testing. With tests running in unpredictable order via Flank, even small state leaks compound into major reliability issues.

Key Takeaways

Forced drawable cache clearing after each screenshot eliminated pixel variation flakiness
Custom wrapper functions for postDelay and async code enable deterministic test execution
Mocked date framework ensures JodaTime calls return consistent values across test runs
Disabled RecyclerView prefetching prevents non-deterministic view layout during screenshots
Centralized ImageView architecture allows synchronous local asset injection instead of network loads

Critical Insight

Airbnb achieved reliable Android testing by systematically eliminating flakiness sources at the framework level, from shared preferences to WebView mocking.

The team's approach to handling out-of-memory exceptions during 40,000 pixel screenshots reveals clever bitmap management strategies.

Better Android Testing at Airbnb (Part 4)

Article Summary

Key Takeaways

Recent from Airbnb

Related Articles

Related Articles

Common Mistakes When Shifting Left in Mobile Testing

Explores the most common pitfalls mobile teams encounter when implementing shift-left testing, from over-relying on UI tests to ignoring mobile-specific risks like process death and flaky networks. Argues that shifting left is about changing what you test at each stage, not just when tests run.

Individual Author • Feb 4, 2026

Our Buildkite Brings All the Devs to the Yard: (Re)Building Reddit Mobile CI in 2025

Reddit Mobile CI overhaul: up to 50% faster build times, improved stability and developer sentiment; details on infra choices and trade-offs. (Reddit)

Posts on Reddit • Aug 1, 2025

Measuring Product Impact Without A/B Testing: How Discord Used the Synthetic Control Method for Voice Messages

Discord used synthetic control to measure voice message impact cleanly.

Discord • Nov 23, 2024

Good Practices When Creating E2E Tests at Skeelo

Skeelo shares their best tips for solid end-to-end app testing.

Skeelo • Jun 13, 2023

Better Android Testing at Airbnb (Part 4)

Article Summary

Key Takeaways

Recent from Airbnb

Why Airbnb’s React Native Experiment Ended With a Complete Rewrite | by Yash Batra | JavaScript in Plain English

Understanding and Improving SwiftUI Performance

Understanding and Improving SwiftUI Performance

Animations: Bringing the Host Passport to Life on iOS

Related Articles

Common Mistakes When Shifting Left in Mobile Testing

Our Buildkite Brings All the Devs to the Yard: (Re)Building Reddit Mobile CI in 2025

Measuring Product Impact Without A/B Testing: How Discord Used the Synthetic Control Method for Voice Messages

Good Practices When Creating E2E Tests at Skeelo