DigiData: Training and Evaluating General-Purpose Mobile Control Agents

DigiData: Training and Evaluating General-Purpose Mobile Control Agents

Meet DigiData: AI that can use your phone

Imagine an assistant that taps, swipes, and navigates apps to finish tasks for you. This paper introduces DigiData—a large, diverse, multi-modal dataset built to train mobile control agents to do exactly that.

  • Richer goals: Instead of scraping random user logs, DigiData maps app features through systematic exploration, yielding harder, more human-relevant tasks.
  • Real-world testing: DigiData-Bench evaluates agents on complex mobile workflows, not toy demos.
  • Better metrics: The popular “step accuracy” score can mislead. The authors propose dynamic protocols and AI-powered reviews that judge whether an agent actually completes the task.

Why it matters: Stronger data and fairer evaluations speed up progress toward trustworthy, helpful phone agents—and safer automation of everyday digital chores.

Paper: http://arxiv.org/abs/2511.07413v1

Paper: http://arxiv.org/abs/2511.07413v1

Register: https://www.AiFeta.com

#AI #Mobile #Agents #Dataset #Benchmark #MachineLearning #HCI #UX #Evaluation #MobileAI

Read more