CowPilot Logo CowPilot: A Framework for Autonomous and Human-Agent Collaborative Web Navigation

Carnegie Mellon University
Under Review in NAACL 2025 Demo Track

*Indicates Equal Supervision
A step-by-step illustration of how human intervention enables the agent to overcome a failure point during task execution.

A step-by-step illustration of how human intervention enables the agent to overcome a failure point during task execution.

Abstract

While much work on web agents emphasizes the promise of autonomously performing tasks on behalf of users, in reality, agents often fall short on complex tasks in real-world contexts and modeling user preference. This presents an opportunity for humans to collaborate with the agent and leverage the agent's capabilities effectively. We propose CowPilot, a framework supporting autonomous as well as human-agent collaborative web navigation, and evaluation across task success and task efficiency. CowPilot reduces the number of steps humans need to perform by allowing agents to propose next steps, while users are able to pause, reject, or take alternative actions. During execution, users can interleave their actions with the agent by overriding suggestions or resuming agent control when needed. We conducted case studies on five common websites and found that the human-agent collaborative mode achieves the highest success rate of 95% while requiring humans to perform only 15.2% of the total steps. Even with human interventions during task execution, the agent successfully drives up to half of task success on its own. CowPilot can serve as a useful tool for data collection and agent evaluation across websites, which we believe will enable research in how users and agents can work together.

Latest Updates

  • [Jan 29, '25] We are looking for participants to test CowPilot. Use the extension for free and get $0.50 per task you annotate. Plase fill up this screening form if you are interested and one of our team members will get in touch with you!
  • [Jan 29, '25] CowPilot Chrome Extension (beta version) is released. Please feel free to test and share your feedback with us!
  • [Jan 28, '25] CowPilot is released over ArXiv.
  • [Dec 15, '24] CowPilot is under review in NAACL Demo Track, 2025

Video Demonstrations

BibTeX

@misc{huq2025cowpilotframeworkautonomoushumanagent,
      title={CowPilot: A Framework for Autonomous and Human-Agent Collaborative Web Navigation}, 
      author={Faria Huq and Zora Zhiruo Wang and Frank F. Xu and Tianyue Ou and Shuyan Zhou and Jeffrey P. Bigham and Graham Neubig},
      year={2025},
      eprint={2501.16609},
      archivePrefix={arXiv},
      primaryClass={cs.AI},
      url={https://arxiv.org/abs/2501.16609}}