pull down to refresh

This paper introduces ASearcher, an open-source project for large-scale RL training of search agents. Our key contributions include:
  1. Scalable fully asynchronous RL training that enables long-horizon search while maintaining high training efficiency.
  2. A prompt-based LLM agent that autonomously synthesizes high-quality and challenging QAs, creating a large-scale QA dataset.