Search-R1: Training LLMs to Reason and Leverage Search Engines with RL

(arxiv.org)

95 points | by jonbaer a day ago ago

11 comments