Can RL Improve Generalization of LLM Agents? An Empirical Study

(arxiv.org)

3 points | by tsurg_dot_com 9 hours ago ago

1 comments