Is One Layer Enough? A Single Transformer Layer Matches Full-Parameter RL Train

(arxiv.org)

149 points | by tcp_handshaker a day ago ago

40 comments