Theoretical Analysis of Positional Encodings in Transformer Models

(arxiv.org)

36 points | by PaulHoule 4 days ago ago

4 comments