Writing Speed-of-Light Flash Attention for 5090 in CUDA C++

(gau-nernst.github.io)

158 points | by dsr12 3 days ago ago

34 comments