FlashAttention-T: Towards Tensorized Attention

(dl.acm.org)

59 points | by matt_d 3 hours ago ago

21 comments