TurboPrefill: 2.7× faster than llama.cpp Pipeline Parallel on Llama-3-70B

(github.com)

3 points | by trykhlieb a day ago ago

1 comments