vLLM: An Efficient Inference Engine for Large Language Models

(www2.eecs.berkeley.edu)

2 points | by matt_d 2 days ago ago

No comments yet.