1 points | by birdculture 13 hours ago ago
1 comments
Title: Serve an interactive language model app with latency-optimized TensorRT-LLM
Title: Serve an interactive language model app with latency-optimized TensorRT-LLM