Making LLMs Cheaper and Better via Performance-Efficiency Optimized Routing

(arxiv.org)

129 points | by omarsar 3 days ago ago

28 comments