Show HN: A new benchmark for testing LLMs for deterministic outputs

(interfaze.ai)

58 points | by khurdula 2 days ago ago

27 comments