LLM misalignment may stem from role inference, not corrupted weights

(echoesofvastness.substack.com)

3 points | by PinResearch 14 hours ago ago

1 comments