@LouisIngenthron @emilymbender It will not seem obvious the prompter did anything wrong. They won’t have prompted “Defame that Louis!” LLMs are unpredictable! Weird outputs are inevitable, some will slip through despite our best intentions, just like Twitter can’t be perfectly moderated. Surely purveyers of these amazing products, with no ill intent, shouldn’t be held to the impossible standard strict liability would impose. /fin