Researchers Uncovered a New Flaw in ChatGPT to Turn Them Evil
LLMs are commonly trained on vast internet text data, often containing offensive content. To mitigate this, developers use “alignment” methods via finetuning to prevent harmful or objectionable responses in recent LLMs. ChatGPT and AI sibli…