Politeness Doesn't Work on Every LLM: The PLUM Verdict
A systematic study of politeness effects on five LLMs across three languages reveals that there is no universal benefit to being polite. The findings force a major rethinking of prompt engineering best practices.
- Researchers created the PLUM corpus to test how politeness and impoliteness affect LLM responses across English, Hindi, and Spanish.
- Five models were tested: Gemini-Pro, GPT-4o Mini, Claude 3.7 Sonnet, DeepSeek-Chat, and Llama 3.
- The study found that politeness effects are model-specific, language-specific, and history-dependent — there is no universal rule.
- This means that current prompt engineering advice ("always be polite") is dangerously oversimplified.
Discussion
Add a comment