Studie: Forscher untersuchen KI-Reaktionen auf depressive Persona
Researchers tested how LLMs respond to users describing depressive symptoms; Grok and Gemini performed riskier than GPT and Claude.
A study by German researchers benchmarked safety responses across models using a persona framework; response quality and risk assessment varied sharply by model and vendor. Researchers submitted identical prompts describing depression to multiple models and scored responses on harm reduction, appropriate escalation, and factual accuracy.
DE




