Comparative Analysis of LLMs for Mental Health Counseling
Comprehensive evaluation of proprietary and open-source LLMs on counseling capabilities with systematic quality assessment.
Jan 2025 - Mar 2025
Technologies
PythonPyTorchHuggingface TransformersLLMsEvaluation Metrics
About This Project
Evaluated proprietary and open-source LLMs (ChatGPT, Claude, LLaMA, Deepseek) on counseling capabilities using classification and generation tasks with comprehensive metrics. Developed detailed annotation guidelines and analyzed inter-rater reliability to systematically assess therapeutic response quality across models. The study examines empathy, safety, clinical appropriateness, and actionability of AI-generated counseling responses, providing insights into the readiness of LLMs for mental health applications.
Complete Report
View the complete project report below or download PDF
Unable to display PDF viewer.
Download Report PDF