Comparative Analysis of LLMs for Mental Health Counseling

Comprehensive evaluation of proprietary and open-source LLMs on counseling capabilities with systematic quality assessment.

Jan 2025 - Mar 2025

Technologies

PythonPyTorchHuggingface TransformersLLMsEvaluation Metrics

About This Project

Evaluated proprietary and open-source LLMs (ChatGPT, Claude, LLaMA, Deepseek) on counseling capabilities using classification and generation tasks with comprehensive metrics. Developed detailed annotation guidelines and analyzed inter-rater reliability to systematically assess therapeutic response quality across models. The study examines empathy, safety, clinical appropriateness, and actionability of AI-generated counseling responses, providing insights into the readiness of LLMs for mental health applications.

Complete Report

View the complete project report below or download PDF

Unable to display PDF viewer.

Download Report PDF