Evaluating Anti-LGBTQIA+ Medical Bias in Large Language Models

September 2025 in “ PLOS Digital Health ”

Crystal Chang, Neha Srivathsa, Charbel Bou-Khalil, Akshay Swaminathan, Mitchell R. Lunn, K. Mishra, Oluwasanmi Koyejo, Roxana Daneshjou

TLDR Large language models often give biased or inaccurate medical responses, especially for LGBTQIA+ prompts.

This study evaluated the potential of four large language models (LLMs) to propagate anti-LGBTQIA+ medical bias and misinformation in clinical settings. Using 38 prompts, both with and without LGBTQIA+ identity terms, the study assessed the appropriateness and clinical utility of LLM responses. Results showed that all models generated inappropriate responses, with 43-62% for LGBTQIA+ prompts and 47-65% for non-LGBTQIA+ prompts, primarily due to hallucination/accuracy issues, followed by bias or safety concerns. LGBTQIA+ prompts elicited more severe bias. The study suggests future work should focus on improving accuracy, reducing bias, and tailoring outputs for LGBTQIA+ patients. The prompts and responses are provided as a benchmark for future evaluations.

View this study on journals.plos.org →

Discuss this study in the Community →

Research cited in this study

2 / 2 results

research Androgenetic Alopecia in Transgender and Gender Diverse Populations: A Review of Therapeutics

1 citations , October 2021 in “Journal of The American Academy of Dermatology”

The document concludes that treatments for hair loss in transgender and gender-diverse individuals include topical solutions, oral medications, laser therapy, and hair restoration procedures, with progress assessed after 6-12 months.

research Endocrine Treatment of Gender-Dysphoric/Gender-Incongruent Persons: An Endocrine Society Clinical Practice Guideline

2170 citations , September 2017 in “The Journal of Clinical Endocrinology & Metabolism”

A multidisciplinary approach is crucial for safe and effective hormone treatment in gender-dysphoric individuals, with specific guidelines for adolescents and adults.