
International study highlights demographic biases in AI models diagnosing skin diseases from images.
Key Details
- 1Researchers evaluated ChatGPT-4 and LLaVA on 10,000 dermatoscopic images of skin diseases.
- 2Study assessed diagnostic accuracy and fairness regarding sex and age groups.
- 3ChatGPT-4 showed better demographic fairness than LLaVA, which had marked sex-based biases.
- 4Both AI models outperformed traditional deep learning approaches overall.
- 5Calls made for considering demographic fairness before clinical deployment of AI in healthcare.
- 6Further research planned to evaluate impact of skin tone and other demographic factors.
Why It Matters
Addressing bias in AI diagnostic tools is essential to ensure equitable healthcare outcomes. This study provides critical insights and grounds for improvement in AI model development for medical imaging.

Source
EurekAlert
Related News

•EurekAlert
New Framework Compares AI Segmentation Without Ground Truth Annotations
Researchers introduce an open-source approach for evaluating AI anatomy segmentation models in medical imaging without requiring ground truth annotations.

•EurekAlert
AI Model Uses EKG and EHR Data to Predict Sudden Cardiac Arrest
Researchers have developed AI models that analyze EKG and EHR data to predict risk of sudden cardiac arrest in the general population.

•EurekAlert
AI-Driven Handheld Endomicroscope Enhances Early Cancer Detection
Researchers develop PrecisionView, a handheld AI-powered endomicroscope for real-time, high-resolution cancer diagnostics.