AI-generated clinical summaries: errors and susceptibility to speech and speaker variability

April 24, 2026
, 1:22 pm

A VPN is an essential component of IT security, whether you’re just starting a business or are already up and running. Most business interactions and transactions happen online and VPN

Objectives

To evaluate whether variability in patients’ communication style (personality), international English-accents (human and synthetic) and speech impairments affects the accuracy of a Clinical AI Scribe (CAIS) and identify where performance degrades to inform pre-deployment validation and monitoring.

Methods

We conducted simulated primary-care consultations using trained actors. For personality types, four scenarios were enacted, each with five patient-personality types. For accents, transcripts of consultations were used to generate combinations of seven accents across five scenarios. The CAIS produced summaries that were compared with transcripts, and errors classified as omissions, factual inaccuracies or hallucinations. For speech impairments, public recordings representing five profiles were transcribed and word-recognition accuracy calculated.

Results

Personality types showed no statistically significant differences in errors (all p>0.05). Extraversion had the highest total errors (median 3.5). Across accents, comparisons were non-significant for both patient and doctor voices (patients: p=0.851; doctors: p=0.980). Omissions predominated, with low rates of hallucinations and factual inaccuracies. Omissions were slightly higher for Chinese-accented and Indian-accented doctors (both medians 3.0). Conversely, speech impairments differed: cleft palate and vowel disorders were near-perfect, whereas phonological impairment markedly reduced recognition (p<0.001).

Discussion

Operationally, CAIS deployment should include clinician-in-the-loop verification, subgroup performance monitoring (accents, impairments) and predefined ‘switch-off’ criteria for severe phonological patterns. High-quality synthetic voices are a pragmatic proxy for accent testing when balanced corpora are unavailable.

Conclusions

Under controlled conditions CAIS performance was broadly stable across communication styles and most accents, but vulnerable to specific speech characteristics, particularly phonological impairment, in this single-system simulation study.

Draper, T. C., Leake, J., Cox, T., Lamb-Riddell, K., Johns, B. E., McCormick, J., Trowell, S., Kiely, J., Luxton, R.

Leave a Replay

What Dentistry Is Teaching Healthcare About Operational AI

Patients are Turning to HCP Influencers in the Age of AI and Misinformation

Too Many Prescriptions, Too Many Fatal Falls

Sign up for our Newsletter

HealthGility provides healthcare organizations with a solution for patient protected health information disclosure and confidentiality that reduces workloads and ensures the safe and secure sharing of sensitive data between different providers.

AI-generated clinical summaries: errors and susceptibility to speech and speaker variability

Draper, T. C., Leake, J., Cox, T., Lamb-Riddell, K., Johns, B. E., McCormick, J., Trowell, S., Kiely, J., Luxton, R.

Leave a Replay

Related Posts

What Dentistry Is Teaching Healthcare About Operational AI

Patients are Turning to HCP Influencers in the Age of AI and Misinformation

Too Many Prescriptions, Too Many Fatal Falls

Follow Us

Sign up for our Newsletter

Our Company

Resources

Connect

© All rights reserved

Contact Us