Researchers enhance elderly ASR with LLM paraphrasing and speech synthesis

By PulseAugur Editorial · Summary by gemini-2.5-flash-lite from 1 source

Researchers have developed a novel data augmentation technique to improve automatic speech recognition (ASR) for elderly individuals. This method utilizes large language models to paraphrase existing transcripts, generating elderly-contextual variations. These paraphrased texts are then converted into synthetic speech using text-to-speech synthesis with elderly reference speakers. Experiments demonstrated a significant reduction in word error rate, with up to a 58.2% improvement compared to baseline models. AI

Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →

IMPACT Enhances ASR performance for specific demographics, potentially improving accessibility of voice technologies for the elderly.

RANK_REASON Academic paper detailing a new method for data augmentation in ASR.

Read on arXiv cs.CL →

paper
other

COVERAGE [1]

arXiv cs.CL TIER_1 · Minsik Lee, Seoi Hong, Chongmin Lee, Sieun Choi, Jian Kim, Jua Han, Jihie Kim · 2026-04-29 04:00

Elderly-Contextual Data Augmentation via Speech Synthesis for Elderly ASR

arXiv:2604.24770v1 Announce Type: new Abstract: Despite recent progress in automatic speech recognition (ASR), elderly ASR (EASR) remains challenging due to limited training data and the distinct acoustic and linguistic characteristics of elderly speech. In this work, we address …

COVERAGE [1]

Elderly-Contextual Data Augmentation via Speech Synthesis for Elderly ASR

RELATED ENTITIES

RELATED TOPICS