Authors
Pierre Vinet, Pierre Dillenbourg, Amelieke Slot, Sharmila Selvanayakam, Sandra Giovanoli, Elisa Du, Julia Cardoso, Meret Branscheidt, Chris Easthope Awai, Christoph Michael Bauer
Published in
JMIR formative research. Volume 10. Pages e85230. Jul 03, 2026. Epub Jul 03, 2026.
Abstract
Dysarthria is a frequent motor speech disorder following a stroke, affecting up to 42% of survivors and resulting in reduced speech intelligibility and diminished quality of life. Clinical assessments, such as the Frenchay Dysarthria Assessment, Second Edition (FDA-2), rely heavily on the subjective judgment of speech-language pathologists (SLPs), which limits comparability and scalability. Telepractice solutions have the potential to extend access to care, but validated digital tools that combine automatic analysis with clinically usable interfaces remain scarce.
This study aimed to develop and evaluate a web-based application that integrates automatic speech recognition (ASR) and acoustic analysis into a user-centered dashboard for SLPs. Specifically, we investigated: (1) whether ASR can provide intelligibility scores comparable to those of human listeners; (2) the usability of the system in 2 iterative cycles with SLPs; and (3) the feasibility of presenting clinically relevant acoustic features to support telerehabilitation.
A user-centered design process was followed, involving contextual inquiry, requirements gathering, prototype development, and iterative testing with SLPs. The analytical core of the prototype included an ASR module (Whisper Large-v3) to compute intelligibility scores, combining word error rate-based accuracy with sentence-level and word-level alignment. Phoneme-level error highlighting was implemented to identify frequent substitution or deletion patterns. In parallel, an acoustic module extracted clinically relevant measures, including fundamental frequency (mean and range), intensity (mean and variability), and vowel formants (F1-F2 space), supplemented by sustained phonation duration. A pilot validation compared ASR-based intelligibility scores with transcriptions from 8 lay listeners for 3 patients with dysarthria performing the Frenchay Dysarthria Assessment-2 word and sentence tasks. Usability was evaluated in 2 cycles with 8 and 4 SLPs, respectively, using the System Usability Scale and structured questionnaires.
In the pilot validation, ASR performance was comparable to, and in some cases better than, untrained human listeners for individuals with mild and moderate dysarthria, though performance declined with severe cases. Both usability cycles yielded excellent System Usability Scale scores (cycle 1: mean 88.4, SD 4.6; cycle 2: mean 91.7, SD 4.1). Core workflow elements, including navigation, session upload, and intelligibility score presentation, were consistently rated highly. Feedback evolved from bug reports and requests for clearer terminology in cycle 1 to suggestions for advanced analytic features in cycle 2, such as additional voice-quality indices and integrated note-taking.
The prototype demonstrates that automatic intelligibility scoring and acoustic analysis can be integrated into a clinically usable, web-based dashboard. While current limitations include reliance on English-only phoneme analysis, limited advanced acoustic features, and lack of regulatory compliance, the application achieved excellent usability and shows promise for scalable telerehabilitation. Future work should expand multilingual support, incorporate additional acoustic measures, and validate the tool in larger clinical cohorts.
PMID:
42397858
Bibliographic data and abstract were imported from PubMed on 04 Jul 2026.
Read full publication at:
Please sign in
to see all details.
Advertisement
Stats
- Recommendations n/a n/a positive of 0 vote(s)
- Views 4
- Comments 0