A subscription to JoVE is required to view this content. Sign in or start your free trial.
Ultrasound imaging can be used to display the shape and movements of the tongue in real time during speech. The images can be used to determine the nature of speech sound errors. Visual feedback of the tongue can be used to facilitate improvements in speech sound production in clinical populations.
Diagnostic ultrasound imaging has been a common tool in medical practice for several decades. It provides a safe and effective method for imaging structures internal to the body. There has been a recent increase in the use of ultrasound technology to visualize the shape and movements of the tongue during speech, both in typical speakers and in clinical populations. Ultrasound imaging of speech has greatly expanded our understanding of how sounds articulated with the tongue (lingual sounds) are produced. Such information can be particularly valuable for speech-language pathologists. Among other advantages, ultrasound images can be used during speech therapy to provide (1) illustrative models of typical (i.e. "correct") tongue configurations for speech sounds, and (2) a source of insight into the articulatory nature of deviant productions. The images can also be used as an additional source of feedback for clinical populations learning to distinguish their better productions from their incorrect productions, en route to establishing more effective articulatory habits.
Ultrasound feedback is increasingly used by scientists and clinicians as both the expertise of the users increases and as the expense of the equipment declines. In this tutorial, procedures are presented for collecting ultrasound images of the tongue in a clinical context. We illustrate these procedures in an extended example featuring one common error sound, American English /r/. Images of correct and distorted /r/ are used to demonstrate (1) how to interpret ultrasound images, (2) how to assess tongue shape during production of speech sounds, (3), how to categorize tongue shape errors, and (4), how to provide visual feedback to elicit a more appropriate and functional tongue shape. We present a sample protocol for using real-time ultrasound images of the tongue for visual feedback to remediate speech sound errors. Additionally, example data are shown to illustrate outcomes with the procedure.
Both clinical and research settings have seen an increase in the use of ultrasound imaging to provide visual biofeedback intervention to individuals with speech disorders. One important use of ultrasound imaging for speech-language pathologists is as a visual biofeedback tool during intervention for individuals with speech disorders. With the guidance of a speech-language pathologist, learners can observe real-time video of the shape and movements of their tongue and discuss how these images may differ from the tongue movements needed to properly articulate a speech sound. To conduct such interventions, it is important for users to be competent in the interpretation of ultrasound images as the tongue moves in real time. Knowledge of the range of correct articulatory patterns used by typical speakers is foundational to recognizing erroneous tongue shapes.
The methods described herein address (a) collecting ultrasound images of the tongue, (b) interpreting ultrasound images associated with both correct and incorrect productions of speech sounds, and (c) using real-time ultrasound imaging as a source of visual biofeedback to facilitate speech production changes in individuals with speech sound errors. Although ultrasound can be used to visualize a variety of lingual phonemes, examples here will focus on ultrasound images of the tongue for the /r/ sound (as in red car), which is described as the most common residual error among children acquiring American English 1. It is also the sound that has been most extensively studied in clinical applications of ultrasound to date. 2-14
One important goal in speech (re)habilitation is to facilitate more intelligible speech by teaching articulatory routines that result in perceptually appropriate productions of a target sound or sequence. Therefore, it is critical to understand tongue actions during normal speech and during production of speech errors. Real-time visualization of the tongue can play a highly beneficial role in encouraging a speaker to modify articulatory movements, as it provides the clinician and client with a shared representation of what is actually happening during speech. Without real-time visualization of the tongue, only static pictures or verbal descriptions of target tongue configurations are available to facilitate understanding of the desired articulatory behaviors. In schema-based models of motor learning, visual information about the movements of the tongue during speech is considered a form of "knowledge of performance" feedback (i.e. it provides specific qualitative information about the movement that occurred)15. Previous research has indicated that detailed knowledge of performance feedback can facilitate acquisition of a novel motor routine16.
Ultrasound has several advantages over other technologies used to visualize speech. With ultrasound, the entire contour of the tongue can be visualized quickly from tip to root. Preparation for ultrasound imaging generally takes less than a minute.
In contrast, electropalatography (EPG) requires a dental impression and the creation of a customized pseudopalate (which may take weeks), and it can take time to adapt to speaking with the pseudo-palate 17. EPG also enables visualization of tongue-palate contact only in the region covered by the pseudopalate and cannot display the tongue root or the overall shape of the tongue. This limits the nature of what aspects of articulation can be effectively targeted with EPG.
Another alternative is electromagnetic articulography (EMA), which can provide general information about tongue shape and movement 18. However, EMA requires sensors to be glued to the tongue and other structures; thus, the set-up for this type of tongue imaging can take 20 - 30 min and may not be a viable method for frequent use. Thus, ultrasound may be viewed as more practical.
In the specific context of clinical research on the assessment and treatment of /r/ errors, the use of ultrasound has been reported in several studies for individuals with idiopathic speech sound disorders 2,10,11,13,19, hearing impairment 20, childhood apraxia of speech 12,21, and acquired apraxia of speech following a cerebral vascular accident 22. Studies have also reported the use of ultrasound to treat errors on other lingual phonemes such as /s k g l ʃ ʧ / 23,24. Additional populations that may be candidates include individuals with speech disorders related to cleft palate, or individuals learning pronunciation of sounds in a non-native language 25.
Ultrasound imaging may also be useful diagnostically, e.g., to characterize errors in lingual shapes,26,27, or to identify sub-perceptible or covert contrasts in disordered speech 28,29. If precise articulatory measurements are being obtained and compared, it is essential that the ultrasound be stabilized so that the coordinate space for measurement remains reasonably constant. However, it is generally agreed that an unstabilized probe yields information of sufficient quality for clinical diagnosis and treatment applications, which is the focus of the present paper.
Ethics Statement. When used in research, informed consent and/or assent from children is always required before collecting ultrasound images. When used clinically, clients should be informed of the purpose of the ultrasound imaging. Although diagnostic ultrasound imaging is considered "minimal risk" 30, users should always follow the ALARA principle when using ultrasound, meaning exposure to ultrasound should be as "As Low As Reasonably Achievable"31. This involves limiting acoustic power during imaging and also limiting exposure time. For example, if ultrasound is being used for visual feedback but the participant is not attending to the visual feedback, it would be prudent to discontinue imaging.
1. Collecting Ultrasound Images of the Tongue
NOTE: Technical Considerations. Diagnostic ultrasound probes are used to image the tongue. A frequency range between approximately 3 - 8 MHz with a frame rate of about 30 frames per second is recommended for clinical imaging the tongue 32.
NOTE: The instructions below apply to the diagnostic Ultrasound System (see Materials Table) with a C6-2 transducer, which was selected based on visual comparison of ultrasound images collected from several transducers available for this system. These instructions are adapted from the diagnostic ultrasound system reference manual for this device and are intended to be an illustrative example for one ultrasound. Many other ultrasound systems are in use, and users should consult the operating manuals of their specific device.
2. Interpreting Ultrasound Images of the Tongue
3. Using Real-time Ultrasound Images for Feedback to Remediate Speech Sound Errors
Figure 1 presents sample sagittal images of correct /r/ in a 9-year-old female. The ultrasound images are paired with magnetic resonance images from the same speaker to demonstrate the similar tongue shape that can be viewed with both technologies.
Figure 1: Sagittal View of a Magnetic Resonance Image during a Correctly Produced Ame...
Critical Steps within the Protocol
It is essential to obtain clear, interpretable images as described in steps 1.3 and 1.6. Poor image quality renders the procedures meaningless. Additionally, participants must be fully aware of what they are seeing on the screen. Therefore, orienting the participant to the image as described in 3.2 is a step that should be emphasized prior to providing visual feedback training. Additionally, step 3.10, which involves clearly describing differences in tongue shap...
Siemens Corporation provided a temporary loan of three Acuson X300 ultrasounds for research purposes at no cost to the authors.
The work was supported by NIH grants R01DC013668 (D. Whalen, PI) and R03DC013152 (J. Preston, PI).
Name | Company | Catalog Number | Comments |
ACUSON X300 ultrasound with C6-2 probe | Siemens | Acuson X300 | |
Trasceptic Spray | Parker labs | PLI 09-25 | |
Acquasonic 100 ultrasound gel | Parker labs | 01-08 |
Request permission to reuse the text or figures of this JoVE article
Request PermissionThis article has been published
Video Coming Soon
Copyright © 2025 MyJoVE Corporation. All rights reserved