Clarity in Motion Phase-1 Perceptual Study of Speech Intelligibility
NCT07020728 · Status: COMPLETED · Phase: NA · Type: INTERVENTIONAL · Enrollment: 72
Last updated 2025-06-25
Summary
Participants received a bilateral pure-tone hearing screen administered by the research team. All potential participants who failed the hearing screen were provided with information about its meaning and referral for further audiological testing.
Participants who passed the hearing screen and other inclusion criteria were divided into 6 groups, each of which were presented with 144 stimuli equally distributed among processing conditions. Listeners choose a comfortable listening level using supplied headphones and were able to control the rate of presentation. Following a short practice session, listeners were be asked to transcribe each target sentence. The intelligibility of each stimulus was estimated by determining the mean percentage of content words correctly transcribed. After transcription, listeners were asked for two qualitative judgments: (1) the "clarity" of the stimulus, and (2) the "listening effort" involved. The quality of each stimulus was estimated by the median quality judgment, and the effort likewise. Listening sessions were located in a quiet room and presentation was controlled by the Superlab presentation software program.
The Stimuli consisted of audio recordings of target spondaic words embedded in a carrier sentence produced by a male and a female native speaker of American English recorded under quiet conditions. Each stimulus presented to the listeners for identification was either unmasked pristine speech or speech that had been processed in one of five ways with different mixtures of noise and sensor movement. The latter are identified as QoS Levels 1-5.
Collectively, the estimates of word intelligibility, clarity, and listening effort under the different conditions shed light on the effectiveness with which the tested algorithm preserves listener intelligibility with acceptable effort and quality.
Conditions
- Healthy
Interventions
- BEHAVIORAL
-
Solo: Unmasked Speech Stimuli
Speech stimuli recorded using non-moving speakers and mics. No masking sources present. No BSS applied to multi-channel recordings. Very high output QoS values.
- BEHAVIORAL
-
Raw: Fully masked speech--no motion stimuli
Speech stimuli recorded using non-moving speakers and mics. All masking sources present. No speech separation or extraction methods applied to multi-channel recordings. Very low output QoS values.
- BEHAVIORAL
-
StatScrub: Extracted Speech--no motion stimuli
Speech stimuli recorded using non-moving speakers and mics. All masking sources present. Joint ACES scrubbing of both noise sources applied to multi-channel recordings. Very high output QoS values.
- BEHAVIORAL
-
SlideSpch: Scrubbed Speech emitted from linearly moving speaker stimuli
Speech stimuli recorded using linearly moving speech source and stationary masking sources and mics. All masking sources present. Joint ACES scrubbing of both noise sources applied to multi-channel recordings. Moderately high output QoS values.
- BEHAVIORAL
-
SlideNoise: Speech Scrubbed from linearly moving and stationary noise stimuli
Mixed speech and noise sources recorded using a stationary speech source, a stationary noise source, and a linearly moving noise source. A valid source hypothesis of the speech source is used to extract the speech source. High output QoS values.
- BEHAVIORAL
-
SlideMic: Stationary sources scrubbed from a linearly moving mic stimuli
Mixed speech and noise sources recorded using all stationary sources, and a linearly moving microphone (mic 1). Joint ACES scrubbing of both noise sources is used to reduce the response of Mic 1 to a residue of speech. Low output QoS values.
Sponsors & Collaborators
-
National Institute on Deafness and Other Communication Disorders (NIDCD)
collaborator NIH -
University of Cincinnati
collaborator OTHER -
Speech Technology and Applied Research Corp.
lead INDUSTRY
Principal Investigators
-
Richard S Goldhor, PhD · Speech Technology & Applied Research Corp.
Study Design
- Allocation
- RANDOMIZED
- Purpose
- BASIC_SCIENCE
- Masking
- SINGLE
- Model
- CROSSOVER
Eligibility
- Min Age
- 18 Years
- Sex
- ALL
- Healthy Volunteers
- Yes
Timeline & Regulatory
- Start
- 2024-08-01
- Primary Completion
- 2024-08-31
- Completion
- 2024-08-31
Countries
- United States
Study Locations
More Related Trials
-
The Brain Processes Underlying Speech Motor Learning and Speech Production
NCT05918679 ·Status: RECRUITING ·Phase: NA
-
Intensive Speech Motor Chaining Treatment for Residual Speech Sound Disorders
NCT05929859 ·Status: RECRUITING ·Phase: PHASE1/PHASE2
-
Speech Motor Learning and Retention (Aim 2)
NCT06491823 ·Status: RECRUITING ·Phase: NA
-
Speech Recognition Training in Children With Hearing Loss
NCT04041440 ·Status: COMPLETED ·Phase: PHASE1
-
Effects of Altered Auditory Feedback on Speech Fluency
NCT06011837 ·Status: ACTIVE_NOT_RECRUITING ·Phase: NA
-
Modulation of Sensory Acuity With Transcranial Magnetic Stimulation (TMS)
NCT06234059 ·Status: WITHDRAWN ·Phase: NA
-
Delineation of Sensorimotor Subtypes Underlying Residual Speech Errors
NCT03736213 ·Status: COMPLETED ·Phase: PHASE1
-
Contribution of the Somatosensory System to Speech Perceptual Processing
NCT04818021 ·Status: TERMINATED ·Phase: NA
-
Investigating the Effects of Rhythm and Entrainment on Fluency in People With Aphasia
NCT05248295 ·Status: COMPLETED ·Phase: NA
-
Speech Entrainment Treatment for People With Aphasia
NCT05687994 ·Status: COMPLETED ·Phase: NA
-
Online Rotating Delivery of Perception/Production Enhanced Treatment for Rhotics
NCT06969521 ·Status: RECRUITING ·Phase: NA
-
Speech Signals in Stuttering
NCT05668923 ·Status: RECRUITING ·Phase: NA
-
Incidental Auditory Category Training for Language Learning
NCT04509024 ·Status: TERMINATED ·Phase: NA
-
Neural Indices of Intervention Outcomes in Children With Speech Sound Disorders
NCT03623100 ·Status: ACTIVE_NOT_RECRUITING ·Phase: NA
-
Evaluating the Efficacy of Opti-Speech for Speech Treatment
NCT02705326 ·Status: COMPLETED ·Phase: NA
-
Maximizing Language Development in Children With Hearing Loss
NCT01963468 ·Status: COMPLETED ·Phase: NA
-
Modulation of Sensory Acuity With Transcranial Magnetic Stimulation (TMS)
NCT05723575 ·Status: COMPLETED ·Phase: NA
-
Rhythmic Entrainment in Bilingual Speakers With Dysarthria
NCT03645265 ·Status: TERMINATED ·Phase: NA
-
Behavioral Manifestations of Listening Effort
NCT03087045 ·Status: COMPLETED
-
Immediate Effects of Postural Repositioning on Voice Production Tasks in Seated Individuals With Acquired Dysarthria:
NCT04520828 ·Status: COMPLETED ·Phase: NA
-
Investigating Speech Sequencing in Neurotypical Speakers and Persons With Disordered Speech
NCT05437159 ·Status: RECRUITING ·Phase: NA
-
Neural Prediction to Enhance Language
NCT05962359 ·Status: RECRUITING ·Phase: NA
-
Treating Childhood Apraxia of Speech
NCT03238677 ·Status: COMPLETED ·Phase: NA
-
Correcting Residual Errors With Spectral, Ultrasound, Traditional Speech Therapy
NCT03737318 ·Status: COMPLETED ·Phase: NA
-
AI-Based Speech Intelligibility Assessment: Efficacy & Reliability
NCT07347782 ·Status: NOT_YET_RECRUITING