Your browser doesn't support javascript.
loading
Smartphone Recordings are Comparable to "Gold Standard" Recordings for Acoustic Measurements of Voice.
Awan, Shaheen N; Shaikh, Mohsin Ahmed; Awan, Jordan A; Abdalla, Ibrahim; Lim, Kelvin O; Misono, Stephanie.
Afiliación
  • Awan SN; University of South Florida, Dept. of Communication Sciences & Disorders, Tampa FL 33620. Electronic address: awans@usf.edu.
  • Shaikh MA; Commonwealth University of Pennsylvania, Dept. of Communication Sciences & Disorders, Bloomsburg PA 17815.
  • Awan JA; Purdue University, Dept. of Statistics, Mathematical Sciences Building, 150 N. University Street, West Lafayette, IN 47907.
  • Abdalla I; University of Minnesota Medical School, 420 Delaware Street SE, Minneapolis, MN 55455.
  • Lim KO; University of Minnesota Medical School, Dept. of Psychiatry and Behavioral Sciences, 420 Delaware Street SE, Minneapolis, MN 55455.
  • Misono S; University of Minnesota Medical School, Division of Laryngology, Department of Otolaryngology, Head and Neck Surgery, 420 Delaware Street SE, Minneapolis, MN 55455.
J Voice ; 2023 Apr 03.
Article en En | MEDLINE | ID: mdl-37019804
PURPOSE: The purpose of this study was to assess the relationship and comparability of cepstral and spectral measures of voice obtained from a high-cost "flat" microphone and precision sound level meter (SLM) vs. high-end and entry level models of commonly and currently used smartphones (iPhone i12 and iSE; Samsung s21 and s9 smartphones). Device comparisons were also conducted in different settings (sound-treated booth vs. typical "quiet" office room) and at different mouth-to-microphone distances (15 and 30 cm). METHODS: The SLM and smartphone devices were used to record a series of speech and vowel samples from a prerecorded diverse set of 24 speakers representing a wide range of sex, age, fundamental frequency (F0), and voice quality types. Recordings were analyzed for the following measures: smoothed cepstral peak prominence (CPP in dB); the low vs high spectral ratio (L/H Ratio in dB); and the Cepstral Spectral Index of Dysphonia (CSID). RESULTS: A strong device effect was observed for L/H Ratio (dB) in both vowel and sentence contexts and for CSID in the sentence context. In contrast, device had a weak effect on CPP (dB), regardless of context. Recording distance was observed to have a small-to-moderate effect on measures of CPP and CSID but had a negligible effect on L/H Ratio. With the exception of L/H Ratio in the vowel context, setting was observed to have a strong effect on all three measures. While these aforementioned effects resulted in significant differences between measures obtained with SLM vs. smartphone devices, the intercorrelations of the measurements were extremely strong (r's > 0.90), indicating that all devices were able to capture the range of voice characteristics represented in the voice sample corpus. Regression modeling showed that acoustic measurements obtained from smartphone recordings could be successfully converted to comparable measurements obtained by a "gold standard" (precision SLM recordings conducted in a sound-treated booth at 15 cm) with small degrees of error. CONCLUSIONS: These findings indicate that a variety of commonly available modern smartphones can be used to collect high quality voice recordings usable for informative acoustic analysis. While device, setting, and distance can have significant effects on acoustic measurements, these effects are predictable and can be accounted for using regression modeling.
Palabras clave

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Tipo de estudio: Prognostic_studies Idioma: En Revista: J Voice Asunto de la revista: OTORRINOLARINGOLOGIA Año: 2023 Tipo del documento: Article Pais de publicación: Estados Unidos

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Tipo de estudio: Prognostic_studies Idioma: En Revista: J Voice Asunto de la revista: OTORRINOLARINGOLOGIA Año: 2023 Tipo del documento: Article Pais de publicación: Estados Unidos