UCL  IRIS
Institutional Research Information Service
UCL Logo
Please report any queries concerning the funding data grouped in the sections named "Externally Awarded" or "Internally Disbursed" (shown on the profile page) to your Research Finance Administrator. Your can find your Research Finance Administrator at https://www.ucl.ac.uk/finance/research/rs-contacts.php by entering your department
Please report any queries concerning the student data shown on the profile page to:

Email: portico-services@ucl.ac.uk

Help Desk: http://www.ucl.ac.uk/ras/portico/helpdesk
Publication Detail
Semi-automatic method for analysing vocal tract morphology from real-time magnetic resonance images
Abstract

Real-time magnetic resonance imaging (rtMRI) is a technique that provides high contrast videographic data of the vocal tract. These dynamic MRI scans capture 2D T1-weighted mid-sagittal slices that provide a rich description of complex movements inside the mouth and throat. These anatomical structures provide the physiological basis for behaviours such as speech, singing, expressions of emotion, and swallowing, which are otherwise not accessible for external observation. However, while this technology produces images of the dynamic morphological features of the vocal tract, taking quantitative measurements from these images is notoriously difficult. Traditionally, researchers have taken limited measurements of select morphological features, done so on a limited number of images, and in small sample sizes constrained by the extreme laboriousness of these procedures. We introduce a semi-automated processing pipeline that produces outlines of the vocal tract as a means to quantify the dynamic morphology of the vocal tract. Our approach uses simple tissue classification operating within researcher-specified constraints to facilitate feature extraction while retaining the involvement of a human analyst. We demonstrate that this pipeline generalises well to new datasets covering behaviours such as speech, vocal size exaggeration, laughter, and whistling as well as producing reliable outcomes across analysts, particularly among users with domain specific expertise. This approach provides considerable advantages in the ability to scale to larger datasets relative to manual measurements. We make this pipeline available for immediate use by the research community, and further suggest that it may contribute to the continued development of fully automated methods based on deep learning algorithms.

Publication data is maintained in RPS. Visit https://rps.ucl.ac.uk
 More search options
UCL Researchers
Author
Speech, Hearing & Phonetic Sciences
Author
Speech, Hearing & Phonetic Sciences
University College London - Gower Street - London - WC1E 6BT Tel:+44 (0)20 7679 2000

© UCL 1999–2011

Search by