School of Languages and Linguistics - Research Publications

Permanent URI for this collection

Search Results

Now showing 1 - 5 of 5
  • Item
    Thumbnail Image
    Building Speech Recognition Systems for Language Documentation: The CoEDL Endangered Language Pipeline and Inference System (ELPIS)
    Foley, B ; Arnold, J ; Coto-Solano, R ; Durantin, G ; Ellison, TM ; van Esch, D ; Heath, S ; Kratochvíl, F ; Maxwell-Smith, Z ; Nash, D ; Olsson, O ; Richards, M ; San, N ; Stoakes, H ; Thieberger, N ; Wiles, J (ISCA, 2018)
    Machine learning has revolutionized speech technologies for major world languages, but these technologies have generally not been available for the roughly 4,000 languages with populations of fewer than 10,000 speakers. This paper describes the development of ELPIS, a pipeline which language documentation workers with minimal computational experience can use to build their own speech recognition models, resulting in models being built for 16 languages from the Asia-Pacific region. ELPIS puts machine learning speech technologies within reach of people working with languages with scarce data, in a scalable way. This is impactful since it enables language communities to cross the digital divide, and speeds up language documentation. Complete automation of the process is not feasible for languages with small quantities of data and potentially large vocabularies. Hence our goal is not full automation, but rather to make a practical and effective workflow that integrates machine learning technologies.
  • Item
    Thumbnail Image
    Nasal aerodynamics and coarticulation in Bininj Kunwok: Smoothing Spline Analysis of Variance
    STOAKES, H ; Fletcher, J ; Butcher, AR ; Carignan, C ; Tyler, M (ASSTA, 2016-12-06)
    Nasal phonemes are well represented within the lexicon of BininjKunwok.1 Thisstudyexaminesintervocalic,wordmedial nasals and reports patterns of coarticulation using a Smooth- ing Spline Analysis of Variance (SSANOVA). This allows for detailed comparisons of peak nasal airflow across six female speakers of the language. Results show that in a VNV sequence there is very little anticipatory vowel nasalisation and greater carryover into a following vowel. The maximum peak nasal flow is delayed for coronals when compared to the onset of oral closure in the nasal, indicating a delayed velum opening gesture. The velar place of articulation is the exception to this pattern with some limited anticipatory nasalisation. The SSANOVA has shown to be an appropriate technique for quantifying these patterns and dynamic speech data in general.
  • Item
    Thumbnail Image
    Building Speech Recognition Systems for Language Documentation: The CoEDL Endangered Language Pipeline and Inference System (ELPIS)
    Foley, B ; Arnold, J ; Coto-Solano, R ; Durantin, G ; Mark, E ; van Esch, D ; Heath, S ; Kratochvíl, F ; Maxwell-Smith, Z ; Nash, D ; Olsson, O ; Richards, M ; San, N ; Stoakes, H ; Thieberger, N ; Wiles, J (International Speech Communication Association, 2018-08-30)
    Machine learning has revolutionised speech technologies for major world languages, but these technologies have generally not been available for the roughly 4,000 languages with populations of fewer than 10,000 speakers. This paper describes the development of Elpis, a pipeline which language documentation workers with minimal computational experience can use to build their own speech recognition models, resulting in models being built for 16 languages from the Asia-Pacific region. Elpis puts machine learning speech technologies within reach of people working with languages with scarce data, in a scalable way. This is impactful since it enables language communities to cross the digital divide, and speeds up language documentation. Complete automation of the process is not feasible for languages with small quantities of data and potentially large vocabularies. Hence our goal is not full automation, but rather to make a practical and effective workflow that integrates machine learning technologies.
  • Item
    Thumbnail Image
    Intonational correlates of subject and object realisation in Mawng (Australian)
    FLETCHER, J ; Stoakes, H ; Singer, R ; Loakes, D ; BARNES, J ; VEILLEUX, N ; SHATTUCK-HUFNAGEL, S ; BRUGOS, A (ISCA, 2016)
    A range of intonational devices can be used in the grammar of information and corrective focus marking in languages with relatively free word order. In this paper we explore whether nouns in the Australian Indigenous language Mawng are realised differently depending on syntactic function and focus. Results show that the pitch level associated with Subjects is higher in conditions of corrective focus compared to other utterance contexts and there is a strong correlation between focus and utterance position. Placing a word in a corrective focus context does not appear to have an effect on word duration in this corpus confirming that pitch register variation and intonational phrasing are the major prosodic cues associated with corrective focus in Mawng.
  • Item
    Thumbnail Image
    Accentual prominence and consonant lengthening and strengthening in Mawng
    Fletcher, J ; Stoakes, H ; Loakes, D ; Singer, R ; Wolters, M ; Livingstone, J ; Beattie, B ; Smith, R ; MacMahon, M ; Stuart-Smith, J ; Scobbie, J (University of Glasgow, 2015)