School of Languages and Linguistics - Research Publications

Permanent URI for this collection

Search Results

Now showing 1 - 7 of 7
  • Item
    Thumbnail Image
    “And healthcare will be cured too”: transformations of winged phrases from Soviet films in modern Russian newspaper headings
    Kabiak, N (Sveučilište Jurja Dobrile u Puli / Juraj Dobrila University of Pula, Croatia Zagrebačka 30, 52100 Pula, Hrvatska, 2022-08-29)
    This paper examines the usage of “winged phrases” from Soviet cinema as they make their way from their place as film quotations into modern Russian – more specifically, into newspaper headings. The focus of this study is the process of the transformation of winged phrases in article headings within three contemporary Russian newspapers: “Комсомольская правда – Москва” (Komsomolʹs Truth – Moscow), “Известия” (News) and “Литературная газета” (Literary Gazette). The paper draws on analysis of 151 newspaper headings for the period from 1st January 2017 until 1st July 2018. An interpretation of transformed winged phrases embedded into newspaper headings is presented, drawing upon Lotman’s writings on cultural semiotics. It is argued that when winged phrases travel across time and across a “semiotic border” (Lotman) they undergo transformations from Soviet films into a contemporary context while retaining a dialogic connection with the past.
  • Item
    No Preview Available
    Prosodic phrasing, pitch range and word order variation in Murrinhpatha
    Fletcher, J ; Kidd, E ; Stoakes, H ; Nordlinger, R ; Rosey, B (ASSTA, 2022)
    Like many Indigenous Australian languages, Murrinhpatha has flexible word order with no apparent configurational syntax. We analyzed an experimental corpus of Murrinhpatha utterances for associations between different thematic role orders, intonational phrasing patterns and pitch downtrends. We found that initial constituents (Agents or Patients) tend to carry the highest pitch targets (HiF0), followed by patterns of downstep and declination. Sentence-final verbs always have lower Hif0 values than either initial or medial Agents or Patients. Thematic role order does not influence intonational patterns, with the results suggesting that Murrinhpatha has positional prosody, although final nominals can disrupt global pitch downtrends regardless of thematic role.
  • Item
    Thumbnail Image
    Stop (de)gemination in Veneto Italian: The role of durational correlates.
    Dian, A ; Hajek, J ; Fletcher, J ; Billington, R (Australasian Speech Science and Technology Association, 2022-12-16)
    This preliminary study investigates a long-assumed but previously untested degemination of stops in the regional variety of Italian spoken in the Veneto, in North-East Italy. The durational parameters known to be affected by gemination in Italian – i.e., consonant duration, duration of the preceding vowel and the ratio between the two – are considered. The entire Italian stop series is investigated through an acoustic-phonetic production experiment involving six speakers reading a set of carrier sentences designed to elicit different prosodic patterns. Partial degemination is observed for most speakers in terms of (a) decreased geminate-singleton consonant duration differences compared to previous studies on other Italian varieties, and (b) considerable overlap between geminate and singleton consonant-to-vowel duration ratio categories. Possible sociophonetic effects are discussed.
  • Item
    Thumbnail Image
    The Pacific Expansion: Optimizing phonetic transcription of archival corpora
    Billington, R ; Stoakes, H ; Thieberger, N (ISCA-INT SPEECH COMMUNICATION ASSOC, 2021)
    For most of the world’s languages, detailed phonetic analyses across different aspects of the sound system do not exist, due in part to limitations in available speech data and tools for efficiently processing such data for low-resource languages. Archival language documentation collections offer opportunities to extend the scope and scale of phonetic research on low-resource languages, and developments in methods for automatic recognition and alignment of speech facilitate the preparation of phonetic corpora based on these collections. We present a case study applying speech modelling and forced alignment methods to narrative data for Nafsan, an Oceanic language of central Vanuatu. We examine the accuracy of the forced-aligned phonetic labelling based on limited speech data used in the modelling process, and compare acoustic and durational measures of 17,851 vowel tokens for 11 speakers with previous experimental phonetic data for Nafsan. Results point to the suitability of archival data for large-scale studies of phonetic variation in low-resource languages, and also suggest that this approach can feasibly be used as a starting point in expanding to phonetic comparisons across closely-related Oceanic languages.
  • Item
    Thumbnail Image
    Developing a workforce to support research reliant on data and compute
    Turpin, A ; Gruba, P ; Pozanenko, A ; Stupnikov, S ; Thalheim, B ; Mendez, E ; Kiselyova, N (CEUR, 2021-01-01)
    We describe the construction, operation and evaluation of the Melbourne Data Analytics Platform; a group of academics whose mission is to support research requiring non-trivial data analysis or compute at the University of Melbourne.
  • Item
    Thumbnail Image
    AusKidTalk: An Auditory-Visual Corpus of 3- to 12-Year-Old Australian Children's Speech
    Ahmed, B ; Ballard, KJ ; Burnham, D ; Sirojan, T ; Mehmood, H ; Estival, D ; Baker, E ; Cox, F ; Arciuli, J ; Benders, T ; Demuth, K ; Kelly, B ; Diskin-Holdaway, C ; Shahin, M ; Sethu, V ; Epps, J ; Lee, CB ; Ambikairajah, E (ISCA, 2021)
    Here we present AusKidTalk [1], an audio-visual (AV) corpus of Australian children’s speech collected to facilitate the development of speech based technological solutions for children. It builds upon the technology and expertise developed through the collection of an earlier corpus of Australian adult speech, AusTalk [2,3]. This multi-site initiative was established to remedy the dire shortage of children’s speech corpora in Australia and around the world that are sufficiently sized to train accurate automated speech processing tools for children. We are collecting ~600 hours of speech from children aged 3–12 years that includes single word and sentence productions as well as narrative and emotional speech. In this paper, we discuss the key requirements for AusKidTalk and how we designed the recording setup and protocol to meet them. We also discuss key findings from our feasibility study of the recording protocol, recording tools, and user interface.
  • Item
    No Preview Available
    The Language Documentation Quartet
    Musgrave, S ; Thieberger, N (University of Colorado at Boulder, 2021)
    As we noted in an earlier paper (Musgrave & Thieberger 2012), the written description of a language is an essentially hypertextual exercise, linking various kinds of material in a dense network. An aim based on that insight is to provide a model that can be implemented in tools for language documentation, allowing instantiation of the links always followed in writing a grammar or a dictionary, tracking backwards and forwards to the texts and media as the source of authority for claims made in an analysis. Our earlier paper described our initial efforts to encode Heath’s (1984) grammar, texts (1980), and dictionary (1982) of Nunggubuyu, an Australian language from eastern Arnhemland. We chose this body of work because it was written with many internal links between the three volumes. The links are all encoded with textual indexes which looked to be ready to be instantiated as automated hyperlinks once the technology was available. In this paper, we discuss our progress in identifying how the four component parts of a description (grammar, text, dictionary, media, henceforth the quartet) can be interlinked, what are the logical points at which to join them, and whether there are practical limits to how far this linking should be carried. We suggest that the problems which are exposed in this process can inform the development of an abstract or theoretical data structure for each of the components and these in turn can provide models for language documentation work which can feed into hypertext presentations of the type we are developing.