- Computing and Information Systems - Research Publications
Computing and Information Systems - Research Publications
Permanent URI for this collection
Search Results
Now showing
1 - 10 of 13
-
ItemTalkBank: Building an open unified multimodal database of communicative interactionMacWhinney, B ; Bird, S ; Cieri, C ; Martell, C (Evaluations and Language resources Distribution Agency, 2004-01-01)
-
ItemNLTK: The Natural Language ToolkitBIRD, SG ; LOPER, E (Association for Computational Linguistics, 2004)
-
ItemTowards a General Model for Linguistic ParadigmsPENTON, D ; BOW, C ; BIRD, S ; HUGHES, B (emeld.org, 2004)
-
ItemSecuring interpretability: The case of ega language documentationGibbon, D ; Bow, C ; Bird, S ; Hughes, B (Evaluations and Language resources Distribution Agency, 2004-01-01)
-
ItemManagement of metadata in linguistic fieldwork: Experience from the ACLA projectHughes, B ; Penton, D ; Bird, S ; Bow, C ; Wigglesworth, G ; McConvell, P ; Simpson, J (European Language Resource Association, 2004-01-01)
-
ItemFunctional requirements for an interlinear text editorHUGHES, BADEN ; BOW, CATHERINE ; BIRD, STEVEN (European Language Resources Association, 2004)Interlinear text has long been considered a valuable format in the presentation of multilingual data, and a variety of software tools have facilitated the creation and processing of such texts by researchers. Despite the diversity of tools, a common core of editorial functionality is provided. Identifying these core functions has important implications for software engineers who seek to efficiently build tools that support interlinear text editing. While few applications are specifically designed for the creation or manipulation of interlinear text, a number of tools offer varying degrees of incidental support for this modality. In this paper we provide a comprehensive set of critieria upon which the derivation of functional criteria can be based. We describe the basis on which a group of tools was selected for investigation, along with the evaluation criteria. Finally we consolidate our findings into a functional specification for the development of software applications for the editing of interlinear text.
-
ItemQuerying and Updating Treebanks: A Critical Survey and Requirements AnalysisLAI, C ; BIRD, SG (Australian Speech Science and Technology Association, 2004)
-
ItemRepresenting and Rendering Linguistic ParadigmsPENTON, DJ ; BIRD, SG (Australian Speech Science and Technology Association, 2004)
-
ItemTowards a General Model of Linguistic ParadigmsPenton, D. ; Bow, C. ; Bird, S. G. ; Hughes, B. ( 2004-07)Linguistic forms are inherently multi-dimensional. They exhibit a variety of phonological, orthographic, morphosyntactic, semantic and pragmatic properties. Accordingly, linguistic analysis involves multi-dimensional exploration, a process in which the same collection of forms are laid out in many ways until clear patterns emerge. Equally, language documentation usually contains tabulations of linguistic forms to illustrate systematic patterns and variations. In all such cases, multi-dimensional data is projected onto a two-dimensional table known as a linguistic paradigm, the most widespread format for linguistic data presentation. In this paper we survey a representative sample of paradigms and develop a simple relational data model. We show how XML technologies can be used to store and render paradigms. The result is a flexible and extensible model for the storage, interchange and delivery of linguistic paradigms.
-
ItemA Four-Level Model for Interlinear TextBow, C. ; Hughes, B. ; Bird, S. G. ( 2003)Interlinear text has long been a valuable device in language documentation and linguistic description. However, the task of creating, editing and publishing interlinear text is an onerous one. Interlinear text is governed by simple rules, yet laborious manual formatting in a word processor is the norm. A handful of specialized software tools facilitate the creation of interlinear text, permitting customizable views and alignment to audio and video. However, word processors and specialized software alike fail to deliver on a key promise of digitization, namely reusability. In order to facilitate reusability, we have developed a general-purpose conceptual model of interlinear text consisting of four levels: text, phrase, word and morph. The details of the model are informed by our analysis of a representative sample of current practice. We have implemented the model using standard XML technologies.