University Library
  • Login
A gateway to Melbourne's research publications
Minerva Access is the University's Institutional Repository. It aims to collect, preserve, and showcase the intellectual output of staff and students of the University of Melbourne for a global audience.
View Item 
  • Minerva Access
  • Engineering and Information Technology
  • Computing and Information Systems
  • Computing and Information Systems - Research Publications
  • View Item
  • Minerva Access
  • Engineering and Information Technology
  • Computing and Information Systems
  • Computing and Information Systems - Research Publications
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

    Fast query for large treebanks

    Thumbnail
    Download
    Fast query for large treebanks (97.77Kb)

    Citations
    Altmetric
    Author
    GHODKE, SUMUKH; BIRD, STEVEN
    Date
    2010
    Source Title
    Human Language Technologies: Proceedings of the 11th Annual Conference of the North American Chapter of the Association for Computational Linguistics
    Publisher
    Association for Computational Linguistics
    University of Melbourne Author/s
    GHODKE, SUMUKH; Bird, Steven
    Affiliation
    Engineering - Computer Science and Software Engineering
    Metadata
    Show full item record
    Document Type
    Conference Paper
    Citations
    Ghodke, S., & Bird, S. (2010). Fast query for large treebanks. In Human Language Technologies: Proceedings of the 11th Annual Conference of the North American Chapter of the Association for Computational Linguistics, Los Angeles, USA.
    Access Status
    Open Access
    URI
    http://hdl.handle.net/11343/27681
    Description

    This is a pre-print of a paper from Human Language Technologies: Proceedings of the 11th Annual Conference of the North American Chapter of the Association for Computational Linguistics 2010 published by Association for Computational Linguistics. http://naaclhlt2010.isi.edu/

    Abstract
    A variety of query systems have been developed for interrogatingparsed corpora, or treebanks. With the arrival of efficient,wide-coverage parsers, it is feasible to create very largedatabases of trees.However, existing approaches that use in-memory search,or relational or XML database technologies, do not scale up.We describe a method for storage, indexing, and query oftreebanks that uses an information retrieval engine.Several experiments with a large treebank demonstrateexcellent scaling characteristics for a wide rangeof query types. This work facilitates the curation ofmuch larger treebanks, and enables them to be used effectivelyin a variety of scientific and engineering tasks.

    Export Reference in RIS Format     

    Endnote

    • Click on "Export Reference in RIS Format" and choose "open with... Endnote".

    Refworks

    • Click on "Export Reference in RIS Format". Login to Refworks, go to References => Import References


    Collections
    • Computing and Information Systems - Research Publications [1565]
    Minerva AccessDepositing Your Work (for University of Melbourne Staff and Students)NewsFAQs

    BrowseCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects
    My AccountLoginRegister
    StatisticsMost Popular ItemsStatistics by CountryMost Popular Authors