University Library
  • Login
A gateway to Melbourne's research publications
Minerva Access is the University's Institutional Repository. It aims to collect, preserve, and showcase the intellectual output of staff and students of the University of Melbourne for a global audience.
View Item 
  • Minerva Access
  • Engineering and Information Technology
  • Infrastructure Engineering
  • Infrastructure Engineering - Research Publications
  • View Item
  • Minerva Access
  • Engineering and Information Technology
  • Infrastructure Engineering
  • Infrastructure Engineering - Research Publications
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

    A Recurrent Deep Network for Estimating the Pose of Real Indoor Images from Synthetic Image Sequences

    Thumbnail
    Download
    Published version (4.999Mb)

    Citations
    Altmetric
    Author
    Acharya, D; Singha Roy, S; Khoshelham, K; Winter, S
    Date
    2020-10-01
    Source Title
    Sensors
    Publisher
    MDPI
    University of Melbourne Author/s
    Acharya, Debaditya; Winter, Stephan; Khoshelham, Kourosh; Acharya, Debaditya
    Affiliation
    Infrastructure Engineering
    Metadata
    Show full item record
    Document Type
    Journal Article
    Citations
    Acharya, D., Singha Roy, S., Khoshelham, K. & Winter, S. (2020). A Recurrent Deep Network for Estimating the Pose of Real Indoor Images from Synthetic Image Sequences. SENSORS, 20 (19), https://doi.org/10.3390/s20195492.
    Access Status
    Open Access
    URI
    http://hdl.handle.net/11343/251374
    DOI
    10.3390/s20195492
    Open Access at PMC
    http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7582800
    Abstract
    Recently, deep convolutional neural networks (CNN) have become popular for indoor visual localisation, where the networks learn to regress the camera pose from images directly. However, these approaches perform a 3D image-based reconstruction of the indoor spaces beforehand to determine camera poses, which is a challenge for large indoor spaces. Synthetic images derived from 3D indoor models have been used to eliminate the requirement of 3D reconstruction. A limitation of the approach is the low accuracy that occurs as a result of estimating the pose of each image frame independently. In this article, a visual localisation approach is proposed that exploits the spatio-temporal information from synthetic image sequences to improve localisation accuracy. A deep Bayesian recurrent CNN is fine-tuned using synthetic image sequences obtained from a building information model (BIM) to regress the pose of real image sequences. The results of the experiments indicate that the proposed approach estimates a smoother trajectory with smaller inter-frame error as compared to existing methods. The achievable accuracy with the proposed approach is 1.6 m, which is an improvement of approximately thirty per cent compared to the existing approaches. A Keras implementation can be found in our Github repository.

    Export Reference in RIS Format     

    Endnote

    • Click on "Export Reference in RIS Format" and choose "open with... Endnote".

    Refworks

    • Click on "Export Reference in RIS Format". Login to Refworks, go to References => Import References


    Collections
    • Minerva Elements Records [52369]
    • Infrastructure Engineering - Research Publications [1250]
    Minerva AccessDepositing Your Work (for University of Melbourne Staff and Students)NewsFAQs

    BrowseCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects
    My AccountLoginRegister
    StatisticsMost Popular ItemsStatistics by CountryMost Popular Authors