Computing and Information Systems - Theses

Permanent URI for this collection

Search Results

Now showing 1 - 1 of 1
  • Item
    Thumbnail Image
    Internet host geolocation based on probabilistic latency models
    Arif, Mohammed Jubaer ( 2010)
    The robust and scalable growth of the Internet has allowed value added services that provide enhanced user experience. Offering information based on geographic location to Internet users is one of the newest and notable advancements. Finding the geographical location of the user on the Internet, commonly referred to as geolocation, is one of the challenging problems currently addressed by the research community. Geolocation of Internet hosts is not straightforward, mainly due to the underlying behavior of the IP network. In the conventional telephone network each telephone number has a physical address associated with it. On the contrary, an IP address does not have any notion of location or any physical location associated with it. Dynamic assignment of IP addresses makes the task of geolocation even harder. This thesis focuses on scalable and accurate approaches to finding the physical location of Internet users. We consider the scenario of a user connected to the Internet through a machine, which we refer to as a host. Of the two commonly used approaches, repository-based and measurement-based, we primarily focus on geolocating Internet hosts using the measurement-based approach. Existing measurement-based geolocation approaches are based on simple bounds on latency measurements and do not take into consideration the uncertainty associated with latency measurements effectively. We focus on improving host geolocation based on probabilistic models for Internet latency. In this thesis, we address the following issues: • We provide a systematic study to understand the relationship between Internet latency and distance. The observations from this study motivate us to propose a probabilistic model for the latency to distance relationship that captures the high variability of latency against distance. • We propose two distinct probabilistic latency models for the latency to distance relationship- strict and adaptable. • We propose and analyze two novel measurement-based geolocation techniques based on the latency models derived earlier. • The scalability and reliability of the proposed approaches are investigated and further improvements are proposed taking into account the relative landmark location and latency data anomalies. • We illustrate and demonstrate the applicability of geolocation techniques with different accuracy levels; in particular, with a novel Voice over IP (VoIP) architecture Service Oriented VoIP (SOVoIP). The main contributions of the thesis are more accurate and scalable measurement-based geolocation techniques, suitable for Internet-scale geolocation.