direkt zum Inhalt springen

direkt zum Hauptnavigationsmenü

Sie sind hier

TU Berlin

Page Content

Scientific Publications

Human vs Machine: Establishing a Human Baseline for Multimodal Location Estimation
Citation key 1423Choi2013
Author Jaeyoung Choi and Venkatesan Ekambaram and Howard Lei and Pascal Kelm and Luke Gottlieb and Thomas Sikora and Kannan Ramchandran and Gerald Friedland
Title of Book Human vs Machine: Establishing a Human Baseline for Multimodal Location Estimation
Year 2013
Month oct
Organization ACM
Abstract Over the recent years, the problem of video location estimation (i.e., estimating the longitude/latitude coordinates of a video without GPS information) has been approached with diverse methods and ideas in the research community and significant improvements have been made. So far, however, systems have only been compared against each other and no systematic study on human performance has been conducted. Based on a human-subject study with over 11,000 experiments, this article presents a human baseline for location estimation for different combinations of modalities (au- dio, audio/video, audio/video/text). Furthermore, this article reports on the comparison of the accuracy of state-of-the-art location estimation systems with the human baseline. Although the overall performance of humans’ multimodal video location estimation is better than current machine learning approaches, the difference is quite small: For 41 % of the test set, the machine’s accuracy was superior to the humans. We present case studies and discuss why machines did better for some videos and not for others. Our analysis suggests new directions and priorities for future work on the improvement of location inference algorithms.
Download Bibtex entry

Zusatzinformationen / Extras

Quick Access:

Schnellnavigation zur Seite über Nummerneingabe

Auxiliary Functions