Publications

conference paper

Conference/Proceedings	ACM International Conference on Multimedia Retrieval (ICMR)
Start date	17.04.2011
End date	20.04.2011
Pages	8
Author(s)	Pascal Kelm, Sebastian Schmiedeke, Thomas Sikora
Title	Multi-modal, Multi-resource Methods for Placing Flickr Videos on the Map
Abstract	We present three approaches for placing videos in Flickr on the world map. The toponym extraction and geo lookup ap- proach makes use of external resources to identify toponyms in the metadata and associate them with geo-coordinates. The metadata-based region model approach uses a k-nearest- neighbour classiﬁer trained over geographical regions. Videos are represented using their metadata in a text space with re- duced dimensionality. The visual region model approach uses a support vector machine also trained over geographical re- gions. Videos are represented using low-level feature vectors from multiple key frames. Voting methods are used to form a single decision for each video. We compare the approaches experimentally, highlighting the importance of using appro- priate metadata features and suitable regions as the basis of the region model. The best performance is achieved by the geo-lookup approach used with fallback to the visual region model when the video metadata contains no toponym.
Key words	compression, geo-localization, gazetteers, probabilistic latent semantic analysis, MPEG-7 visual features
DOI	10.1145/1991996.1992048
URL	http://dl.acm.org/ft_gateway.cfm?id=1992048&ftid=980972&dwn=1&CFID=57606239&CFTOKEN=98200645

[BibTeX]