direkt zum Inhalt springen

direkt zum Hauptnavigationsmenü

Sie sind hier

TU Berlin

Page Content

Scientific Publications

Jang-Heon Kim and Thomas Sikora (2006). Robust Anisotropic Disparity Estimation with Perceptual Maximum Variation Modeling. IEEE Int. Conf. on Image Processing (ICIP'06)

Link to publication

Jang-Heon Kim and Thomas Sikora (2006). Color Image Noise Reduction using Perceptual Maximum Variation Modeling for Color Diffusion. 7th International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS 2006)

Link to publication

Jang-Heon Kim and Thomas Sikora (2006). Anisotropic Scene Geometry Resampling with Occlusion Filling for 3DTV Applications. Conf. on Stereoscopic Displays and Applications XVII, IS&T/SPIE's Electronic Imaging 2006

Link to publication

Jang-Heon Kim and Matthias Kunter and Thomas Sikora (2005). Depth Diffusion Objects (DeDiO) - A Seamless Object-based Approach for TV Applications. 2nd Workshop on Immersive Communication and Broadcast Systems (ICOB '05)

Link to publication

Hyoung-Gook Kim and Nicolas Moreau and Thomas Sikora (2005). MPEG-7 Audio and Beyond: Audio Content Indexing and Retrieval. John Wiley & Sons, 304.


Jang-Heon Kim and Thomas Sikora (2005). Hybrid Recursive Energy-based Method for Robust Optical Flow on Large Motion Fields. IEEE Int. Conf. on Image Processing (ICIP '05)

Link to publication

Jang-Heon Kim and Thomas Sikora (2005). Gaussian Scale-space Dense Disparity Estimation with Anisotropic Disparity-field Diffusion. IEEE Int. Conf. on 3-D Digital Imaging and Modeling (3DIM '05)

Link to publication

Hyoung-Gook Kim and Daniel Ertelt and Thomas Sikora (2005). Hybrid Speaker-Based Segmentation System Using Model-Level Clustering. ICASSP 2005

Link to publication

Hyoung-Gook Kim and Steffen Roeber and Amjad Samour and Thomas Sikora (2005). Detection Of Goal Events In Soccer Videos. IS&T/SPIE's Electronic Imaging 2005

Link to publication

Hyoung-Gook Kim and Thomas Sikora (2004). Speech Enhancement based on Smoothing of Spectral Noise Floor. INTERSPEECH 2004 - ICSLP

Link to publication


Hyoung-Gook Kim and Juan José Burred and Thomas Sikora (2004). How efficient is MPEG-7 for General Sound Recognition?. 25th International AES Conference Metadata for Audio

Link to publication

Hyoung-Gook Kim and Martin Haller and Thomas Sikora (2004). Comparison of MPEG-7 Basis Projection Features and MFCC applied to Robust Speaker Recognition. ISCA - A Speaker Odyssey 2004

Link to publication


Hyoung-Gook Kim and Nicolas Moreau and Thomas Sikora (2004). Audio Classification Based on MPEG-7 Spectral Basis Representations. IEEE Transactions on Circuits and Systems for Video Technology 7, Special Issue on Audio and Video Analysis for Multimedia Interactive Services, 16–725.

Link to publication

Pascal Kelm and Vanessa Murdock and Sebastian Schmiedeke and Steven Schockaert and Pavel Serdyukov and Olivier Van Laere (2012). Georeferencing in Social Networks. Social Media Retrieval. Springer.


Pascal Kelm and Sebastian Schmiedeke and Thomas Sikora (2012). Multimodal Geo-tagging in Social Media Websites using Hierarchical Spatial Segmentation. Proceedings of the 20th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, 8.

Link to publication

Pascal Kelm and Sebastian Schmiedeke and Thomas Sikora (2012). How Spatial Segmentation improves the Multimodal Geo-Tagging. Working Notes Proceedings of the MediaEval 2012 Workshop. CEUR-WS.org, 9–10.

Link to publication

Pascal Kelm and Sebastian Schmiedeke and Thomas Sikora (2011). A Hierarchical, Multi-modal Approach for Placing Videos on the Map using Millions of Flickr Photographs. ACM Multimedia 2011 (Workshop on Social and Behavioral Networked Media Access - SBNMA)

Link to publication

Pascal Kelm and Sebastian Schmiedeke and Kai Clüver and Thomas Sikora (2011). Automatic Geo-referencing of Flickr Videos. NEM Summit 2011. Sigma Orionis, 76–80.

Link to publication

Pascal Kelm and Sebastian Schmiedeke and Thomas Sikora (2011). Multi-modal, Multi-resource Methods for Placing Flickr Videos on the Map. ACM International Conference on Multimedia Retrieval (ICMR), 8.

Link to publication

Pascal Kelm and Sebastian Schmiedeke and Thomas Sikora (2010). Video2GPS: Geotagging using collaborative systems, textual and visual features. Video2GPS: Geotagging using collaborative systems, textual and visual features

Link to publication

Pascal Kelm and Sebastian Schmiedeke and and Thomas Sikora (2009). FEATURE-BASED VIDEO KEY FRAME EXTRACTION FOR LOW QUALITY VIDEO. Proceedings of the International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS 2009), pp.25–28.

Link to publication

Pascal Kelm and Sebastian Schmiedeke and Steven Schockaert and Thomas Sikora and Michele Trevisiol and Olivier Van Laere (2014). Georeferencing Flickr resources based on multimodal features. Multimodal Location Estimation of Videos and Images. Springer International Publishing, 127–152.


Pascal Kelm and Sebastian Schmiedeke and Jaeyoung Choi and Gerald Friedland and Venkatesan Nallampatti Ekambaram and Kannan Ramchandran and Thomas Sikora (2013). A Novel Fusion Method for Integrating Multiple Modalities and Knowledge for Multimodal Location Estimation. Proceedings of the 2nd ACM International Workshop on Geotagging and Its Applications in Multimedia. ACM, New York, NY, USA, 7–12.

Link to publication

Mustafa Karaman and Lutz Goldmann and Thomas Sikora (2009). Improving object segmentation by reflection detection and removal. Visual Communications and Image Processing (VCIP), IS&T/SPIE's Electronic Imaging 2009


Mustafa Karaman and Lutz Goldmann and Thomas Sikora (2006). A New Segmentation Approach Using Gaussian Color Model and Temporal Information. Visual Communications and Image Processing (VCIP), IS&T/SPIE's Electronic Imaging 2006

Link to publication

Mustafa Karaman and Lutz Goldmann and Da Yu and Thomas Sikora (2005). Comparison of Static Background Segmentation Methods. Visual Communications and Image Processing (VCIP '05)

Link to publication

Andreas Kanbach and Andreas Körber (1999). ISDN - Die Technik; Schnittstellen - Protokolle - Dienste - Endsysteme. Hüthig, 549.


Andreas Kanbach and Andreas Körber (1991). ISDN - Die Technik. Hüthig, Heidelberg.


T. Kamceva and Detlef Hardt and G. Klassmeyer (1998). Mögliche Zusammenhänge zwischen den Ergebnissen einer Analyse von Formantverläufen und der Fehlerrate eines textabhängigen Sprecherverifizierungssystem bei unterschiedlichen Sprechweisen. 9. Konferenz "Elektronische Sprachsignalverarbeitung" und 5. ITG-Fachtagung "Sprachkommunikation", 41–44.


Florian Kaiser and Thomas Sikora (2011). Multi-Probe Histograms: A Mid-Level Harmonic Feature for Music Structure Segmentation. 14th International Conference on Digital Audio Effects (DAFx)


Florian Kaiser and Marina Georgia Arvanitidou and Thomas Sikora (2011). Audio Similarity Matrices Enhancement in an Image Processing Framework. 9th International Workshop on Content-Based Multimedia Indexing (CBMI)

Link to publication

Florian Kaiser (2011). Audio Signal Representations for Temporal Structure Segmentation. Dagstuhl Seminar on Multimodal Music Processing

Link to publication

Florian Kaiser and Thomas Sikora (2010). Détection de structure en musique par décomposition non-négative des matrices de similarités. Journées Jeunes Chercheurs en Audition, Acoustique musicale et Signal audio (JJCAAS)


Florian Kaiser and Thomas Sikora (2010). Music Structure Discovery in Popular Music using Non-negative Matrix Factorization. 11th International Society for Music Information Retrieval Conference (ISMIR 2010)

Link to publication

Ernst Kabot and R. Weber (1994). Der Einfluß der Bandbreite von Rauschsignalen auf die Schärfeempfindung. Fortschritte der Akustik - DAGA 94, 1029–1032.


Ernst Kabot and R. Weber (1993). Kategorial beurteilte Schärfe von künstlichen und natürlichen Schallen. Fortschritte der Akustik - DAGA 93, 840–843.


Carsten Jürgens and B. Wehen and Wiebke Johannsen (1995). Untersuchungen zur Auswahl von Sprechern für die Sprachsynthese im Zeitbereich. Elektronische Sprachsignalverarbeitung


Carsten Jürgens and M. Wunderlich (1995). A Comparison of Different Speech Units for the German TTS-System TUBSY. EUROSPEECH


Carsten Jürgens (1994). TUBSY - Sprachsynthese auf Clusterbasis nach dem PSOLA- Verfahren.. Elektronische Sprachsignalverarbeitung


Carsten Jürgens and Harald Klaus (1994). Speech Quality Assessment of Synthesized Speech Using Different Reference Systems.. Workshop on Speech Quality Assessment


Carsten Jürgens (1993). Sprachsynthese auf Clusterbasis nach dem PSOLA-Verfahren. Elektronische Sprachsignalverarbeitung


Carsten Jürgens (1992). Zur Klassifikation und Beurteilung von Sprachsyntheseverfahren. Elektronische Sprachsignalverarbeitung


Carsten Jürgens (1992). Arbeiten zur Sprachsynthese an der TU Berlin.. DEGA-ITG-Diskussionssitzung "Vergleich realisierter Sprachsynthese-Systeme"


Rolf Jongebloed and Erik Bochinski and Lieven lange and Thomas Sikora (2019). Quantized and Regularized Optimization for Coding Images Using Steered Mixtures-of-Experts. 2019 Data Compression Conference (DCC)

Link to publication

Rolf Jongebloed and Ruben Verhack and Lieven Lange and Thomas Sikora (2018). Hierarchical Learning of Sparse Image Representations using Steered Mixture-of-Experts. 2018 IEEE International Conference on Multimedia Expo Workshops (ICMEW)

Link to publication

Shan Jin and Thomas Sikora (2009). Combining Confusion Networks with probabilistic phone matching for open-vocabulary keyword spotting in spontaneous speech signal. 17th in a series of conferences organised by the European Association for Signal, Speech, and Image Processing (EUSIPCO 2009)


Shan Jin and Hemant Misra and Thomas Sikora and Joemon Jose (2009). Automatic Topic Detection Strategy for information retrieval in Spoken Document. Wiamis 2009

Link to publication

L. Jia and Mrinal Mandal and Thomas Sikora (2006). Efficient Disparity Estimation using Region based Segmentation and Multistage Feedback. WSEAS Transactions on Communications, 1577–1584.

Link to publication

Zusatzinformationen / Extras

Quick Access:

Schnellnavigation zur Seite über Nummerneingabe