direkt zum Inhalt springen

direkt zum Hauptnavigationsmenü

Sie sind hier

TU Berlin

Inhalt des Dokuments

Compression and Transmission

Lupe

In the past years our research group has been investigating algorithms that drastically depart from “Motion Compensated DPCM/Transform (MC-DPCM)”. Our prime intention is to allow ourselves a completely “fresh view” on how non-linear dependencies between pixels and motion in images and video can be described and harvested for compression. To this end we employ non-linear machine learning algorithms that explore dependencies between vast amounts of pixels in images and video sequences. Our current approaches are based on non-linear Kernel methods, Steered Mixture of Experts networks (MoE) and Restricted Boltzmann Machines and show strong resemblance to recent work on deep neural networks. Our experiments give hope that our networks may provide far better visual quality compared to DPCM/Transform approaches in the long run. 

Research Activities

Image Compression via Sparse GMM Representation

Bild

Previous research has shown the interesting properties and potential of Steered Mixtures-of-Experts (SMoE) for image representation, approximation, and compression based on EM optimization. In this paper we introduce an MSE optimization method based on Gradient Descent for training SMoEs. This allows improved optimization towards PSNR and SSIM and de-coupling of experts and gates. In consequence we can now generate very high quality SMoE models with significantly reduced model complexity compared to previous work and much improved edge representations. Based on this strategy a block-based image coder was developed using Mixture-of-Experts that uses very simple experts with very few model parameters. Experimental evaluations shows that a significant compression gain can be achieved compared to JPEG for low bit rates. mehr zu: Image Compression via Sparse GMM Representation

Video Coding Group

Bild
ITU/ISO/IEC

The working group "Video Coding" deals with approaches of global and local motion estimation and compensation with the aim to improve existing video codecs like e.g. H.264 / AVC. Currently, the working group is active in particular within the framework of the ITU / ISO / IEC standardization effort "HEVC". This has already resulted in numerous publications and input documents for MPEG. The sub-projects listed below have been processed so far. mehr zu: Video Coding Group

MPEG-4 Audio Lossless Coding (ALS)

Bild

Der MPEG-4 ALS Standard gehört zur Familie der MPEG-4 Audiocodierstandards, die von der ISO (www.iso.org) herausgegeben werden. Im Gegensatz zu verlustbehafteten Verfahren wie MP3 und AAC, die lediglich die subjektiv empfundene Qualität zu erhalten versuchen, erlaubt die verlustlose Codierung jedoch eine exakte Wiederherstellung jedes einzelnen Bits der ursprüglichen Audiodaten. Das grundlegende Verfahren von MPEG-4 ALS wurde am Fachgebiet Nachrichtenübertragung der Technischen Universität Berlin entwickelt. Die erste Version des MPEG-4 ALS Standards wurde 2006 veröffentlicht, und die aktuelle Beschreibung ist inzwischen Teil der 4. Edition (2009) des übergreifenden MPEG-4 Audiostandards (ISO/IEC 14496-3:2009). Eine neue Version (RM23) der MPEG-4 ALS Referenzsoftware und des Codecs ist jetzt verfügbar. Mehr dazu hier (in English)... MPEG-4 ALS wird mittlerweile von FFmpeg, MPlayer, VLC Media Player und weiteren Anwendungen unterstützt. Mehr dazu hier (in Englisch)... mehr zu: MPEG-4 Audio Lossless Coding (ALS)

Software

Software
SMoE (TensorFlow implementation)
Code on Github

Related Publications

2018

2017

2016

2015

2014

2013

2012

  • Andreas Krutz, Alexander Glantz, Michael Tok, Marko Esche and Thomas Sikora
    Adaptive Global Motion Temporal Filtering for High Efficiency Video Coding
    IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), IEEE, 01.12.2012
    Details BibTeX
  • Pascal Kelm, Vanessa Murdock, Sebastian Schmiedeke, Steven Schockaert, Pavel Serdyukov, Olivier Van Laere
    Georeferencing in Social Networks
    in Social Media Retrieval, Naeem Ramzan, Roelof van Zwol, Jong-Seok Lee, Kai Clüver, Xian-Sheng Hua (ed(s).), Springer, 30.11.2012
    ISBN 978-1-4471-4554-7
    Details BibTeX
  • Ashfaq Mohammed
    Uplink Decoding Quality during Downlinks
    15.11.2012, master thesis tutored by Dr. Bruns (Baker Hughes GmbH), Dr. Krutz, Prof. Dr.-Ing. Thomas Sikora
    Details BibTeX
  • Verhack, Ruben
    Image Coding using Adaptive Sampling and Data-Adaptive Kernel Regression
    23.08.2012, master thesis tutored by Dr.-Ing. Andreas Krutz, Prof. Dr.-Ing. Thomas Sikora
    Details BibTeX
  • Chen Liang
    Interpolation of dense motion vector fields from block-based motion vectors
    01.08.2012, bachelor thesis tutored by M.Sc. Marko Esche, Prof. Dr.-Ing. Thomas Sikora
    Details BibTeX
  • Andreas Krutz, Alexander Glantz, Michael Tok, Thomas Sikora
    Adaptive Global Motion Temporal Filtering
    Proceedings of the 29th IEEE Picture Coding Symposium (PCS 2012), Kraków, Poland, 07.05.2012 - 09.05.2012
    ISBN: 978-1-4577-2048-2
    Details BibTeX
  • Michael Tok, Andreas Krutz, Alexander Glantz, Thomas Sikora
    Lossy Parametric Motion Model Compression for Global Motion Temporal Filtering
    Proceedings of the 29th IEEE Picture Coding Symposium (PCS 2012), Kraków, Poland, 07.05.2012 - 09.05.2012
    ISBN: 978-1-4577-2048-2
    Details BibTeX
  • Michael Tok, Alexander Glantz, Andreas Krutz, Thomas Sikora
    Parametric Motion Vector Prediction for Hybrid Video Coding
    Proceedings of the 29th IEEE Picture Coding Symposium (PCS 2012), Kraków, Poland, 07.05.2012 - 09.05.2012
    ISBN: 978-1-4577-2048-2
    Details BibTeX
  • Marko Esche, Alexander Glantz, Andreas Krutz, Michael Tok, Thomas Sikora
    Weighted Temporal Long Trajectory Filtering for Video Compression
    Proceedings of the 29th IEEE Picture Coding Symposium (PCS 2012), Kraków, Poland, 07.05.2012 - 09.05.2012
    ISBN: 978-1-4577-2048-2
    Details BibTeX
  • Fabian Cordes
    Generation of Layered Depth Images from Fused Multiview Depth Maps
    09.03.2012, diploma thesis tutored by Dipl.-Ing. Engin Kurutepe, Prof. Dr.-Ing. Thomas Sikora
    Details BibTeX
  • Michael Andersch
    Lossy Parametric Motion Model Compression
    09.02.2012, bachelor thesis tutored by Dipl.-Ing. Michael Tok, Dipl.-Ing. Alexander Glantz, Prof. Dr.-Ing. Thomas Sikora
    Details BibTeX

2011

2010

2009

2008

  • Matthias Kunter, Philipp Krey, Andreas Krutz, Thomas Sikora
    Extending H.264/AVC with a Background Sprite Prediction Mode
    15th IEEE International Conference on Image Processing, Proceedings ICIP 2008, October, San Diego, California, USA, volume October 2008, United States, 12.10.2008 - 15.10.2008, pp. 2128 - 2131
    ISBN 978-1-4244-1764-3 ; ISSN 1522-4880
    Details BibTeX
  • Engin Kurutepe, Thomas Sikora
    Multi-View Video Streaming over P2P Networks With Low Start-Up Delay
    15th IEEE International Conference on Image Processing, Proceedings ICIP 2008, October, San Diego, California, USA, volume October 2008, San Diego, USA, 12.10.2008 - 15.09.2008, pp. 3088 - 3091
    ISBN 978-1-4244-1764-3 ; ISSN 1522-4880
    Details BibTeX
  • Lutz Goldmann, Antonio Rama, Thomas Sikora, Francesc Tarres
    On the Detection and Localization of Facial Occlusions and its Use within Different Scenarios
    IEEE 10th International Workshop on Multimedia Signal Processing, MMSP 8-10 October 2008, Proceedings, Cairns, Australia, volume October 2008, Australia, 08.10.2008 - 10.10.2008, pp. 592-597
    ISBN 978-1-4244-2295-1
    Details BibTeX
  • Andreas Krutz, Sebastian Knorr, Matthias Kunter, Thomas Sikora
    Camera Motion-Constraint Video Codec Selection (invited)
    IEEE 10th International Workshop on Multimedia Signal Processing, MMSP 8-10 October 2008, Proceedings, Cairns, Australia, volume October 2008, Cairns, Queensland, Australia, 08.10.2008 - 10.10.2008, pp. 58 - 63
    Special Session on Global Motion Estimation and Mosaicing for Applications in Video Analysis and Coding ; ISBN 978-1-4244-2295-1
    Details BibTeX
  • Dirk Farin, Martin Haller, Andreas Krutz, Thomas Sikora
    Recent Developments in Panoramic Image Generation and Sprite Coding (invited)
    Proceedings of the IEEE International Workshop on Multimedia Signal Processing (MMSP 2008), volume October 2008, Cairns, Queensland, Australia, 08.10.2008 - 10.10.2008, pp. 64-69
    Special Session on Global Motion Estimation and Mosaicing for Applications in Video Analysis and Coding ; ISBN 978-1-4244-2294-4
    Details BibTeX
  • Andreas Krutz, Alexander Glantz, Thomas Sikora, Paulo Nunes, Fernando Pereira
    Automatic Object Segmentation Algorithms for Sprite Coding using MPEG-4
    50th International Symposium ELMAR-2008, September, Proceedings, Zadar, Croatia, Vol. 2 of 2, volume Vol. 2, September 2008, Department of Wireless Communications, Faculty of Electrical Engineering and Computering, University of Zagreb, Croatia, 10.09.2008 - 12.09.2008, pp. 459 - 462
    Details BibTeX
  • Sebastian Wegener, Martin Haller, Juan José Burred, Thomas Sikora, Slim Essid, Gaël Richard
    On the Robustness of Audio Features for Musical Instrument Classification
    16th European Signal Processing Conference, EUSIPCO 2008, Proceedings, Lausanne, Switzerland, volume August 2008, Lausanne, Switzerland, 25.08.2008 - 29.08.2008
    Oral presentation
    Details BibTeX
  • Jes�s Guti�rrez S�nchez
    Analysis for Video Codec Selection Considering Object-based Coding and H.264/AVC
    26.06.2008, master thesis tutored by Dipl.-Ing. Andreas Krutz, Prof. Dr.-Ing. Thomas Sikora
    Details BibTeX
  • Andreas Krutz, Alexander Glantz, Martin Haller, Michael Droese, Thomas Sikora
    Multiple Background Sprite Generation using Camera Motion Characterization for Object-based Video Coding
    3DTV Conference 2008, The True Vision Capture, Transmission and Display of 3D Video, May 2008, Istanbul, Turkey, volume Mai 2008, Ankara, Turkey, 28.05.2008 - 30.05.2008, pp. 313 - 316
    ISBN 978-1-4244-1760-5, CD ISBN 978-1-4244-1755-1
    Details BibTeX
  • Patrick Ndjiki-Nya
    Mid-Level Content-Based Video Coding using Texture Analysis and Synthesis
    2008
    Details BibTeX
  • Matthias Kunter
    Advances in Sprite-based Video Coding - Towards Universal Usability
    2008
    Details BibTeX
  • Shpend Mirta
    Ratenkontrolle f�r RTP-basierte IPTV �bertragung von H264/ Scalable Video Coding (SVC) Echtzeitdaten in mobilen, infrastrukturlosen Multihop-Systemen
    09.01.2008, diploma thesis tutored by Dipl.-Ing. Schierl (HHI), Dipl.-Ing. Engin Kurutepe, Prof. Dr.-Ing. Thomas Sikora
    Details BibTeX

2007

2006

2005

2004

2003

2002

2001

2000

  • Sila Ekmekci
    Encoding and Reconstruction of Incomplete 3D Video Objects
    Special Issue of IEEE Transactions on Circuits and Systems for Video Technology, vol. 10, no. 7, October 2000, pp. 1198-1207
    Details BibTeX
  • Peter Noll
    Speech and Audio Coding for Multimedia Communications (invited)
    Proceedings International Cost 254 Workshop on Intelligent Communication Technologies and Applications, Neuchâtel, Schweiz, 2000, pp. 253-263
    Details BibTeX

1999

1998

  • Guido Heising, D. Marpe, H. L. Cycon
    A Wavelet-Based Video Coding Scheme Using Image Warping Prediction
    IEEE Int. Conf. on Image Processing (ICIP'98), Chicago, IL, USA, 04.10.1998 - 07.10.1998
    D. Marpe, H. L. Cycon: Fachhochschule für Technik und Wirtschaft Berlin
    Details BibTeX
  • Peter Noll
    MPEG Digital Audio Coding Standards
    in The Digital Signal Processing Handbook, V.K. Madisetti, D. B. Williams (ed(s).), IEEE Press/CRC Press, 1998, pp. 40/1-40/28
    Details BibTeX
  • Peter Noll
    Audio Coding: From Broadcast Standard(s) to Advanced Audio Coding (invited)
    ITG-Fachbericht Nr. 146, "Codierung für Quelle, Kanal und Übertragung", 1998, pp. 13-22
    Details BibTeX
  • Marcus Purat
    Zum Einsatz von Wavelet- und Waveletpacket-Transformationen in niederratigen, wahrnehmungsangepaßten Audiocodierverfahren
    1998
    Details BibTeX

1997

  • Kai Uwe Barthel, Sven Brandau, W. Hermesmeier, Guido Heising
    Zerotree Wavelet Coding Using Fractal Prediction
    IEEE Int. Conf. on Image Processing (ICIP'97), Santa Barbara, CA, USA, 26.10.1997 - 29.10.1997
    Details BibTeX
  • Guido Heising, Kai Uwe Barthel, Wiebke Johannsen, Christoph Steinbach
    Blocking Artefact Free Video Coding Based on a Bilinear Forward Image Warping Model
    IEEE Int. Conf. on Image Processing (ICIP'97), Santa Barbara, CA, USA, 26.10.1997 - 29.10.1997
    Details BibTeX
  • Peter Noll
    MPEG Digital Audio Coding - Setting the Standard for High-Quality Audio Compression (invited)
    IEEE Signal Processing Magazine, Special Issue on MPEG Audio and Video Coding, vol. 14, no. 5, September 1997, pp. 59 - 81
    Details BibTeX
  • Peter Noll
    Speech and Audio Coding (invited)
    internetaudio.org = 14th Conference of the Acoustical Engineering Society (AES), WWW-Proceedings, Seattle, 1997
    Details BibTeX
  • Peter Noll
    MPEG-based Audio Coding
    in Handbook on Digital Consumer Electronics, McGrawHill, 1997
    Details BibTeX
  • Peter Noll, Davis Pan
    ISO/MPEG Audio Coding (invited)
    International Journal of High-Speed Electronics and Systems, World Scientific Publ. Co., vol. 8, no. 1, 1997, pp. 69-118
    D. Pan: Digital Equipment Corp.(USA); auch/also in: Signal Compression - Coding of Speech, Audio, Text, Image and Video (Ed.: N. Jayant), World Scientific Publ. Co., 1997.
    Details BibTeX
  • Kai Uwe Barthel
    Entropy Constrained Zerotree Wavelet Image Coding Using Fractal Prediction
    Picture Coding Symposium PCS '97, 1997
    Details BibTeX
  • Guido Heising
    Blocking Artefact Free Video Coding by Combining Warping Based Prediction with Wavelet Error Coding
    Picture Coding Symposium PCS '97, Berlin, Germany, 1997
    Details BibTeX
  • Marcus Purat, Tilman Liebchen, Peter Noll
    Lossless Transform Coding of Audio Signals
    102nd AES Convention, München, 1997
    Preprint No. 4414
    Details BibTeX

1996

  • Guido Heising, G. Ruhl
    Video Coding Using Spatial Extrapolation Based Motion Field Segmentation
    IEEE Int. Conf. on Image Processing (ICIP'96), volume II, Lausanne, Switzerland, 16.09.1996 - 19.09.1996, pp. 481-484
    Details BibTeX
  • Marco Waller
    Design and Implementation of a graphical Demonstration Tool for ISO/MPEG-1 Audio Coding
    01.08.1996, diploma thesis tutored by Purat, Prof. Noll
    Details BibTeX
  • Peter Noll
    Source Compression: Audio Coding
    in The Communications Handbook, J. Gibson, Texas A&M University (ed(s).), CRC, Inc., 1996, pp. 1475-1487
    Details BibTeX
  • Marcus Purat, Peter Noll
    Audio Coding with a Dynamic Wavelet Packet Decomposition Based on Frequency-Varying Modulated Lapped Transforms
    IEEE Acoustics, Speech and Signal Processing Conference (ICASSP), Atlanta (USA), 1996
    Details BibTeX
  • Kai Uwe Barthel
    Entropy constrained fractal image coding
    2. ITG-Diskussionssitzung, 1996, pp. 3-10
    ISBN 3-8007-2190-2
    Details BibTeX

1995

  • Peter Noll
    Digital Audio Coding for Visual Communications (invited)
    Proceedings of the IEEE, vol. 83, no. 6, June 1995, pp. 925-943
    Details BibTeX
  • Kai Barthel, Thomas Voyé
    Three-Dimensional Fractal Video Coding
    IEEE Int. Conf. on Image Processing (ICIP 95), Washington, D.C., USA, 1995
    Details BibTeX
  • Kai Barthel
    Entropy constrained fractal image coding
    NATO ASI on Fractal Image Coding, Trondheim, Norwegen, 1995
    Details BibTeX
  • Marcus Purat, Peter Noll
    A New Orthonormal Wavelet Packet Decomposition For Audio Coding Using Frequency-Varying Modulated Lapped Transforms
    IEEE 1995 Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, N.Y. (USA), 1995
    Details BibTeX

1994

  • Kai Barthel, Jörg Schüttemeyer, Thomas Voyé, Peter Noll
    A New Image Coding Technique Unifying Fractal and Transform Coding
    ICIP, November 1994
    Details BibTeX
  • Jens-Rainer Ohm
    Motion-compensated 3-D subband coding with multiresolution representation of motion parameters
    Proceedings IEEE 1st International Conference on Image Processing (ICIP-94), Austin, TX, November 1994
    Details BibTeX
  • Hui Li, Peter Noll
    Comparative Study of two Rate-Selectable Channel Coding Techniques
    ITG-Fachtagung "Codierung für Quelle, Kanal und Übertragung", München, October 1994
    Details BibTeX
  • Jens-Rainer Ohm
    Three-Dimensional Subband Coding with Motion Compensation
    IEEE Transactions on Image Processing, vol. IP-3, no. 5, September 1994, pp. 559-571
    Details BibTeX
  • Kai Uwe Barthel, Thomas Voyé
    Adaptive fractal image coding in the frequency domain
    Proceedings of International Workshop on Image Processing, Budapest, June 1994
    Details BibTeX

1993

  • Peter Noll
    Wideband Speech and Audio Coding (invited)
    IEEE Communications Magazine, vol. 31, no. 11, November 1993, pp. 34 - 44
    Details BibTeX
  • Jens-Rainer Ohm
    Advanced Packet Video Coding Based on Layered VQ and SBC Techniques
    IEEE Transactions on Circuits and Systems for Video Technology, vol. CSVT-3, no. 3, June 1993, pp. 208-221
    Details BibTeX
  • Jens-Rainer Ohm
    Three-Dimensional Motion-Compensated Subband Coding
    International Symposium on Video Communications and Fiber Optic Services, SPIE, volume 1977, Berlin, Germany, April 1993, pp. 188-197
    Details BibTeX
  • Kai Barthel, Thomas Voyé, Peter Noll
    Improved Fractal Image Coding
    Picture Coding Symposium PCS '93, Lausanne, Switzerland, 1993
    Proceedings Section 1.5
    Details BibTeX
  • Peter Noll
    High Quality Audio Coding: The ISO/MPEG Standard(s) (invited)
    Cost 229 Workshop on Intelligent Terminals and Source and Channel Coding, Budapest, Hungary, 1993, pp. 5 - 14
    Details BibTeX
  • Peter Noll
    Speech Coding for Communications (invited)
    European Speech Processing Conference (EUSIPCO'93), volume 1, Berlin, Germany, 1993, pp. 479 - 488
    Details BibTeX
  • Peter Noll, G. Stoll
    ISO/MPEG High Quality Audio Coding (invited)
    High Definition Television Conference (HDTV '93), Ottawa, Canada, 1993
    G. Stoll: Institut für Rundfunktechnik, München
    Details BibTeX
  • Peter Noll
    ISO/MPEG Audio Coding: Status and Trends (invited)
    Workshop on Mobile Multimedia Communications, MoMuc-1, Tokyo, 1993
    Details BibTeX
  • G. Schamel, H. Li
    Frequence Scanning in Digital Coding for HDTV Broadcasting
    SPIE EUROPTO-93, Berlin, Germany, 1993
    Details BibTeX

1992

  • Jens-Rainer Ohm
    Temporal Domain Sub-Band Video Coding with Motion Compensation
    International Conference on Acoustics, Speech and Signal Processing (ICASSP-92), volume 3, San Francisco, CA, USA, March 1992, pp. III/229 - III/232
    Details BibTeX
  • J. Kuang, Peter Noll, F. Fu, J. Liu
    A Typical Channel Model of Digital Mobile Communications Applied in Speech Coding (auf chinesisch)
    Journal of Beijing Institute of Technology (Beijing, China), vol. 12, no. 3, 1992, pp. 43 - 48
    J. Kuang, F. Fu, J. Liu: Beijing Institute of Technology, Peking
    Details BibTeX
  • Hui Li, Peter Noll
    Hybrid Phase Trellis-Coded Modulation for Unequal Error Protection Coding
    Proceedings 1992 URSI International Symposium on Signals, Systems, and Electronics (ISSSE'92), 1992, pp. 113 - 116
    H. Li: Jiao-Tong-Universität Shanghai
    Details BibTeX

1991

  • Jens-Rainer Ohm
    Region-Oriented Predictive Tree-VQ- A New Approach for Image Coding Schemes Based on Segmentation Techniques
    Proceedings Picture Coding Symposium, Kyoto, Japan, 1991
    Details BibTeX

1990

  • C. Podilchuk, N. S. Jayant, Peter Noll
    Sparse Codebooks for the Quantization of Non-Dominant Subbands in Image Coding
    IEEE Intern. Conference on Acoustics, Speech, and Signal Processing(ICASSP'90), 1990, pp. 2101 - 2104
    C. Podilchuk, N. S. Jayant: AT&T Bell Laboratories
    Details BibTeX
  • Peter Noll
    Data Compression Techniques for New Standards in Speech and Image Coding
    VI. Internationales Weiterbildungsprogramm Berlin '90, TU Berlin, Zentrum für Technologische Zusammenarbeit, 1990, pp. 245 - 264
    Details BibTeX
  • Jens-Rainer Ohm, Peter Noll
    Predictive Tree Encoding of Still Images with Vector Quantization
    Annales des Télécommunications, vol. 45, no. 9-10, 1990, pp. 465 - 470
    Details BibTeX
  • N. S. Jayant, Peter Noll
    Digital Coding of Waveforms - Principles and Applications to Speech and Video
    Kai Fa Book Company, Taipei, Taiwan, 1990, 688 pages
    N. S. Jayant: AT&T Bell Laboratories
    Details BibTeX
  • Jens-Rainer Ohm
    Still Image Coding Using Predictive Tree-VQ with Sub-Band Decomposition
    IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 90), Albuquerque, 1990
    Details BibTeX
  • Jens-Rainer Ohm
    Classified Predictive Tree-VQ for Still Image Coding
    Picture Coding Symposium, MIT Media Lab, USA, 1990
    Details BibTeX

1989

  • Jens-Rainer Ohm, Peter Noll
    Predictive Tree Encoding of Still Images with Vector Quantization
    URSI International Symposium on Signals, Systems, and Electronics (ISSSE'89), Erlangen, 1989, pp. 325 - 328
    Details BibTeX

1984

  • N. S. Jayant, Peter Noll
    Digital Coding of Waveforms, Principles and Applications to Speech and Video
    Prentice-Hall, Englewood Cliffs NJ, USA, 1984, 688 pages
    N. S. Jayant: Bell Laboratories; ISBN 0-13-211913-7
    Details BibTeX

Zusatzinformationen / Extras