Michael Tschannen

I’m a Research Scientist at Google DeepMind Zurich (formerly Google Brain) broadly interested in multimodal learning for understanding and generation tasks.

Before that I was working on computer vision R&D at Apple Zurich for two years, and spent a year as a postdoc at Google Research Zurich (Brain Team) exploring topics in unsupervised representation learning, generative models, and neural compression. I completed my PhD at ETH Zurich under the supervision of Helmut Bölcskei in late 2018. Prior to that I obtained a MSc (with distinction) from ETH Zurich and a BSc from EPFL, both in Electrical Engineering and Information Technology. In fall 2017, I interned at Amazon AI in Palo Alto, CA, and in fall 2018 I was a part-time research consultant working with Google Research Zurich (Brain Team).

Contact: mi.<last name><at>gmail.com

News

May 4, 2024	Check out recent code releases for GIVT (link) and CapPa (link), and recent talks on CLIPPO (link) and CapPa (link).
Aug 15, 2022	I re-joined Google.
Oct 10, 2020	HiFiC brings generative image compression to the next level! Check out the demo page and the Hacker News Thread.
Mar 24, 2020	Two papers accepted for presentation at CVPR 2020!
Jan 25, 2020	I’m happy to announce that I obtained the ETH Medal (outstanding thesis award) for my PhD thesis!

Publications

*denotes equal contribution. See Google Scholar for a potentially more up-to-date list.

2024

LocCa: Visual Pretraining with Location-aware Captioners Bo Wan, Michael Tschannen, Yongqin Xian, Filip Pavetic, Ibrahim Alabdulmohsin, Xiao Wang, André Susano Pinto, Andreas Steiner, Lucas Beyer, and Xiaohua Zhai arXiv:2403.19596, 2024
PaLI-X: On scaling up a multilingual vision and language model Xi Chen, Josip Djolonga, Piotr Padlewski, Basil Mustafa, Soravit Changpinyo, Jialin Wu, Carlos Riquelme Ruiz, Sebastian Goodman, Xiao Wang, Yi Tay, Siamak Shakeri, Mostafa Dehghani, Daniel Salz, Mario Lucic, Michael Tschannen, Arsha Nagrani, Hexiang Hu, Mandar Joshi, Bo Pang, Ceslee Montgomery, Paulina Pietrzyk, Marvin Ritter, A. J. Piergiovanni, Matthias Minderer, Filip Pavetic, Austin Waters, Gang Li, Ibrahim Alabdulmohsin, Lucas Beyer, Julien Amelot, Kenton Lee, Andreas Peter Steiner, Yang Li, Daniel Keysers, Anurag Arnab, Yuanzhong Xu, Keran Rong, Alexander Kolesnikov, Mojtaba Seyedhosseini, Anelia Angelova, Xiaohua Zhai, Neil Houlsby, and Radu Soricut In Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024
Finite scalar quantization: VQ-VAE made simple Fabian Mentzer, David Minnen, Eirikur Agustsson, and Michael Tschannen In Proc. International Conference on Learning Representations (ICLR), 2024 [colab]
Towards truly zero-shot compositional visual reasoning with LLMs as programmers Aleksandar Stanić, Sergi Caelles, and Michael Tschannen Transactions on Machine Learning Research (TMLR), 2024

2023

GIVT: Generative Infinite-Vocabulary Transformers Michael Tschannen, Cian Eastwood, and Fabian Mentzer arXiv:2312.02116, 2023 [code] [colab]
Image captioners are scalable vision learners too Michael Tschannen, Manoj Kumar, Andreas Steiner, Xiaohua Zhai, Neil Houlsby, and Lucas Beyer In Advances in Neural Information Processing Systems (NeurIPS), 2023 oral presentation [talk] [code]
M2T: Masking transformers twice for faster decoding Fabian Mentzer, Eirikur Agustson, and Michael Tschannen In Proc. IEEE International Conference on Computer Vision (ICCV), 2023
Scaling vision transformers to 22 billion parameters Mostafa Dehghani, Josip Djolonga, Basil Mustafa, Piotr Padlewski, Jonathan Heek, Justin Gilmer, Andreas Peter Steiner, Mathilde Caron, Robert Geirhos, Ibrahim Alabdulmohsin, Rodolphe Jenatton, Lucas Beyer, Michael Tschannen, Anurag Arnab, Xiao Wang, Carlos Riquelme Ruiz, Matthias Minderer, Joan Puigcerver, Utku Evci, Manoj Kumar, Sjoerd Steenkiste, Gamaleldin Fathy Elsayed, Aravindh Mahendran, Fisher Yu, Avital Oliver, Fantine Huot, Jasmijn Bastings, Mark Collier, Alexey A. Gritsenko, Vighnesh Birodkar, Cristina Nader Vasconcelos, Yi Tay, Thomas Mensink, Alexander Kolesnikov, Filip Pavetic, Dustin Tran, Thomas Kipf, Mario Lucic, Xiaohua Zhai, Daniel Keysers, Jeremiah J. Harmsen, and Neil Houlsby In Proc. International Conference on Machine Learning (ICML), 2023
CLIPPO: Image-and-Language Understanding from Pixels Only Michael Tschannen, Basil Mustafa, and Neil Houlsby In Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023 [code] [colab] [talk]
FlexiViT: One Model for All Patch Sizes Lucas Beyer, Pavel Izmailov, Alexander Kolesnikov, Mathilde Caron, Simon Kornblith, Xiaohua Zhai, Matthias Minderer, Michael Tschannen, Ibrahim Alabdulmohsin, and Filip Pavetic In Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023 [code]

2022

Neural Face Video Compression using Multiple Views Anna Volokitin, Stefan Brugger, Ali Benlalah, Sebastian Martin, Brian Amberg, and Michael Tschannen In Proc. IEEE Conf. on Computer Vision and Pattern Recognition Workshops (CVPRW), 2022 Workshop and Challenge on Learned Image Compression (CLIC) Best Student Paper Award

2021

On Robustness and Transferability of Convolutional Neural Networks Josip Djolonga*, Jessica Yung*, Michael Tschannen*, Rob Romijnders, Lucas Beyer, Alexander Kolesnikov, Joan Puigcerver, Matthias Minderer, Alexander D’Amour, Dan Moldovan, Sylvan Gelly, Neil Houlsby, Xiaohua Zhai, and Mario Lucic In Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021
Representation learning from videos in-the-wild: An object-centric approach Rob Romijnders, Aravindh Mahendran, Michael Tschannen, Josip Djolonga, Marvin Ritter, Neil Houlsby, and Mario Lucic In Proc. IEEE Winter Conference on Applications of Computer Vision (WACV), 2021

2020

High-Fidelity Generative Image Compression Fabian Mentzer, George Toderici, Michael Tschannen, and Eirikur Agustsson In Advances in Neural Information Processing Systems (NeurIPS), 2020 oral presentation
Automatic shortcut removal for self-supervised representation learning Matthias Minderer, Olivier Bachem, Neil Houlsby, and Michael Tschannen In Proc. International Conference on Machine Learning (ICML), 2020
Weakly-supervised disentanglement without compromises Francesco Locatello, Ben Poole, Gunnar Rätsch, Bernhard Schölkopf, Olivier Bachem, and Michael Tschannen In Proc. International Conference on Machine Learning (ICML), 2020
Self-supervised learning of video-induced visual invariances Michael Tschannen, Josip Djolonga, Marvin Ritter, Aravindh Mahendran, Xiaohua Zhai, Neil Houlsby, Sylvain Gelly, and Mario Lucic In Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020
Learning better lossless image compression using lossy compression Fabian Mentzer, Luc Van Gool, and Michael Tschannen In Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020
On mutual information maximization for representation learning Michael Tschannen*, Josip Djolonga*, Paul K. Rubenstein, Sylvain Gelly, and Mario Lucic In Proc. International Conference on Learning Representations (ICLR), 2020
Disentangling factors of variation using few labels Francesco Locatello, Michael Tschannen, Stefan Bauer, Gunnar Rätsch, Bernhard Schölkopf, and Olivier Bachem In Proc. International Conference on Learning Representations (ICLR), 2020

2019

Semantic bottleneck scene generation Samaneh Azadi, Michael Tschannen, Eric Tzeng, Sylvain Gelly, Trevor Darrell, and Mario Lucic arXiv:1911.11357, 2019
The visual task adaptation benchmark Xiaohua Zhai, Joan Puigcerver, Alexander Kolesnikov, Pierre Ruyssen, Carlos Riquelme, Mario Lucic, Josip Djolonga, Andre Susano Pinto, Maxim Neumann, Alexey Dosovitskiy, Lucas Beyer, Olivier Bachem, Michael Tschannen, Marcin Michalski, Olivier Bousquet, Sylvain Gelly, and Neil Houlsby arXiv:1910.04867, 2019
Generative adversarial networks for extreme learned image compression Eirikur Agustsson*, Michael Tschannen*, Fabian Mentzer*, Radu Timofte, and Luc Van Gool In Proc. IEEE International Conference on Computer Vision (ICCV), 2019
Practical full resolution learned lossless image compression Fabian Mentzer, Eirikur Agustsson, Michael Tschannen, Radu Timofte, and Luc Van Gool In Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019 oral presentation
High-fidelity image generation with fewer labels Mario Lucic*, Michael Tschannen*, Marvin Ritter*, Xiaohua Zhai, Olivier Bachem, and Sylvain Gelly In Proc. International Conference on Machine Learning (ICML), 2019

2018

Noisy subspace clustering via matching pursuits Michael Tschannen, and Helmut Bölcskei IEEE Transactions on Information Theory, 2018
Deep generative models for distribution-preserving lossy compression Michael Tschannen, Eirikur Agustsson, and Mario Lucic In Advances in Neural Information Processing Systems (NeurIPS), 2018
Recent advances in autoencoder-based representation learning Michael Tschannen, Olivier Bachem, and Mario Lucic Bayesian Deep Learning Workshop at NeurIPS 2018, 2018
StrassenNets: Deep learning with a multiplication budget Michael Tschannen, Aran Khanna, and Anima Anandkumar In Proc. International Conference on Machine Learning (ICML), 2018 long oral presentation
Born-again neural networks Tommaso Furlanello, Zachary C. Lipton, Michael Tschannen, Laurent Itti, and Anima Anandkumar In Proc. International Conference on Machine Learning (ICML), 2018
Conditional probability models for deep image compression Fabian Mentzer, Eirikur Agustsson, Michael Tschannen, Radu Timofte, and Luc Van Gool In Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018
Towards image understanding from deep compression without decoding Róbert Torfason, Fabian Mentzer, Eirikur Agustsson, Michael Tschannen, Radu Timofte, and Luc Van Gool In Proc. International Conference on Learning Representations (ICLR), 2018
Unsupervised learning: Model-based clustering and learned compression Michael Tschannen PhD thesis, ETH Zurich, 2018 ETH Medal (outstanding thesis award)

2017

Robust nonparametric nearest neighbor random process clustering Michael Tschannen, and Helmut Bölcskei IEEE Transactions on Signal Processing, 2017
Dimensionality-reduced subspace clustering Reinhard Heckel, Michael Tschannen, and Helmut Bölcskei Information and Inference: A Journal of the IMA, 2017
A unified optimization view on generalized matching pursuit and Frank-Wolfe Francesco Locatello, Rajiv Khanna*, Michael Tschannen*, and Martin Jaggi In Proc. International Conference on Artificial Intelligence and Statistics (AISTATS), 2017
Greedy algorithms for cone constrained optimization with convergence guarantees Francesco Locatello, Michael Tschannen, Gunnar Rätsch, and Martin Jaggi In Advances in Neural Information Processing Systems (NIPS), 2017
Soft-to-hard vector quantization for end-to-end learning compressible representations Eirikur Agustsson, Fabian Mentzer, Michael Tschannen, Lukas Cavigelli, Radu Timofte, Luca Benini, and Luc Van Gool In Advances in Neural Information Processing Systems (NIPS), 2017
Convolutional recurrent neural networks for electrocardiogram classification Martin Zihlmann, Dmytro Perekrestenko, and Michael Tschannen In Proc. Computing in Cardiology (CinC), 2017 5th place in the PhysioNet/CinC Challenge 2017
Deep structured features for semantic segmentation Michael Tschannen, Lukas Cavigelli, Fabian Mentzer, Thomas Wiatowski, and Luca Benini In Proc. European Signal Processing Conference (EUSIPCO), 2017

2016

Discrete deep feature extraction: A theory and new architectures Thomas Wiatowski, Michael Tschannen, Aleksandar Stanić, Philipp Grohs, and Helmut Bölcskei In Proc. International Conference on Machine Learning (ICML), 2016
Heart sound classification using deep structured features Michael Tschannen, Thomas Kramer, Gian Marti, Matthias Heinzmann, and Thomas Wiatowski In Proc. Computing in Cardiology (CinC), 2016
Regression forest-based automatic estimation of the articular margin plane for shoulder prosthesis planning Michael Tschannen, Lazaros Vlachopoulos, Christian Gerber, Gábor Székely, and Philipp Fürnstahl Medical Image Analysis, 2016

2015

Nonparametric nearest neighbor random process clustering Michael Tschannen, and Helmut Bölcskei In Proc. IEEE International Symposium on Information Theory (ISIT), 2015

2014

Subspace clustering of dimensionality-reduced data Reinhard Heckel, Michael Tschannen, and Helmut Bölcskei In Proc. IEEE International Symposium on Information Theory (ISIT), 2014
Dimensionality reduction for sparse subspace clustering Michael Tschannen MS thesis, ETH Zurich, 2014 ETH Medal (outstanding thesis award) and the SEW Eurodrive Foundation Graduate Award

2013

A learning-based approach for fast and robust vessel tracking in long ultrasound sequences Valeria De Luca, Michael Tschannen, Gábor Székely, and Christine Tanner In Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2013