Publications

X-VILA: Cross-Modality Alignment for Large Language Model. Hanrong Ye, De-An Huang, Yao Lu, Zhiding Yu, Wei Ping, Andrew Tao, Jan Kautz, Song Han, Dan Xu, Pavlo Molchanov, Hongxu Yin (2024). In Arxiv.

DoRA: Weight-Decomposed Low-Rank Adaptation. Shih-Yang Liu, Chien-Yi Wang, Hongxu Yin, Pavlo Molchanov, Yu-Chiang Frank Wang, Kwang-Ting Cheng, Min-Hung Chen (2024). ICML 2024.

PDF Cite Code Project Video ICML2024(Oral)

AM-RADIO: Reduce All Domains Into One. Mike Ranzinger, Greg Heinrich, Jan Kautz, Pavlo Molchanov (2023). In Arxiv.

PDF Cite Code CVPR2024

VILA: On Pre-training for Visual Language Models. Ji Lin, Hongxu Yin, Wei Ping, Yao Lu, Pavlo Molchanov, Andrew Tao, Huizi Mao, Jan Kautz, Mohammad Shoeybi, Song Han (2023). In Arxiv.

PDF Cite CVPR2024

FasterViT: Fast Vision Transformers with Hierarchical Attention. A. Hatamizadeh, G. Heinrich, H. Yin, A. Tao, J. Alvarez, J. Kautz, P. Molchanov (2023). In Arxiv.

PDF Cite Code ICLR2024

Recurrence without Recurrence: Stable Video Landmark Detection with Deep Equilibrium Models. P. Micaelli, A. Vahdat, H. Yin, J. Kautz, P. Molchanov (2023). In CVPR 2023.

PDF Cite Code Video CVPR2023

Heterogeneous Continual Learning. D. Madaan, H. Yin, W. Byeon, J. Kautz, P. Molchanov (2023). In CVPR 2023.

PDF Cite Code Video CVPR2023 Highlight

Global context vision transformers. A. Hatamizadeh, H. Yin, J. Kautz, P. Molchanov (2023). In ICML 2023.

PDF Cite Code ICML2023

Structural pruning via latency-saliency knapsack. M. Shen, H. Yin, P. Molchanov, L. Mao, J. Liu, J. Alvarez (2022). In NeurIPS2022.

PDF Cite Code Video NeurIPS2022

LANA: Latency Aware Network Acceleration. P. Molchanov, J. Hall, H. Yin, J. Kautz, N. Fusi, A. Vahdat (2022). In CVPR 2022.

PDF Cite Video ECCV2022

Gradvit: Gradient inversion of vision transformers. Ali Hatamizadeh, Hongxu Yin, Holger Roth, Wenqi Li, Jan Kautz, Daguang Xu, Pavlo Molchanov (2022). In CVPR 2022.

PDF Cite Code CVPR2022

DRaCoN--Differentiable Rasterization Conditioned Neural Radiance Fields for Articulated Avatars. Amit Raj, Umar Iqbal, Koki Nagano, Sameh Khamis, Pavlo Molchanov, James Hays, Jan Kautz (2022). arXiv preprint arXiv:2203.15798.

Do Gradient Inversion Attacks Make Federated Learning Unsafe?. Ali Hatamizadeh, Hongxu Yin, Pavlo Molchanov, Andriy Myronenko, Wenqi Li, Prerna Dogra, Andrew Feng, Mona G Flores, Jan Kautz, Daguang Xu, others (2022). arXiv preprint arXiv:2202.06924.

AViT: Adaptive Tokens for Efficient Vision Transformer. H. Yin, A. Vahdat, J. Alvarez, A. Mallya, J. Kautz, P. Molchanov (2021). In CVPR 2022.

PDF Cite Code CVPR2022 (Oral)

GLAMR: Global Occlusion-Aware Human Mesh Recovery with Dynamic Cameras. Ye Yuan, Umar Iqbal, Pavlo Molchanov, Kris Kitani, Jan Kautz (2021). In CVPR 2022.

PDF Cite Code Project Video CVPR2022 (Oral)

When to Prune? A Policy towards Early Structural Pruning. Maying Shen, Pavlo Molchanov, Hongxu Yin, Jose M Alvarez (2021). In CVPR 2022.

PDF Cite Project CVPR2022

LANA: Latency Aware Network Acceleration. Pavlo Molchanov, Jimmy Hall, Hongxu Yin, Jan Kautz, Nicolo Fusi, Arash Vahdat (2021). In Arxiv.

Deep Neural Networks are Surprisingly Reversible: A Baseline for Zero-Shot Inversion. Xin Dong, Hongxu Yin, Jose M Alvarez, Jan Kautz, Pavlo Molchanov (2021). In XAI4CV 2022.

PDF Cite Poster

Adversarial motion modelling helps semi-supervised hand pose estimation. Adrian Spurr, Pavlo Molchanov, Umar Iqbal, Jan Kautz, Otmar Hilliges (2021). arXiv preprint arXiv:2106.05954.

DexYCB: A benchmark for capturing hand grasping of objects. Yu-Wei Chao, Wei Yang, Yu Xiang, Pavlo Molchanov, Ankur Handa, Jonathan Tremblay, Yashraj S Narang, Karl Van Wyk, Umar Iqbal, Stan Birchfield, others (2021). Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.

PDF Cite Code Project Video

Towards Understanding the Risks of Gradient Inversion in Federated Learning. Ali Hatamizadeh, Hongxu Yin, Pavlo Molchanov, Andriy Myronenko, Wenqi Li, Prerna Dogra, Andrew Feng, Mona Flores, Jan Kautz, Daguang Xu, others (2021).

See through gradients: Image batch recovery via gradinversion. Hongxu Yin, Arun Mallya, Arash Vahdat, Jose M Alvarez, Jan Kautz, Pavlo Molchanov (2021). Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.

Optimizing Selective Protection for CNN Resilience. Abdulrahman Mahmoud, Siva Kumar Sastry Hari, Christopher Wardlaw Fletcher, Sarita V Adve, Charbel Sakr, Naresh Shanbhag, Pavlo Molchanov, Michael B Sullivan, Timothy Tsai, Stephen W Keckler (2021). 32nd IEEE International Symposium on Software Reliability Engineering, ISSRE 2021.

Optimal quantization using scaled codebook. Yerlan Idelbayev, Pavlo Molchanov, Maying Shen, Hongxu Yin, Miguel A Carreira-Perpinán, Jose M Alvarez (2021). Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.

Kama: 3d keypoint aware body mesh articulation. Umar Iqbal, Kevin Xie, Yunrong Guo, Jan Kautz, Pavlo Molchanov (2021). 2021 International Conference on 3D Vision (3DV).

HALP: Hardware-Aware Latency Pruning. Maying Shen, Hongxu Yin, Pavlo Molchanov, Lei Mao, Jianna Liu, Jose M Alvarez (2021). arXiv preprint arXiv:2110.10811.

Global Vision Transformer Pruning with Hessian-Aware Saliency. H. Yang, H. Yin, M. Shen, P. Molchanov, H. Li, J. Kautz (2021). In CVPR 2023.

PDF Cite Code Project Video CVPR2023

Data-free knowledge distillation for object detection. Akshay Chawla, Hongxu Yin, Pavlo Molchanov, Jose Alvarez (2021). Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision.

Dreaming to Distill: Data-free Knowledge Transfer via DeepInversion. H. Yin, P. Molchanov, J. M. Alvarez, Z. Li, A. Mallya, D. Hoiem, N..Jha, J. Kautz (2020). In CVPR 2022.

PDF Cite Code Slides Video

Weakly-supervised 3d human pose learning via multi-view images in the wild. Umar Iqbal, Pavlo Molchanov, Jan Kautz (2020). Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.

Weakly supervised 3d hand pose estimation via biomechanical constraints. Adrian Spurr, Umar Iqbal, Pavlo Molchanov, Otmar Hilliges, Jan Kautz (2020). European Conference on Computer Vision.

Measuring generalisation to unseen viewpoints, articulations, shapes and objects for 3d hand pose estimation under hand-object interaction. Anil Armagan, Guillermo Garcia-Hernando, Seungryul Baek, Shreyas Hampali, Mahdi Rad, Zhaohui Zhang, Shipeng Xie, MingXiu Chen, Boshen Zhang, Fu Xiong, others (2020). European Conference on Computer Vision.

Hardnn: Feature map vulnerability evaluation in cnns. Abdulrahman Mahmoud, Siva Kumar Sastry Hari, Christopher W Fletcher, Sarita V Adve, Charbel Sakr, Naresh Shanbhag, Pavlo Molchanov, Michael B Sullivan, Timothy Tsai, Stephen W Keckler (2020). arXiv preprint arXiv:2002.09786.

Scops: Self-supervised co-part segmentation. Wei-Chih Hung, Varun Jampani, Sifei Liu, Pavlo Molchanov, Ming-Hsuan Yang, Jan Kautz (2019). Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.

Importance estimation for neural network pruning. Pavlo Molchanov, Arun Mallya, Stephen Tyree, Iuri Frosio, Jan Kautz (2019). Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.

Few-shot adaptive gaze estimation. Seonwook Park, Shalini De Mello, Pavlo Molchanov, Umar Iqbal, Otmar Hilliges, Jan Kautz (2019). Proceedings of the IEEE/CVF International Conference on Computer Vision.

Boosting segmentation with weak supervision from image-to-image translation. Eugene Vorontsov, Pavlo Molchanov, Wonmin Byeon, Shalini De Mello, Varun Jampani, Ming-Yu Liu, Samuel Kadoury, Jan Kautz (2019). arXiv preprint arXiv:1904.01636.

Making convolutional networks recurrent for visual sequence learning. Xiaodong Yang, Pavlo Molchanov, Jan Kautz (2018). Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.

Improving landmark localization with semi-supervised learning. Sina Honari, Pavlo Molchanov, Stephen Tyree, Pascal Vincent, Christopher Pal, Jan Kautz (2018). Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.

Iamnn: Iterative and adaptive mobile neural network for efficient image classification. Sam Leroux, Pavlo Molchanov, Pieter Simoens, Bart Dhoedt, Thomas Breuel, Jan Kautz (2018). arXiv preprint arXiv:1804.10123.

Hand Pose Estimation via Latent 2.5 D Heatmap Regression. Umar Iqbal, Pavlo Molchanov, Thomas Breuel, Juergen Gall, Jan Kautz (2018). ECCV2018.

Depth-based 3d hand pose estimation: From current achievements to future goals. Shanxin Yuan, Guillermo Garcia-Hernando, Björn Stenger, Gyeongsik Moon, Ju Yong Chang, Kyoung Mu Lee, Pavlo Molchanov, Jan Kautz, Sina Honari, Liuhao Ge, others (2018). Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.

Budget-aware activity detection with a recurrent policy network. Behrooz Mahasseni, Xiaodong Yang, Pavlo Molchanov, Jan Kautz (2017). arXiv preprint arXiv:1712.00097.

A lightweight approach for on-the-fly reflectance estimation. Kihwan Kim, Jinwei Gu, Stephen Tyree, Pavlo Molchanov, Matthias Nießner, Jan Kautz (2017). Proceedings of the IEEE International Conference on Computer Vision.

Towards selecting robust hand gestures for automotive interfaces. Shalini Gupta, Pavlo Molchanov, Xiaodong Yang, Kihwan Kim, Stephen Tyree, Jan Kautz (2016). 2016 IEEE Intelligent Vehicles Symposium (IV).

Pruning convolutional neural networks for resource efficient inference. Pavlo Molchanov, Stephen Tyree, Tero Karras, Timo Aila, Jan Kautz (2016). ICLR2017.

Online detection and classification of dynamic hand gestures with recurrent 3d convolutional neural network. Pavlo Molchanov, Xiaodong Yang, Shalini Gupta, Kihwan Kim, Stephen Tyree, Jan Kautz (2016). Proceedings of the IEEE conference on computer vision and pattern recognition.

Multilayer and multimodal fusion of deep neural networks for video classification. Xiaodong Yang, Pavlo Molchanov, Jan Kautz (2016). Proceedings of the 24th ACM international conference on Multimedia.

Short-range FMCW monopulse radar for hand-gesture sensing. Pavlo Molchanov, Shalini Gupta, Kihwan Kim, Kari Pulli (2015). 2015 IEEE Radar Conference (RadarCon).

Range-Doppler surface: a tool to analyse human target in ultra-wideband radar. Yuan He, Pavlo Molchanov, Takuya Sakamoto, Pascal Aubry, Francois Le Chevalier, Alexander Yarovoy (2015). IET Radar, Sonar & Navigation.

Multi-sensor system for driver's hand-gesture recognition. Pavlo Molchanov, Shalini Gupta, Kihwan Kim, Kari Pulli (2015). 2015 11th IEEE international conference and workshops on automatic face and gesture recognition (FG).

Hand gesture recognition with 3D convolutional neural networks. Pavlo Molchanov, Shalini Gupta, Kihwan Kim, Jan Kautz (2015). Proceedings of the IEEE conference on computer vision and pattern recognition workshops.

Radar micro-Doppler feature extraction using the singular value decomposition. Jacco JM De Wit, Ronny IA Harmanny, Pavlo Molchanov (2014). In CVPR 2022.

Classification of small UAVs and birds by micro-Doppler signatures. Pavlo Molchanov, Ronny IA Harmanny, Jaco JM de Wit, Karen Egiazarian, Jaakko Astola (2014). International Journal of Microwave and Wireless Technologies.

The use of automotive radars in video-based overtaking assistance applications. Evgeny Belyaev, Pavlo Molchanov, Alexey Vinel, Yevgeni Koucheryavy (2013). IEEE Transactions on Intelligent Transportation Systems.

On micro-Doppler period estimation. Pavlo Molchanov, Jaakko Astola, Karen Egiazarian, Alexander Totsky (2013). 2013 19th international conference on control systems and computer science.

Classification of ground moving radar targets by using joint time-frequency analysis. Pavlo Molchanov, Jaakko Astola, Karen Egiazarian, Alexander Totsky (2012). 2012 IEEE Radar Conference.

Ground moving target classification by using DCT coefficients extracted from micro-Doppler radar signatures and artificial neuron network. Pavlo Molchanov, Jaakko Astola, Karen Egiazarian, Alexander Totsky (2011). 2011 Microwaves, Radar and Remote Sensing Symposium.