Publications

X-VILA: Cross-Modality Alignment for Large Language Model. (2024). In Arxiv.

PDF Cite

DoRA: Weight-Decomposed Low-Rank Adaptation. (2024). ICML 2024.

PDF Cite Code Project Video ICML2024(Oral)

AM-RADIO: Reduce All Domains Into One. (2023). In Arxiv.

PDF Cite Code CVPR2024

VILA: On Pre-training for Visual Language Models. (2023). In Arxiv.

PDF Cite CVPR2024

FasterViT: Fast Vision Transformers with Hierarchical Attention. (2023). In Arxiv.

PDF Cite Code ICLR2024

Recurrence without Recurrence: Stable Video Landmark Detection with Deep Equilibrium Models. (2023). In CVPR 2023.

PDF Cite Code Video CVPR2023

Heterogeneous Continual Learning. (2023). In CVPR 2023.

PDF Cite Code Video CVPR2023 Highlight

Global context vision transformers. (2023). In ICML 2023.

PDF Cite Code ICML2023

Structural pruning via latency-saliency knapsack. (2022). In NeurIPS2022.

PDF Cite Code Video NeurIPS2022

LANA: Latency Aware Network Acceleration. (2022). In CVPR 2022.

PDF Cite Video ECCV2022

Gradvit: Gradient inversion of vision transformers. (2022). In CVPR 2022.

PDF Cite Code CVPR2022

DRaCoN--Differentiable Rasterization Conditioned Neural Radiance Fields for Articulated Avatars. (2022). arXiv preprint arXiv:2203.15798.

Cite

Do Gradient Inversion Attacks Make Federated Learning Unsafe?. (2022). arXiv preprint arXiv:2202.06924.

PDF Cite

AViT: Adaptive Tokens for Efficient Vision Transformer. (2021). In CVPR 2022.

PDF Cite Code CVPR2022 (Oral)

GLAMR: Global Occlusion-Aware Human Mesh Recovery with Dynamic Cameras. (2021). In CVPR 2022.

PDF Cite Code Project Video CVPR2022 (Oral)

When to Prune? A Policy towards Early Structural Pruning. (2021). In CVPR 2022.

PDF Cite Project CVPR2022

LANA: Latency Aware Network Acceleration. (2021). In Arxiv.

PDF Cite

Deep Neural Networks are Surprisingly Reversible: A Baseline for Zero-Shot Inversion. (2021). In XAI4CV 2022.

PDF Cite Poster

Adversarial motion modelling helps semi-supervised hand pose estimation. (2021). arXiv preprint arXiv:2106.05954.

PDF Cite

DexYCB: A benchmark for capturing hand grasping of objects. (2021). Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.

PDF Cite Code Project Video

Towards Understanding the Risks of Gradient Inversion in Federated Learning. (2021).

Cite

See through gradients: Image batch recovery via gradinversion. (2021). Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.

Cite

Optimizing Selective Protection for CNN Resilience. (2021). 32nd IEEE International Symposium on Software Reliability Engineering, ISSRE 2021.

Cite

Optimal quantization using scaled codebook. (2021). Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.

Cite

Kama: 3d keypoint aware body mesh articulation. (2021). 2021 International Conference on 3D Vision (3DV).

Cite

HALP: Hardware-Aware Latency Pruning. (2021). arXiv preprint arXiv:2110.10811.

Cite

Global Vision Transformer Pruning with Hessian-Aware Saliency. (2021). In CVPR 2023.

PDF Cite Code Project Video CVPR2023

Data-free knowledge distillation for object detection. (2021). Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision.

PDF Cite Code

Dreaming to Distill: Data-free Knowledge Transfer via DeepInversion. (2020). In CVPR 2022.

PDF Cite Code Slides Video

Weakly-supervised 3d human pose learning via multi-view images in the wild. (2020). Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.

Cite

Weakly supervised 3d hand pose estimation via biomechanical constraints. (2020). European Conference on Computer Vision.

Cite

Measuring generalisation to unseen viewpoints, articulations, shapes and objects for 3d hand pose estimation under hand-object interaction. (2020). European Conference on Computer Vision.

Cite

Hardnn: Feature map vulnerability evaluation in cnns. (2020). arXiv preprint arXiv:2002.09786.

Cite

Scops: Self-supervised co-part segmentation. (2019). Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.

Cite

Importance estimation for neural network pruning. (2019). Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.

Cite

Few-shot adaptive gaze estimation. (2019). Proceedings of the IEEE/CVF International Conference on Computer Vision.

Cite

Boosting segmentation with weak supervision from image-to-image translation. (2019). arXiv preprint arXiv:1904.01636.

Cite

Making convolutional networks recurrent for visual sequence learning. (2018). Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.

Cite

Improving landmark localization with semi-supervised learning. (2018). Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.

Cite

Iamnn: Iterative and adaptive mobile neural network for efficient image classification. (2018). arXiv preprint arXiv:1804.10123.

Cite

Hand Pose Estimation via Latent 2.5 D Heatmap Regression. (2018). ECCV2018.

Cite

Depth-based 3d hand pose estimation: From current achievements to future goals. (2018). Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.

Cite

Budget-aware activity detection with a recurrent policy network. (2017). arXiv preprint arXiv:1712.00097.

Cite

A lightweight approach for on-the-fly reflectance estimation. (2017). Proceedings of the IEEE International Conference on Computer Vision.

Cite

Towards selecting robust hand gestures for automotive interfaces. (2016). 2016 IEEE Intelligent Vehicles Symposium (IV).

Cite

Pruning convolutional neural networks for resource efficient inference. (2016). ICLR2017.

Cite

Online detection and classification of dynamic hand gestures with recurrent 3d convolutional neural network. (2016). Proceedings of the IEEE conference on computer vision and pattern recognition.

Cite

Multilayer and multimodal fusion of deep neural networks for video classification. (2016). Proceedings of the 24th ACM international conference on Multimedia.

Cite

Short-range FMCW monopulse radar for hand-gesture sensing. (2015). 2015 IEEE Radar Conference (RadarCon).

Cite

Range-Doppler surface: a tool to analyse human target in ultra-wideband radar. (2015). IET Radar, Sonar & Navigation.

Cite

Multi-sensor system for driver's hand-gesture recognition. (2015). 2015 11th IEEE international conference and workshops on automatic face and gesture recognition (FG).

Cite

Hand gesture recognition with 3D convolutional neural networks. (2015). Proceedings of the IEEE conference on computer vision and pattern recognition workshops.

Cite

Radar micro-Doppler feature extraction using the singular value decomposition. (2014). In CVPR 2022.

Cite

Classification of small UAVs and birds by micro-Doppler signatures. (2014). International Journal of Microwave and Wireless Technologies.

Cite

The use of automotive radars in video-based overtaking assistance applications. (2013). IEEE Transactions on Intelligent Transportation Systems.

Cite

On micro-Doppler period estimation. (2013). 2013 19th international conference on control systems and computer science.

Cite

Classification of ground moving radar targets by using joint time-frequency analysis. (2012). 2012 IEEE Radar Conference.

Cite

Ground moving target classification by using DCT coefficients extracted from micro-Doppler radar signatures and artificial neuron network. (2011). 2011 Microwaves, Radar and Remote Sensing Symposium.

Cite