Our Computer Vision and Multimedia Laboratory (CVM-Lab) is affiliated with IIT (Illinois Institute of Technology) at Chicago, IL. Our research work is committed to enabling machines with the ability to intelligently perceive, understand and interact with humans and the real world via multiple sensory information from vision, language, audio, and other sources. Concretely, we aim to develop theoretical and practical research projects for multimodal learning and its various applications, biomedical image analysis, and neural network compression.
Our CVM-Lab activity collaborates with other laboratories and industries, and always looks for postdoc research fellows and prospective Ph.D. students to join us. Please contact Prof. Yan Yan at yyan34@iit.edu with your CV for opening positions.
07-2024: Two papers accepted to ECCV 2024.
02-2024: Five papers accepted to CVPR 2024.
12-2023: One paper accepted to FG 2024.
12-2023: One paper accepted to ICASSP 2024.
10-2023: One paper accepted to WACV 2024.
07-2023: Two papers accepted to ICCV 2023.
02-2023: One paper accepted to CVPR 2023.
01-2023: One paper accepted to ICLR 2023.
Ye Zhu, PhD student during 09/2020-09/2023. Postdoc Researcher, Princeton University.
Keshav Bhandari, PhD student during 09/2018-07/2022. Data Scientist, Tesla.
Gaowen Liu, visiting during 09/2018-09/2020. Data Scientist, Cisco.
Aihua Zheng, visiting during 09/2019-08/2020. Associate Professor, Anhui University.
Xianjing Han, visiting during 09/2019-08/2020. Ph.D. Student, Shandong University.
Na Zheng, visiting during 09/2019-08/2020. Ph.D. Student, Shandong University.
Songsong Wu, visiting during 08/2018-09/2019. Associate Professor, Guangdong University of Petrochemical Technology.
Hao Tang, visiting during 06/2018-05/2019. Postdoc, ETH CV Lab.
Yutian Lin, visiting during 09/2018-03/2019. Associate Professor, Wuhan University.
SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding
Weitai Kang, Gaowen Liu, Mubarak Shah, Yan Yan
European Conference on Computer Vision (ECCV), 2024
Dataset Quantization with Active Learning based Adaptive Sampling
Zhenghao Zhao, Yuzhang Shang, Junyi Wu, Yan Yan
European Conference on Computer Vision (ECCV), 2024
Efficient Multitask Dense Predictor via Binarization
Yuzhang Shang, Dan Xu, Gaowen Liu, Ramana Kompella, Yan Yan
Computer Vision and Pattern Recognition (CVPR), 2024
Enhancing Post-training Quantization Calibration through Contrastive Learning
Yuzhang Shang, Gaowen Liu, Ramana Rao Kompella, Yan Yan
Computer Vision and Pattern Recognition (CVPR), 2024
On the Faithfulness of Vision Transformer Explanations
Junyi Wu, Weitai Kang, Hao Tang, Yuan Hong, Yan Yan
Computer Vision and Pattern Recognition (CVPR), 2024
Token Transformation Matters: Towards Faithful Post-hoc Explanation for Vision Transformer
Junyi Wu, Bin Duan, Weitai Kang, Hao Tang, Yan Yan
Computer Vision and Pattern Recognition (CVPR), 2024
Versatile Navigation under Partial Observability via Value-guided Diffusion Policy
Gengyu Zhang, Hao Tang, Yan Yan
Computer Vision and Pattern Recognition (CVPR), 2024
Causal-DFQ: Causality Guided Data-free Network Quantization
Yuzhang Shang, Bingxin Xu, Gaowen Liu, Ramana Rao Kompella, Yan Yan
International Conference on Computer Vision (ICCV), 2023
Towards Saner Deep Image Registration
Bin Duan, Ming Zhong, Yan Yan
International Conference on Computer Vision (ICCV), 2023
Post-training Quantization on Diffusion Models
Yuzhang Shang*, Zhihang Yuan*, Bin Xie, Bingzhe Wu, Yan Yan
Computer Vision and Pattern Recognition (CVPR), 2023
Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation
Ye Zhu, Yu Wu, Kyle Olszewski, Jian Ren, Sergey Tulyakov, Yan Yan
International Conference on Learning Representations (ICLR), 2023
Lipschitz Continuity Retained Binary Neural Network
Yuzhang Shang, Dan Xu, Bin Duan, Ziliang Zong, Liqiang Nie, Yan Yan
European Conference on Computer Vision (ECCV), 2022
Network Binarization via Contrastive Learning
Yuzhang Shang, Dan Xu, Ziliang Zong, Liqiang Nie, Yan Yan
European Conference on Computer Vision (ECCV), 2022
Quantized GAN for Complex Music Generation from Dance Videos
Ye Zhu, Kyle Olszewski, Yu Wu, Panos Achlioptas, Menglei Chai, Yan Yan, and Sergey Tulyakov
European Conference on Computer Vision (ECCV), 2022
Learning Omnidirectional Flow in 360-degree Video via Siamese Representation
Keshav Bhandari, Bin Duan, Gaowen Liu, Hugo Latapie, Ziliang Zong, Yan Yan
European Conference on Computer Vision (ECCV), 2022
Unsupervised Neural Tracing In Densely Labeled Multispectral Brainbow Images
Bin Duan, Logan Walker, DouglasRoossien, Fred Shen, Dawen Cai, Yan Yan
IEEE International Symposium on Biomedical Imaging (ISBI), 2021
Learning Audio-Visual Correlations From Variational Cross-Modal Generations
Ye Zhu, Yu Wu, Hugo Latapie, Yi Yang, and Yan Yan
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2021
Revisiting Optical Flow Estimation in 360 Videos
K. Bhandari, Z. Zong, and Yan Yan
International Conference on Pattern Recognition (ICPR), 2020
Audio-Visual Event Localization via Recursive Fusion by Joint Co-Attention
Bin Duan, Hao Tang, Wei Wang, Ziliang Zong, Guowei Yang, Yan Yan
IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2020
EGOK360: A 360 Egocentric Kinetic Human Activity Video Dataset
K. Bhandari, Mario A. DeLaGarza, Z. Zong, Hugo Latapie, Y. Yan
International conference on Image Processing (ICIP), 2020
Hierarchical HMM for Eye Movement Classifications
Ye Zhu, Yu Wu, Hugo Latapie, Yi Yang, and Yan Yan
European Conference on Computer Vision Workshop (ECCV Workshop), 2020
An IoT Edge Computing Framework Using Cordova Accessor Host
Ngu, A. H., Eyitayo, J., Yang, G., Campbell, C., Sheng, Q. Z., and Ni, J.
IEEE Internet of Things Journal