Publications
* denotes equal contribution.
2024
- Enhancing Vision-Language Pre-training with Rich SupervisionsIn Computer Vision and Pattern Recognition (CVPR) , 2024
2023
- Musketeer (All for One, and One for All): A Generalist Vision-Language Model with Task Explanation PromptsarXiv preprint arXiv:2305.07019, 2023