Kunyu Shi
I lead an LLM post-training team at Alibaba (2024–present), with groups in Hangzhou and Sunnyvale. We build agentic foundational models through continual pre-training, supervised fine-tuning, reinforcement learning. Our models power AI services on Alibaba.com and Accio.com. We are hiring!
Previously, I was a Senior Applied Scientist at AWS AI Labs (2020–2024), where I worked in Stefano Soatto’s group at Caltech on document understanding and large foundation models (LLM and VLM). I contributed to several AWS AI services including Amazon Textract and Amazon Bedrock.
I received my M.S. in Computer Science from the University of California, San Diego (UCSD) in 2020 and my B.Eng. in Electrical Engineering from the University of Electronic Science and Technology of China (UESTC) in 2018. Prior to industry, I was a research assistant at the Machine Learning, Perception, and Cognition Lab (mlPC) at UCSD with Prof. Zhuowen Tu (2019–2020) and at the Big Data Research Center at UESTC with Prof. Tao Zhou (2016). I also interned at McMaster University with Prof. Paul W. Ayers (2017).
My academic and industrial research interest includes detection, segmentation, large scale multi-modality learning, large language model and document understanding.
News
| Dec 01, 2024 | I joined Alibaba as a Senior Staff Applied Scientist. |
|---|---|
| Feb 28, 2024 | 2 papers accepted to the CVPR 2024 conference! |
| Nov 06, 2023 | We launched service Custom Queries in Amazon Textract that allows customers to customize their own model specifically trained for their business-specific documents AWS Machine Learning Blog |
| Aug 28, 2023 | AWS generative AI Bedrock service that I contributed to is now generally available. Bedrock Official Introduction Video |
| Jun 07, 2023 | AWS Textract-Tables service has announced new features. Official Release Note |
Products
![]() | Accio Work is a local desktop AI agent that works for you, featuring self-evolving agents, multi-agent collaboration, browser automation, and scheduled tasks. |
|---|---|
![]() | Accio is an agentic AI service for entrepreneurs, offering product sourcing, product design, business research and more. |
![]() | AWS Bedrock provides generative AI foundational models. |
![]() | Amazon Textract provides ML based intelligent document processing services. |
Selected Publications
-
Hybrid Reward Normalization for Process-Supervised Non-Verifiable Agentic TasksarXiv preprint arXiv:2509.25598, 2025 -
Enhancing Vision-Language Pre-training with Rich SupervisionsIn Computer Vision and Pattern Recognition (CVPR) , 2024



