Aston Zhang

Aston Zhang is a member of technical staff at OpenAI. His work has been recognized with the ICLR Outstanding Paper Award, the ACM Ubicomp Distinguished Paper Award, and an ACM SenSys Best Paper Award nomination. His textbook, “Dive into Deep Learning,” is adopted worldwide. He earned his Ph.D. from the University of Illinois Urbana-Champaign.

Books

A. Zhang, Z. C. Lipton, M. Li, and A. J. Smola
Dive into Deep Learning
Cambridge University Press, 2023
- Adopted at 500 universities from 70 countries
- Best seller in China

Papers (All)

R. Bansal, A. Zhang, R. Tiwari, L. Madaan, S. Duvvuri, D. Khatri, D. Brandfonbrener, D. Alvarez-Melis, P. Bhargava, M. Kale, and S. Jelassi
Let’s (NoT) Just Put Things in Context: Test-Time Training for Long-Context LLMs
In Proceedings of the International Conference on Learning Representations (ICLR), 2026
M. Zhong*, A. Zhang*, X. Wang, R. Hou, W. Xiong, C. Zhu, Z. Chen, L. Tan, C. Bi, M. Lewis, S. Popuri, S. Narang, M. Kambadur, D. Mahajan, S. Edunov, J. Han, and L. van der Maaten
Law of the Weakest Link: Cross Capabilities of Large Language Models
In Proceedings of the International Conference on Learning Representations (ICLR), 2025
llm-cross-capabilities.org
Llama Team, AI@Meta (Core Contributor)
The Llama 3 Herd of Models
2024
Z. Zhang and A. Zhang
You Only Look at Screens: Multimodal Chain-of-Action Agents
In Findings of the Association for Computational Linguistics (ACL), 2024
Z. Zhang, A. Zhang, M. Li, H. Zhao, G. Karypis, and A. J. Smola
Multimodal Chain-of-Thought Reasoning in Language Models
In Transactions on Machine Learning Research (TMLR), 2024
[Idea Inspiration by Homeschooling]
J. Chen, A. Zhang, X. Shi, M. Li, A. J. Smola, and D. Yang
Parameter-Efficient Fine-Tuning Design Spaces
In Proceedings of the International Conference on Learning Representations (ICLR), 2023
T. Yang, Y. Zhu, Y. Xie, A. Zhang, C. Chen, and M. Li
AIM: Adapting Image Models for Efficient Video Understanding
In Proceedings of the International Conference on Learning Representations (ICLR), 2023
Z. Zhang, A. Zhang, M. Li, and A. J. Smola
Automatic Chain of Thought Prompting in Large Language Models
In Proceedings of the International Conference on Learning Representations (ICLR), 2023
C. Qin, A. Zhang, Z. Zhang, J. Chen, M. Yasunaga, and D. Yang
Is ChatGPT a General-Purpose Natural Language Processing Task Solver?
In Empirical Methods in Natural Language Processing (EMNLP), 2023
A. Zhang, Y. Tay, S. Zhang, A. Chan, A. T. Luu, S. C. Hui, and J. Fu
Beyond Fully-Connected Layers with Quaternions: Parameterization of Hypercomplex Multiplications with 1/n Parameters
In Proceedings of the International Conference on Learning Representations (ICLR, Outstanding Paper Award), 2021

Tutorials

with A. J. Smola
Attention in Deep Learning [Keynote] [PDF] [Video]
In The 36th International Conference on Machine Learning (ICML), 2019

Services

Area Chair
- Annual Meeting of the Association for Computational Linguistics (ACL)
- Conference on Empirical Methods in Natural Language Processing (EMNLP)
- International Conference on Computational Linguistics (COLING)