Unpublished
2024
APA
Click to copy
Wu*, W., Su*, M., Hu*, J. Y.-C., Song, Z., & Liu, H. (2024). Transformers are Deep Optimizers: Provable In-Context Learning for Deep Model Training.
Chicago/Turabian
Click to copy
Wu*, Weimin, Maojiang Su*, Jerry Yao-Chieh Hu*, Zhao Song, and Han Liu. “Transformers Are Deep Optimizers: Provable In-Context Learning for Deep Model Training,” 2024.
MLA
Click to copy
Wu*, Weimin, et al. Transformers Are Deep Optimizers: Provable In-Context Learning for Deep Model Training. 2024.
BibTeX Click to copy
@unpublished{weimin2024a,
title = {Transformers are Deep Optimizers: Provable In-Context Learning for Deep Model Training},
year = {2024},
author = {Wu*, Weimin and Su*, Maojiang and Hu*, Jerry Yao-Chieh and Song, Zhao and Liu, Han}
}