Memory-Efficient Training
-
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
Jiawei
Zhao, Zhenyu
Zhang
, Beidi
Chen
, and
3 more authors
In International Conference on Machine Learning (ICML) , 2024
Oral presentation (top 1.5%)
-
InRank: Incremental Low-Rank Learning
Jiawei
Zhao*, Yifei
Zhang*
, Beidi
Chen
, and
2 more authors
ES-FoMo Workshop at International Conference on Machine Learning (ICML), 2023
Low-Precision Training
-
LNS-Madam: Low-Precision Training in Logarithmic Number System Using Multiplicative Weight Update
Jiawei
Zhao, Steve
Dai
, Rangharajan
Venkatesan
, and
6 more authors
IEEE Transactions on Computers | US Patent 17/346,100, 2022
-
Learning compositional functions via multiplicative weight updates
Jeremy
Bernstein
, Jiawei
Zhao, Markus
Meister
, and
3 more authors
In Advances in Neural Information Processing Systems (NeurIPS) , 2020
Distributed Training
-
signSGD with Majority Vote is Communication Efficient and Fault Tolerant
Jeremy
Bernstein*
, Jiawei
Zhao*, Kamyar
Azizzadenesheli
, and
1 more author
In International Conference on Learning Representations (ICLR) , 2019
Understanding Training Dynamics
-
ZerO Initialization: Initializing Neural Networks with only Zeros and Ones
Jiawei
Zhao, Florian Tobias
Schaefer
, and Anima
Anandkumar
Transactions on Machine Learning Research (TMLR), 2022
Please refer to my Google Scholar for a complete list of publications.