Zixi (Charlie) Chen
I am a Computer Science PhD at NYU Courant, working with Prof. Andrew Gordon Wilson. My research interests include optimization, scaling laws, and structured matrices for efficiently training large neural networks. Prior to this, I finished my Bachelor and Master Degree in Math and Computer Science at NYU, where I worked with Yanjun Han and Stefano Martiniani.
selected publications
-
Hyperparameter Transfer Enables Consistent Gains of Matrix-Preconditioned Optimizers Across ScalesIn Advances in Neural Information Processing Systems (NeurIPS), 2025 -
Efficient Linear Layers over a Continuous Space of Structured MatricesIn Advances in Neural Information Processing Systems (NeurIPS), 2024 -
A unifying approach to self-organizing systems interacting via conservation lawsarXiv preprint arXiv:2507.02575, 2025