Zixi (Charlie) Chen
I am a Computer Science PhD at NYU Courant, working with Prof. Andrew Gordon Wilson. My research interest lies in efficiently training large neural networks. Prior to this, I finished my Bachelor and Master Degree in Math and Compute Science at NYU, when I worked with Yanjun Han and Stefano Martiniani.
selected publications
-
Hyperparameter Transfer Enables Consistent Gains of Matrix-Preconditioned Optimizers Across ScalesIn Advances in Neural Information Processing Systems (NeurIPS), 2025 -
Efficient Linear Layers over a Continuous Space of Structured MatricesIn Advances in Neural Information Processing Systems (NeurIPS), 2024