Zixi (Charlie) Chen

prof_pic.jpg

I am a Computer Science PhD at NYU Courant, working with Prof. Andrew Gordon Wilson. My research interest lies in efficiently training large neural networks. Prior to this, I finished my Bachelor and Master Degree in Math and Compute Science at NYU, when I worked with Yanjun Han and Stefano Martiniani.

selected publications

  1. hyper_scaling.png
    Hyperparameter Transfer Enables Consistent Gains of Matrix-Preconditioned Optimizers Across Scales
    Shikai Qiu, Zixi Chen, Hoang Phan, Qi Lei, and Andrew Gordon Wilson
    In Advances in Neural Information Processing Systems (NeurIPS), 2025
  2. einsum_struct.png
    Efficient Linear Layers over a Continuous Space of Structured Matrices
    Andres Potapczynski, Shikai Qiu, Marc Finzi, Christopher Ferri, Zixi Chen, Micah Goldblum, Bayan Bruss, Christopher De Sa, and Andrew Gordon Wilson
    In Advances in Neural Information Processing Systems (NeurIPS), 2024