Publications

2025

  1. Preprint
    polar_express_meme.jpeg
    The Polar Express: Optimal Matrix Sign Methods and Their Application to the Muon Algorithm
    2025
  2. Preprint
    HSS.png
    Quasi-optimal hierarchically semi-separable matrix approximation
    2025
  3. Preprint
    rock-paper-scissors.jpg
    Compositional Reasoning with Transformers, RNNs, and Chain of Thought
    Gilad Yehudai, Noah Amsel, and Joan Bruna
    2025
  4. ICML
    Customizing the Inductive Biases of Softmax Attention using Structured Matrices
    Yilun Kuang, Noah Amsel, Sanae Lotfi, Shikai Qiu, Andres Potapczynski, and Andrew Gordon Wilson
    International Conference on Machine Learning (ICML), 2025, 2025
  5. ICLR
    three_heads.jpg
    Quality over Quantity in Attention Layers: When Adding More Heads Hurts
    Noah Amsel, Gilad Yehudai, and Joan Bruna
    In The Thirteenth International Conference on Learning Representations, 2025

2024

  1. Preprint
    Fixed-sparsity matrix approximation from matrix-vector products
    2024
  2. NeurIPS
    Cornelius.jpg
    Nearly Optimal Approximation of Matrix Functions by the Lanczos Method
    Noah Amsel, Tyler Chen, Anne Greenbaum, Cameron Musco, and Christopher Musco
    In The Thirty-eighth Annual Conference on Neural Information Processing Systems. (Also check out this follow up work) , 2024

2023

  1. Inf Inference
    Spectral top-down recovery of latent tree models
    Yariv Aizenbud, Ariel Jaffe, Meng Wang, Amber Hu, Noah Amsel, Boaz Nadler, Joseph T Chang, and Yuval Kluger
    Information and Inference: A Journal of the IMA, Aug 2023

2021

  1. SIGCOMM
    Designing Data Center Networks Using Bottleneck Structures
    Jordi Ros-Giralt, Noah Amsel, Sruthi Yellamraju, James Ezick, Richard Lethin, Yuang Jiang, Aosong Feng, Leandros Tassiulas, Zhenguo Wu, Min Yee Teh, and Keren Bergman
    In Proceedings of the 2021 ACM SIGCOMM 2021 Conference, Virtual Event, USA, Aug 2021
  2. SIMODS
    Spectral Neighbor Joining for Reconstruction of Latent Tree Models
    Ariel Jaffe, Noah Amsel, Yariv Aizenbud, Boaz Nadler, Joseph T. Chang, and Yuval Kluger
    SIAM Journal on Mathematics of Data Science (SIMODS), Aug 2021

2020

  1. INDIS
    Computing Bottleneck Structures at Scale for High-Precision Network Performance Analysis
    Noah Amsel, Jordi Ros-Giralt, Sruthi Yellamraju, James Ezick, Brendan Hofe, Alison Ryan, and Richard Lethin
    2020 IEEE/ACM Innovating the Network for Data-Intensive Science (INDIS), Nov 2020

2019

  1. BlackboxNLP
    Finding Hierarchical Structure in Neural Stacks Using Unsupervised Parsing
    William Merrill, Lenny Khazan, Noah Amsel, Yiding Hao, Simon Mendelsohn, and Robert Frank
    In Proceedings of the 2019 ACL Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP, Aug 2019

2018

  1. BlackboxNLP
    Context-Free Transductions with Neural Stacks
    Yiding Hao, William Merrill, Dana Angluin, Robert Frank, Noah Amsel, Andrew Benz, and Simon Mendelsohn
    In Proceedings of the 2018 EMNLP Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP, Nov 2018