Publications

(*) denotes equal contribution

2024

  1. Mini-batch Coresets for Memory-efficient Training of Large Language Models
    Dang NguyenWenhan Yang, Rathul Anand, and 2 more authors
    arXiv preprint arXiv:2407.19580, 2024
  2. Changing the Training Data Distribution to Reduce Simplicity Bias Improves In-distribution Generalization
    Dang Nguyen, Paymon Haddad, Eric Gan, and 1 more author
    Advances in Neural Information Processing Systems, 2024
  3. Understanding the Robustness of Multi-modal Contrastive Learning to Distribution Shift
    Yihao XueSiddharth JoshiDang Nguyen, and 1 more author
    International Conference on Learning Representations (ICLR), 2024
    Data-centric Machine Learning Research (DMLR) Workshop at ICLR 2024

2023

  1. Self-Attention Amortized Distributional Projection Optimization for Sliced Wasserstein Point-Cloud Reconstruction
    Khai Nguyen*Dang Nguyen*, and Nhat Ho
    International Conference on Machine Learning (ICML), 2023
  2. On Cross-Layer Alignment for Model Fusion of Heterogeneous Neural Networks
    Dang NguyenTrang NguyenKhai Nguyen, and 3 more authors
    IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
    Top 3%

2022

  1. Improving Mini-batch Optimal Transport via Partial Transportation
    Khai Nguyen*Dang Nguyen*The-Anh Vu-Le, and 2 more authors
    International Conference on Machine Learning (ICML), 2022
  2. On Transportation of Mini-batches: A Hierarchical Approach
    Khai NguyenDang NguyenQuoc Nguyen, and 5 more authors
    International Conference on Machine Learning (ICML), 2022