Publications

(*) denotes equal contribution

2025

  1. Do We Need All the Synthetic Data? Towards Targeted Synthetic Image Augmentation via Diffusion Models
    Dang Nguyen* , Jiping Li*, Jinghao Zheng, and 1 more author
    arXiv preprint arXiv:2505.21574, 2025
  2. Beyond Semantic Entropy: Boosting LLM Uncertainty Quantification with Pairwise Semantic Similarity
    Dang NguyenAli Payani, and Baharan Mirzasoleiman
    In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (ACL), 2025
  3. Synthetic Text Generation for Training Large Language Models via Gradient Matching
    Dang Nguyen*Zeman Li*Mohammadhossein Bateni, and 3 more authors
    International Conference on Machine Learning (ICML), 2025
  4. Mini-batch Coresets for Memory-efficient Language Model Training on Data Mixtures
    Dang NguyenWenhan YangRathul Anand, and 2 more authors
    International Conference on Learning Representations (ICLR), 2025

2024

  1. Changing the Training Data Distribution to Reduce Simplicity Bias Improves In-distribution Generalization
    Dang NguyenPaymon HaddadEric Gan, and 1 more author
    Advances in Neural Information Processing Systems, 2024
  2. Understanding the Robustness of Multi-modal Contrastive Learning to Distribution Shift
    Yihao XueSiddharth JoshiDang Nguyen, and 1 more author
    International Conference on Learning Representations (ICLR), 2024
    Data-centric Machine Learning Research (DMLR) Workshop at ICLR 2024

2023

  1. Self-Attention Amortized Distributional Projection Optimization for Sliced Wasserstein Point-Cloud Reconstruction
    Khai Nguyen*Dang Nguyen*, and Nhat Ho
    International Conference on Machine Learning (ICML), 2023
  2. On Cross-Layer Alignment for Model Fusion of Heterogeneous Neural Networks
    Dang NguyenTrang NguyenKhai Nguyen, and 3 more authors
    IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
    Top 3%

2022

  1. Improving Mini-batch Optimal Transport via Partial Transportation
    Khai Nguyen*Dang Nguyen*The-Anh Vu-Le, and 2 more authors
    International Conference on Machine Learning (ICML), 2022
  2. On Transportation of Mini-batches: A Hierarchical Approach
    Khai NguyenDang NguyenQuoc Nguyen, and 5 more authors
    International Conference on Machine Learning (ICML), 2022