ICLR 2025 result

Our paper Mini-batch Coresets for Memory-efficient Language Model Training on Data Mixtures is accepted to ICLR 2025.




Enjoy Reading This Article?

Here are some more articles you might like to read next:

  • Self-Attention Amortized Distributional Projection Optimization for Sliced Wasserstein Point-Cloud Reconstruction
  • Improving Mini-batch Optimal Transport via Partial Transportation
  • On Transportation of Mini-batches: A Hierarchical Approach