New AI Training Method Enhances Model Efficiency by Compressing During Learning

Apr 9, 2026

AI Summary

Researchers have developed a technique called CompreSSM that allows AI models to be compressed during training, improving efficiency without sacrificing performance. This method enables models to discard unnecessary components early in the training process, resulting in faster training times and comparable accuracy to larger models.

New AI Training Method Enhances Model Efficiency by Compressing During Learning

Training large AI models is resource-intensive, often requiring a trade-off between size and performance.
Researchers from MIT, Max Planck Institute, and others created CompreSSM, a method that compresses models during training rather than after.
CompreSSM focuses on state-space models, using control theory to identify and remove less important components early in the training process.
The technique allows models to maintain high accuracy while training up to 1.5 times faster compared to traditional methods.
For example, a compressed model achieved 85.7% accuracy on the CIFAR-10 benchmark, outperforming a smaller model trained from scratch.
CompreSSM is distinct from conventional pruning and knowledge distillation, as it makes compression decisions during training, reducing overall computational costs.
The method has been shown to be over 40 times faster than existing spectral techniques while achieving higher accuracy.
The researchers proved that the importance of model states changes smoothly during training, providing confidence in early compression decisions.
CompreSSM is particularly effective for multi-input, multi-output models, with potential applications in various AI architectures, including linear attention.
Future research aims to extend the technique to other architectures used in industry today, enhancing its applicability in modern AI systems.

ai modelscontrol theorytraining efficiencycompute costsperformance

New AI Training Method Enhances Model Efficiency by Compressing During Learning

Related Stories

Nvidia Director Mark Stevens Donates $200 Million to USC for AI Research

MIT Professor Advances AI Through Game Theory and Strategic Reasoning

Mark Stevens Donates $200 Million to USC for AI Research and Education

Quality of Data is Crucial for Advancing Physical AI and World Models