Latest Colloquia for February 5th, 2024

MixTraining: Leveraging Asynchronous Computation in the Pretrain-Finetune Paradigm

Abstract: Pretrain-finetune has emerged as a powerful learning paradigm, achieving remarkable accuracy gains in various domains. However, its substantial computational requirements limit its application to broader areas. To address this challenge, we develop MixTraining, a novel training framework that---for the first time---incorporates asynchronous computation into the standard pretrain-finetune paradigm. At a high level, our MixTraining...

By Yinglun Zhu | February 05, 2024

Center for Robotics and Intelligent Systems

Latest Colloquia for February 5th, 2024

MixTraining: Leveraging Asynchronous Computation in the Pretrain-Finetune Paradigm

CENTER FOR RESEARCH IN INTELLIGENT SYSTEMS