Machine Learning Parallelism Could Be Adaptive, Composable and Automated

Zoom Link Abstract: In recent years, researchers in SysML have created algorithms and systems that parallelize ML training over multiple devices or computational nodes. As ML models become more structurally complex, many systems have struggled to provide all-round performance on a variety of models. Particularly, ML scale-up is usually underestimated in terms of the amount [...]