log in  |  register  |  feedback?  |  help  |  web accessibility
Logo
Large-Scale Machine Learning Model Computing and Compression
Heng Huang
IRB 0318 or Zoom: https://umd.zoom.us/j/92721031800?pwd=dGhidU13dzl0cmI2eUM4SzJLNTZrZz09
Friday, October 6, 2023, 11:00-11:55 am Calendar
  • You are subscribed to this talk through .
  • You are watching this talk through .
  • You are subscribed to this talk. (unsubscribe, watch)
  • You are watching this talk. (unwatch, subscribe)
  • You are not subscribed to this talk. (watch, subscribe)
Abstract

Machine learning and artificial intelligence are gaining fresh momentum, and have helped us enhance not only many industrial and professional processes but also our everyday living. The recent success of machine learning heavily relies on the surge of big data, big models, and big computing. However, the inefficient algorithms often restrict the applications of machine learning to very large-scale tasks. In terms of big data, serious concerns, such as communication overhead and convergence speed, should be rigorously addressed when we train learning models using large amounts of data located at multiple computers or devices. In terms of the big model, it is still an underexplored research area if a model is too big to train on a single computer or device. To address these challenging problems, we focused on designing new ultra-scale machine learning algorithms, efficiently optimizing and training models for big data problems, and studying new discoveries in both theory and applications. I will present our recent research results on developing model parallelization algorithms to solve the big model problem in deep neural networks, and distributed learning methods to address the big data computing issues. To deploy the big models in real-world applications with limited computational budget, we also designed new model pruning approaches to compress the big machine learning models to much smaller size with maintaining good performance. I will introduce our newly developed model pruning techniques including interpretation enhanced model pruning and multimodal transformer pruning.

Bio

Dr. Heng Huang is a Brendan Iribe Endowed Professor in Computer Science and Electrical and Computer Engineering at the University Maryland College Park. Dr. Huang received the PhD degree in Computer Science at Dartmouth College. His research areas include machine learning, AI, data mining, and biomedical data science. Dr. Huang has published more than 280 papers in top-tier conferences and many papers in premium journals, such as ICML, NeurIPS, KDD, IJCAI, AAAI, RECOMB, ISMB, ICCV, CVPR, Nature Machine Intelligence, Nucleic Acids Research, Bioinformatics, Medical Image Analysis, Journal of Machine Learning Research, IEEE TPAMI, TMI, TIP, TKDE, TNNLS, etc. As PI, Dr. Huang currently is leading NIH R01s, U01, and multiple NSF funded projects on machine learning, AI, imaging-omics, precision medicine, electronic medical record data analysis and privacy-preserving, smart healthcare, and cyber physical system. He is a Fellow of AIBME and served as the Program Chair of ACM SIGKDD Conference 2020.

This talk is organized by Samuel Malede Zewdu