talks/teaching

Please see here for earlier talks.

teaching

Probabilistic Graphical Models

[Course Website]

Many of the problems in artificial intelligence, statistics, computer systems, computer vision, natural language processing, and computational biology, among many other fields, can be viewed as the search for a coherent global conclusion from local information. The probabilistic graphical models framework provides an unified view for this wide range of problems, enables efficient inference, decision-making and learning in problems with a very large number of attributes and huge datasets. This graduate-level course will provide you with a strong foundation for both applying graphical models to complex problems and for addressing core research topics in graphical models.

Introduction to Machine Learning

[Course Website]

Machine Learning is concerned with computer programs that automatically improve their performance through experience (e.g., programs that learn to recognize human faces, recommend music and movies, and drive autonomous robots). This course covers the theory and practical algorithms for machine learning from a variety of perspectives. We cover topics such as Linear Regression, SVMs, Neural Networks, Graphical Models, Clustering, etc. Programming assignments include hands-on experiments with various learning algorithms. This course is designed to give a PhD-level student a thorough grounding in the methodologies, technologies, mathematics and algorithms currently needed by people who do research in machine learning.

talks

  1. CIAI
    On the Utility of Gradient Compression in Distributed Training Systems
    CIAI Colloquium, MBZUAI 2022
  2. Baidu
    From Learning, to Meta-Learning, to "Lego-Learning” – theory, system, and applications
    Baidu 2021
  3. KDD DLD
    It is time for deep learning to understand its expense bills
  4. ACL Meta-NLP
    Learning-to-learn through Model-based Optimization: HPO, NAS, and Distributed Systems
  5. ICML ML4Data
    A Data-Centric View for Composable Natural Language Processing