Essays

Recent talks

Reducing Longform Errors in End2End Speech Recognition
(an overview of my work in Google Cloud Speech)
Invited talk at Rutgers University
April 2022
An extended version

Courses

I taught at Columbia University from 2013 to 2018.

Columbia E6894 (fall 2018)
Deep Learning for Computer Vision, Speech, and Language
Co-taught with Xiaodong Cui and Kapil Thadani

Columbia E6894 (spring 2017)
Deep Learning for Computer Vision, Speech, and Language
Co-taught with Xiaodong Cui and Kapil Thadani

Columbia EECS 6894 (spring 2015)
Deep Learning for Computer Vision and Natural Language Processing
Co-taught with James Fan

Columbia EECS 6890 (spring 2014)
Visiual recognition and search
Co-taught with Rogerio Feris and Jun Wang

Columbia EECS 6890 (spring 2013)
Visiual recognition and search
Co-taught with Rogerio Feris

Tutorials

Tutorial at ICME 2019 (Shanghai, July 8-12)
Visual Search and Question Answering
Co-presented with Lu Jiang and Yannis Kalant

Tutorial at Google AI Winter Camp, Beijing 2019
Convolutional Networks and Their Applications

Tutorial at CVPR 2014:
Learning Visual Semantics: Models, Massive Computation, and Innovative Applications
Co-presented with Shih-Fu Chang, John R. Smith, and Rogerio Feris

Tutorial at ACM Multimedia 2013:
Massive-Scale Multimedia Semantic Modeling
Co-presented with John R. Smith