Essays

Recent talks

Scale Learning in Image Semantics: A 15-Year Review and A Case Study
Invited talk at IEEE CVPR Workshop on Fair, Data Efficient, Trusted Computer Vision
June 2024


Reducing Longform Errors in End2End Speech Recognition
(an overview of my work in Google Cloud Speech)
Invited talk at Rutgers University
April 2022
An extended version

Courses

I taught at Columbia University from 2013 to 2018.

Columbia E6894 (fall 2018)
Deep Learning for Computer Vision, Speech, and Language
Co-taught with Xiaodong Cui and Kapil Thadani

Columbia E6894 (spring 2017)
Deep Learning for Computer Vision, Speech, and Language
Co-taught with Xiaodong Cui and Kapil Thadani

Columbia EECS 6894 (spring 2015)
Deep Learning for Computer Vision and Natural Language Processing
Co-taught with James Fan

Columbia EECS 6890 (spring 2014)
Visiual recognition and search
Co-taught with Rogerio Feris and Jun Wang

Columbia EECS 6890 (spring 2013)
Visiual recognition and search
Co-taught with Rogerio Feris

Tutorials

Tutorial at ICME 2019 (Shanghai, July 8-12)
Visual Search and Question Answering
Co-presented with Lu Jiang and Yannis Kalant

Tutorial at Google AI Winter Camp, Beijing 2019
Convolutional Networks and Their Applications

Tutorial at CVPR 2014:
Learning Visual Semantics: Models, Massive Computation, and Innovative Applications
Co-presented with Shih-Fu Chang, John R. Smith, and Rogerio Feris

Tutorial at ACM Multimedia 2013:
Massive-Scale Multimedia Semantic Modeling
Co-presented with John R. Smith