The world is changing rapidly, and I haven’t updated this page frequently since the pandemic. I apologize if some links are no longer functional. For my recent papers, please visit arXiv.

Quick links to codes and datasets

  1. MMComposition code & data coming soon
  2. Ferret: Refer and Ground Anything Anywhere (ICLR'24)
  3. ImageNet Adversarial Text Regions (ImageNet-Atr) dataset.
  4. Switchboard Speech Sentiment (ICASSP'20)
  5. Memex QA (CVPR'18)
  6. Tumblr GIF (CVPR'16)
  7. Video2GIF (CVPR'16)
  8. GPU-FV (ICMR'16)
  9. CUDA-based Matrix Factorization (HDPC'16)
  10. LADF for Person Verification: Exp 1, Exp 2 (CVPR'13)
  11. Delta SimRank (BigDataMining'12, Best Paper Award)
  12. Gender from Body (ACM MM'08)

Paper grouped by projects (before 2022)

Long-Form Speech Recognition and SSL [SLT'21: RNN-T model long form errors ], [ICASSP'21: Non-Streaming Model Distillation On Unsupervised Data], [Interspeech'21: Targeted Universal Adversarial Perturbations], [Interspeech'21: Bridging the gap between streaming and non-streaming ASR], [IEEE JSTSP'22: BigSSL], [Input Length Matters]
Learning from Noisy Labels

[CVPR 2019: Automatic Adaptation of Object Detectors] [ICCV 2017: Learning from Noisy Labels with Distillaton]
Photo Classification, Search and Understanding

[TPAMI 2019: Memex QA] (Junwei's CVPR'20 talk) [TMM 2017: Fashion Images] [TMM 2017: Real Estate Images] [WSDM 2017] [CVPR 2013] [CVPR 2011] [ACMMM 2008] [CVPR 2008] [ICCV 2007]
GIFs and Short Videos

[CVPR 2016: TGIF] [CVPR 2016: Video2GIF]
Video Understanding and Action Detection

[IBMR&D 2015] [ACMMM 2012b] [ACMMM 2012a] [ECCV 2012] [CVPR 2010] [ICCV 2009]
Social Media Mining

[ACMMM 2016] [TMM 2015] [SDM 2011] [WWW 2011] [NeuroComputing 2011] [ACMMM 2010 ]
Understanding Line Drawings

[ICCV 2005] [ECCV 2006] [ACMMM 2006] [TPAMI 2008a] [TPAMI 2008b]
If you are looking for papers published before 2019, see here.

Patents

US Patents 8,639,042    9,014,420    9,165,217    9,251,434    9,471,851    9,659,258    9,659,560    9,672,814    9,734,166    9,911,223    9,947,073    11,265,271