I am a principal scientist at Apple in Cupertino, California. Previously I worked as a scientist/engineer at Google, Yahoo!, and IBM, as well as as an adjunct associate professor at Columbia University and UMass. I was a recipient of the ACM SIGMM Rising Star Award. I won 1st place in the ImageNet LSVRC Challenge in 2010. In 2016, I co-founded a startup named Switi Inc and worked as the CTO. After the startup was acquired, I worked as the tech lead for Google Cloud speech modeling and then the tech lead for Cloud vision modeling. Here is my CV (updated June 2024).
Liangliang Cao
Scientist at Apple Inc.Personal email: llcao[at]cs.umass.edu
[LinkedIn], [X], [Google Scholar], [DBLP], [arXiv]
Bio
News
- Apple Intelligence was announced in WWDC'24! It was a great experience to act as a modeling lead and engineering lead to support a number of AI features.
- New paper and code for "Ferret: Refer and Ground Anything Anywhere" are available.
- The ImageNet Adversarial Text Regions (ImageNet-Atr) dataset is available. It is similar to the ImageNet eval set, but challenging for typical CLIP models. For example, the Open-CLIP B-16 trained from LAION dataset got a top-1 zero-shot accuracy of 29.4%.
Recent Essays
- Modern AI R&D Differs from Classic Research
- Does your AI need a (good) screen?
- Book reading notes: The heartbeats of AI development
- Understanding LLMs through the analysis of search engines
- The struggles of New Bing: insights for AI products in the genAI era.
- Can content generation AI become the next Web search?
Recent Projects
- Apple Intelligence: modeling and engineering lead, 2023-2024
- Google Cloud Vision: proposed the Cloud Multimodal Services, 2022
- Google Speech: team lead for Cloud speech modeling, 2018 - 2021
- Google-Verizon Contact Center AI: tech lead and sales engineer lead to win one of the largest contracts in the history of Cloud AI, 2019