About me

I am a Research Scientist at PRIOR, the Computer Vision team at the Allen Institute for Artificial Intelligence. I received my PhD from UIUC where I was advised by Prof. Derek Hoiem and closely collaborated with Prof. Alex Schwing. Before that, I studied Electrical Engineering at IIT Kanpur and began vision and learning research with Prof. Aditya K. Jagannatham.

Research Interests


I am interested in building “agents” that help us with myriad chores we perform everyday in both the digital as well as the physical world. My recent work includes:

  • Neuro-symbolic visual reasoning framework - Visual Programming (CVPR 2023 Best Paper)
  • Some of the first instruction-following General Purpose Vision systems - GPV-1, GPV-2
  • Benchmark for evaluation of GPVs - GRIT

What’s new?

Nov 2023 Serving as an Area Chair for CVPR 2024
Oct 2023 Invited talk at Mitsubishi Electric Research Labs (MERL) Seminal Series on "Visual Programming"
Sep 2023 Recognized as Outstanding Reviewer at ICCV 2023!
Aug 2023 Serving as an Area Chair for NeurIPS 2023
June 2023 VisProg received the Best Paper Award at CVPR 2023!
June 2023 Hosted GRIT Challenge at CVPR 2023 as part of VPLOW Workshop
Jan 2023 Invited talk in DNOW workshop at WACV 2023 on "Novelty in the Open World: A generalist & multimodal perspective"
Nov 2022 Checkout VisProg - a neuro-symbolic system using GPT3 for generating programs for solving complex visual tasks described in natural language. No backprop required!
Sep 2022 Serving as an Area Chair for CVPR 2023
Sep 2022 My thoughts on Meta's new text-to-video model (Make-A-Video) in an MIT Tech Review article
May 2022 GRIT Benchmark is ready to test generality, robustness, and calibration of your models for 7 diverse vision and vision-language tasks!
March 2022 GPV-1 accepted to CVPR 2022!
Feb 2022 GPV-2, a stronger GPV model that learned 10,000 concepts from the web across 5 skills, released on arXiv.
Feb 2022 Invited guest speaker at IIT Kanpur ML School
May 2021 Recognized as an "Outstanding Reviewer" for CVPR 2021!
May 2021 Striving towards General Purpose Vision! Checkout the GPV-1 demo.
May 2021 Create learning curves to analyze deep classifiers using our ICML 2021 work.
April 2021 The VidSitu dataset and the VidSRL challenge at CVPR 2021 are now live.
Aug 2020Contrastive learning approach to weakly supervised phrase grounding presented at ECCV 2020.
Aug 2020 Recognized as an "Outstanding Reviewer" for ECCV 2020!
July 2020Joined PRIOR @ AI2 as a Research Scientist.
May 2020 Defended my thesis! Thesis & Slides
Sept 2019 Lecture material for guest lecture at CS 598RK: HCI for ML (Fall 2019).
Sept 2019 Code and data released for ICCV 2019 papers:
- ViCo: Word Embeddings from Visual Co-occurrences
- No-Frills Human-Object Interaction Detection

Education

Ph.D. (CS)B. Tech. (EE)
UIUCIIT Kanpur
2014-20202010-2014

Research Internships

NvidiaAI2
Santa Clara | 2019Seattle | 2017
A9.comCornell
Palo Alto | 2015Ithaca | 2013

Teaching

Professional services

  • Served as a reviewer for TPAMI, CVPR, ICCV, ECCV, and NeurIPS since 2016
  • Recognized as an Outstanding Reviewer for ECCV 2020, CVPR 2021, and ICCV 2023