Prajwal Gatti

I am a second year Ph.D. student at the University of Bristol advised by Prof. Dima Damen, in the Machine Learning and Computer Vision Group. My research interests are in AI for video understanding.

Prior to this, I was a Research Assistant at the Indian Institute of Technology Jodhpur advised by Prof. Anand Mishra where I explored the problems of vision-augemented table-to-text generation, cross-modal image retrieval among other vision-language problems.

I also briefly interned at the Center for Neuroscience, Indian Institute of Science where I worked on EEG Brain-Computer Interfaces under the advise of Prof. Sridharan Devarajan.

I received my B.E. in Information Science and Engineering at Dayananda Sagar College of Engineering in 2020.

Email  /  LinkedIn  /  Google Scholar  /  GitHub

News

Selected Publications
PontTuset HD-EPIC: A Highly-Detailed Egocentric Video Dataset
Toby Perrett, Ahmad Darkhalil, Saptarshi Sinha, Omar Emara, Sam Pollard, Kranti Kumar Parida, Kaiting Liu, Prajwal Gatti, Siddhant Bansal, Kevin Flanagan, Jacob Chalk, Zhifan Zhu, Rhodri Guerrier, Fahd Abdelazim, Bin Zhu, Davide Moltisanti, Michael Wray, Hazel Doughty, Dima Damen
CVPR 2025
paper / project page / dataset / video teaser / explore samples

PontTuset ShowHowTo: Generating Scene-Conditioned Step-by-Step Visual Instructions
Tomáš Souček, Prajwal Gatti, Michael Wray, Ivan Laptev, Dima Damen, Josef Sivic
CVPR 2025
paper / project page / code / dataset

PontTuset Show Me the World in My Language: Establishing the First Baseline for Scene-Text to Scene-Text Translation
Shreyas Vaidya*, Arvind Kumar Sharma*, Prajwal Gatti, Anand Mishra
ICPR 2024
paper / project page

PontTuset Composite Sketch+Text Queries for Retrieving Objects with Elusive Names and Complex Interactions
Prajwal Gatti, Kshitij Parikh, Dhriti Paul, Manish Gupta, Anand Mishra
AAAI 2024
paper / poster / project page

PontTuset Towards Making Flowchart Images Machine Interpretable
Shreya Shukla, Prajwal Gatti, Yogesh Kumar, Vikash Yadav, Anand Mishra
ICDAR 2023
paper / pre-print / project page

PontTuset VisToT: Vision-Augmented Table-to-Text Generation
Prajwal Gatti, Anand Mishra, Manish Gupta, Mithun Das Gupta
EMNLP 2022
paper / poster / project page

PontTuset COFAR: Commonsense and Factual Reasoning in Image Search
Prajwal Gatti, Abhirama Penamakuri, Revant Teotia, Anand Mishra, Shubhashis Sengupta, Roshni Ramnani
AACL-IJCNLP 2022
paper / poster / project page

Template credits: this and this