Prajwal Gatti

I am a third year Ph.D. student at the University of Bristol advised by Prof. Dima Damen, in the Machine Learning and Computer Vision Group. My research interests are in video understanding and generation.

Prior to this, I was a Research Assistant at the Indian Institute of Technology Jodhpur advised by Prof. Anand Mishra where I explored the problems of vision-augemented table-to-text generation, cross-modal image retrieval among other vision-language problems.

I also briefly interned at the Center for Neuroscience, Indian Institute of Science where I worked on EEG Brain-Computer Interfaces under the advise of Prof. Sridharan Devarajan.

I received my B.E. in Information Science and Engineering at Dayananda Sagar College of Engineering in 2020.

Email / LinkedIn / Google Scholar / GitHub

News

June 2026: Our work Gen2Balance is accepted at ECCV 2026. Paper and code available.
Feb 2025: Introducing HD-EPIC: a richly-annotated egocentric dataset. Accepted at CVPR 2025. Paper now available!
Dec 2024: Our work ShowHowTo is accepted at CVPR 2025. Check it out!
Jul 2024: Attending the ICVSS 2024 summer school.
Jan 2024: Started my Ph.D. in Computer Science at the University of Bristol, advised by Prof. Dima Damen! 🥳
Dec 2023: Our work CSTBIR has been accepted at AAAI 2024.
Nov 2023: Speaking at the AI-ML 2023 (All India Track) Conference about my work on VisToT.
Aug 2023: Our work Towards Making Flowchart Images Machine Interpretable has been accepted at ICDAR 2023.
Apr 2023: In the organizing team of Summer Challenge on Writer Verification at NCVPRIPG'23. Consider participating!
Nov 2022: Will be attending AACL 2022 virtually, and EMNLP 2022 in-person at Abu Dhabi.
Oct 2022: Our work VisToT has been accepted at EMNLP 2022.
Sep 2022: Our work COFAR has been accepted at AACL-IJCNLP 2022.

Selected Publications

	Gen2Balance: Generative Balancing for Long-Tailed Video Action Recognition Prajwal Gatti, Simon Jenni Fabian Caba Heilbron Dima Damen ECCV 2026 paper / project page / code
	HD-EPIC: A Highly-Detailed Egocentric Video Dataset Toby Perrett, Ahmad Darkhalil, Saptarshi Sinha, Omar Emara, Sam Pollard, Kranti Kumar Parida, Kaiting Liu, Prajwal Gatti, Siddhant Bansal, Kevin Flanagan, Jacob Chalk, Zhifan Zhu, Rhodri Guerrier, Fahd Abdelazim, Bin Zhu, Davide Moltisanti, Michael Wray, Hazel Doughty, Dima Damen CVPR 2025 paper / project page / dataset / video teaser / explore samples
	ShowHowTo: Generating Scene-Conditioned Step-by-Step Visual Instructions Tomáš Souček, Prajwal Gatti, Michael Wray, Ivan Laptev, Dima Damen, Josef Sivic CVPR 2025 paper / project page / code / dataset
	Show Me the World in My Language: Establishing the First Baseline for Scene-Text to Scene-Text Translation Shreyas Vaidya^, Arvind Kumar Sharma^, Prajwal Gatti, Anand Mishra ICPR 2024 paper / project page
	Composite Sketch+Text Queries for Retrieving Objects with Elusive Names and Complex Interactions Prajwal Gatti, Kshitij Parikh, Dhriti Paul, Manish Gupta, Anand Mishra AAAI 2024 paper / poster / project page
	Towards Making Flowchart Images Machine Interpretable Shreya Shukla, Prajwal Gatti, Yogesh Kumar, Vikash Yadav, Anand Mishra ICDAR 2023 paper / pre-print / project page
	VisToT: Vision-Augmented Table-to-Text Generation Prajwal Gatti, Anand Mishra, Manish Gupta, Mithun Das Gupta EMNLP 2022 paper / poster / project page
	COFAR: Commonsense and Factual Reasoning in Image Search Prajwal Gatti, Abhirama Penamakuri, Revant Teotia, Anand Mishra, Shubhashis Sengupta, Roshni Ramnani AACL-IJCNLP 2022 paper / poster / project page

Template credits: this and this