|
HD-EPIC: A Highly-Detailed Egocentric Video Dataset
Toby Perrett,
Ahmad Darkhalil,
Saptarshi Sinha,
Omar Emara,
Sam Pollard,
Kranti Kumar Parida,
Kaiting Liu,
Prajwal Gatti,
Siddhant Bansal,
Kevin Flanagan,
Jacob Chalk,
Zhifan Zhu,
Rhodri Guerrier,
Fahd Abdelazim,
Bin Zhu,
Davide Moltisanti,
Michael Wray,
Hazel Doughty,
Dima Damen
CVPR 2025
paper /
project page /
dataset /
video teaser /
explore samples
|
|
ShowHowTo: Generating Scene-Conditioned Step-by-Step Visual Instructions
Tomáš Souček,
Prajwal Gatti,
Michael Wray,
Ivan Laptev,
Dima Damen,
Josef Sivic
CVPR 2025
paper /
project page /
code /
dataset
|
|
Show Me the World in My Language: Establishing the First Baseline for Scene-Text to Scene-Text Translation
Shreyas Vaidya*,
Arvind Kumar Sharma*,
Prajwal Gatti,
Anand Mishra
ICPR 2024
paper /
project page
|
|
Composite Sketch+Text Queries for Retrieving Objects with Elusive Names and Complex Interactions
Prajwal Gatti,
Kshitij Parikh,
Dhriti Paul,
Manish Gupta,
Anand Mishra
AAAI 2024
paper /
poster /
project page
|
|
Towards Making Flowchart Images Machine Interpretable
Shreya Shukla,
Prajwal Gatti,
Yogesh Kumar,
Vikash Yadav,
Anand Mishra
ICDAR 2023
paper /
pre-print /
project page
|
|
VisToT: Vision-Augmented Table-to-Text Generation
Prajwal Gatti,
Anand Mishra,
Manish Gupta,
Mithun Das Gupta
EMNLP 2022
paper /
poster /
project page
|
|
COFAR: Commonsense and Factual Reasoning in Image Search
Prajwal Gatti,
Abhirama Penamakuri,
Revant Teotia,
Anand Mishra,
Shubhashis Sengupta,
Roshni Ramnani
AACL-IJCNLP 2022
paper /
poster /
project page
|
|