Research
I am interested in understanding and improving multimodal AI systems through interpretability.
|
|
Investigating Mechanisms for In-Context Vision Language Binding
Darshana Saravanan, Makarand Tapaswi, Vineet Gandhi
Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), MIV, 2025
Oral
arxiv /
|
|
VELOCITI: Benchmarking Video-Language Compositional Reasoning with Strict Entailment
Darshana Saravanan*, Varun Gupta*, Darshan Singh*, Zeeshan Khan, Vineet Gandhi, Makarand Tapaswi
Conference on Computer Vision and Pattern Recognition (CVPR), 2025
arxiv /
website /
|
|
Pseudo-labelling meets Label Smoothing for Noisy Partial Label Learning
Darshana Saravanan, Naresh Manwani, Vineet Gandhi
Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), FGVC-12, 2025
Best Paper Award
arxiv /
code /
|
|