Shravan Venkatraman

I am an M.Sc. computer vision student at MBZUAI. I am fortunate to be a part of the Intellectual and Visual Analytics Lab, where I am advised by Dr. Fahad Khan and Dr. Salman Khan. I completed my undergrad in computer science at VIT University, advised by Dr. Joe Dhanith and Dr. Pandiyaraju.

My research focuses on developing unified large-scale models for image understanding and generation, within the broader context of multimodal representation learning for reasoning. I also work on self-questioning reasoning in LMMs, and controllable generation of extended, coherent video sequences.

Prior to this, I was a research intern at Nagasaki University advised by Dr. Muthu Subash Kavitha. In the summer of 2024, I interned at MedxAI under the mentorship of Dr. Susan Elias and Dr. Sheena Pravin. I also work on industry- and consultancy-funded projects through SPORIC at VIT.

Email  /  CV  /  Google Scholar  /  GitHub

News

Sep 18, 2025 – I'm honored to serve as a Student Representative for the Computer Vision department at MBZUAI!
Aug 10, 2025 – I'm excited to start my MSc. in Computer Vision at MBZUAI!
Jul 27, 2025 – SAG-ViT has been accepted to Complex and Intelligent Systems!
Jul 14, 2025 – UGPL is accepted to ICCV'25 Workshops: CVAMD! Paper and Code are available!
Apr 17, 2025 – I have successfully defended my bachelor's thesis (titled: Making NeRF See Structure, Not Just Light) at VIT Chennai!
Feb 28, 2025 – FUSION is accepted to CVPR'25 Workshops: NTIRE!
Apr 07, 2025 – Honored to receive the Sir C. V. Raman Award from VIT Chennai for the second time in recognition of my research!
Feb 28, 2025 – We showcased and presented CerviLens at IInvenTiv'25 @IIT Madras, representing MedxAI Innovations!
Jan 25, 2025 – I am honored to have been admitted to the MSc. Computer Vision program at MBZUAI!
Dec 12, 2024 – Proud to have been selected as a recipient of the Sir C. V. Raman Award by VIT Chennai for my research!
Jun 23, 2024 – I presented our paper on attention-fused deep CNNs at ICRAS 2024 in Tokyo, Japan!
Selected Publications
Bone Heatmap
UGPL: Uncertainty-Guided Progressive Learning for Evidence-Based Classification in Computed Tomography
Shravan Venkatraman*, Pavan Kumar S*, Rakesh Raj M*, Chandrakala S
Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) 2025 Workshops
project page / paper / code / abs / bibtex

Guiding CT image classification by leveraging uncertainty estimates to focus analysis on ambiguous regions through progressive, multi-scale refinement.

Bone Heatmap
FUSION: Frequency-guided Underwater Spatial Image recOnstructioN
Jaskaran Singh Walia*, Shravan Venkatraman*, Pavithra L K
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops
paper / code / project page / abs / bibtex

Fusing spatial detail with frequency-guided attention cues enables perceptual underwater image enhancement across color-distorted environments.

Making NeRF See Structure, Not Just Light: Enforcing PDE-Based Surface Constraints for 3D Consistency
Shravan Venkatraman, Pandiyaraju V
Submitted: Pattern Recognition
code & paper: post acceptance

Enforcing physical surface properties through PDE constraints yields geometrically accurate neural scene representations from sparse views.

SAG-ViT: A Scale-Aware, High-Fidelity Patching Approach with Graph Attention for Vision Transformers
Shravan Venkatraman, Jaskaran Singh Walia, Joe Dhanith P R
Complex and Intelligent Systems
code / paper / Hugging Face

Structuring attention through multi-scale graphs enable transformers to reason across visual hierarchies.

Teaching
dragon

BCSE332P - Deep Learning Lab (Fall 2024)


Yep it's another Jon Barron and Ben Mildenhall website.
Last updated July 2025.