Shravan Venkatraman

I am an M.Sc. computer vision student at MBZUAI. I am fortunate to be a part of the Intellectual and Visual Analytics Lab, where I am advised by Dr. Fahad Khan and Dr. Salman Khan. I completed my undergrad in computer science at VIT University, advised by Dr. Joe Dhanith and Dr. Pandiyaraju.

My research focuses on developing self-evolving large multimodal models for generalizable multimodal intelligence, within the broader context of multimodal representation learning for reasoning. I also work on unified large-scale models for image understanding and generation, and controllable generation of extended, coherent video sequences.

Prior to this, I was a research intern at Nagasaki University advised by Dr. Muthu Subash Kavitha. In the summer of 2024, I interned at MedxAI under the mentorship of Dr. Susan Elias and Dr. Sheena Pravin. I also work on industry- and consultancy-funded projects through SPORIC at VIT.

Email  /  CV  /  Google Scholar  /  GitHub

News

Nov 21, 2025 – Our paper on EvoLMM, a purely self-evolving framework for LMMs, is now on arXiv!
Nov 27, 2025 – SPROUT has been accepted to Neurocomputing!
Oct 27, 2025 – RG-ViT has been accepted to Computers in Biology and Medicine!
Sep 18, 2025 – I'm honored to serve as a Student Representative for the Computer Vision department at MBZUAI!
Aug 10, 2025 – I'm excited to start my MSc. in Computer Vision at MBZUAI!
Jul 27, 2025 – SAG-ViT has been accepted to Complex and Intelligent Systems!
Jul 14, 2025 – UGPL is accepted to ICCV'25 Workshops: CVAMD! Paper and Code are available!
Apr 17, 2025 – I have successfully defended my bachelor's thesis (titled: Making NeRF See Structure, Not Just Light) at VIT Chennai!
Feb 28, 2025 – FUSION is accepted to CVPR'25 Workshops: NTIRE!
Apr 07, 2025 – Honored to receive the Sir C. V. Raman Award from VIT Chennai for the second time in recognition of my research!
Feb 28, 2025 – We showcased and presented CerviLens at IInvenTiv'25 @IIT Madras, representing MedxAI Innovations!
Jan 25, 2025 – I am honored to have been admitted to the MSc. Computer Vision program at MBZUAI!
Dec 12, 2024 – Proud to have been selected as a recipient of the Sir C. V. Raman Award by VIT Chennai for my research!
Jun 23, 2024 – I presented our paper on attention-fused deep CNNs at ICRAS 2024 in Tokyo, Japan!
Selected Publications
Bone Heatmap
EvoLMM: Self-Evolving Large Multimodal Models with Continuous Rewards
Omkar Thawakar*, Shravan Venkatraman*, Ritesh Thawkar*, Abdelrahman M Shaker, Hisham Cholakkal, Rao Muhammad Anwer, Salman Khan, Fahad Shahbaz Khan
Submitted: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR'26)
paper / code / project page / abs / bibtex

EvoLMM is a fully unsupervised self-evolving framework for large multimodal models (LMMs) that improves visual reasoning from raw images only by coupling a Proposer and a Solver trained via continuous self-consistency rewards.

Making NeRF See Structure, Not Just Light: Enforcing PDE-Based Surface Constraints for 3D Consistency
Shravan Venkatraman, Pandiyaraju V
Submitted: Pattern Recognition
code & paper: post acceptance

Enforcing physical surface properties through PDE constraints yields geometrically accurate neural scene representations from sparse views.

SAG-ViT: A Scale-Aware, High-Fidelity Patching Approach with Graph Attention for Vision Transformers
Shravan Venkatraman, Jaskaran Singh Walia, Joe Dhanith P R
Complex and Intelligent Systems
code / paper / Hugging Face

Structuring attention through multi-scale graphs enable transformers to reason across visual hierarchies.

Teaching
dragon

BCSE332P - Deep Learning Lab (Fall 2024)


Yep it's another Jon Barron and Ben Mildenhall website.
Last updated July 2025.