Shravan Venkatraman

My research focuses on advancing large multimodal systems that bridge visual perception, representation learning, and the unification of discriminative and generative tasks in open-world environments requiring continual learning and self-evolution.

Email  /  CV  /  Scholar  /  GitHub

Publications

Hover over publications for quick preview

* indicates equal contribution \(\dagger\) denotes my role as mentor

2025
EvoLMM: Self-Evolving Large Mu
EvoLMM: Self-Evolving Large Multimodal Models with Continuous Rewards
CVPR 2026 (Findings)
Bone Heatmap
EvoLMM: Self-Evolving Large Multimodal Models with Continuous Rewards
Omkar Thawakar*, Shravan Venkatraman*, Ritesh Thawkar*, Abdelrahman M Shaker, Hisham Cholakkal, Rao Muhammad Anwer, Salman Khan, Fahad Shahbaz Khan
CVPR 2026 (Findings)
paper / code / project page / abs / bibtex

EvoLMM is a fully unsupervised self-evolving framework for LMMs that improves visual reasoning from raw images only, by coupling a Proposer and a Solver trained via continuous self-consistency rewards.

TIDE: Two-Stage Inverse Degrad
TIDE: Two-Stage Inverse Degradation Estimation with Guided Prior Disentanglement for Underwater Image Restoration
arXiv
Bone Heatmap
TIDE: Two-Stage Inverse Degradation Estimation with Guided Prior Disentanglement for Underwater Image Restoration
Shravan Venkatraman*, Rakesh Raj M*, Pavan Kumar S*, Chandrakla S
arXiv
paper / code / project page / abs / bibtex

Two-stage framework that adaptively restores underwater images by identifying local degradation patterns and applying specialized corrections through inverse degradation mapping and progressive refinement.

UGPL: Uncertainty-Guided Progr
UGPL: Uncertainty-Guided Progressive Learning for Evidence-Based Classification in Computed Tomography
Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV'25) Workshops
Bone Heatmap
UGPL: Uncertainty-Guided Progressive Learning for Evidence-Based Classification in Computed Tomography
Shravan Venkatraman*, Pavan Kumar S*, Rakesh Raj M*, Chandrakla S
Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV'25) Workshops
project page / paper / code / abs / bibtex

Guiding CT image classification by leveraging uncertainty estimates to focus analysis on ambiguous regions through progressive, multi-scale refinement.

PCM-NeRF: Probabilistic Camera
PCM-NeRF: Probabilistic Camera Modeling for Neural Radiance Fields under Pose Uncertainty
Under Review
Bone Heatmap
PCM-NeRF: Probabilistic Camera Modeling for Neural Radiance Fields under Pose Uncertainty
Shravan Venkatraman*, Rakesh Raj M*, Pavan Kumar S*
Under Review
abs / project page / code and paper: out soon

Explicitly modeling camera poses as probability distributions with learnable uncertainties rather than fixed points in SE(3) achieves high-quality reconstruction even with significant pose errors.

Can We Go Beyond Visual Featur
Can We Go Beyond Visual Features? Neural Tissue Relation Modeling for Relational Graph Analysis in Non-Melanoma Skin Histology
arXiv
Bone Heatmap
Can We Go Beyond Visual Features? Neural Tissue Relation Modeling for Relational Graph Analysis in Non-Melanoma Skin Histology
Shravan Venkatraman, Muthu Subash Kavitha, Joe Dhanith P R, V Manikandarajan, Jia Wu
arXiv
paper / code / project page / abs / bibtex

Neural encoding of inter-tissue dependencies enables structurally coherent predictions in boundary-dense regions for histopathology segmentation.

SPROUT:Symptom-centricPrototyp
SPROUT:Symptom-centricPrototypicalRepresentationOptimization andUncertainty-awareTuning for Few-Shot Precision Agriculture
Neurocomputing
Bone Heatmap
SPROUT: Symptom-centric Prototypical Representation Optimization and Uncertainty-aware Tuning for Few-Shot Precision Agriculture
Shravan Venkatraman, Pavan Kumar S, Pandiyaraju V, Abeshek A, Aravintakshan S A, Kannan A
Neurocomputing
paper / code / abs / bibtex

Dynamically weighting symptom-representative samples enhances few-shot plant disease recognition in regionally diverse scenarios.

Bayesian Uncertainty Propagati
Bayesian Uncertainty Propagation for Bone Fracture Diagnosis via Region-Aware Adaptive Label Refinement
Under Review
Bone Heatmap
Bayesian Uncertainty Propagation for Bone Fracture Diagnosis via Region-Aware Adaptive Label Refinement
Shravan Venkatraman, Pandiyaraju V, Abeshek A, Pavan Kumar S, Aravintakshan S A, Kannan A
Under Review
abs / code & paper: out soon

Entropy-guided label pruning and region-aware uncertainty estimation enables fracture diagnosis models to reason under ambiguity.

FUSION: Frequency-guided Under
FUSION: Frequency-guided Underwater Spatial Image recOnstructioN
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR'25) Workshops
Bone Heatmap
FUSION: Frequency-guided Underwater Spatial Image recOnstructioN
Jaskaran Singh Walia*, Shravan Venkatraman*, Pavithra L K
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR'25) Workshops
paper / code / project page / abs / bibtex

Fusing spatial detail with frequency-guided attention cues enables perceptual underwater image enhancement across color-distorted environments.

Making NeRF See Structure, Not
Making NeRF See Structure, Not Just Light: Enforcing PDE-Based Surface Constraints for 3D Consistency
Under Review
Making NeRF See Structure, Not Just Light: Enforcing PDE-Based Surface Constraints for 3D Consistency
Shravan Venkatraman, Pandiyaraju V
Under Review
abs / code & paper: out soon

Enforcing physical surface properties through PDE constraints yields geometrically accurate neural scene representations from sparse views.

SAG-ViT: A Scale-Aware, High-F
SAG-ViT: A Scale-Aware, High-Fidelity Patching Approach with Graph Attention for Vision Transformers
Complex and Intelligent Systems
Graph Construction
SAG-ViT: A Scale-Aware, High-Fidelity Patching Approach with Graph Attention for Vision Transformers
Shravan Venkatraman, Jaskaran Singh Walia, Joe Dhanith P R
Complex and Intelligent Systems
code / paper / Hugging Face / abs / bibtex

Structuring attention through multi-scale graphs enable transformers to reason across visual hierarchies.

Hierarchical Graph-Guided Cont
Hierarchical Graph-Guided Contextual Representation Learning for Neurodegenerative Pattern Recognition in MRI
Computers in Biology and Medicine
Hierarchical Graph-Guided Contextual Representation Learning for Neurodegenerative Pattern Recognition in MRI
Shravan Venkatraman, Joe Dhanith P R, Muthu Subash Kavitha
Computers in Biology and Medicine
paper / bibtex / abs

Bridging local-global brain patterns and transforming disconnected MRI patches into spatially-coherent disease markers through residual graphs.


2024
Targeted Neural Architectures
Targeted Neural Architectures in Multi-Objective Frameworks for Complete Glioma Characterization from Multimodal MRI
Under Review
Targeted Neural Architectures in Multi-Objective Frameworks for Complete Glioma Characterization from Multimodal MRI
Shravan Venkatraman Pandiyaraju V, Abeshek A, Aravintakshan S A, Pavan Kumar S, Kannan A, Madhan
Under Review
paper / abs / bibtex

Augmenting encoder-decoder architectures with attention-guided feature extraction helps in highly effective localization, segmentation, and classification of brain tumors.

Multimodal Emotion Recognition
Multimodal Emotion Recognition using Audio-Video Transformer Fusion with Cross Attention
Under Review
Multimodal Emotion Recognition using Audio-Video Transformer Fusion with Cross Attention
Joe Dhanith P R, Shravan Venkatraman, Vigya Sharma, Santhosh Malarvannan, Modigari Narendra
Under Review
code / paper / abs / bibtex

Cross-modal attention enables synchronized audio-visual feature extraction through Transformer fusion for emotion recognition.

Traffic Sign Classification Us
Traffic Sign Classification Using Attention Fused Deep Convolutional Neural Networks
8\(^{th}\) International Conference on Robotics and Automation Sciences (ICRAS)
Traffic Sign Classification Using Attention Fused Deep Convolutional Neural Networks
Shravan Venkatraman, Abeshek A, Santhosh Malarvannan, Shriyans A, Jashwanth R, Joe Dhanith P R
8\(^{th}\) International Conference on Robotics and Automation Sciences (ICRAS)
paper / abs / bibtex

Attention-fused deep convolutional neural networks improve the ability to classify diverse traffic signs through parallel hierarchical and multi-scale feature emphasis.


2023
Improved Tomato Leaf Disease C
Improved Tomato Leaf Disease Classification Through Adaptive Ensemble Models with Exponential Moving Average Fusion and Enhanced Weighted Gradient Optimization
Frontiers in Plant Science
Improved Tomato Leaf Disease Classification Through Adaptive Ensemble Models with Exponential Moving Average Fusion and Enhanced Weighted Gradient Optimization
Pandiyaraju V, Senthil Kumar A M, Praveen Joe I R, Shravan Venkatraman, Pavan Kumar S, Aravintakshan S A, Abeshek A, Kannan A
Frontiers in Plant Science
paper / abs / bibtex

Ensemble deep learning with optimized weighted gradient techniques enables early and accurate detection of tomato leaf diseases.

Last updated February 2026