Shravan Venkatraman

My research focuses on advancing large multimodal systems that bridge visual perception, representation learning, and the unification of discriminative and generative tasks in open-world environments requiring continual learning and self-evolution.

Email  /  CV  /  Scholar  /  GitHub

Publications

Hover over publications for quick preview

* indicates equal contribution \(\dagger\) denotes my role as mentor

2025
EvoLMM: Self-Evolving Large Mu
EvoLMM: Self-Evolving Large Multimodal Models with Continuous Rewards
CVPR 2026 (Findings)
EvoLMM is a fully unsupervised self-evolving framework for LMMs that improves visual reasoning from raw images only, by coupling a Proposer and a Solver trained via continuous self-consistency rewards.
Bone Heatmap
EvoLMM: Self-Evolving Large Multimodal Models with Continuous Rewards
Omkar Thawakar*, Shravan Venkatraman*, Ritesh Thawkar*, Abdelrahman M Shaker, Hisham Cholakkal, Rao Muhammad Anwer, Salman Khan, Fahad Shahbaz Khan
CVPR 2026 (Findings)

TIDE: Two-Stage Inverse Degrad
TIDE: Two-Stage Inverse Degradation Estimation with Guided Prior Disentanglement for Underwater Image Restoration
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR'26) Workshops [NTIRE]
Two-stage framework that adaptively restores underwater images by identifying local degradation patterns and applying specialized corrections through inverse degradation mapping and progressive refinement.
Bone Heatmap
TIDE: Two-Stage Inverse Degradation Estimation with Guided Prior Disentanglement for Underwater Image Restoration
Shravan Venkatraman*, Rakesh Raj M*, Pavan Kumar S*, Chandrakla S
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR'26) Workshops [NTIRE]

UGPL: Uncertainty-Guided Progr
UGPL: Uncertainty-Guided Progressive Learning for Evidence-Based Classification in Computed Tomography
Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV'25) Workshops [CVAMD]
Guiding CT image classification by leveraging uncertainty estimates to focus analysis on ambiguous regions through progressive, multi-scale refinement.
Bone Heatmap
UGPL: Uncertainty-Guided Progressive Learning for Evidence-Based Classification in Computed Tomography
Shravan Venkatraman*, Pavan Kumar S*, Rakesh Raj M*, Chandrakla S
Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV'25) Workshops [CVAMD]

PCM-NeRF: Probabilistic Camera
PCM-NeRF: Probabilistic Camera Modeling for Neural Radiance Fields under Pose Uncertainty
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR'26) Workshops [GenRecon3D]
Explicitly modeling camera poses as probability distributions with learnable uncertainties rather than fixed points in SE(3) achieves high-quality reconstruction even with significant pose errors.
Bone Heatmap
PCM-NeRF: Probabilistic Camera Modeling for Neural Radiance Fields under Pose Uncertainty
Shravan Venkatraman*, Rakesh Raj M*, Pavan Kumar S*
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR'26) Workshops [GenRecon3D]

Can We Go Beyond Visual Featur
Can We Go Beyond Visual Features? Neural Tissue Relation Modeling for Relational Graph Analysis in Non-Melanoma Skin Histology
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR'26) Workshops [PHAROS-AIF-MIH]
Neural encoding of inter-tissue dependencies enables structurally coherent predictions in boundary-dense regions for histopathology segmentation.
Bone Heatmap
Can We Go Beyond Visual Features? Neural Tissue Relation Modeling for Relational Graph Analysis in Non-Melanoma Skin Histology
Shravan Venkatraman, Muthu Subash Kavitha, Joe Dhanith P R, V Manikandarajan, Jia Wu
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR'26) Workshops [PHAROS-AIF-MIH]

SPROUT:Symptom-centricPrototyp
SPROUT:Symptom-centricPrototypicalRepresentationOptimization andUncertainty-awareTuning for Few-Shot Precision Agriculture
Neurocomputing
Dynamically weighting symptom-representative samples enhances few-shot plant disease recognition in regionally diverse scenarios.
Bone Heatmap
SPROUT: Symptom-centric Prototypical Representation Optimization and Uncertainty-aware Tuning for Few-Shot Precision Agriculture
Shravan Venkatraman, Pavan Kumar S, Pandiyaraju V, Abeshek A, Aravintakshan S A, Kannan A
Neurocomputing

Bayesian Uncertainty Propagati
Bayesian Uncertainty Propagation for Bone Fracture Diagnosis via Region-Aware Adaptive Label Refinement
Under Review
Entropy-guided label pruning and region-aware uncertainty estimation enables fracture diagnosis models to reason under ambiguity.
Bone Heatmap
Bayesian Uncertainty Propagation for Bone Fracture Diagnosis via Region-Aware Adaptive Label Refinement
Shravan Venkatraman, Pandiyaraju V, Abeshek A, Pavan Kumar S, Aravintakshan S A, Kannan A
Under Review
code & paper: out soon

FUSION: Frequency-guided Under
FUSION: Frequency-guided Underwater Spatial Image recOnstructioN
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR'25) Workshops [NTIRE]
Fusing spatial detail with frequency-guided attention cues enables perceptual underwater image enhancement across color-distorted environments.
Bone Heatmap
FUSION: Frequency-guided Underwater Spatial Image recOnstructioN
Jaskaran Singh Walia*, Shravan Venkatraman*, Pavithra L K
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR'25) Workshops [NTIRE]

Making NeRF See Structure, Not
Making NeRF See Structure, Not Just Light: Enforcing PDE-Based Surface Constraints for 3D Consistency
Bachelor's Thesis
Enforcing physical surface properties through PDE constraints yields geometrically accurate neural scene representations from sparse views.
Making NeRF See Structure, Not Just Light: Enforcing PDE-Based Surface Constraints for 3D Consistency
Shravan Venkatraman, Pandiyaraju V
Bachelor's Thesis

SAG-ViT: A Scale-Aware, High-F
SAG-ViT: A Scale-Aware, High-Fidelity Patching Approach with Graph Attention for Vision Transformers
Complex and Intelligent Systems
Structuring attention through multi-scale graphs enable transformers to reason across visual hierarchies.
Graph Construction
SAG-ViT: A Scale-Aware, High-Fidelity Patching Approach with Graph Attention for Vision Transformers
Shravan Venkatraman, Jaskaran Singh Walia, Joe Dhanith P R
Complex and Intelligent Systems
Hugging Face

Hierarchical Graph-Guided Cont
Hierarchical Graph-Guided Contextual Representation Learning for Neurodegenerative Pattern Recognition in MRI
Computers in Biology and Medicine
Bridging local-global brain patterns and transforming disconnected MRI patches into spatially-coherent disease markers through residual graphs.
Hierarchical Graph-Guided Contextual Representation Learning for Neurodegenerative Pattern Recognition in MRI
Shravan Venkatraman, Joe Dhanith P R, Muthu Subash Kavitha
Computers in Biology and Medicine


2024
Targeted Neural Architectures
Targeted Neural Architectures in Multi-Objective Frameworks for Complete Glioma Characterization from Multimodal MRI
Under Review
Augmenting encoder-decoder architectures with attention-guided feature extraction helps in highly effective localization, segmentation, and classification of brain tumors.
Targeted Neural Architectures in Multi-Objective Frameworks for Complete Glioma Characterization from Multimodal MRI
Shravan VenkatramanPandiyaraju V, Abeshek A, Aravintakshan S A, Pavan Kumar S, Kannan A, Madhan
Under Review

Multimodal Emotion Recognition
Multimodal Emotion Recognition using Audio-Video Transformer Fusion with Cross Attention
Under Review
Cross-modal attention enables synchronized audio-visual feature extraction through Transformer fusion for emotion recognition.
Multimodal Emotion Recognition using Audio-Video Transformer Fusion with Cross Attention
Joe Dhanith P R, Shravan Venkatraman, Vigya Sharma, Santhosh Malarvannan, Modigari Narendra
Under Review

Traffic Sign Classification Us
Traffic Sign Classification Using Attention Fused Deep Convolutional Neural Networks
8\(^{th}\) International Conference on Robotics and Automation Sciences (ICRAS)
Attention-fused deep convolutional neural networks improve the ability to classify diverse traffic signs through parallel hierarchical and multi-scale feature emphasis.
Traffic Sign Classification Using Attention Fused Deep Convolutional Neural Networks
Shravan Venkatraman, Abeshek A, Santhosh Malarvannan, Shriyans A, Jashwanth R, Joe Dhanith P R
8\(^{th}\) International Conference on Robotics and Automation Sciences (ICRAS)


2023
Improved Tomato Leaf Disease C
Improved Tomato Leaf Disease Classification Through Adaptive Ensemble Models with Exponential Moving Average Fusion and Enhanced Weighted Gradient Optimization
Frontiers in Plant Science
Ensemble deep learning with optimized weighted gradient techniques enables early and accurate detection of tomato leaf diseases.
Improved Tomato Leaf Disease Classification Through Adaptive Ensemble Models with Exponential Moving Average Fusion and Enhanced Weighted Gradient Optimization
Pandiyaraju V, Senthil Kumar A M, Praveen Joe I R, Shravan Venkatraman, Pavan Kumar S, Aravintakshan S A, Abeshek A, Kannan A
Frontiers in Plant Science

Last updated February 2026