Shravan Venkatraman

My research focuses on advancing large multimodal systems that bridge visual perception, representation learning, and the unification of discriminative and generative tasks in open-world environments requiring continual learning and self-evolution.

Email  /  CV  /  Scholar  /  GitHub

Publications

Hover over publications for quick preview

* indicates equal contribution \(\dagger\) denotes my role as mentor

2025
EvoLMM: Self-Evolving Large Mu
EvoLMM: Self-Evolving Large Multimodal Models with Continuous Rewards
arXiv
Bone Heatmap
EvoLMM: Self-Evolving Large Multimodal Models with Continuous Rewards
Omkar Thawakar*, Shravan Venkatraman*, Ritesh Thawkar*, Abdelrahman M Shaker, Hisham Cholakkal, Rao Muhammad Anwer, Salman Khan, Fahad Shahbaz Khan
arXiv
paper / code / project page / abs / bibtex

EvoLMM is a fully unsupervised self-evolving framework for LMMs that improves visual reasoning from raw images only, by coupling a Proposer and a Solver trained via continuous self-consistency rewards.

TIDE: Two-Stage Inverse Degrad
TIDE: Two-Stage Inverse Degradation Estimation with Guided Prior Disentanglement for Underwater Image Restoration
Winter Conference on Applications of Computer Vision (WACV 2026)
Bone Heatmap
TIDE: Two-Stage Inverse Degradation Estimation with Guided Prior Disentanglement for Underwater Image Restoration
Shravan Venkatraman*, Rakesh Raj M*, Pavan Kumar S*, Chandrakala S
Winter Conference on Applications of Computer Vision (WACV 2026)
paper / code / project page / abs / bibtex

Two-stage framework that adaptively restores underwater images by identifying local degradation patterns and applying specialized corrections through inverse degradation mapping and progressive refinement.

UGPL: Uncertainty-Guided Progr
UGPL: Uncertainty-Guided Progressive Learning for Evidence-Based Classification in Computed Tomography
Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV'25) Workshops
Bone Heatmap
UGPL: Uncertainty-Guided Progressive Learning for Evidence-Based Classification in Computed Tomography
Shravan Venkatraman*, Pavan Kumar S*, Rakesh Raj M*, Chandrakala S
Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV'25) Workshops
project page / paper / code / abs / bibtex

Guiding CT image classification by leveraging uncertainty estimates to focus analysis on ambiguous regions through progressive, multi-scale refinement.

PCM-NeRF: Probabilistic Camera
PCM-NeRF: Probabilistic Camera Modeling for Neural Radiance Fields under Pose Uncertainty
The 36th British Machine Vision Conference (BMVC 2025)
Bone Heatmap
PCM-NeRF: Probabilistic Camera Modeling for Neural Radiance Fields under Pose Uncertainty
Shravan Venkatraman*, Rakesh Raj M*, Pavan Kumar S*
The 36th British Machine Vision Conference (BMVC 2025)
abs / project page / code and paper: out soon

Explicitly modeling camera poses as probability distributions with learnable uncertainties rather than fixed points in SE(3) achieves high-quality reconstruction even with significant pose errors.

Can We Go Beyond Visual Featur
Can We Go Beyond Visual Features? Neural Tissue Relation Modeling for Relational Graph Analysis in Non-Melanoma Skin Histology
Medical Image Computing and Computer-Assisted Intervention (MICCAI'25) Workshops
Bone Heatmap
Can We Go Beyond Visual Features? Neural Tissue Relation Modeling for Relational Graph Analysis in Non-Melanoma Skin Histology
Shravan Venkatraman, Muthu Subash Kavitha, Joe Dhanith P R, V Manikandarajan, Jia Wu
Medical Image Computing and Computer-Assisted Intervention (MICCAI'25) Workshops
paper / code / project page / abs / bibtex

Neural encoding of inter-tissue dependencies enables structurally coherent predictions in boundary-dense regions for histopathology segmentation.

Rethinking Knowledge Retrieval
Rethinking Knowledge Retrieval for Generation: A Survey on RAG Architectures and Applications
Artificial Intelligence Review
Bone Heatmap
Rethinking Knowledge Retrieval for Generation: A Survey on RAG Architectures and Applications
Meghana Sunil*, Shravya V*, Shravan Venkatraman \(^{\dagger}\), Joe Dhanith P R
Artificial Intelligence Review
abs / paper: out soon

Survey of modular Retrieval-Augmented Generation frameworks that improve LLM reliability, grounding, and controllability in open-domain tasks.

A Lightweight Continual Learni
A Lightweight Continual Learning Approach via Retrieval-Augmented Generation for Personalized AI Assistants
In Progress
SIH
A Lightweight Continual Learning Approach via Retrieval-Augmented Generation for Personalized AI Assistants
Shravan Venkatraman*, Pavan Kumar S*, Jayasankar K S*, Meghana Sunil*, Gowri Ajith*, Santhosh Malarvannan*, Joe Dhanith P R
In Progress
abs / paper: out soon

A lightweight continual learning pipeline for efficient RAG workflows in AI agents.

SPROUT:Symptom-centricPrototyp
SPROUT:Symptom-centricPrototypicalRepresentationOptimization andUncertainty-awareTuning for Few-Shot Precision Agriculture
Neurocomputing
Bone Heatmap
SPROUT: Symptom-centric Prototypical Representation Optimization and Uncertainty-aware Tuning for Few-Shot Precision Agriculture
Shravan Venkatraman, Pavan Kumar S, Pandiyaraju V, Abeshek A, Aravintakshan S A, Kannan A
Neurocomputing
paper / code / abs / bibtex

Dynamically weighting symptom-representative samples enhances few-shot plant disease recognition in regionally diverse scenarios.

Bayesian Uncertainty Propagati
Bayesian Uncertainty Propagation for Bone Fracture Diagnosis via Region-Aware Adaptive Label Refinement
Knowledge Based Systems
Bone Heatmap
Bayesian Uncertainty Propagation for Bone Fracture Diagnosis via Region-Aware Adaptive Label Refinement
Shravan Venkatraman, Pandiyaraju V, Abeshek A, Pavan Kumar S, Aravintakshan S A, Kannan A
Knowledge Based Systems
abs / code & paper: out soon

Entropy-guided label pruning and region-aware uncertainty estimation enables fracture diagnosis models to reason under ambiguity.

FUSION: Frequency-guided Under
FUSION: Frequency-guided Underwater Spatial Image recOnstructioN
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR'25) Workshops
Bone Heatmap
FUSION: Frequency-guided Underwater Spatial Image recOnstructioN
Jaskaran Singh Walia*, Shravan Venkatraman*, Pavithra L K
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR'25) Workshops
paper / code / project page / abs / bibtex

Fusing spatial detail with frequency-guided attention cues enables perceptual underwater image enhancement across color-distorted environments.

Making NeRF See Structure, Not
Making NeRF See Structure, Not Just Light: Enforcing PDE-Based Surface Constraints for 3D Consistency
Pattern Recognition
Making NeRF See Structure, Not Just Light: Enforcing PDE-Based Surface Constraints for 3D Consistency
Shravan Venkatraman, Pandiyaraju V
Pattern Recognition
abs / code & paper: out soon

Enforcing physical surface properties through PDE constraints yields geometrically accurate neural scene representations from sparse views.

SAG-ViT: A Scale-Aware, High-F
SAG-ViT: A Scale-Aware, High-Fidelity Patching Approach with Graph Attention for Vision Transformers
Complex and Intelligent Systems
Graph Construction
SAG-ViT: A Scale-Aware, High-Fidelity Patching Approach with Graph Attention for Vision Transformers
Shravan Venkatraman, Jaskaran Singh Walia, Joe Dhanith P R
Complex and Intelligent Systems
code / paper / Hugging Face / abs / bibtex

Structuring attention through multi-scale graphs enable transformers to reason across visual hierarchies.

Hierarchical Graph-Guided Cont
Hierarchical Graph-Guided Contextual Representation Learning for Neurodegenerative Pattern Recognition in MRI
Computers in Biology and Medicine
Hierarchical Graph-Guided Contextual Representation Learning for Neurodegenerative Pattern Recognition in MRI
Shravan Venkatraman, Joe Dhanith P R, Muthu Subash Kavitha
Computers in Biology and Medicine
paper / bibtex / abs

Bridging local-global brain patterns and transforming disconnected MRI patches into spatially-coherent disease markers through residual graphs.


2024
Targeted Neural Architectures
Targeted Neural Architectures in Multi-Objective Frameworks for Complete Glioma Characterization from Multimodal MRI
Applied Soft Computing
Targeted Neural Architectures in Multi-Objective Frameworks for Complete Glioma Characterization from Multimodal MRI
Shravan Venkatraman Pandiyaraju V, Abeshek A, Aravintakshan S A, Pavan Kumar S, Kannan A, Madhan
Applied Soft Computing
paper / abs / bibtex

Augmenting encoder-decoder architectures with attention-guided feature extraction helps in highly effective localization, segmentation, and classification of brain tumors.

Statistical and Multivariate F
Statistical and Multivariate Feature Selection with Dynamic Graph Learning and Domain-Informed Fusion for Histopathological Image Classification
Biomedical Signal Processing and Control
Statistical and Multivariate Feature Selection with Dynamic Graph Learning and Domain-Informed Fusion for Histopathological Image Classification
Shravan Venkatraman, Pandiyaraju V
Biomedical Signal Processing and Control
abs / code and paper: out soon

Dynamic graph construction based on tissue-specific nuclear spatial distributions helps neural networks better understand heterogeneous histopathological structures.

Leveraging Bi-Focal Perspectiv
Leveraging Bi-Focal Perspectives and Granular Feature Integration for Accurate and Reliable Early Alzheimer’s Detection
IEEE Access
Leveraging Bi-Focal Perspectives and Granular Feature Integration for Accurate and Reliable Early Alzheimer’s Detection
Shravan Venkatraman, Pandiyaraju V, Abeshek A, Pavan Kumar S, Aravintakshan S A
IEEE Access
paper / abs / bibtex

Bi-focal perspectives guide neural networks to focus on subtle brain abnormalities while granular feature extraction at multiple scales identify subtle neurofibrillary tangles and amyloid plaques in MRI scans for accurate Alzheimer's detection.

Exploiting Precision Mapping a
Exploiting Precision Mapping and Component-Specific Feature Enhancement for Breast Cancer Segmentation and Identification
Ain Shams Journal
Exploiting Precision Mapping and Component-Specific Feature Enhancement for Breast Cancer Segmentation and Identification
Pandiyaraju V, Shravan Venkatraman, Saraswathi D, Pavan Kumar S, Santhosh Malarvannan, Kannan A
Ain Shams Journal
paper / abs / bibtex

Dynamic spatial mapping and component-specific feature enhancement overcome boundary delineation challenges in breast ultrasound imaging.

Multimodal Emotion Recognition
Multimodal Emotion Recognition using Audio-Video Transformer Fusion with Cross Attention
IEEE Transactions on Affective Computing
Multimodal Emotion Recognition using Audio-Video Transformer Fusion with Cross Attention
Joe Dhanith P R, Shravan Venkatraman, Vigya Sharma, Santhosh Malarvannan, Modigari Narendra
IEEE Transactions on Affective Computing
code / paper / abs / bibtex

Cross-modal attention enables synchronized audio-visual feature extraction through Transformer fusion for emotion recognition.

Traffic Sign Classification Us
Traffic Sign Classification Using Attention Fused Deep Convolutional Neural Networks
8\(^{th}\) International Conference on Robotics and Automation Sciences (ICRAS)
Traffic Sign Classification Using Attention Fused Deep Convolutional Neural Networks
Shravan Venkatraman, Abeshek A, Santhosh Malarvannan, Shriyans A, Jashwanth R, Joe Dhanith P R
8\(^{th}\) International Conference on Robotics and Automation Sciences (ICRAS)
paper / abs / bibtex

Attention-fused deep convolutional neural networks improve the ability to classify diverse traffic signs through parallel hierarchical and multi-scale feature emphasis.


2023
Enhancing Traffic Sign Classif
Enhancing Traffic Sign Classification in Autonomous Vehicular Technology Using Weather-Conditioned Synthetic Data and Xception-Enhanced Vision Transformers
IEEE Transactions on Intelligent Vehicles
Enhancing Traffic Sign Classification in Autonomous Vehicular Technology Using Weather-Conditioned Synthetic Data and Xception-Enhanced Vision Transformers
Joe Dhanith P R, Shravan Venkatraman, Raja Soosaimarian Peter Raj, Abeshek A, Santhosh Malarvannan, Jashwanth R, Shriyans A
IEEE Transactions on Intelligent Vehicles
code / paper / abs

Physically-grounded weather conditioning through GANs combined with adaptive Transformer tokenization preserves high-frequency sign details under adverse conditions.

Improved Tomato Leaf Disease C
Improved Tomato Leaf Disease Classification Through Adaptive Ensemble Models with Exponential Moving Average Fusion and Enhanced Weighted Gradient Optimization
Frontiers in Plant Science
Improved Tomato Leaf Disease Classification Through Adaptive Ensemble Models with Exponential Moving Average Fusion and Enhanced Weighted Gradient Optimization
Pandiyaraju V, Senthil Kumar A M, Praveen Joe I R, Shravan Venkatraman, Pavan Kumar S, Aravintakshan S A, Abeshek A, Kannan A
Frontiers in Plant Science
paper / abs / bibtex

Ensemble deep learning with optimized weighted gradient techniques enables early and accurate detection of tomato leaf diseases.

Last updated January 2026