VLLM Inferencing for Long Videos
CVIT, IIIT Hyderabad · 2026
Investigated the VLLM Codebase to ensure zero to minimal quality degradation in video inference.
RoadTones
CVIT · IIIT Hyderabad · 2025
Built a dataset-model-evaluation stack to enable tone-controllable video captioning (accepted at CVPR Findings 2026).
Image Tampering Detector
Bachelor's Thesis · TBVL, IISER Bhopal · 2025
Integrated Discrete Wavelet Transform with a sub-band attention layer over an existing transformer architecture backbone (SAFIRE) for higher accuracy within fewer training epochs.
Monocular Depth Estimation in Dim Light Road Scenes
IISER Bhopal · 2024
Added denoising and deblurring layers ahead of a DepthAnything-V2 backbone to enhance depth estimation in noisy, blurry night-time footage.
3D Medical Super-Resolution
MiRL, IIT Madras · 2024
Investigated GANs, Diffusion, and NeRF-based super-resolution methods to enhance z-axis resolution in CT and MRI volumes for sharper diagnostics.
ICU Mortality Prediction
IISER Bhopal · 2023
Binary-classification pipeline using classical supervised ML to flag critical mortality indicators for heart-failure patients admitted in ICU.