Medha Sawhney

I am a PhD student at the Knowledge Guided Machine Learning Lab advised by Dr. Anuj Karpatne.

I work on generative AI (Diffusion models, VLMs) to tackle complex scientific tasks such as solving and discovering equations through vision-based and multimodal learning.

Broadly, my research lies at the intersection of AI for Science: developing machine learning to accelerate scientific discovery, and Science for AI: using scientific principles to enhance model robustness and generalization. I’ve contributed to diverse domains including physics, geophysics, biology, and aquatic sciences through both independent and collaborative projects. My research works include:

Diffusion models for solving PDEs in sparse, noisy, and out-of-distribution settings via physics-informed learning, with extensions to super-resolution and inpainting.
Invertible neural networks & normalizing flows for full waveform inversion in geophysics, exploring latents for reconstruction vs. manifold-learning. (GFI’25)
Object tracking for biomedical applications, including tiny object tracking and motion detection from noisy complex images, videos for cancer research, and inverse modeling of cellular forces.(MEMTRACK’24 , DLFM’22)
Foundation models for aquatic science, modeling lake dynamics from real-world sensor data, with a focus on pretraining strategies and handling highly sparse observational data. (LakeFM’25)
Vision-language models for open-world scene graph generation (OW-SGG’25), biological trait prediction (VLM4Bio’24), and equation discovery from visual inputs, addressing challenges in prompting and generating structured outputs.

During my PhD, I have interned as a Machine Learning Engineer Intern (MLE) at Twitter and NVIDIA. Prior to starting my PhD, I worked as a full-time ML Engineer at HP, and hold a Bachelor’s in Electronics and Communication Engineering from Manipal Institute of Technology, Manipal, India.

Feel free to get in touch if you would like discuss, collaborate and connect :)

News

Jun 03, 2025	Our work on “Scientific Foundation models” has been accepted at ICML 2025 Workshop - Toward Scientific Foundation Models for Aquatic Ecosystems!
May 29, 2025	Our paper on “Open World Scene Graph Generation using Vision Language Models” got accepted at CV in Wild workshop at CVPR 2025!
May 22, 2025	Two posters accepted at CVPR 2025 Workshop - 1) “Physics-guided Diffusion Neural Operators for Solving Forward and Inverse PDEs” , 2) “Scientific Equation Discovery using Modular Symbolic Regression via Vision-Language Guidance”! Excited to share that we’ll also be giving a lightning talk at the workshop—looking forward to the discussion!
Jan 22, 2025	I finally have an ICLR paper! Thrilled to share that our paper, “A Unified Framework for Forward and Inverse Problems in Subsurface Imaging using Latent Space Translations”, has been accepted to ICLR 2025! [ paper] [ project page]
Oct 26, 2024	Excited to announce that our paper, “VLM4Bio: A New Benchmark Dataset for Vision-Language Models”, has been accepted to NeurIPS 2024! This work introduces a comprehensive benchmark tailored for evaluating Vision-Language Models in the context of biological data. [ paper] [ code]
Oct 15, 2024	New preprint available “A Unified Framework for Forward and Inverse Problems in Subsurface Imaging using Latent Space Translations.” [preprint]
May 20, 2024	Excited to be interning at NVIDIA as a Deep Learning Automation Intern this Summer!
Feb 16, 2024	I’ll be delivering a lightning talk at the Imageomics workshop, AAAI’24 Conference in Vancouver, Canada!
Jan 17, 2024	Our paper “MEMTRACK: A Deep Learning-Based Approach to Microrobot Tracking in Dense and Low-Contrast Environments was accepted to Advanced Intelligent Systems”! [paper] [code]
Oct 18, 2023	New preprint available “MEMTRACK: A Deep Learning-Based Approach to Microrobot Tracking in Dense and Low-Contrast Environments.” [preprint] [code]
May 20, 2023	Excited to have received the opportunity to present my work on Bacteria Tracking at CVPR 2023 as part of the CV4Animals Workshop! [poster] [workshop website]
Apr 06, 2023	Embarked on a Ph.D. Journey at Virginia Tech!
Oct 25, 2022	New preprint available “Deep Learning Enabled Label-free Cell Force Computation in Deformable Fibrous Environments.” [preprint]
Jun 21, 2022	Interning at Twitter as a Machine Learning Engineering Intern this Summer!
Jun 06, 2022	Honored to be the recipient of the Grace Hopper Celebration (GHC) 2022 Student Scholarship awarded by CS@VT and AnitaB.org!
Aug 21, 2021	Joined the MS in Computer Science Program at Virginia Tech

Selected Publications

ICLR 2025

A Unified Framework for Forward and Inverse Problems in Subsurface Imaging using Latent Space Translations

Naveen Gupta^*, Medha Sawhney^*, Arka Daw, and 2 more authors

In The Thirteenth International Conference on Learning Representations, 2025

arXiv PDF Code Website
CVPR 2025

Physics-guided Diffusion Neural Operators for Solving Forward and Inverse PDEs

Medha Sawhney, Abhilash Neog, Mridul Khurana, and 3 more authors

2025

Oral + Poster Presentation at CV4Science Workshop
CVPR 2025

Open World Scene Graph Generation using Vision Language Models

Amartya Dutta, Kazi Sajeed Mehrab^*, Medha Sawhney^*, and 8 more authors

2025

CVPR 2025 Workshop (CV in the Wild)

arXiv
ICML 2025

Toward Scientific Foundation Models for Aquatic Ecosystems

Abhilash Neog, Medha Sawhney, K.S. Mehrab, and 11 more authors

2025

Oral + Poster Presentation at ICML 2025 Workshop (Foundation Models for Structured Data)

PDF