Medha Sawhney
Computer Science Ph.D. Student at Virginia Tech
I am a PhD student at the Knowledge Guided Machine Learning Lab advised by Dr. Anuj Karpatne.
My research focuses on generative AI for complex scientific systems. In particular, I work primarily with diffusion models and vision-language systems, studying how generative models can learn and enforce domain-specific structures, such as physical laws or motion dynamics, from images and videos.
I’ve contributed to diverse domains through both independent and collaborative projects. My research works include:
- Diffusion models for solving PDEs in sparse, noisy, and out-of-distribution settings via physics-informed learning, with extensions to super-resolution and inpainting.
- Invertible neural networks & normalizing flows for full waveform inversion, exploring latents for reconstruction vs. manifold-learning. (GFI-ICLR’25)
- Object tracking for biomedical applications, including tiny object tracking and motion detection from noisy complex images, videos for cancer research, and inverse modeling of cellular forces.(MEMTrack-AISY’24, DLFM-PNAS’25)
- Foundation models for aquatic science, modeling lake dynamics from real-world sensor data, with a focus on pretraining strategies and handling highly sparse observational data. (LakeFM-ICMLW’25)
- Vision-language models for open-world scene graph generation (OWSGG-CVPRW’25), biological trait prediction (VLM4Bio-NeurIPS’24), and equation discovery from visual inputs.
During my PhD, I have interned as a Machine Learning Engineer Intern (MLE) at Twitter, NVIDIA and Amazon. Prior to starting my PhD, I worked as a full-time ML Engineer at HP, and hold a Bachelor’s in Electronics and Communication Engineering from Manipal Institute of Technology, Manipal, India.
Feel free to get in touch if you would like discuss, collaborate, and connect :)
News
| Oct 13, 2025 | |
|---|---|
| Oct 06, 2025 | Our work “Investigating PDE Residual Attentions in Frequency Space for Diffusion Neural Operators” has been accepted at ML4Physics Workshop, NeurIPS 2025! |
| Jun 03, 2025 | Our work on “Scientific Foundation models” has been accepted at ICML 2025 Workshop - Toward Scientific Foundation Models for Aquatic Ecosystems! |
| May 29, 2025 | Our paper on “Open World Scene Graph Generation using Vision Language Models” got accepted at CV in Wild workshop at CVPR 2025! |
| May 22, 2025 | Two posters accepted at CVPR 2025 Workshop - 1) “Physics-guided Diffusion Neural Operators for Solving Forward and Inverse PDEs” , 2) “Scientific Equation Discovery using Modular Symbolic Regression via Vision-Language Guidance”! Excited to share that we’ll also be giving a lightning talk at the workshop—looking forward to the discussion! |
| Jan 22, 2025 | |
| Oct 26, 2024 | |
| Oct 15, 2024 | |
| May 20, 2024 | |
| Feb 16, 2024 | |
| Jan 17, 2024 | |
| Oct 18, 2023 | |
| May 20, 2023 | |
| Apr 06, 2023 | |
| Oct 25, 2022 | |
| Jun 21, 2022 | |
| Jun 06, 2022 | |
| Aug 21, 2021 | |
Selected Publications
- Under Review
Beyond Loss Guidance: Using PDE Residuals as Spectral Attention in Diffusion Neural OperatorsUnder Review, 2025 - CVPR W 2025
Open World Scene Graph Generation using Vision Language ModelsCV in the Wild Workshop, CVPR, 2025CVPR 2025 Workshop (CV in the Wild) - NeurIPS W 2025
Investigating PDE Residual Attentions in Frequency Space for Diffusion Neural OperatorsML4Physics Workshop, NeurIPS, 2025Poster Presentation at ML4Physics Workshop - ICML W 2025
Toward Scientific Foundation Models for Aquatic EcosystemsUnder Review, 2025ICML 2025 Workshop (Foundation Models for Structured Data)