Medha Sawhney
Computer Science Ph.D. Student at Virginia Tech

I am a PhD student at the Knowledge Guided Machine Learning Lab advised by Dr. Anuj Karpatne.
I work on generative AI (Diffusion models, VLMs) to tackle complex scientific tasks such as solving and discovering equations through vision-based and multimodal learning.
Broadly, my research lies at the intersection of AI for Science: developing machine learning to accelerate scientific discovery, and Science for AI: using scientific principles to enhance model robustness and generalization. I’ve contributed to diverse domains including physics, geophysics, biology, and aquatic sciences through both independent and collaborative projects. My research works include:
- Diffusion models for solving PDEs in sparse, noisy, and out-of-distribution settings via physics-informed learning, with extensions to super-resolution and inpainting.
- Invertible neural networks & normalizing flows for full waveform inversion in geophysics, exploring latents for reconstruction vs. manifold-learning. (GFI’25)
- Object tracking for biomedical applications, including tiny object tracking and motion detection from noisy complex images, videos for cancer research, and inverse modeling of cellular forces.(MEMTRACK’24 , DLFM’22)
- Foundation models for aquatic science, modeling lake dynamics from real-world sensor data, with a focus on pretraining strategies and handling highly sparse observational data. (LakeFM’25)
- Vision-language models for open-world scene graph generation (OW-SGG’25), biological trait prediction (VLM4Bio’24), and equation discovery from visual inputs, addressing challenges in prompting and generating structured outputs.
During my PhD, I have interned as a Machine Learning Engineer Intern (MLE) at Twitter and NVIDIA. Prior to starting my PhD, I worked as a full-time ML Engineer at HP, and hold a Bachelor’s in Electronics and Communication Engineering from Manipal Institute of Technology, Manipal, India.
Feel free to get in touch if you would like discuss, collaborate and connect :)
News
Jun 03, 2025 | Our work on “Scientific Foundation models” has been accepted at ICML 2025 Workshop - Toward Scientific Foundation Models for Aquatic Ecosystems! |
---|---|
May 29, 2025 | Our paper on “Open World Scene Graph Generation using Vision Language Models” got accepted at CV in Wild workshop at CVPR 2025! |
May 22, 2025 | Two posters accepted at CVPR 2025 Workshop - 1) “Physics-guided Diffusion Neural Operators for Solving Forward and Inverse PDEs” , 2) “Scientific Equation Discovery using Modular Symbolic Regression via Vision-Language Guidance”! Excited to share that we’ll also be giving a lightning talk at the workshop—looking forward to the discussion! |
Jan 22, 2025 | ![]() ![]() ![]() |
Oct 26, 2024 | ![]() ![]() ![]() |
Oct 15, 2024 | ![]() |
May 20, 2024 | ![]() |
Feb 16, 2024 | ![]() |
Jan 17, 2024 | ![]() |
Oct 18, 2023 | ![]() |
May 20, 2023 | ![]() ![]() |
Apr 06, 2023 | ![]() |
Oct 25, 2022 | ![]() |
Jun 21, 2022 | ![]() |
Jun 06, 2022 | ![]() |
Aug 21, 2021 | ![]() |
Selected Publications
- CVPR 2025Physics-guided Diffusion Neural Operators for Solving Forward and Inverse PDEs2025Oral + Poster Presentation at CV4Science Workshop
- CVPR 2025Open World Scene Graph Generation using Vision Language Models2025CVPR 2025 Workshop (CV in the Wild)
- ICML 2025Toward Scientific Foundation Models for Aquatic Ecosystems2025Oral + Poster Presentation at ICML 2025 Workshop (Foundation Models for Structured Data)