Medha Sawhney

Computer Science Ph.D. Student at Virginia Tech

prof_pic.jpg

I am a PhD student at the Knowledge Guided Machine Learning Lab advised by Dr. Anuj Karpatne.

I work on generative AI (Diffusion models, VLMs) to tackle complex scientific tasks such as solving and discovering equations through vision-based and multimodal learning.

Broadly, my research lies at the intersection of AI for Science: developing machine learning to accelerate scientific discovery, and Science for AI: using scientific principles to enhance model robustness and generalization. I’ve contributed to diverse domains including physics, geophysics, biology, and aquatic sciences through both independent and collaborative projects. My research works include:

  • Diffusion models for solving PDEs in sparse, noisy, and out-of-distribution settings via physics-informed learning, with extensions to super-resolution and inpainting.
  • Invertible neural networks & normalizing flows for full waveform inversion in geophysics, exploring latents for reconstruction vs. manifold-learning. (GFI’25)
  • Object tracking for biomedical applications, including tiny object tracking and motion detection from noisy complex images, videos for cancer research, and inverse modeling of cellular forces.(MEMTRACK’24 , DLFM’22)
  • Foundation models for aquatic science, modeling lake dynamics from real-world sensor data, with a focus on pretraining strategies and handling highly sparse observational data. (LakeFM’25)
  • Vision-language models for open-world scene graph generation (OW-SGG’25), biological trait prediction (VLM4Bio’24), and equation discovery from visual inputs, addressing challenges in prompting and generating structured outputs.

During my PhD, I have interned as a Machine Learning Engineer Intern (MLE) at Twitter and NVIDIA. Prior to starting my PhD, I worked as a full-time ML Engineer at HP, and hold a Bachelor’s in Electronics and Communication Engineering from Manipal Institute of Technology, Manipal, India.

Feel free to get in touch if you would like discuss, collaborate and connect :)

News

Jun 03, 2025 Our work on “Scientific Foundation models” has been accepted at ICML 2025 Workshop - Toward Scientific Foundation Models for Aquatic Ecosystems!
May 29, 2025 Our paper on “Open World Scene Graph Generation using Vision Language Models” got accepted at CV in Wild workshop at CVPR 2025!
May 22, 2025 Two posters accepted at CVPR 2025 Workshop - 1) “Physics-guided Diffusion Neural Operators for Solving Forward and Inverse PDEs” , 2) “Scientific Equation Discovery using Modular Symbolic Regression via Vision-Language Guidance”! Excited to share that we’ll also be giving a lightning talk at the workshop—looking forward to the discussion!
Jan 22, 2025 :tada: I finally have an ICLR paper! Thrilled to share that our paper, “A Unified Framework for Forward and Inverse Problems in Subsurface Imaging using Latent Space Translations”, has been accepted to ICLR 2025! [:page_facing_up: paper] [:computer: project page]
Oct 26, 2024 :rocket: Excited to announce that our paper, “VLM4Bio: A New Benchmark Dataset for Vision-Language Models”, has been accepted to NeurIPS 2024! This work introduces a comprehensive benchmark tailored for evaluating Vision-Language Models in the context of biological data. [:page_facing_up: paper] [:computer: code]
Oct 15, 2024 :scroll: New preprint available “A Unified Framework for Forward and Inverse Problems in Subsurface Imaging using Latent Space Translations.” [preprint]
May 20, 2024 :star_struck: Excited to be interning at NVIDIA as a Deep Learning Automation Intern this Summer!
Feb 16, 2024 :sparkles: I’ll be delivering a lightning talk at the Imageomics workshop, AAAI’24 Conference in Vancouver, Canada!
Jan 17, 2024 :tada: Our paper “MEMTRACK: A Deep Learning-Based Approach to Microrobot Tracking in Dense and Low-Contrast Environments was accepted to Advanced Intelligent Systems”! [paper] [code]
Oct 18, 2023 :scroll: New preprint available “MEMTRACK: A Deep Learning-Based Approach to Microrobot Tracking in Dense and Low-Contrast Environments.” [preprint] [code]
May 20, 2023 :scroll: Excited to have received the opportunity to present my work on Bacteria Tracking at CVPR 2023 as part of the CV4Animals Workshop! :smile: [poster] [workshop website]
Apr 06, 2023 :woman_student: Embarked on a Ph.D. Journey at Virginia Tech!
Oct 25, 2022 :scroll: New preprint available “Deep Learning Enabled Label-free Cell Force Computation in Deformable Fibrous Environments.” [preprint]
Jun 21, 2022 :sparkles: Interning at Twitter as a Machine Learning Engineering Intern this Summer!
Jun 06, 2022 :trophy: Honored to be the recipient of the Grace Hopper Celebration (GHC) 2022 Student Scholarship awarded by CS@VT and AnitaB.org!
Aug 21, 2021 :books: Joined the MS in Computer Science Program at Virginia Tech

Selected Publications

  1. ICLR 2025
    openfwi.jpg
    A Unified Framework for Forward and Inverse Problems in Subsurface Imaging using Latent Space Translations
    Naveen Gupta*Medha Sawhney*, Arka Daw, and 2 more authors
    In The Thirteenth International Conference on Learning Representations, 2025
  2. CVPR 2025
    Physics-guided Diffusion Neural Operators for Solving Forward and Inverse PDEs
    Medha Sawhney, Abhilash Neog, Mridul Khurana, and 3 more authors
    2025
    Oral + Poster Presentation at CV4Science Workshop
  3. CVPR 2025
    owsgg.png
    Open World Scene Graph Generation using Vision Language Models
    Amartya Dutta, Kazi Sajeed Mehrab*Medha Sawhney*, and 8 more authors
    2025
    CVPR 2025 Workshop (CV in the Wild)
  4. ICML 2025
    LakeFM.png
    Toward Scientific Foundation Models for Aquatic Ecosystems
    Abhilash Neog, Medha Sawhney, K.S. Mehrab, and 11 more authors
    2025
    Oral + Poster Presentation at ICML 2025 Workshop (Foundation Models for Structured Data)