cv

resume

Basics

Name Gaurav Sahu
Label AI Researcher
Email gaurav.sahu@mila.quebec
Url https://scholar.google.com/citations?user=nMAt7UMAAAAJ&hl=en
Summary Machine Learning Researcher with expertise in building AI systems that bridge cutting-edge research with real-world applications. Proven track record in developing agentic systems and LLMs for complex scientific workflows, with demonstrated experience in both exploratory research and production ML infrastructure. Passionate about transforming AI research into impactful products that deliver value at scale.

Work

  • 2025.01 - ongoing
    Postdoctoral Fellow
    Mila - Quebec AI Institute
    Focus: Enhancing Scientific Workflows with AI. Advisors: Prof. Chris Pal, Prof. Laurent Charlin
    • Led LitLLM and deployed a novel RAG pipeline for automated literature discovery with a "Deep Research" variant that boosts retrieval coverage by over 5x and a planning-based generation framework that cuts content hallucination by 18-26%
    • Developed Essence, an AI-based framework for literature-grounded paper analysis and claim verification during peer-reviewing
    • Probing frontier AI models to evaluate their ability to rediscover complex scientific concepts in AI/ML from first principles
    • Building an agentic pipeline to curate a knowledge base of Wiki-style articles for Physics-focused RAG applications
  • 2021.09 - 2024.12
    Visiting Researcher
    ServiceNow Research
    Focus: Data Augmentation & Agentic Data Analysis. Advisors: Dzmitry Bahdanau, Issam Laradji
    • Proposed PromptMix, a 2-shot data augmentation strategy outpeforming 5-shot and 100-shot text classification methods
    • Developed MixSumm and PPSL for text summarization, matching fully supervised methods with 5% of the labeled data
    • Spearheaded the creation of InsightBench, a comprehensive benchmark with 100 diverse tasks to evaluate data analytics agents, and developed AgentPoirot, a multi-step reasoning agent that autonomously discovers complex insights from data
  • 2018.09 - 2024.12
    Graduate Research
    University of Waterloo
    Focus: Computational Creativity & Multi-modal AI. Advisor: Prof. Olga Vechtomova
    • Proposed a novel, interpretable framework to computationally measure artistic inspiration in poetry and predict aesthetic preferences of creatives, outperforming a 450-shot LLaMA classifier by 18 points on the curated EvocativeLines dataset
    • Trained a bi-modal CVAE+LSTM architecture with an adversarial loss for LyricJam Sonic, a real-time, bi-modal system for music and lyric co-creation; also implemented a BERT filter to improve the coherence of the served lyrics at runtime
    • Designed fusion mechanisms (Auto-Fusion and GAN-Fusion) to adaptively combine multi-modal data sources for improved emotion recognition and machine translation; also worked on the Multi-Modal Discussion Transformer (mDT), which integrates text, image, and graph transformer data to more effectively detect hate speech in Reddit discussions
    • Proposed adversarial alignment for multi-turn dialogs [16] and studied the racial and ethnic bias in bots v/s humans
  • 2016.07 - 2018.06
    Undergraduate Research
    IIT Kharagpur
    Focus: Linguistics-grounded NLP. Advisor: Prof. Pawan Goyal
    • Designed program synthesis models for morphological inflection in English, and the first energy-based model for Sanskrit

Education

  • 2020 - 2024

    Waterloo, ON, Canada

    PhD
    University of Waterloo
    Computer Science
    • Thesis: Harnessing Generalist LLMs for Diverse Objective and Subjective NLP Tasks
  • 2018 - 2020

    Waterloo, ON, Canada

    M.Math
    University of Waterloo
    Computer Science
    • Thesis: Adaptive Fusion Techniques for Effective Multimodal Deep Learning
  • 2014 - 2018

    Kharagpur, WB, India

    B.Tech (Hons)
    IIT Kharagpur
    Manufacturing Science and Engineering (Mechanical Engineering)
    • Thesis (in Computer Science): Program Synthesis for Natural Language

Publications

Volunteer

  • 2019 - Present
    Mentor & 3x World Finalist
    Technovation Girls
    Mentored 8-18 yrs old girls to solve societal problems using technology, with teams reaching the world finals three times.
  • 2017 - 2018
    Mentor
    Student Welfare Group, IITKGP
    Helped 7 undergraduates successfully navigate their initial years at IIT Kharagpur.

Awards

Skills

Technical Skills
PyTorch
Python
HuggingFace
Scikit-learn
Git
Linux
NLTK
Flask
Pandas
Numpy
vLLM
Wandb
Professional Services
Area Chair: NAACL, EMNLP, ACL
Reviewer: ACL, NAACL, EMNLP, COLM, ACM Multimedia, AAAI

Languages

Hindi
Native Speaker
English
Fluent
Japanese
Intermediate
French
Beginner

Interests

Research Interests
Natural Language Processing
Large Language Models
Generative AI
Synthetic Data Generation