cv

resume

Basics

Name Gaurav Sahu
Label AI Researcher
Email gaurav.sahu@mila.quebec
Url https://scholar.google.com/citations?user=nMAt7UMAAAAJ&hl=en
Summary Machine Learning Researcher specializing in self-improving AI agents and scalable ML systems. Experienced in bridging cutting-edge research with production-ready applications, including automated literature discovery, evaluation of scientific works, enterprise data analytics, and multimodal AI. Passionate about transforming research into real-world impact and advancing science.

Work

  • 2025.01 - ongoing
    Postdoctoral Fellow
    Mila - Quebec AI Institute
    Focus: Enhancing Scientific Workflows with AI. Advisors: Prof. Chris Pal, Prof. Laurent Charlin
    • Led the LitLLM project, a novel RAG pipeline for literature search, improving recall from 8.2% → 84.4% with "Deep Research" implementation and reducing hallucinations by 18-26%, enabling researchers to access relevant papers more reliably
    • Created ReviewerToo, a modular, training-free AI-assisted peer review framework achieving near-human accuracy for the task of accepting/rejecting a scientific paper (81.8% vs. 83.9%), in addition to providing systematic assessments
    • Developed AInstein, a self-reflective research agent that autonomously solves AI problems; independently rediscovers 20% of ICLR 2025 approaches and generates 60% novel solutions, demonstrating frontier potential for automated scientific discovery
    • Built an agentic pipeline to curate Wiki-style articles for Physics-focused RAG applications covering >300k Physics papers
  • 2021.09 - 2024.12
    Visiting Researcher
    ServiceNow Research
    Focus: Data Augmentation & Agentic Data Analysis. Advisors: Dzmitry Bahdanau, Issam Laradji
    • Proposed PromptMix, a 2-shot data augmentation strategy outpeforming 5-shot and 100-shot text classification methods
    • Developed MixSumm and PPSL for text summarization, matching fully supervised methods with 5% of the labeled data
    • Spearheaded the creation of InsightBench, a comprehensive benchmark with 100 diverse tasks to evaluate data analytics agents, and developed AgentPoirot, a multi-step reasoning agent that autonomously discovers complex insights from data
  • 2018.09 - 2024.12
    Graduate Research
    University of Waterloo
    Focus: Computational Creativity & Multi-modal AI. Advisor: Prof. Olga Vechtomova
    • Proposed a novel, interpretable framework to computationally measure artistic inspiration in poetry and predict aesthetic preferences of creatives, outperforming a 450-shot LLaMA classifier by 18 points on the curated EvocativeLines dataset
    • Trained a bi-modal CVAE+LSTM architecture with an adversarial loss for LyricJam Sonic, a real-time, bi-modal system for music and lyric co-creation; also implemented a BERT filter to improve the coherence of the served lyrics at runtime
    • Designed fusion mechanisms (Auto-Fusion and GAN-Fusion) to adaptively combine multi-modal data sources for improved emotion recognition and machine translation; also worked on the Multi-Modal Discussion Transformer (mDT), which integrates text, image, and graph transformer data to more effectively detect hate speech in Reddit discussions
    • Proposed adversarial alignment for multi-turn dialogs [16] and studied the racial and ethnic bias in bots v/s humans
  • 2016.07 - 2018.06
    Undergraduate Research
    IIT Kharagpur
    Focus: Linguistics-grounded NLP. Advisor: Prof. Pawan Goyal
    • Designed program synthesis models for morphological inflection in English, and the first energy-based model for Sanskrit

Education

  • 2020 - 2024

    Waterloo, ON, Canada

    PhD
    University of Waterloo
    Computer Science
    • Thesis: Harnessing Generalist LLMs for Diverse Objective and Subjective NLP Tasks
  • 2018 - 2020

    Waterloo, ON, Canada

    M.Math
    University of Waterloo
    Computer Science
    • Thesis: Adaptive Fusion Techniques for Effective Multimodal Deep Learning
  • 2014 - 2018

    Kharagpur, WB, India

    B.Tech (Hons)
    IIT Kharagpur
    Manufacturing Science and Engineering (Mechanical Engineering)
    • Thesis (in Computer Science): Program Synthesis for Natural Language

Publications

Volunteer

  • 2019 - Present
    Mentor & 3x World Finalist
    Technovation Girls
    Mentored 8-18 yrs old girls to solve societal problems using technology, with teams reaching the world finals three times.
  • 2017 - 2018
    Mentor
    Student Welfare Group, IITKGP
    Helped 7 undergraduates successfully navigate their initial years at IIT Kharagpur.

Awards

Skills

Technical Skills
PyTorch
Python
HuggingFace
Scikit-learn
Git
Linux
NLTK
Flask
Pandas
Numpy
vLLM
Wandb
Professional Services
Area Chair: NAACL, EMNLP, ACL
Reviewer: ACL, NAACL, EMNLP, COLM, ACM Multimedia, AAAI

Languages

Hindi
Native Speaker
English
Fluent
Japanese
Intermediate
French
Beginner

Interests

Research Interests
Natural Language Processing
Large Language Models
Generative AI
Synthetic Data Generation