cv
resume
Basics
Name | Gaurav Sahu |
Label | AI Researcher |
gaurav.sahu@mila.quebec | |
Url | https://scholar.google.com/citations?user=nMAt7UMAAAAJ&hl=en |
Summary | Machine Learning Researcher specializing in self-improving AI agents and scalable ML systems. Experienced in bridging cutting-edge research with production-ready applications, including automated literature discovery, evaluation of scientific works, enterprise data analytics, and multimodal AI. Passionate about transforming research into real-world impact and advancing science. |
Work
-
2025.01 - ongoing Postdoctoral Fellow
Mila - Quebec AI Institute
Focus: Enhancing Scientific Workflows with AI. Advisors: Prof. Chris Pal, Prof. Laurent Charlin
- Led the LitLLM project, a novel RAG pipeline for literature search, improving recall from 8.2% → 84.4% with "Deep Research" implementation and reducing hallucinations by 18-26%, enabling researchers to access relevant papers more reliably
- Created ReviewerToo, a modular, training-free AI-assisted peer review framework achieving near-human accuracy for the task of accepting/rejecting a scientific paper (81.8% vs. 83.9%), in addition to providing systematic assessments
- Developed AInstein, a self-reflective research agent that autonomously solves AI problems; independently rediscovers 20% of ICLR 2025 approaches and generates 60% novel solutions, demonstrating frontier potential for automated scientific discovery
- Built an agentic pipeline to curate Wiki-style articles for Physics-focused RAG applications covering >300k Physics papers
-
2021.09 - 2024.12 Visiting Researcher
ServiceNow Research
Focus: Data Augmentation & Agentic Data Analysis. Advisors: Dzmitry Bahdanau, Issam Laradji
- Proposed PromptMix, a 2-shot data augmentation strategy outpeforming 5-shot and 100-shot text classification methods
- Developed MixSumm and PPSL for text summarization, matching fully supervised methods with 5% of the labeled data
- Spearheaded the creation of InsightBench, a comprehensive benchmark with 100 diverse tasks to evaluate data analytics agents, and developed AgentPoirot, a multi-step reasoning agent that autonomously discovers complex insights from data
-
2018.09 - 2024.12 Graduate Research
University of Waterloo
Focus: Computational Creativity & Multi-modal AI. Advisor: Prof. Olga Vechtomova
- Proposed a novel, interpretable framework to computationally measure artistic inspiration in poetry and predict aesthetic preferences of creatives, outperforming a 450-shot LLaMA classifier by 18 points on the curated EvocativeLines dataset
- Trained a bi-modal CVAE+LSTM architecture with an adversarial loss for LyricJam Sonic, a real-time, bi-modal system for music and lyric co-creation; also implemented a BERT filter to improve the coherence of the served lyrics at runtime
- Designed fusion mechanisms (Auto-Fusion and GAN-Fusion) to adaptively combine multi-modal data sources for improved emotion recognition and machine translation; also worked on the Multi-Modal Discussion Transformer (mDT), which integrates text, image, and graph transformer data to more effectively detect hate speech in Reddit discussions
- Proposed adversarial alignment for multi-turn dialogs [16] and studied the racial and ethnic bias in bots v/s humans
-
2016.07 - 2018.06 Undergraduate Research
IIT Kharagpur
Focus: Linguistics-grounded NLP. Advisor: Prof. Pawan Goyal
- Designed program synthesis models for morphological inflection in English, and the first energy-based model for Sanskrit
Education
-
2020 - 2024 Waterloo, ON, Canada
PhD
University of Waterloo
Computer Science
- Thesis: Harnessing Generalist LLMs for Diverse Objective and Subjective NLP Tasks
-
2018 - 2020 Waterloo, ON, Canada
M.Math
University of Waterloo
Computer Science
- Thesis: Adaptive Fusion Techniques for Effective Multimodal Deep Learning
-
2014 - 2018 Kharagpur, WB, India
B.Tech (Hons)
IIT Kharagpur
Manufacturing Science and Engineering (Mechanical Engineering)
- Thesis (in Computer Science): Program Synthesis for Natural Language
Publications
-
2025 Balancing indeterminacy and structure: Neural text generation for artistic inspiration
International Conference on Computational Intelligence in Music, Sound, Art and Design (Part of EvoStar)
Olga Vechtomova and Gaurav Sahu.
-
2025 Insightbench: Evaluating business analytics agents through multi-step insight generation
ICLR 2025
Gaurav Sahu, Abhay Puri, Juan Rodriguez, Amirhossein Abaskohi, Mohammad Chegini, Alexandre Drouin, Perouz Taslakian, Valentina Zantedeschi, Alexandre Lacoste, David Vazquez, et al.
-
2025 A guide to effectively leveraging llms for low-resource text summarization: Data augmentation and semi-supervised approaches
Findings of the Association for Computational Linguistics: NAACL 2025
Gaurav Sahu, Olga Vechtomova, and Issam H Laradji.
-
2025 Litllms, llms for literature review: Are we there yet?
Transactions on Machine Learning Research
Shubham Agarwal*, Gaurav Sahu*, Abhay Puri*, Issam H Laradji, Krishnamurthy Dj Dvijotham, Jason Stanley, Laurent Charlin, and Christopher Pal.
-
2024 Computational modeling of artistic inspiration: A framework for predicting aesthetic preferences in lyrical lines using linguistic and stylistic features
arXiv preprint arXiv:2410.02881
Gaurav Sahu and Olga Vechtomova.
-
2023 Llm aided semi-supervision for efficient extractive dialog summarization
Findings of the Association for Computational Linguistics: EMNLP 2023
Nishant Mishra, Gaurav Sahu, Iacer Calixto, Ameen Abu-Hanna, and Issam Laradji.
-
2023 Promptmix: A class boundary augmentation method for large language model distillation
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing
Gaurav Sahu, Olga Vechtomova, Dzmitry Bahdanau, and Issam Laradji.
-
2022.05 Data augmentation for intent classification with off-the-shelf large language models
Proceedings of the 4th Workshop on NLP for Conversational AI
Gaurav Sahu, Pau Rodriguez, Issam Laradji, Parmida Atighehchian, David Vazquez, and Dzmitry Bahdanau.
Volunteer
-
2019 - Present Mentor & 3x World Finalist
Technovation Girls
Mentored 8-18 yrs old girls to solve societal problems using technology, with teams reaching the world finals three times.
-
2017 - 2018 Mentor
Student Welfare Group, IITKGP
Helped 7 undergraduates successfully navigate their initial years at IIT Kharagpur.
Awards
- 2025
Best Paper Award
International Conference on Computational Creativity (ICCC)
Built an interpretable framework for modeling inspiration in creative individuals.
- 2023.2025
- 2024
- 2020.2024
- 2014.2018
Recipient of Prime Minister Scholarship
Government of India
For passing JEE Advanced 2014 examination (< 0.1% acceptance rate)
-
State-level Silver medalist
6th International Mathematics Olympiad (conducted by the Science Olympiad Foundation)
Skills
Technical Skills | |
PyTorch | |
Python | |
HuggingFace | |
Scikit-learn | |
Git | |
Linux | |
NLTK | |
Flask | |
Pandas | |
Numpy | |
vLLM | |
Wandb |
Professional Services | |
Area Chair: NAACL, EMNLP, ACL | |
Reviewer: ACL, NAACL, EMNLP, COLM, ACM Multimedia, AAAI |
Languages
Hindi | |
Native Speaker |
English | |
Fluent |
Japanese | |
Intermediate |
French | |
Beginner |
Interests
Research Interests | |
Natural Language Processing | |
Large Language Models | |
Generative AI | |
Synthetic Data Generation |