WattBot 2025: Estimating AI Emissions with RAG

Projects

ML Marathon

MLM25

RAG

Retrieval

LLM

NLP

Sustainability

Energy

GenAI

Author

Chris Endemann

Published

September 9, 2025

WattBot was a challenge in the 2025 Machine Learning Marathon (MLM25). Teams built retrieval-augmented generation (RAG) systems to extract credible, citation-backed emissions and cost estimates for AI workloads from a corpus of 35+ peer-reviewed papers and 300+ curated Q&A pairs. Systems were expected to return citation-grounded answers or explicitly abstain when evidence was missing – promoting transparency and reproducibility in sustainability reporting.

Challenge design

Task: Given a natural-language question about AI energy use, water consumption, or carbon emissions, retrieve relevant passages from the provided corpus and generate a citation-backed answer.
Evaluation: Answers were scored on factual accuracy, proper citation, and appropriate abstention when evidence was insufficient.
Corpus: 35+ academic papers covering AI sustainability, energy benchmarking, and environmental impact reporting.

Winning approach

The winning solution by KohakuBlueleaf used a RAG pipeline that was later replicated and deployed in both AWS Bedrock and locally with open-source Hugging Face models. See the follow-up talk below for deployment details.

WattBot 2025: Estimating AI Emissions with RAG

Challenge design

Winning approach

Links

Comments

Challenge design

Winning approach

Links

Related resources

Comments