Calculating AI emissions for CSRD

In this guide

Why AI emissions belong in your CSRD report
Step 1 — Pull token usage data from provider APIs
Step 2 — Apply an emissions methodology
Step 3 — Apply grid carbon intensity by data centre region
Step 4 — Format as a Scope 3 Category 1 line item
The transparency principle: modelled, not measured
Automating the pipeline

Why AI emissions belong in your CSRD report

Under the EU Corporate Sustainability Reporting Directive (CSRD), companies report against the European Sustainability Reporting Standards — specifically ESRS E1 (Climate change), which requires disclosure of gross Scope 1, 2 and 3 greenhouse gas emissions using GHG Protocol methodology.

CSRD creates no separate "AI emissions" category. AI workloads slot into existing scopes:

Scope 2 — electricity purchased to run AI on infrastructure you control (your own data centre, a co-location lease).
Scope 3, Category 1 — Purchased goods and services — the most common case for enterprise AI users who consume models via API or SaaS.

This matters now because AI electricity consumption is growing faster than most corporate energy budgets, regulators are beginning to ask pointed questions about it, and third-party assurance providers are starting to require traceable methodology for IT-related Scope 3 estimates. Getting this right in 2025–2026 is early-mover positioning, not compliance theatre.

Important transparency note No major AI provider — including OpenAI, Anthropic, Google, or Microsoft — currently publishes per-query or per-token energy or emissions data. Every number you produce for CSRD reporting is a modelled estimate. Stating this clearly is not a weakness; it is the credibility signal that separates a defensible disclosure from a made-up figure.

Step 1 — Pull token usage data from provider APIs

Token and request counts are the most granular activity data you can actually obtain. They are the starting point for all downstream calculations.

What each major provider exposes

Provider	Where to get usage data	Granularity	Notes
OpenAI	/v1/usage endpoint; dashboard export; per-request headers	Input + output tokens, model, timestamp	Usage API gives daily aggregates; per-call tokens returned in API response body
Anthropic	Console Usage page; x-anthropic-* response headers; Billing API (beta)	Input + output tokens, model ID, timestamp	Cache-read tokens reported separately — lower energy cost; account for this
Azure OpenAI	Azure Monitor Metrics; Cost Management + Billing; Log Analytics workspace	Processed Inference Tokens metric; request count	Model deployment name links to specific model version — important for emissions factors
Microsoft Copilot	Microsoft 365 admin centre; Viva Insights; Purview Audit	Activity counts (not tokens); user-level in some plans	No token data exposed publicly; use request counts × assumed average token length per product feature
Google Vertex AI / Gemini	Cloud Monitoring; BigQuery billing export	Character count (not tokens); request count	Divide characters by ~4 to approximate tokens for emission factor application

Recommended data collection approach

For a defensible annual CSRD disclosure, collect data at three levels of precision, working down from what you have:

Per-call token logs — ideal; capture input tokens, output tokens, model name, and Unix timestamp from the API response for every call your application makes. Store in a data warehouse.
Monthly API billing exports — acceptable fallback; most providers show token volume in billing breakdowns even when the usage API is not integrated.
Cost-based back-calculation — last resort only; divide spend by published pricing per million tokens to estimate token volume. Document clearly as an approximation.

# Example: pull Anthropic monthly token totals via console export
# Then aggregate by model for the reporting period

import pandas as pd

df = pd.read_csv("anthropic_usage_export.csv")

# Group by model, sum input and output tokens over fiscal year
summary = (
  df
  .groupby("model")[["input_tokens", "output_tokens"]]
  .sum()
  .reset_index()
)
print(summary)

Record the reporting period, the data source, and any gaps or exclusions. This documentation is what an assurance provider will ask for first.

Step 2 — Apply an emissions methodology

Two credible open methodologies exist for translating token counts into energy and CO₂e estimates. Use them in combination — or choose the one best suited to your provider mix.

Methodology A: EcoLogits (open-source)

EcoLogits is an open-source Python library developed by Boavizta that wraps major AI provider clients and returns energy and GHG estimates alongside every API response. It uses a hardware-based model of GPU energy consumption, normalised by model size and utilisation assumptions.

The core inference energy formula it applies:

EcoLogits core formula E_inference = (T_output / S_tokens) × P_GPU × n_GPU × PUE

Where:

T_output = output tokens generated
S_tokens = model throughput (tokens/second) per GPU
P_GPU = TDP of GPU type (watts)
n_GPU = number of GPUs serving the model
PUE = data centre Power Usage Effectiveness

EcoLogits ships with default parameter tables for GPT-4, Claude 3, Gemini, Mistral, and Llama variants, derived from published model cards, academic benchmarks, and hardware specs. You can override these for proprietary models.

from ecologits import EcoLogits
from anthropic import Anthropic

EcoLogits.init()  # patches supported clients automatically
client = Anthropic()

response = client.messages.create(
  model="claude-3-5-sonnet-20241022",
  max_tokens=1024,
  messages=[{"role": "user", "content": "Summarise our Q3 risk register."}]
)

# EcoLogits attaches impacts to the response object
print(response.impacts.energy.value)   # kWh
print(response.impacts.gwp.value)      # kg CO2e

For batch CSRD calculations where you have historical token data rather than live API calls, use the functional interface directly:

from ecologits.tracers.utils import llm_impacts

impacts = llm_impacts(
  provider="anthropic",
  model_name="claude-3-5-sonnet-20241022",
  output_token_count=850,       # average per call
  request_latency=2.4,            # seconds
  electricity_mix_zone="US"     # ISO 3166-1 or region code
)

# Scale to annual volume
annual_calls = 2_400_000
total_kwh = impacts.energy.value * annual_calls
total_co2e_kg = impacts.gwp.value * annual_calls

Methodology B: Green Software Foundation Software Carbon Intensity (SCI) for AI — 2025 specification

The Green Software Foundation ratified its SCI for AI specification in 2025. It standardises how to express the carbon intensity of an AI service as a rate — carbon emitted per functional unit — making it consistent with ESRS E1's emphasis on intensity metrics alongside absolute figures.

The SCI equation:

GSF SCI for AI SCI = (E × I) + M
────────────────
R

E = energy consumed (kWh) by the AI workload
I = location-based marginal carbon intensity of electricity (kg CO₂e / kWh)
M = embodied carbon of hardware, amortised over use share (kg CO₂e)
R = functional unit — e.g. per 1,000 tokens, per API call, per user per month

The SCI's value for CSRD is that it gives you a per-unit intensity figure you can disclose and track over time — useful in the qualitative narrative of your ESRS E1 report to show efficiency improvements even if absolute usage grows.

For absolute Scope 3 totals (what ESRS E1-6 actually mandates), you still need E × I + M summed across all calls in the reporting year. SCI and EcoLogits are complementary, not competing.

Which to use when

Situation	Recommended approach
Live production system, integrating now	EcoLogits SDK — attach to every API call, log impacts to data warehouse
Retrospective annual calculation from token exports	EcoLogits functional interface (llm_impacts)
Intensity metric for year-on-year benchmarking	GSF SCI — report as kg CO₂e per 1,000 output tokens
Multi-vendor normalisation	GSF SCI framework with EcoLogits providing E values per vendor

Step 3 — Apply grid carbon intensity by data centre region

The single biggest variable in AI emissions calculations is where the inference runs. A query routed to a coal-heavy grid emits roughly 8–10× more CO₂e than one running on a predominantly renewables grid. Getting the regional factor right matters.

Data sources for grid intensity

Source	Coverage	Update frequency	Use for
Electricity Maps API	70+ countries, sub-national US/Europe	Hourly marginal & average	Live or historical location-based intensity; best accuracy
IEA World Energy Statistics	Country-level	Annual	Annual average figures; ESRS-acceptable as a recognised source
EPA eGRID (US)	US sub-regional (NERC regions)	Annual (2-year lag)	US-domiciled workloads on Azure/AWS/GCP US regions
Cloud provider carbon tools	Their own region fleet	Varies (often quarterly)	Market-based estimates, useful for Scope 2 analogue; less good for Scope 3

Mapping AI providers to regions

Most providers do not tell you which physical data centre handled your request. Use these defaults, documented clearly in your methodology:

Provider / product	Primary inference region	Suggested grid factor (location-based)	Source
OpenAI API (default)	US-East (Virginia / Iowa)	~0.35–0.38 kg CO₂e/kWh	EPA eGRID RFCE / MROW average
Anthropic API	US-East (AWS us-east-1)	~0.35 kg CO₂e/kWh	EPA eGRID RFCE
Azure OpenAI (Sweden Central)	Sweden	~0.008 kg CO₂e/kWh	IEA / Electricity Maps SE
Azure OpenAI (East US)	US-East	~0.35 kg CO₂e/kWh	EPA eGRID RFCE
Google Vertex AI (us-central1)	Iowa	~0.35 kg CO₂e/kWh	EPA eGRID MROW

ESRS E1 requirement ESRS E1-6 requires you to disclose both a location-based and a market-based Scope 2 figure. For AI consumed as a Scope 3 service, location-based methodology is the relevant analogue — use the grid intensity of the region where the model physically runs, not a RECs-adjusted figure you do not control.

Handling multi-region workloads

If your AI usage spans multiple regions or providers, calculate emissions separately per region-model combination, then sum:

Multi-region sum Total CO₂e = Σ (Tokens_i × EF_{model_i} × GridIntensity_{region_i})

For each provider–model–region combination

Step 4 — Format as a Scope 3 Category 1 line item

The output of Steps 1–3 needs to be shaped into something that sits cleanly inside your ESRS E1 disclosure. Here is how to structure it.

Where it sits in ESRS E1

For most companies using AI as a purchased service:

ESRS E1-6, Disclosure Requirement E1-6 — Gross Scope 3 GHG emissions, Category 1 (Purchased goods and services)
AI emissions are part of Category 1 — they do not need their own ESRS line item, but you may break them out in supplementary disclosures or the narrative sections of ESRS E1
If AI electricity runs on infrastructure you control, classify as Scope 2 instead

What to document

An auditable CSRD disclosure for AI Scope 3 Category 1 emissions should contain:

Activity data — total tokens consumed by provider, model, and reporting period
Emissions methodology — e.g. "EcoLogits v0.x functional interface, applying hardware-based GPU energy model; GSF SCI specification for intensity metrics"
Emission factors — per-query energy factors by model (from EcoLogits parameter tables, cited by version); grid intensity by region (source, year)
PUE assumption — the data centre PUE applied (EcoLogits default is 1.2; document if you override it)
Confidence / uncertainty range — provide a range (e.g. ±40%) reflecting uncertainty in hardware parameters and region allocation
Explicit modelled-not-measured statement — required for credibility; see section below

Worked example disclosure entry

Illustrative ESRS E1-6 supplementary note — AI (Scope 3 Cat. 1) Reporting period: FY 2025 (1 Jan – 31 Dec)

Activity data:
OpenAI GPT-4o → 1.24 billion tokens (input + output)
Anthropic Claude 3.5 → 0.87 billion tokens
Azure Copilot → ~14 million requests (no token data; avg assumed 1,200 tokens)

Methodology: EcoLogits v0.6 hardware-based model
Grid factors: EPA eGRID RFCE 2024 (0.352 kg CO₂e/kWh)
PUE: 1.20 (EcoLogits default)

Result:
Estimated operational emissions → 18.4 t CO₂e
Embodied hardware (amortised) → 2.1 t CO₂e
Total AI — Scope 3, Cat. 1 → 20.5 t CO₂e (range: 12–34 t)

Note: All figures are modelled estimates. No provider supplied measured per-query energy data for this reporting period.

The transparency principle: modelled, not measured

This is not a caveat to hide in footnotes. It should be stated plainly in the body of your methodology disclosure.

Here is why this is the right approach — and why experienced assurance providers will respect it:

Every estimate in corporate GHG accounting involves assumptions. Scope 3 Category 1 for physical goods already relies on spend-based or average emission factors that are rougher than most AI estimates.
Claiming measured precision you do not have is an audit failure waiting to happen. Stating uncertainty ranges is standard GHG Protocol practice.
The existence of recognised open methodologies (EcoLogits, GSF SCI) means your estimates are not arbitrary — they are traceable to published, peer-reviewed parameter sets.
Providers' own sustainability reports (where they exist) use the same modelling approaches for AI workloads. You are doing what the industry does.

Suggested disclosure language "AI-related greenhouse gas emissions are estimated using modelled methodology (EcoLogits v0.x / GSF SCI AI specification). No AI service provider currently supplies measured per-query energy or emissions data. Estimates carry an uncertainty range of approximately ±40%, primarily driven by assumptions about hardware utilisation rates and data centre location. Methodology details are provided in the Appendix."

Regulators and assurance bodies accept methodological uncertainty when it is disclosed honestly, bounded quantitatively, and accompanied by a credible improvement plan — such as committing to integrate live EcoLogits instrumentation into production systems by the following reporting year.

Automating the pipeline

The four steps above are entirely automatable. Once the pipeline runs in production, the marginal cost of producing CSRD-ready AI emissions data approaches zero.

A minimal production architecture

Instrumented API wrapper
Wrap all outbound AI API calls (OpenAI, Anthropic, Azure) in a thin middleware layer. On each response, call llm_impacts() and log impact metadata — kWh, kg CO₂e, model, timestamp, region — to a structured log or your data warehouse alongside the business transaction ID.
Monthly aggregation job
A scheduled query (SQL or dbt model) aggregates logged impact data by reporting period, provider, and model. Output: a flat table with columns [period, provider, model, region, total_kwh, total_co2e_kg, input_tokens, output_tokens, call_count].
Grid intensity refresh
An annual (or more frequent) job pulls updated grid intensity factors from Electricity Maps or EPA eGRID for each region in use and writes them to a reference table. Calculation jobs always join to the reference table rather than hard-coding factors.
CSRD reporting export
A parameterised report template (SQL + Python or a BI tool) takes the aggregation table, applies embodied hardware factors, outputs the Scope 3 Category 1 figure with confidence range, and generates the methodology appendix text in structured form ready for the sustainability report.

This architecture also produces the data lineage trace that CSRD's assurance requirements are increasingly demanding — a clear, auditable path from raw API response to reported CO₂e figure.

The tooling landscape

Several commercial platforms now offer to automate this pipeline with provider integrations, pre-built emission factor libraries, and CSRD/ESRS reporting templates. If you are evaluating tooling, the questions to ask are: does it use a recognised, versioned methodology (EcoLogits or equivalent)? Can you export the underlying calculation for audit? Does it produce both absolute and intensity metrics? And does it handle multi-provider, multi-region workloads in a single view?

The open-source approach described in this guide remains the most transparent option for companies that need full auditability — and the pipeline, once built, requires only modest maintenance as model versions and grid factors change annually.

Summary

Calculating AI emissions for CSRD is a four-step process: collect token usage data from provider APIs, convert tokens to energy and CO₂e using a recognised methodology (EcoLogits for hardware-based estimates, GSF SCI for intensity metrics), apply the appropriate grid carbon intensity for each region, and report the result as a Scope 3 Category 1 line item under ESRS E1-6 with an explicit uncertainty range and a clear statement that all figures are modelled, not measured.

The companies that will handle this most effectively in 2026 and beyond are those instrumenting their AI usage now — building the data infrastructure to move from rough retrospective estimates to real-time, per-call accounting, and using that pipeline as the foundation for both compliance and genuine emissions reduction.