Carbon Tracking & Green Routing
Bauxite Enterprise provides real-time visibility into the environmental impact of your AI operations. By combining hardware telemetry with regional grid data, Bauxite allows you to monitor, report, and optimize the carbon footprint of every LLM request.
The “Green AI” Challenge
LLM inference is energy-intensive. A single complex query can consume as much electricity as charging a smartphone. In a global deployment, the Carbon Intensity of that energy varies significantly depending on whether the data center is powered by coal, gas, or renewables.
How It Works
Bauxite calculates emissions using a multi-factor model that integrates with your inference fleet:
1. Energy Estimation
Bauxite tracks the duration and token count of each request. For self-hosted vLLM clusters, it can ingest GPU power metrics (TDP/Watts) to calculate actual Kilowatt-hours (kWh) consumed per prompt.
2. Grid Intensity Integration
Bauxite maps your vllm-prod endpoints to geographic regions
(e.g., us-east-1 vs. eu-west-1) and pulls real-time Grid Carbon Intensity (gCO2eq/kWh) from providers like Electricity Maps or
Cloud provider sustainability APIs.
3. Green Routing (Beta)
When multiple clusters are available, Bauxite can be configured to favor “Green Regions” during periods of high renewable energy availability, provided latency requirements are met.
Configuration
Enable carbon tracking in your enterprise.yaml:
enterprise:
carbon_tracking:
enabled: true
provider: "google_cloud" # or "aws", "electricity_maps"
reporting_interval: "1h"
green_routing:
enabled: true
priority: "low_carbon" # Balanced with 'latency'
threshold_gco2: 200 # Maximum carbon intensity allowed Reporting & Dashboards
Bauxite exports carbon metrics directly to Prometheus/Grafana, allowing you to build sustainability dashboards that show:
- Total CO2e: Cumulative emissions across all LLM providers.
- Carbon per Department: Track which internal teams are driving the highest emissions.
- Efficiency Gains: Visualize how KV-Aware Routing reduces total compute time and associated carbon.
Example Metric
bauxite_request_carbon_grams_total{model="gpt-4o", region="us-east-1"} 1.24
Zero-Trust Compliance
In keeping with the Straitjacket model, carbon metadata is purely architectural.
No PII Leaks: Carbon reports only contain model names, token counts, and regions. Memory-Mapped: Telemetry is buffered in a fixed 2MB volatile ring buffer before being scraped, ensuring it never hits the disk.