With this post, we conclude the series dedicated to Green Quality Assurance. In the previous articles, we explained what Green QA means and its impact on companies as well as a possible framework for its implementation.

Now that we already understand what this methodology is and how to implement it, the only thing left is to understand how we can measure it. In this final post, we focus on the foundation of the entire process: measurement, addressing the KPIs that can be used to turn requirements into data and to track progress as accurately as possible based on objective information.

It is not an easy task to have all the tools and frameworks required to perform these measurements, which is why the process is progressive and requires commitment from every member of the organization.

In Green QA, metrics are the thermometer of our efficiency. We do not measure for bureaucracy’s sake, but to identify “energy leaks” and “digital waste.”

When defining the KPIs and OKRs needed to evaluate the different aspects of GQA, we grouped them into the following measurement categories:

Energy and Software KPIs (Technical Efficiency)

Here, we measure the physical effort hardware performs to run our software. The KPIs used for this area are the following:

KPI Definition Suggested Tool Technical Metric
QA Energy Intensity Energy consumed per execution of the test suite. Scaphandre (via Prometheus) kWh \Tests
Green Code Quality Detection of inefficient code patterns (loops, API calls). SonarQube (Eco-Code) # of Green Smells
Idle Energy Rate Energy consumed by Staging environments while not being tested. AWS CCFT / Azure Dashboards Idle kWh
CPU, Memory, and Idle Cycles Physical resource usage during the test lifecycle. Prometheus / Netdata % CPU + % RAM) \Idle Time
Frontend Efficiency Carbon footprint generated on the end-user device. GreenFrame.io / Lighthouse gCO2 per session

Carbon and Waste KPIs (Planetary Impact)

We transform watts into real environmental impact. The KPIs used for this area are the following:

KPI Definition Suggested Tool Technical Metric
Release Carbon Footprint CO2 emissions generated by deploying a new version. Cloud Carbon Footprint gCO2 per release
Compute Efficiency (LCA) Environmental impact of QA hardware (manufacturing + usage). SimaPro / GaBi Hardware Carbon Debt
Annual Reduction Rate Emission savings compared to the previous period. Watershed / Persefoni % Annual Reduction
Hardware Circularity Index % of testing devices reused, repaired, or recycled. Internal ERP / Snipe-IT Refurbished Equipment\Total
Compliance Deviation Rate Releases exceeding the established carbon budget. Jenkins / GitHub Actions # Blocked Releases

ESG Data Quality KPIs (Trust and Compliance)

If sustainability data is unreliable, the strategy fails. The KPIs used to measure this area are the following:

KPI Definition Suggested Tool Technical Metric
ESG Data Health Index % of sustainability data that is auditable and real. Persefoni / Plan A Verifiable Data \Total
% of Auditable ESG Data Proportion of QA metrics with verifiable technical evidence. MS Sustainability Manager Auditable Metrics\Total

Operational Efficiency and “Digital Waste” KPIs (Process Improvement)

It is not enough for a test to consume little; the key is not executing what is unnecessary. Digital waste is silent pollution. The data used to measure this area includes the following:

KPI Definition Suggested Tool Technical Metric

“Zombie Tests” Rate
% of automated tests that run but provide no value (duplicated tests, tests that always pass without validating real logic, or tests for deprecated functionality).
Manual

Zombie Tests / Total
Test Data Density Measures the size of datasets used. Do we really need a 1TB database for an integration test, or can we use smart subsetting? Less storage = less server energy consumption.
Manual

Used Data / Total
Time-to-Feedback The longer a pipeline takes to fail, the more resources (cloud compute minutes) are wasted. Optimizing execution order to fail fast is an energy-saving strategy.
Manual

Optimized Pipelines / Total

Standards and Compliance

It is not enough to “be green”; you must prove it to international regulators. Green QA is the final filter ensuring that a company does not incur legal risks by reporting inaccurate data.

In the previous post, we discussed legal aspects where applying Green QA within a company is beneficial. Here, we will look at how to ensure compliance through data and how to verify whether the quality process we followed has contributed to regulatory compliance using the KPIs defined above.

CSRD (EU Directive) + ESRS

The CSRD (Corporate Sustainability Reporting Directive) is the European regulation (in force since 2024) that requires large companies and publicly traded organizations to report detailed sustainability information under Environmental, Social, and Governance (ESG) criteria.

In Spain, the Corporate Sustainability Reporting Bill was approved on October 29, 2024, as a transposition of the CSRD. The ESRS (European Sustainability Reporting Standards) are the mandatory technical standards for complying with the CSRD.

From a QA perspective, we can audit sustainability data. Here, Green QA acts as the “Data Auditor.” It must ensure that ESG data (such as server consumption in Staging) is not based on rough estimates but has technical traceability. If reporting software fails, the company may face sanctions for Greenwashing. We can use the following checklist to verify compliance:

GHG Protocol (Scope 3 - Software)

The GHG Protocol (Scope 3) is the most widely used global standard for measuring and reporting indirect greenhouse gas emissions (GHG) occurring across a company’s value chain, excluding purchased energy emissions (Scope 2).

From a QA perspective, it is necessary to validate that third-party tools (testing SaaS platforms, CDNs, paid APIs) provide real emissions data. Quality Gates can be created to block deployments if the “carbon budget” of a microservice exceeds protocol limits. We can use the following checklist to verify compliance:

ISO/IEC 21031 (Software Carbon Intensity - SCI)

This is the Unit Testing of the carbon footprint. It is an international standard for calculating software carbon intensity (SCI).

The role of Green QA here is to integrate carbon footprint measurement into the testing pyramid. Just as we validate response times, QA validates the energy cost per transaction. If a database change increases CPU cycles, QA acts as the gatekeeper preventing that “energy waste” from reaching production.

Consumer Empowerment Directive (Anti-Greenwashing)

The EU is banning generic environmental claims without evidence (“100% eco-friendly software”). From a QA perspective, evidence must be certified. If marketing claims the app consumes 30% less battery, QA should have run energy regression tests (using tools such as GreenFrame or Lighthouse) that support the claim with empirical and repeatable data.

Web Accessibility (WAD / WCAG) as Sustainability

There is a direct correlation: an accessible and lightweight website is also a low-consumption website. From a QA perspective, the DOM must be validated for efficiency. Fewer unnecessary elements and redundant requests mean fewer CPU cycles on the client device. Here, QA combines social impact (inclusion) with environmental impact (efficiency).

Carbon Footprint Measurement Table

At this stage, we present a table with useful data commonly used to automate carbon footprint calculations. These values evolve as technologies improve and become more efficient, so they should be considered estimates.

QA / IT Activity Estimated Consumption / Emissions CO2e Equivalent Visual Impact Source
EC2 Instance (AWS m5.large) - 24h ~0.105 kWh ~2.52 kg CO2 Charging a smartphone 300 times. AWS Customer Carbon Footprint Tool
Azure VM (D2s v3) - 24h ~0.088 kWh ~2.10 kg CO2 10 cold-water laundry cycles. Azure Emissions Impact Dashboard
Cloud SQL / BigQuery (1h) ~0.008 kWh ~0.18 kg CO2 Watching 4 hours of HD streaming. Google Cloud Carbon Footprint
S3 Storage (1 TB/month) ~0.05 kWh ~0.12 kg CO2 Driving 0.5 km in a gasoline car. Cloud Carbon Footprint Methodology
Smartphone Usage (1h testing) ~0.00077 kWh ~5.8g CO2 Keeping an LED bulb on for 45 minutes. ADEME / Scope3 Lifecycle Data
Tablet Usage (1h testing) ~0.003 kWh ~2.9g CO2 Similar to 1 hour of portable radio usage. ADEME “Numérique 2.0” (2025)
AI Query (LLM / Gemini) ~0.003 kWh ~0.15g - 0.75g CO2 50-90 times more than a search engine query. Joule / ITU-WBA Report 2025
Smartphone Manufacturing (embedded) N/A ~50 kg CO2 / year Emissions equivalent to producing 250 hamburgers. Öko-Institut (Life Cycle Study)
1 Hour of Cloud Server Usage (standard) 0.5 kWh ~125g CO2 Charging a smartphone 15 times.
Suite of 1,000 Automated Tests 2.5 kWh ~625g CO2 Driving a gasoline car for 2.5 km.
Staging Environment Running (24h) 12 kWh ~3 kg CO2 The amount of CO2 absorbed by 0.15 trees in a year.
Storing 1 TB of Logs/Data (1 month) 10 kWh ~2.5 kg CO2 Keeping an LED bulb on for 4 months.

Conclusions

Throughout this three-part series on Green Quality Assurance, we have explored the possibilities within the technology world, and specifically within the software industry, to take action and improve energy efficiency.

Tell us what you think.

Comments are moderated and will only be visible if they add to the discussion in a constructive way. If you disagree with a point, please, be polite.

Subscribe