🌍 ClimateEval

This code was originally forked from eci-io/climategpt-evaluation, with significant modifications and additions.

ClimateEval is a comprehensive benchmark designed to evaluate large language models (LLMs) across a wide range of climate change–related NLP tasks. It aggregates 13 datasets into 25 tasks covering tasks like text classification, question answering, information extraction, and misinformation detection — all integrated into the lm-eval-harness framework.

This benchmark enables standardized, reproducible assessment of LLMs for climate-focused tasks.

🔧 Setup Instructions

1. Clone This Repo

git clone https://github.com/NLP-RISE/ClimateEval.git
cd ClimateEval

2. Install lm-eval-harness

git clone https://github.com/EleutherAI/lm-evaluation-harness.git
cd lm-evaluation-harness
pip install -e .

3. Run an Evaluation

Example command (5-shot evaluation on claim_binary task):

lm_eval \
  --model hf \
  --model_args pretrained=eci-io/climategpt-7b \
  --tasks claim_binary \
  --output_path /results/climategpt-7b.jsonl \
  --show_config --log_samples \
  --num_fewshot 5 \
  --include_path <path-to-ClimateEval>/

To evaluate the full ClimateEval suite, use the tag:

--task ClimateEval

Or run by subsets, e.g.,:

--task CheapTalk,climatebert,climabench

🏷 Task Tags

Tag	Description
`ClimateEval`	Full ClimateEval benchmark suite
`CheapTalk`	Corporate climate discourse tasks based on this paper
`climatebert`	Tasks used to evaluate ClimateGPT
`climabench`	Tasks from the ClimaBench benchmark

📄 Citation

If you use ClimateEval in your work, please cite:

@inproceedings{ClimateEval2025,
  title={ClimateEval: A Comprehensive Benchmark for NLP Tasks Related to Climate Change},
  author={Murathan Kurfali and Shorouq Zahra and Joakim Nivre and Gabriele Messori},
  booktitle={Proceedings of the 2nd Workshop of Natural Language Processing meets Climate Change (ClimateNLP 2025) at ACL 2025},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 100 Commits
CheapTalk		CheapTalk
Climate-Change-NER		Climate-Change-NER
climabench		climabench
environmental_claims		environmental_claims
exeter		exeter
guardian_climate_news		guardian_climate_news
netzero_reduction		netzero_reduction
pira		pira
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🌍 ClimateEval

🔧 Setup Instructions

1. Clone This Repo

2. Install lm-eval-harness

3. Run an Evaluation

🏷 Task Tags

📄 Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🌍 ClimateEval

🔧 Setup Instructions

1. Clone This Repo

2. Install lm-eval-harness

3. Run an Evaluation

🏷 Task Tags

📄 Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages