Skip to content

NLP-RISE/swesat

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

SweSAT-1.0: A Benchmark for Swedish LLMs

SweSAT-1.0 is a benchmark dataset created from the Swedish university entrance exam (Högskoleprovet) to assess large language models (LLMs) in Swedish. It includes 867 multiple-choice questions from the last 8 exams (2020-10-25 to 2024-04-13), covering the following six different question types:

  • ORD (Vocabulary) - Tests the understanding of in-domain words and synonyms.
  • LÄS (Reading Comprehension) - Assesses the ability to make inferences from a text.
  • MEK (Sentence Completion) - Evaluates the ability to complete sentences via cloze tests.
  • XYZ (Mathematical Problem-Solving) - Includes questions on arithmetic, algebra, geometry, statistics, and functions.
  • KVA (Quantitative Comparisons) - Measures the ability to compare quantities in mathematical concepts.
  • NOG (Data Sufficiency) - Determines the ability to assess whether given data is sufficient to solve a problem.

Install dependencies

  • With poetry (recommended):

    poetry shell
    poetry install # installs all dependencies from lockfile
  • With pip:

    python -m venv .swe_sat_venv
    source .swe_sat_venv/bin/activate
    pip3 install pdfplumber requests

Dataset

The Swe-SAT-1.0 dataset is partially available exams with the exception of reading passages for the LÄS section. To obtain the full dataset including the LÄS section, run the following script:

python3 process_verbal_sections/get_pdfs.py

After running this script, a newly created directory exam_pdfs will be populated. Now, run the script to process the verbal sections.

python3 process_verbal_sections/parse_exam_pdf.py exam_pdfs

Tip

The LÄS section may contain copyrighted reading passages. You can inspect the sources of these passages on studera.nu when clicking on a specific exam year (found under the "Källor" section). ⚠️ The site is in Swedish!

Citation

If you use SweSAT-1.0 in your research, please cite:

@article{SweSAT2024,
  title={SweSAT-1.0: The Swedish University Entrance Exam as a Benchmark for Large Language Models},
  author={Kurfalı, Murathan and Zahra, Shorouq and Gogoulou, Evangelia and Dürlich, Luise and Carlsson, Fredrik and Nivre, Joakim},
  booktitle = "Proceedings of The Joint 25th Nordic Conference on Computational Linguistics and 11th Baltic Conference on Human Language Technologies (NoDaLiDa/Baltic-HLT 2025)",
  month = march,
  year = "2025",
  address = "Talinn, Estonia"
}

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages