Results outputs for "Beyond the Hype: Identifying and Analyzing Math Word Problem-Solving Challenges for Large Language Models"

  1. Albornoz De Luise, Romina Soledad 1
  2. Arnau, David 1
  3. Arnau-González, Pablo 1
  4. Arevalillo-Herráez, Miguel 1
  1. 1 Universitat de València
    info

    Universitat de València

    Valencia, España

    ROR https://ror.org/043nxc105

Argitaratzaile: Zenodo

Argitalpen urtea: 2024

Mota: Dataset

CC BY-NC-ND 4.0

Laburpena

The provided files contain outputs generated by various Large Language Models (LLMs) for solving problems in the SVAMP dataset. Additionally, they include tagged statements of problems that LLMs incorrectly resolved. This repository includes the following two files: all_data.json --> Contains the generated samples for the SVAMP dataset. df_combined.pkl --> Contains the tagged SVAMP statements of problems that CodeLlama failed to resolve.

Erreferentzia bibliografikoak

  • 10.5281/zenodo.11126655