Skip to content

Commit

Permalink
add cocoteros_es dataset
Browse files Browse the repository at this point in the history
  • Loading branch information
rsepulveda911112 committed Feb 21, 2025
1 parent 1ba35e6 commit 6d31c18
Show file tree
Hide file tree
Showing 3 changed files with 25 additions and 0 deletions.
2 changes: 2 additions & 0 deletions lm_eval/tasks/spanish_bench/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -15,6 +15,7 @@ The datasets included in SpanishBench that have been made public in previous pub
| Task | Category | Paper title | Homepage |
|:-------------:|:-----:|:-------------:|:-----:|
| Belebele_es | Reading Comprehension | [The Belebele Benchmark: a Parallel Reading Comprehension Dataset in 122 Language Variants](https://arxiv.org/abs/2308.16884) | https://huggingface.co/datasets/facebook/belebele |
| Cocoteros_es | Commonsense Reasoning | [COCOTEROS: A Spanish Corpus with Contextual Knowledge for Natural Language Generation](https://besaya.infor.uva.es/sepln24/paper04.pdf) | https://huggingface.co/datasets/gplsi/cocoteros |
| EsCoLA | Linguistic Acceptability | [EsCoLA: Spanish Corpus of Linguistic Acceptability](https://aclanthology.org/2024.lrec-main.554/) | https://huggingface.co/datasets/nbel/EsCoLA |
| FLORES_es | Translation | [The FLORES-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation](https://arxiv.org/abs/2106.03193) | https://huggingface.co/datasets/facebook/flores |
| MGSM_es | Math | [Language Models are Multilingual Chain-of-Thought Reasoners](https://arxiv.org/abs/2210.03057) | https://huggingface.co/datasets/juletxara/mgsm |
Expand Down Expand Up @@ -44,6 +45,7 @@ Paper for SpanishBench coming soon.

The following tasks evaluate tasks on SpanishBench dataset using various scoring methods.
- `belebele_spa_Latn`
- `cocoteros_es`
- `copa_es`
- `escola`
- `flores_es`
Expand Down
22 changes: 22 additions & 0 deletions lm_eval/tasks/spanish_bench/cocoteros_es.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,22 @@
task: cocoteros_es
dataset_path: gplsi/cocoteros
dataset_name: null
output_type: generate_until
doc_to_text: "Genera una frase corta con estas palabras: {{keywords}}. El contexto es: {{context}} \n\nRespuesta:"
doc_to_target: "{{text}}"
training_split: train
test_split: test
target_delimiter: ' '
generation_kwargs:
max_gen_toks: 40
until:
- "\n"
metric_list:
- metric: bleu
aggregation: bleu
higher_is_better: true
- metric: !function utils.rouge1
aggregation: !function utils.rouge1_agg
higher_is_better: true
metadata:
version: 1.0
1 change: 1 addition & 0 deletions lm_eval/tasks/spanish_bench/spanish_bench.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -13,5 +13,6 @@ task:
- mgsm_direct_es_spanish_bench
- flores_es
- phrases_es
- cocoteros_es
metadata:
version: 1.0

0 comments on commit 6d31c18

Please sign in to comment.