raptorkwok/cantonese-chinese-parallel-corpus-base
Viewer โข Updated โข 130k โข 105 โข 15
How to use raptorkwok/cantonese-chinese-translation with Transformers:
# Use a pipeline as a high-level helper
# Warning: Pipeline type "translation" is no longer supported in transformers v5.
# You must load the model directly (see below) or downgrade to v4.x with:
# 'pip install "transformers<5.0.0'
from transformers import pipeline
pipe = pipeline("translation", model="raptorkwok/cantonese-chinese-translation") # Load model directly
from transformers import AutoTokenizer, AutoModelForSeq2SeqLM
tokenizer = AutoTokenizer.from_pretrained("raptorkwok/cantonese-chinese-translation")
model = AutoModelForSeq2SeqLM.from_pretrained("raptorkwok/cantonese-chinese-translation")This model is a fine-tuned version of fnlp/bart-base-chinese on raptorkwok/cantonese-traditional-chinese-parallel-corpus dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Bleu | Chrf | Gen Len |
|---|---|---|---|---|---|---|
| 0.3606 | 0.48 | 1000 | 0.2592 | 60.9844 | 58.8851 | 12.8446 |
| 0.3059 | 0.96 | 2000 | 0.2291 | 61.9606 | 60.1201 | 12.8621 |
| 0.2296 | 1.44 | 3000 | 0.2254 | 61.9458 | 60.0434 | 12.8578 |
| 0.2231 | 1.92 | 4000 | 0.2176 | 61.9617 | 59.9299 | 12.8827 |
| 0.174 | 2.39 | 5000 | 0.2290 | 61.9661 | 59.8844 | 12.9068 |
| 0.171 | 2.87 | 6000 | 0.2258 | 62.1085 | 60.1854 | 12.8755 |
| 0.1346 | 3.35 | 7000 | 0.2334 | 61.4554 | 59.5055 | 12.8175 |
| 0.1285 | 3.83 | 8000 | 0.2408 | 61.3332 | 59.3276 | 12.8412 |
| 0.1061 | 4.31 | 9000 | 0.2530 | 61.6505 | 59.614 | 12.8566 |
Base model
OpenMOSS-Team/bart-base-chinese