Summarization
Models
Model |
Lang |
|---|---|
mrm8488/bert-mini2bert-mini-finetuned-cnn_daily_mail-summarization |
EN |
EN |
|
mrm8488/bert-small2bert-small-finetuned-cnn_daily_mail-summarization |
EN |
RU |
|
RU |
Datasets
-
Lang: EN
Rows: 973
Preprocess:
Select
testsplit.Rename column
reporttosource.Rename column
summarytotarget.Reset indexes.
-
Lang: EN
Rows: 11490
Preprocess:
Select
1.0.0subset.Select
testsplit.Drop columns
id.Rename column
articletosource.Rename column
highlightstotarget.Delete duplicates in dataset.
Remove substring
(CNN)for eachsourcerow.Reset indexes.
-
Lang: EN
Rows: 6658
Preprocess:
Select
testsplit.Rename column
articletosource.Rename column
abstracttotarget.Reset indexes.
-
Lang: RU
Rows: 6793
Preprocess:
Select
testsplit.Drop columns
title,date,url.Rename column
texttosource.Rename column
summarytotarget.Reset indexes.
-
Lang: RU
Rows: 30454
Preprocess:
Select
trainsplit.Drop columns
title,date,url.Rename column
article_contenttosource.Rename column
summarytotarget.Reset indexes.
-
Lang: RU
Rows: 7609
Preprocess:
Select
testsplit.Rename column
infotosource.Rename column
summarytotarget.Reset indexes.
-
Lang: RU
Rows: 95
Preprocess:
Select
trainsplit.Rename column
Reviewstosource.Rename column
Summarytotarget.Reset indexes.
Supervised Fine-Tuning (SFT) Parameters
Note
Set the parameter target_modules as
["query", "key", "value", "dense"] for the
mrm8488/bert-small2bert-small-finetuned-cnn_daily_mail-summarization,
mrm8488/bert-mini2bert-mini-finetuned-cnn_daily_mail-summarization and
dmitry-vorobiev/rubert_ria_headlines models, and as
["q", "k", "wi", "wo"] for the
UrukHan/t5-russian-summarization model.
Note
Set the parameters fine_tuning_steps=150, rank=24, alpha=36 for the
mrm8488/bert-small2bert-small-finetuned-cnn_daily_mail-summarization
as SFT parameters.
In pair with
cnn_dailymail
dataset, set fine_tuning_steps=150, rank=24, alpha=48, learning_rate=1e-5
Metrics
BLEU
ROUGE
Note
Use the rougeL metric and set seed=77 parameter
when loading the rouge metric.