HI @moma1820, I saw you posted a similar question in another thread, so I replied there. See the link below:
arteagac
7
Related topics
| Topic | Replies | Views | Activity | |
|---|---|---|---|---|
| Is there a pre-trained BERT model with the sequence length 2048? | 2 | 2176 | November 5, 2020 | |
| Token Classification Models on (Very) Long Text | 8 | 11664 | March 9, 2023 | |
| Modeling long sequences | 0 | 486 | June 9, 2022 | |
| Flan-T5 - Finetuning to a Longer Sequence Length (512 -> 2048 tokens): Will it work? | 3 | 4434 | January 9, 2024 | |
| Passing Inputs Longer Than 512 Tokens After Pretraining a T5 Model: Is It Safe? | 3 | 218 | November 20, 2025 |