本文最后更新于：2 年前

T5

T5: Transfer Text-to-Text Transformer

C4: Colossal Clean Crawled Corpus

架构择优

其中两层数据之间的关联如下

顺次在以下并列选项中找到“最优解”，蓝色代表“胜出”

10%, 15%, 25%, 50% 的 MASK 比例

2, 3, 5, 10 的长度

NLP > 论文

#NLP

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

https://lr-tsinghua11.github.io/2023/02/09/NLP/T5/

作者

Learning_rate

发布于

2023年2月9日

许可协议