Conditional Layer Normalization预训练模型对比预训练模型概述word2vec(通俗篇)fasttext文本分类器TextCNNRNNSeq2Seq模型attention模型transformer (attention is all you need)BERTtransformers语言模型