This commit is contained in:
Duzeyao
2019-10-18 11:50:52 +08:00

View File

@@ -4,7 +4,7 @@
- Chinese version of GPT2 training code, using BERT tokenizer or BPE tokenizer. It is based on the extremely awesome repository from HuggingFace team [Pytorch-Transformers](https://github.com/huggingface/pytorch-transformers). Can write poems, news, novels, or train general language models. Support char level, word level and BPE level. Support large training corpus.
- 中文的GPT2训练代码使用BERT的Tokenizer或GPT2自带的BPE Tokenizer或Sentencepiece的BPE model感谢[kangzhonghua](https://github.com/kangzhonghua)的贡献。可以写诗新闻小说或是训练通用语言模型。支持字为单位或是分词模式或是BPE模式。支持大语料训练。
- 微信交流群:请加微信duzeyao拉你入群
- 微信交流群:请见Issue第一条
## 项目状态