nlp_datasets Collecting datasets for Natural Language Processing tasks. Pretrained Word Embeddings Tencent AI Lab Chinese Embedding You can save a binary word2vec file to speed up the loading procedure after the first time loading its original text file. more info Named Entity Recognition MSRA dataset More information will be added in the future.