（01）停用词

浏览 105 扫码分享 2023-11-24 00:37:24

1、gensim 包
2、nltk 包

1、gensim 包

from gensim.parsing.preprocessing import STOPWORDS
ls_stopwords = list(STOPWORDS)
print(ls_stopwords)  # jy: 337

2、nltk 包

import nltk
nltk.download('stopwords')
stop_words = nltk.corpus.stopwords.words('english')
print(len(stop_words))
print(stop_words[:7])
print([sw for sw in stop_words if len(sw) == 1])

若有收获，就点个赞吧

让时间为你证明