-
带有中英文的垃圾邮件分类数据集
资源介绍
There are two corpora - mostly English (trec06p) and Chinese (trec06c).
trec06p/full/ -- Ideal feedback English corpus
trec06p/full-delay/ -- Delayed feedback English corpus
trec06c/full/ -- Ideal feedback Chinese corpus
trec06c/delay/ -- Delayed feedback Chinese corpus