-
"结巴"分词的Node.js版本nodejieba.zip下载
资源介绍
NodeJieba "结巴"分词的Node.js版本Introduction
NodeJieba只是CppJieba简单包装而成的node扩展,用来进行中文分词。
详见NodeJiebaBlogInstallnpm install nodejieba
因为npm速度很慢而且经常因为墙的原因出现莫名其妙的问题,在此强烈建议使用cnpm,命令如下:npm --registry=http://r.cnpmjs.org install nodejieba默认分词算法初始化var segment = require("nodejieba");
segment.loadDict("./node_modules/nodejieba/dict/jieba.dict.utf8", "./node_modules/nodejieba/dict/hmm_model.utf8");阻塞式调用var wordList = segment.cutSync("阻塞模式分词"); if (wordList.constructor == Array) // just for tutorial, this is always be true {
wordList.forEach(function(word) { console.log(word);
});
}非阻塞式调用segment.cut("非阻塞模式分词", function(wordList) {
wordList.forEach(function(word) { console.log(word);
});
});初始化var segment = require("nodejieba");
segment.queryLoadDict("./node_modules/nodejieba/dict/jieba.dict.utf8", "./node_modules/nodejieba/dict/hmm_model.utf8");阻塞式调用var wordList = segment.queryCutSync("阻塞模式分词"); if (wordList.constructor == Array) // just for tutorial, this is always be true {
wordList.forEach(function(word) { console.log(word);
});
}非阻塞式调用segment.queryCut("非阻塞模式分词", function(wordList) {
wordList.forEach(function(word) { console.log(word);
});
});
具体用法可以参考 test/segment.js test/query_segment.jsTesting
在node v0.10.2下测试通过http://cppjieba-webdemo.herokuapp.com/ (chrome is suggested)ThanksJieba中文分词
标签:nodejieba
- 上一篇: java zip压缩解压工具解决中文乱码问题
- 下一篇: 通过Python对zip压缩包密码破解