文件名称:lucene_indexer
-
所属分类:
- 标签属性:
- 上传时间:2012-11-16
-
文件大小:4.78mb
-
已下载:0次
-
提 供 者:
-
相关连接:无下载说明:别用迅雷下载,失败请重下,重下不扣分!
介绍说明--下载内容来自于网络,使用问题请自行百度
网页的除噪和预处理,利用lucene建立一个倒排索引,另外利用了HTMLparser对网页的解析进行了优化除噪。-In addition to web pages and pre-noise, using lucene an inverted index, another advantage of HTMLparser analysis on pages optimized denoising.
(系统自动生成,下载前可以参看下载内容)
下载文件列表
lucene_indexer/indexer/Indexer.class
lucene_indexer/indexer/Parserhrml.class
lucene_indexer/lib/filterbuilder.jar
lucene_indexer/lib/htmllexer.jar
lucene_indexer/lib/htmlparser.jar
lucene_indexer/lib/lucene-analyzers-3.0.2.jar
lucene_indexer/lib/lucene-core-3.0.2.jar
lucene_indexer/lib/lucene-smartcn-3.0.2.jar
lucene_indexer/lib/sitecapturer.jar
lucene_indexer/lib/thumbelina.jar
lucene_indexer/indexer
lucene_indexer/lib
lucene_indexer
lucene_indexer/indexer/Parserhrml.class
lucene_indexer/lib/filterbuilder.jar
lucene_indexer/lib/htmllexer.jar
lucene_indexer/lib/htmlparser.jar
lucene_indexer/lib/lucene-analyzers-3.0.2.jar
lucene_indexer/lib/lucene-core-3.0.2.jar
lucene_indexer/lib/lucene-smartcn-3.0.2.jar
lucene_indexer/lib/sitecapturer.jar
lucene_indexer/lib/thumbelina.jar
lucene_indexer/indexer
lucene_indexer/lib
lucene_indexer
本网站为编程资源及源代码搜集、介绍的搜索网站,版权归原作者所有! 粤ICP备11031372号
1999-2046 搜珍网 All Rights Reserved.