搜索资源列表
searchhtml
- java做得html页面内容搜索的程序-done html page content search procedures
jspider-src-0.5.0-dev
- 一个JAVA的网络爬虫源码,可以爬取包括PDF,DOC,HTML等内容,相当不错!-A JAVA source network reptiles can climb check, including PDF, DOC, HTML and other content, very good!
joyhtml-0.2.2
- html正文提取,利用匹配来进行正文的抽取-html text extraction, the use of matching to carry out the extraction of the body