搜索资源列表
Craw
- 一个简单的Java爬虫框架,需要对自己要爬的网站写分析规则,可以自动设定下载线程数量,限制最大网速-A simple robot to catch content from site.
webmagic-master
- 一个爬虫框架,除了不会反爬虫外(当然可以自己加)其他都很牛逼,用java写的。-A crawler frame, besides will not reverse the crawler themselves are added (of course) other are very cow force, written in Java.
crawler
- 轻量级爬虫框架,可控制抓取深度 跟踪最初站源 可配置线程池 可配置UserAgent 可决定是否要抽取链接 Bloom Filter 可控制爬取速度 内置UserAgent池 支持Proxy池(Lightweight crawler framework)
crawler4j-3.5-src
- 一款不错的用于java语言的爬虫框架,编程简单方便,编程人员不需具备较好的功底也能轻松使用(A good for Java language crawler framework, programming simple and convenient, programmers need not have a good foundation, but also easy to use)
weibo3.2
- WebCollector是一个无须配置、便于二次开发的JAVA爬虫框架(内核),它提供精简的的API,只需少量代码即可实现一个功能强大的爬虫。WebCollector-Hadoop是WebCollector的Hadoop版本,支持分布式爬取。(WebCollector is a JAVA crawler framework (kernel) that does not need to be configured and easy to develop for two times. It prov
WebCollector
- java爬虫框架,在eclipse编程环境中,可以良好运行(Java reptilian frame)
WebCollector
- WebCollector爬虫框架源码,对于学习爬虫有很大的帮助(WebCollector crawler framework source code)
webcollector-2.32-bin
- WebCollector是一个无须配置、便于二次开发的JAVA爬虫框架(内核),它提供精简的的API,只需少量代码即可实现一个功能强大的爬虫。(WebCollector is a JAVA crawler framework (kernel) that does not need to be configured and is easy to develop for two times. It provides a streamlined API that requires a small nu