搜索资源列表
SegmentRAM
- 1.基于luncene倒排索引格式的高性能索引数据访问接口。 2. 实现若干中文分词的的分词算法。-1. Based on luncene inverted index format of high-performance data access interface Index. 2. A number of Chinese word segmentation algorithm of.
InverseIndex
- 使用B+树实现文件倒排索引,查找关键词染色,根据出现频率排序
hlink.031023-1010.tar
- 用c++写的搜索引擎中建索引的程序,实现了倒排索引。
倒排索引的实现
- 超高效倒排索引的实现算法
倒排近似查找
- 用倒排索引进行近似查找,根据ED值...
SearchEngine1.0
- 实现搜索引擎最基本的下载网页、建立倒排索引、关键词查询功能。程序的实现借助了libcurl库。-Search engine to achieve the most basic functionality of downloading page, seting up inverted index, keyword querying. Program implementation with the libcurl library.
src
- 采用倒排索引做的相似搜索的源码 。 技术点:倒排索引、归一化、k-临近相似计算-Inverted index used to do similar searches for source
VSM
- 向量空间模型算法,给定一个经过分词的文档集,可以输出向量空间模型、特征词典、倒排索引表等功能,很经典的VSM算法源代码-Vector space model algorithm, given a segmentation of the document set, you can output vector space model, the characteristics of dictionaries, inverted index table functions, it is the clas
IndexDemo
- 一个学习数据结构的好例子,通过此代码能够学习倒排索引时如何创建的,本代码在vc6.0下运行通过-A good example of learning data structure, through this code to learn how to create the inverted index, and the code vc6.0 run through! ! !
EasyXSpider
- 一个Linux下的爬虫,倒排序索引,多条件检索,二元切词以及Google PageRank算法的示例程序。包括CGI的查询界面。Cool!~-Linux under a reptile, invert index, multi-condition searches, Chinese binary segmentation, as well as an example of Google PageRank algorithm procedures. Including the CGI query
CluceneTest
- CLucene 例程 是信息检索的初步学习参考代码,可以了解倒排索引的文件结构以及查询方法-CLuene Demo is a reference code for learning information retrival,which can help to learn the inverted index data structure and query method
ir
- 本系统实现了分词和倒排索引,分词采用正向最大匹配,-The system achieved a sub-word and the inverted index, the biggest being the use of sub-word match,
1
- 建立倒排索引的程序雏型.主要采用链表的数据结构.-Inverted index to establish the procedures for the prototype. The main use of the linked list data structure.
mifluz-0.24.0.tar
- mifluz 的目的是提供一个存储倒排索引c++库,允许存放关键词以便事后进行搜索。-The purpose of mifluz is to provide a C++ library to store a full text inverted index. To put it briefly, it allows storage of occurrences of words in such a way that they can later be searched. The basic id
lucene_indexer
- 网页的除噪和预处理,利用lucene建立一个倒排索引,另外利用了HTMLparser对网页的解析进行了优化除噪。-In addition to web pages and pre-noise, using lucene an inverted index, another advantage of HTMLparser analysis on pages optimized denoising.
text_mining
- 本文讨论了在关系数据库下倒排索引的建立、编码、压缩、存储、获取以及并行化等问题。-We focused our discusion on the building,encoding,compressing,storing,retrievingand parallelization of inverted index.
sphinx-performance
- 搜索引擎开发这使用的倒排索引开源工具sphinx-This search engine developed using open source tools sphinx inverted index
Lying-data-structure9
- 《大话数据结构》主要内容包含:数据结构介绍、算法推导大O阶的方法;顺序结构与链式结构差异、栈与队列的应用;串的朴素模式匹配、KMP模式匹配算法;二叉树前中后序遍历、赫夫曼树及应用;图的深度、广度遍历;最小生成树两种算法、最短路径两种算法;拓扑排序与关键路径算法;折半查找、插值查找、斐波那契查找等静态查找;稠密索引、分块索引、倒排索引等索引技术;冒泡、选择、插入等简单排序;希尔、堆、归并、快速等改进排序-" Westward the data structure," the m
search_engine
- 搜索引擎课程的几次作业,第一次作业实现分词算法,使用正向最大匹配原则。第二次作业实现倒排索引。第三次作业实现正排索引。第四次作业实现索引功能。实现的比较简单,但均可正常运行。-Several job search engine programs, the first job segmentation algorithm, using forward maximum matching principle. The second job inverted index. The third job t
irCode
- 倒排索引的实现 通过倒排索引的方式实现文档集合上的搜索功能(The realization of inverted index)