搜索资源列表
wawatextcluster
- 蛙蛙的中文文本聚类,主要采用k-means算法。wawa s text cluster using C#.
java-cluster.zip
- 用java语言实现文本聚类,包括聚类前的数据预处理:分词、降维、建立向量空间模型等,Implementation using java language text clustering, including clustering of the data pre-processing before: segmentation, dimensionality reduction, set up, such as Vector Space Model
textcluster
- 文本聚类算法源码,包含tf.idf计算的实现,采用java语言编写-text cluster algorithm, including the computation of tf.idf ,written by Java
knn
- k最邻近算法,经典的分类算法,绝对有帮助-k-nearest neighbour algorithm,it is a classical algorithm for text cluster
lingpipe-3.6.0
- 一个自然语言处理的Java开源工具包。LingPipe目前已有很丰富的功能,包括主题分类(Top Classification)、命名实体识别(Named Entity Recognition)、词性标注(Part-of Speech Tagging)、句题检测(Sentence Detection)、查询拼写检查(Query Spell Checking)、兴趣短语检测(Interseting Phrase Detection)、聚类(Clustering)、字符语言建模(Character
dlucene.tar
- 分布式全文搜索工具包 可以支持集群 主要使用java開發 比較方便使用-Distributed full-text search tool kit can support the main use of cluster development of more convenient use of java
AP
- 最新AP聚类算法以及演示程序,算法内容参照affinity appropagation in science。-AP latest clustering algorithm as well as the demo program, algorithm reference content affinity appropagation in science.
automatic-classification-cluster
- 一、问题描述若要在n个城市之间建役通信网络,只福要架设n-1条级路即可.如何以最低的经济代价建设这个通信网,是一个网的最小生成树问题。二、基本要求 (1)利用克鲁斯卡尔算法求图的最小生成树。 (2)能实现教科书6.5节中定义的抽象数据类型MFSet.以此表示构造生成树过程中的连通分量。 (3 ) 以文本形式输出生成树中各条边以及他们的权值.三、需求分析 1、构造图结构。 2、利用克鲁斯卡尔算法求图的最小生成树。 3、完成生成树的输出。 -I. Descr iption of the proble
Discover
- 一个相对很完善的数据挖掘系统(少部分功能欠缺),不仅包括了按类Association(内含Apriori,C4.5,GrowTree),Classification(ID3),Cluster划分常用的算法及个人优化后的算法。同时包含了多种数据源(MS Access,Excel,SQL Server,TEXT)的界面直接连接方式。界面方面更像一个专业的系统,采用了类VC浮动多框架界面,也可以作为界面编程的参考。这个工程的分数和其他小工具的分数一样,感觉亏大了:)-A relatively well
text_data_mining
- java编写的数据挖掘方面的代码,里面包含有文本分类,作者身份识别方面的java源码,本人亲自参与编写-java code about data mining;include:text cluster ,authorship identification,
File10
- Web文档聚类系统的设计与实现:数据挖掘;聚类分柝:文本挖掘;预处理;聚类组合;可 视化;欧氏距离-Web Document Clustering Design and Implementation: Data mining Clustering Hierarchical: text mining pretreatment cluster combinations visualization Euclidean distance
vsm-and-cluster-kmeans
- 对文本聚类的向量空间模型机制VSM以及Kmeans等讲解比较详细,是研究文本聚类的好书籍-Clustering of text vector space model to explain the mechanism of VSM and Kmeans and other more detailed, study of text clustering is a good book
tomcat--Cluster
- tomcat集群的详细配置,有详细配置方法,带文字说明。-tomcat cluster configure, configuration, with the text.
K-Means_Text_Cluster
- K-Means文本聚类python实现,文本聚类算法,人名排除歧义-Text Cluster by the algorithm of K-means(include texts), discrimination of name ambiguity.
AP-Cluster
- AP聚类算法的C++代码实现,其中数据是文本读入iris数据,P值选取欧式矩阵最小值。显示结果为聚类后结果-AP clustering algorithm C++ code, in which data is read into the text iris data, P values selected European matrix minimum. Showing results clustering results after
cluster
- python语言实现k-means算法和Fast Search And Find Of Density Peaks算法用于文本聚类,-python language implements k-means algorithm and Fast Search And Find Of Density Peaks for text clustering algorithm,
Edge-based-text-region-extraction-from-natural-im
- The basic steps of the edge-based text extraction algorithm are given below 1. Create a Gaussian pyramid by convolving the input image with a Gaussian kernel and successively down-sample each direction by half. (Levels: 4) 2. Create directiona
cluster-test-VO.2
- 基于C++语言,利用K均值及其改进方法实现短文本的聚类,其中利用最远距离法实现的聚类中心初始化-Based on the C++ language, the use of K-means clustering and improved methods to achieve short text clustering centers which use the law to achieve the most remote initialization
base-on-WEKA-text-cluster-apllying
- weka平台的文本分类测试,源代码为java-Text categorization test weka platform, the source code for the java
cluster
- 提出了一种基于语义内积空间模型的文本 聚类算法. -Text proposed clustering algorithm within the semantic model based on the product space.