搜索资源列表
多种分词方法
- 含有多个分词算法。毕业设计的时候获得的,希望对大家在汉字处理中能用的到。 ICTCLAS算法,中科院,对名字识别能力很强。VC开发。 CSharp分词,向前匹配加向后最大匹配,C#开发,容易扩展。 小叮咚分词,由后向前最大匹配,C#开发。 xerdoc分词,基于ICTCLAS的Java版本分词。 文本分词词典,分词的词典,可以提供分词数据源。-containing more than Segmentation. Graduation was the time of hope
Xerdoc XDMF
- 一个比较好用的中文分词模块,是车东等人开发的,比较有名,java语言实现-a better quality of the Chinese word module, East cars were developed, more famous, java language!
FIFO与LRU 算法实现(java)
- java编的两种算法,word写的,看不懂,大家帮忙-java series of the two algorithms, write the word, fail to understand, we look at the help
word_segment
- 基于Java的全文文本檢測與分割 (word segmentation)-Java-based version of the transcr ipt detection and segmentation (word segmentation)
1162-Building-with-Blocks
- 求和由于最后一扇门肯定在正方形的边界举,对每一个中点都求出要从给定点P到达此中点的所需要的最少的门的个数。-hello word wellcom to our school Delphi/java,CppBuilder visual c++Gone Fishing (1042) - ACM/ICPC Regional Contest East Central North America1999Descr iption Archeologists from the Ant
神经网络算法
- JAVA版的BP神经网络源代码,源代码是Word形式,若拷贝到项目中可直接运行。
SenseWordsFilter
- java 实现敏感词过滤,毕业设计做博客网站用到的。-java sensitive word filtering, graduation design blog site used.
nlu_project
- 采用机器学习的方法进行自然语言处理,对中文进行分词和词性标注。分词采用crf模型,词性标注用hmm模型,解码算法为Vertibi算法。本系统使用java语言编写-Using machine learning methods for natural language processing, carried out on the Chinese word segmentation and POS tagging. Segmentation using crf model, tagging with
java_stopword_code
- Java 数据挖掘进行文本预处理的一步,除去文本中包含的停用词-Java Data Mining for text preprocessing step to remove the text contains the word disabled
sckr2013_final
- 基于Word2vec的词语相似度计算,包含完整的java代码,但语料由Word2vec训练得到的模型太大,故不附带模型。-Word2vec-based word similarity computation, including complete java code, but by the Word2vec training corpus resulting model is too large, it is not included with model.
FCBF
- 这是FCBF的matlab程序。FCBF是比较实用的特征选择和降维的方法。在这个文件里面,有个word文件详细说明了FCBF的使用方法。本站只有FCBF的java程序。-This is a matlab program of FCBF. FCBF is a more practical method of feature selection and dimensionality reduction. Within the file, a word file detailed deccribed
IKAnalyzer
- JAVA实现简单客服的机器人系统,分词用系统用IK分词,机器人语言用AIML。程序已经实现java socket服务的建立。实现了中文分词,同义词输出,答案匹配。用到的库有IK、program-ab。搞了一个月的小成果,希望大家能用到。-JAVA simple customer service robotic systems, word by word IK systems, robot language with AIML. Procedures have been implemented t
IKAnalyzer
- IKAnalyzer是一个开源的,基于java语言开发的轻量级的中文分词工具包。从2006年12月推出1.0版开始,IKAnalyzer已经推出了3个大版本。最初,它是以开源项目Luence为应用主体的,结合词典分词和文法分析算法的中文分词组件。新版本的IKAnalyzer3.0则发展为面向Java的公用分词组件,独立于Lucene项目,同时提供了对Lucene的默认优化实现。 -IKAnalyzer is an open source, lightweight java-based de
HanLP-1.2.7
- HanLP是一个致力于向生产环境普及NLP技术的开源Java工具包,支持中文分词(N-最短路分词、CRF分词、索引分词、用户自定义词典、词性标注),命名实体识别(中国人名、音译人名、日本人名、地名、实体机构名识别),关键词提取,自动摘要,短语提取,拼音转换,简繁转换,文本推荐,依存句法分析(MaxEnt依存句法分析、神经网络依存句法分析)。-HanLP is a dedicated to popularize NLP technology to production environment of
BinarySearchTree
- java实现二叉树搜索树的查找查找关键字进行优化路径查找-Java implementation of the two binary tree search tree search to find the key word to find the path to optimize the path
Preprocessing-Java
- 自然语言处理,对于分词后的结果预处理,提取分词结果中的中心词和短语-Word and phrase extraction and segmentation results
extract
- java Tika POI 解析word文件-tika poi
NLPLibSVM
- libsvm分词训练集的java版本。包括libsvm.jar以及训练集样本-Libsvm version of the Java word segmentation training set. Including libsvm.jar and training set samples
shuzhi
- 本压缩包内包含用java编写的最佳均方逼近算法和三次样条插值算法的源代码,以及课程设计的word版-In this archive contains java prepared by the best mean square approximation algorithm and cubic spline interpolation algorithm source code, as well as word version Course Design
itext
- java实现txt、excel、word等文本转换为PDF格式文件-Java implements TXT, excel, word, and other text as a PDF file