我的开源项目汇总(机器&深度学习、NLP、网络IO、AIML、mysql协议、chatbot)

1,516 阅读1分钟
原文链接: blog.csdn.net

TextAnalyzer

github.com/sea-boat/Te…

A text analyzer which is based on machine learning,statistics and dictionaries that can analyze text. So far, it supports hot word extracting, text classification, part of speech tagging, named entity recognition, chinese word segment, extracting address, synonym, text clustering, word2vec model, edit distance, chinese word segment, sentence similarity,word sentiment tendency.

轻量reactor网络框架

github.com/sea-boat/ne…

image

深度学习实验室

github.com/sea-boat/De…

  • rnn char test.
  • seq2seq test.
  • word2vec test.
  • cross entropy test.
  • bidirectional rnn test.

机器学习实验室

github.com/sea-boat/Ma…

  • Least Square method.
  • Gradient Descent
  • Newton’s method
  • hierarchy cluster
  • knn
  • markov
  • adaboost
  • random number generation(all kinds of distributions)

mysql协议

github.com/sea-boat/my…

a lib for mysql protocol operation

how to use

i want an OK packet, just simple as below ,bytes is what you need.

OKPacket ok = new OKPacket();
ok.packetId = 2;
ok.affectedRows = 0;
ok.insertId = 0;
ok.serverStatus = 2;
ok.warningCount = 0;
ByteBuffer buffer = ByteBuffer.allocate(256);
ok.write(buffer);
buffer.flip();
byte[] bytes = new byte[buffer.remaining()];
buffer.get(bytes, 0, bytes.length);

seq2seq模型chatbot

github.com/sea-boat/se…

a chatbot which is implemented via seq2seq model.

how to train

  1. run data.py to produce some files we needed.
  2. run train.py to train the model.
  3. run test_model.py to predict.

requirements

  • python3.5
  • tensorflow1.3

AIML聊天机器人

github.com/sea-boat/su…

a robot based on Alice Bot, which can chat with people and handle the business.

Features

  • Alice Bot for chatting.
  • Customizing corpus by AIML1.0.
  • Learning chat online.
  • Ability Bot interface for implementing all kinds of abilities.
  • Service Bot for doing business.

TODO

  • Supports AIML2.0.

自然语言处理实验室

github.com/sea-boat/nl…

  • doc2txt
  • tf-idf
  • cnn text classify
  • hmm cws
  • crf ner

————-推荐阅读————

我的开源项目汇总(机器&深度学习、NLP、网络IO、AIML、mysql协议、chatbot)

我的2017文章汇总——机器学习篇

我的2017文章汇总——Java及中间件

我的2017文章汇总——深度学习篇

我的2017文章汇总——JDK源码篇

我的2017文章汇总——自然语言处理篇

我的2017文章汇总——Java并发篇


跟我交流,向我提问:

公众号的菜单已分为“读书总结”、“分布式”、“机器学习”、“深度学习”、“NLP”、“Java深度”、“Java并发核心”、“JDK源码”、“Tomcat内核”等,可能有一款适合你的胃口。

为什么写《Tomcat内核设计剖析》

欢迎关注: