我的开源项目汇总(机器&深度学习、NLP、网络IO、AIML、mysql协议、chatbot)
TextAnalyzer
https://github.com/sea-boat/TextAnalyzer
A text analyzer which is based on machine learning,statistics and dictionaries that can analyze text. So far, it supports hot word extracting, text classification, part of speech tagging, named entity recognition, chinese word segment, extracting address, synonym, text clustering, word2vec model, edit distance, chinese word segment, sentence similarity,word sentiment tendency.
轻量reactor网络框架
https://github.com/sea-boat/net-reactor
深度学习实验室
https://github.com/sea-boat/DeepLearning-Lab
- rnn char test.
- seq2seq test.
- word2vec test.
- cross entropy test.
- bidirectional rnn test.
机器学习实验室
https://github.com/sea-boat/MachineLearning_Lab
- Least Square method.
- Gradient Descent
- Newton’s method
- hierarchy cluster
- knn
- markov
- adaboost
- random number generation(all kinds of distributions)
mysql协议
https://github.com/sea-boat/mysql-protocol
a lib for mysql protocol operation
how to use
i want an OK packet, just simple as below ,bytes is what you need.
OKPacket ok = new OKPacket();
ok.packetId = 2;
ok.affectedRows = 0;
ok.insertId = 0;
ok.serverStatus = 2;
ok.warningCount = 0;
ByteBuffer buffer = ByteBuffer.allocate(256);
ok.write(buffer);
buffer.flip();
byte[] bytes = new byte[buffer.remaining()];
buffer.get(bytes, 0, bytes.length);
seq2seq模型chatbot
https://github.com/sea-boat/seq2seq_chatbot
a chatbot which is implemented via seq2seq model.
how to train
- run
data.py
to produce some files we needed. - run
train.py
to train the model. - run
test_model.py
to predict.
requirements
- python3.5
- tensorflow1.3
AIML聊天机器人
https://github.com/sea-boat/super-robot
a robot based on Alice Bot, which can chat with people and handle the business.
Features
- Alice Bot for chatting.
- Customizing corpus by AIML1.0.
- Learning chat online.
- Ability Bot interface for implementing all kinds of abilities.
- Service Bot for doing business.
TODO
- Supports AIML2.0.
自然语言处理实验室
https://github.com/sea-boat/nlp_lab
- doc2txt
- tf-idf
- cnn text classify
- hmm cws
- crf ner
————-推荐阅读————
我的开源项目汇总(机器&深度学习、NLP、网络IO、AIML、mysql协议、chatbot)
跟我交流,向我提问:
公众号的菜单已分为“读书总结”、“分布式”、“机器学习”、“深度学习”、“NLP”、“Java深度”、“Java并发核心”、“JDK源码”、“Tomcat内核”等,可能有一款适合你的胃口。
欢迎关注: