2020-10-11 22:18:09 +08:00
|
|
|
|
## Background
|
2020-10-11 22:33:18 +08:00
|
|
|
|
This project is a summary of the author's years of study and work practice.
|
|
|
|
|
Most of the code has been actually run to ensure accuracy.
|
2020-10-11 22:19:35 +08:00
|
|
|
|
|
2021-01-06 10:18:07 +08:00
|
|
|
|
本项目是作者多年学习与工作实践的总结,绝大部分代码都经过实际运行保证准确无误
|
2020-09-12 19:12:52 +08:00
|
|
|
|
|
2020-10-11 22:18:09 +08:00
|
|
|
|
## Structure
|
2020-09-12 19:12:52 +08:00
|
|
|
|
|
2020-10-11 22:18:09 +08:00
|
|
|
|
```
|
|
|
|
|
-- bigdata solution for bigdata,contains hive/hadoop/spark/hbase etc...
|
|
|
|
|
-- code-languages programming language,contains java/python/scala etc...
|
|
|
|
|
-- cs-other some interesting things in the cs field
|
|
|
|
|
-- deep-learning derivation of some algorithm principles of deep learning,tensorflow frame etc...
|
|
|
|
|
-- feature-engineering feature-engineering is very important for algorithm
|
|
|
|
|
-- math mathematical principles,contains matrix analysis, probability and statistics etc...
|
|
|
|
|
-- mathcasebycase some mathematical knowledge points that are relatively scattered and difficult to classify
|
|
|
|
|
-- recommend knowledge about recommendation system
|
|
|
|
|
-- service-enginnering service online,essential knowledge for algorithm online
|
|
|
|
|
-- tools various tools commonly used in practice,
|
|
|
|
|
including awk, grep, sed data processing three swordsmen,
|
|
|
|
|
git, maven and other common tools, intellij,
|
|
|
|
|
sublime, vim and other IDEs,
|
|
|
|
|
linux-shell common scripts
|
|
|
|
|
-- traditional-algorithm traditional machine learning algorithms that are different from deep learning,
|
|
|
|
|
including clustering algorithms/optimization methods/tree algorithms, etc.,
|
|
|
|
|
as well as a brief introduction to mllib.
|
2020-09-12 19:11:23 +08:00
|
|
|
|
|
2020-10-11 22:18:09 +08:00
|
|
|
|
```
|
2020-09-12 19:11:23 +08:00
|
|
|
|
|
2020-09-12 19:22:10 +08:00
|
|
|
|
```
|
2020-09-12 19:16:50 +08:00
|
|
|
|
-- bigdata 大数据处理方案,包括hive/hadoop/spark/hbase等
|
|
|
|
|
-- code-languages 编码语言,包括java/python/scala等
|
|
|
|
|
-- cs-other cs领域的一些有意思的事情
|
|
|
|
|
-- deep-learning 深度学习的一些算法原理推导,tensorflow等框架
|
|
|
|
|
-- feature-engineering 特征工程,做过算法的人都知道特征工程重要性
|
|
|
|
|
-- math 数学原理,包括矩阵分析,概率统计等算法中常用数学知识
|
|
|
|
|
-- mathcasebycase 一些比较分散不好归类的数学知识点
|
|
|
|
|
-- recommend 推荐系统一些相关知识,目前作者就从事推荐相关工作
|
|
|
|
|
-- service-enginnering 线上服务,算法上线必备知识
|
2020-09-12 19:23:41 +08:00
|
|
|
|
-- tools 实际中常用的各种工具,包括awk,grep,sed数据处理三剑客,git,maven等常见工具,
|
|
|
|
|
intellij,sublime,vim等IDE, linux-shell常见脚本
|
|
|
|
|
-- traditional-algorithm 区别于深度学习的传统机器学习算法,包括聚类算法/最优化方法/树类算法等,
|
|
|
|
|
还有mllib的简单介绍。
|
2020-09-12 19:22:10 +08:00
|
|
|
|
```
|
2020-09-12 19:11:23 +08:00
|
|
|
|
|
2020-10-11 22:18:09 +08:00
|
|
|
|
## Suitable for
|
2020-10-25 19:44:35 +08:00
|
|
|
|
### for computer science guys
|
2020-10-11 22:35:26 +08:00
|
|
|
|
The author come from a non-CS major,
|
2020-10-11 22:33:18 +08:00
|
|
|
|
and did not systematically study data structures, operating systems,
|
|
|
|
|
design patterns and other courses during school.
|
|
|
|
|
The relevant knowledge is to be studied systematically after work.
|
|
|
|
|
So this project is especially suitable for non-CS majors.
|
2020-10-11 22:19:35 +08:00
|
|
|
|
|
2020-10-11 22:33:18 +08:00
|
|
|
|
作者非CS专业科班出身,在校期间并未系统学习过数据结构,操作系统,设计模式等课程,
|
|
|
|
|
相关的知识都是工作以后再进行系统学习。
|
2020-10-11 22:18:09 +08:00
|
|
|
|
所以该项目特别适合非CS专业同学参考。
|
|
|
|
|
|
2020-10-25 19:44:35 +08:00
|
|
|
|
### for non-computer science guys
|
2020-10-11 22:33:18 +08:00
|
|
|
|
For CS majors, some of them have not studied mathematics courses systematically,
|
|
|
|
|
such as matrix analysis, probability statistics, optimization, etc.
|
|
|
|
|
The major of the author's master's degree is pattern recognition.
|
2020-10-11 22:35:26 +08:00
|
|
|
|
And in the future work, he roughly understand which mathematical knowledge
|
2020-10-11 22:33:18 +08:00
|
|
|
|
is the key and difficult point in algorithm learning and practice.
|
|
|
|
|
Therefore, this project is also suitable for students majoring in CS
|
|
|
|
|
|
|
|
|
|
CS专业的同学,有一部分没有系统学过数学方面的课程,比如矩阵分析,概率统计,最优化等。
|
|
|
|
|
作者硕士阶段所学专业为模式识别,在以后的工作中,
|
|
|
|
|
大致了解哪些数学知识是算法学习与实践中的重点与难点。
|
|
|
|
|
因此,该项目也特别适合CS专业的同学。
|
|
|
|
|
|
|
|
|
|
|
2020-10-25 19:44:35 +08:00
|
|
|
|
### for guys who need to put the algorithm online from 0 to 1
|
2020-10-11 22:33:18 +08:00
|
|
|
|
The algorithm is not just an offline train model,
|
|
|
|
|
it can even be said that the offline train model is only a small part of the work.
|
|
|
|
|
On the contrary, the corresponding engineering capabilities,
|
|
|
|
|
code capabilities, and data capabilities are very important.
|
|
|
|
|
Therefore, this project is especially suitable for guys
|
|
|
|
|
who need to put the algorithm online from 0 to 1.
|
2020-10-11 22:19:35 +08:00
|
|
|
|
|
2020-10-11 22:33:18 +08:00
|
|
|
|
算法并不只是离线train model,甚至可以说离线train model只是工作很小的一部分。
|
|
|
|
|
相反对应的工程能力,代码能力,数据能力非常重要。
|
|
|
|
|
因此该项目特别适合需要将算法从0到1怼上线的同学
|
2020-10-11 22:18:09 +08:00
|
|
|
|
|
2020-10-11 22:33:18 +08:00
|
|
|
|
|
2020-10-25 19:44:35 +08:00
|
|
|
|
### for guys who need to solve various practical problems in actual work
|
2020-10-11 22:33:18 +08:00
|
|
|
|
The project not only contains algorithm theory, algorithm derivation,
|
|
|
|
|
but also more engineering and data aspects.
|
|
|
|
|
Most of them are actual problems encountered in work,
|
|
|
|
|
which can provide you with reference ideas in practice.
|
|
|
|
|
Therefore, this project is especially suitable for guys
|
|
|
|
|
who need to solve various practical problems in actual combat.
|
2020-10-11 22:19:35 +08:00
|
|
|
|
|
2020-10-11 22:33:18 +08:00
|
|
|
|
该项目不仅有算法理论,算法推导,还有更多工程以及数据方面的内容,
|
|
|
|
|
大部分都是工作中遇到的实际问题,可以为大家实践中提供参考思路。
|
|
|
|
|
因此该项目特别适合实战中需要解决各种实际问题的同学
|
2020-10-11 22:18:09 +08:00
|
|
|
|
|
|
|
|
|
## CSDN address
|
|
|
|
|
https://blog.csdn.net/bitcarmanlee
|
|
|
|
|
|
|
|
|
|
## Update
|
2020-10-11 22:33:18 +08:00
|
|
|
|
The layout and stability of github is higher than that of csdn.
|
|
|
|
|
In the future, it will be prioritized to maintain the projects on github,
|
|
|
|
|
and csdn will also keep synchronized updates.
|
2020-10-11 22:19:35 +08:00
|
|
|
|
|
2020-10-11 22:33:18 +08:00
|
|
|
|
github的排版以及稳定性比csdn更高,以后优先维护github上的项目,CSDN也会保持同步更新。
|