楼主: oliyiyi
1607 1

Top 10 Machine Learning Projects on Github [推广有奖]

版主

泰斗

0%

还不是VIP/贵宾

-

TA的文库  其他...

计量文库

威望
7
论坛币
271951 个
通用积分
31269.3519
学术水平
1435 点
热心指数
1554 点
信用等级
1345 点
经验
383775 点
帖子
9598
精华
66
在线时间
5468 小时
注册时间
2007-5-21
最后登录
2024-4-18

初级学术勋章 初级热心勋章 初级信用勋章 中级信用勋章 中级学术勋章 中级热心勋章 高级热心勋章 高级学术勋章 高级信用勋章 特级热心勋章 特级学术勋章 特级信用勋章

+2 论坛币
k人 参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群
赵安豆老师微信:zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

感谢您参与论坛问题回答

经管之家送您两个论坛币!

+2 论坛币

The top 10 machine learning projects on Github include a number of libraries, frameworks, and education resources. Have a look at the tools others are using, and the resources they are learning from.

By Matthew Mayo, KDnuggets.

Open source software is an important piece of the data science puzzle.

According to the most recent KDnuggets data science software poll results, 73% of data scientists used free software in the previous 12 months. While there are many sources of such tools on the internet, Github has become a de facto clearinghouse for all types of open source software, including tools used in the data science community. The importance, and central position, of machine learning to the field of data science does not need to be pointed out.

The following is an overview of the top 10 machine learning projects on Github.*

1. Scikit-learn

Machine learning in Python.


★ 8641, 5125

The top project is, unsurprisingly, the go-to machine learning library for Pythonistas the world over, from industry to academia. Scikit-learnleverages the Python scientific computing stack, built on NumPy, SciPy, and matplotlib. As general purpose a toolkit as there could be, Scikit-learn contains classification, regression, and clustering algorithms, as well as data-preparation and model-evaluation tools.


2. Awesome Machine Learning

A curated list of awesome Machine Learning frameworks, libraries and software.


★ 8404 , 1885

This is a curated list of machine learning libraries, frameworks, and software. The list is categorized by language, and further by machine learning category (general purpose, computer vision, natural language processing, etc.). It also includes data visualization tools, which opens it up as more of a generalized data science list in some sense... which is a good thing.


3. PredictionIO

PredictionIO, a machine learning server for developers and ML engineers. Built on Apache Spark, HBase and Spray.


★ 8145, 1002

PredictionIO is a general purpose framework. It includes several template engines for well-known tasks, such as classification and recommendation, which can be customized, connects to existing applications with REST APIs or SDKs, and includes supports for Spark MLib. Since it is built on top of Spark and utilizes its ecosystem, it should come as no surprise that PredictionIO is developed mainly in Scala.


4. Dive Into Machine Learning

Dive into Machine Learning with Python Jupyter notebook and scikit-learn.


★ 4326, 342

This is a collection of IPython notebook tutorials for scikit-learn, as well as a number of links to related Python-specific and general machine learning topics, and more general data science information. The author isn't greedy either; they are quick to point out many other tutorials covering similar ground, in case this one doesn't tickle your fancy. The repo has no no software, but if you're new to Python machine learning, it may be worth checking out.



二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

关键词:Learning Projects Project earning machine education important resources software internet

缺少币币的网友请访问有奖回帖集合
https://bbs.pinggu.org/thread-3990750-1-1.html
沙发
oliyiyi 发表于 2016-7-3 08:03:13 |只看作者 |坛友微信交流群

5. Pattern

Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.


★ 3799, 598

Pattern is a Python-based web mining toolkit coming out of theComputational Linguistics & Psycholinguistics (CLiPS) research center at the University of Antwerp. In this context, it has tools for the tasks of scraping, machine learning, natural language processing, network analysis, and visualization. Pattern can also easily mine data from several well-known web services. The project claims to be well-documented, and to include numerous examples and unit tests.

6. NuPIC (Numenta Platform for Intelligent Computing)

A brain-inspired machine intelligence platform, and biologically accurate neural network based on cortical learning algorithms.


★ 3647, 987

NuPIC implements the Hierarchical Temporal Memory (HTM)machine learning algorithms. HTM is an attempt to model the computation of the neocortex, and focuses on storing and recalling spatial and temporal patterns. NuPIC is ideally suited to pattern-related anomaly detection.

7. Vowpal Wabbit

Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive learning.


★ 2949, 827

Vowpal Wabbit aims for speedy modelling of massive datasets, and supports parallel learning. The project was started at Yahoo! and is currently developed at Microsoft Research. Vowpal Wabbit harnesses out-of-core learning, and has been used to learn a tera-feature dataset in an hour across 1000 compute nodes.


8. aerosolve

A machine learning package built for humans.


★ 2538, 245

aerosolve attempts to be different from other libraries, focusing on human-friendly debugging facilities, Scala code for training, an image content analysis engine for easy image ranking, and a feature transformation language giving users flexibility and control over features. aerosolve implements thrift based feature representation, wherein features are logically-grouped for the purposes of applying transformations to, or facilitating interactions between, entire features groups at once.


9. GoLearn

Machine Learning for Go.


★ 2334, 215

GoLearn is an actively developed machine learning library for Go. Its goals are to provide a fully-featured, simple-to-use, customizable package for Go developers. GoLearn implements the familiar (to many) fit/predict interface of Scikit-learn, making it easy to swap out estimators, and implements "helper functions" like cross validation and train/test splitting.


10. Code for Machine Learning for Hackers

Code accompanying the book "Machine Learning for Hackers."


★ 2003, 1446

This repo contains the code from the O'Reilly book Machine Learning for Hackers. All repo code is in R, relies on numerous R packages, and topics covered include the all-too common tasks of classification, ranking, and regression, as well as statistical procedures such as principal component analysis and multidimensional scaling.

* Determined by the top returned results to the query "machine learning" on Github search, sorted by most stars, as of December 10, 2015, 1:00PM EST.


缺少币币的网友请访问有奖回帖集合
https://bbs.pinggu.org/thread-3990750-1-1.html

使用道具

您需要登录后才可以回帖 登录 | 我要注册

本版微信群
加好友,备注jltj
拉您入交流群

京ICP备16021002-2号 京B2-20170662号 京公网安备 11010802022788号 论坛法律顾问:王进律师 知识产权保护声明   免责及隐私声明

GMT+8, 2024-4-28 17:47