请选择 进入手机版 | 继续访问电脑版
楼主: ReneeBK
1953 0

Jason Brownlee: How to Study Machine Learning Algorithms [推广有奖]

  • 1关注
  • 62粉丝

VIP

学术权威

14%

还不是VIP/贵宾

-

TA的文库  其他...

R资源总汇

Panel Data Analysis

Experimental Design

威望
1
论坛币
49402 个
通用积分
51.7504
学术水平
370 点
热心指数
273 点
信用等级
335 点
经验
57815 点
帖子
4006
精华
21
在线时间
582 小时
注册时间
2005-5-8
最后登录
2023-11-26

ReneeBK 发表于 2015-2-12 09:18:38 |显示全部楼层 |坛友微信交流群

+2 论坛币
k人 参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群
赵安豆老师微信:zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

感谢您参与论坛问题回答

经管之家送您两个论坛币!

+2 论坛币

Algorithms make up a big part of machine learning.

You select and apply machine learning algorithms to build a model from your data, select features, combine the predictions from multiple models and even evaluate the capabilities of a given model.

In this post you will review 5 different approaches that you can use to study machine learning algorithms.

1. List Machine Learning Algorithms

There are a lot of machine learning algorithms and it can feel overwhelming.

Even defining what a machine learning algorithm is, can be tricky.

A great place to start out is to make your own lists of algorithms. Start a text file, word document or spreadsheet and list algorithm names. Also list the general category or categories to which each algorithm belongs.

This simple tactic can help you build familiarity with the different types and classes of algorithms available. Later as you get more experienced, lists like this can prompt you and give you ideas of different methods to spot check on your problem.

Some examples of algorithm lists to get you started include:

2. Apply Machine Learning Algorithms

Machine Learning algorithm do not exist in isolation, they are best understood when applied to a dataset.

Apply algorithms to problems to understand them. Practice applied machine learning. It sounds simple, but you will be amazed at the number of people paralysed to make this small step from theory into action.

This may mean working on a problem that matters to you, a competition dataset or a classical machine learning dataset.

Use a machine learning platform like Weka, R or scikit-learn to get access to many machine learning algorithms.

Start to build up an intuition for different types of algorithms, such as decision trees and support vector machines. Think about their required preconditions and the effects the parameters have on results.

Build up confidence in applying different algorithms. You should be spot checking algorithms on your problems.

3. Describe Machine Learning Algorithms

The next step in understanding a machine learning algorithm is to explore what is already understood about the algorithm.

This could be done before you apply the algorithm, but I think it is valuable to have a working intuition of the algorithm in action as context before diving into the algorithm description.

You can research an algorithm. This includes locating and reading the primary sources where the algorithm was first described as well as authoritative interpretations of the algorithm in textbooks and review papers.

Conference papers, competition results and even forms and Q&A websites can help you better understand the best practices and usage heuristics for an algorithm.

As you are researching an algorithm, build up a description. I like to use a well-defined algorithm description template.

You can continue to add to this template you discover more about an algorithm. You can add references, list the pseudocode for the algorithm and list best practices and usage heuristics.

This is a valuable techniques and you can build up your own mini-encyclopedia of algorithm descriptions for your own reference (for example, see Clever Algorithms for 45 algorithm recipes).

For more information on the template that I use, check out the post “How to Learn a Machine Learning Algorithm“.

4. Implement Machine Learning Algorithms

Implementing machine learning algorithms is an excellent way to get a concrete understanding of how an algorithm works.

There are many micro-decisions that have to be made when implementing an algorithm. Some of these decision points are exposed with algorithm configuration parameters, but many are not.

By implementing an algorithm yourself you will get a feeling for just how to customize the algorithm and choose what to expose and what decision points to fix in place.

Implementing algorithms from scratch will help you understand the mathematical descriptions and extensions of an algorithm. This may sound counter-intuitive. The mathematical descriptions are idealized and often provide a snap-shot description of a given processes within an algorithm. Once you translate them into code, the implications of those descriptions may be a lot more obvious.

You can leverage tutorials and open source implementations of algorithms to help you get through those difficult parts.

Note that a “my first implementation” of an algorithm will be less scalable and more fragile than a production grade implementation you may find in a machine learning tool or library.

5. Experiment On Machine Learning Algorithms

Experimenting on machine learning algorithms is the best way to understand them.

You need to act like the scientist and study a running machine learning algorithm like a complex system.

You need to control variables, use standardized datasets that are well understood and explore the cause and effect relationships of the parameters on the results.

Understanding the effects of the parameters will help you better configure the algorithm on problems in the future. Understanding the behavior of the algorithm under different circumstances will help you better scale and adapt the method to new and different problem domains in the future.

Many machine learning algorithms are stochastic in nature and resist more classical methods of algorithm analysis. They often require empirical investigation and probabilistic description to be understood.


二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

关键词:Algorithms Algorithm Learning earning machine different learning defining features multiple

您需要登录后才可以回帖 登录 | 我要注册

本版微信群
加好友,备注jltj
拉您入交流群

京ICP备16021002-2号 京B2-20170662号 京公网安备 11010802022788号 论坛法律顾问:王进律师 知识产权保护声明   免责及隐私声明

GMT+8, 2024-4-18 11:28