0关注
34粉丝

已卖：4535份资源

院士

27%

还不是VIP/贵宾

-

TA的文库 其他...

Clojure NewOccidental

Job and Interview

Perl资源总汇

0%

威望: 7 级
论坛币: 144575308 个
通用积分: 69.1618
学术水平: 37 点
热心指数: 38 点
信用等级: 25 点
经验: 31240 点
帖子: 1873
精华: 1
在线时间: 802 小时
注册时间: 2005-1-3
最后登录: 2024-10-15

楼主

hanszhu 发表于 2005-1-14 10:00:00 |AI写论文

是否 +2 论坛币

k人参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群

赵安豆老师微信：zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

立即领取

感谢您参与论坛问题回答

经管之家送您两个论坛币！

+2 论坛币

Data Mining Techniques : For Marketing, Sales, and Customer Relationship Management by Michael J. A. Berry, Gordon S. Linoff (Paperback - April 5, 2004) Editorial Reviews

Review "The book thoroughly acquaints you with the new generation of data mining tools and techniques and shows you how to use them to make better business decisions. This guide describes techniques for detecting customer behavior patterns useful in formulating marketing, sales and customer support strategies. While database analysts will find more than enough technical information to satisfy their curiosity, technically savvy business and marketing managers will find this book accessible." (Fathbrain.com; Ganthead.com, 9/01) Product Description: Who will remain a loyal customer and who won't? What kind of marketing approach is most likely to increase sales? What can customer buying patterns tell us about improving our inventory control? What type of credit approval process will work best for us and our customers? The answers to these and all your crucial business questions lie buried in your company's information systems. This book supplies you with powerful tools for mining them. Data Mining Techniques thoroughly acquaints you with the new generation of data mining tools and techniques and shows you how to use them to make better business decisions. One of the first practical guides to mining business data, it describes techniques for detecting customer behavior patterns useful in formulating marketing, sales, and customer support strategies. While database analysts will find more than enough technical information to satisfy their curiosity, technically savvy business and marketing managers will find the coverage eminently accessible. Here's your chance to learn all about: * How leading companies across North America are using data mining to beat the competition * How each tool works, and how to pick the right one for the job * Seven powerful techniques -cluster detection, memory-based reasoning, market basket analysis, genetic algorithms, link analysis, decision trees, and neural nets * How to prepare data sources for data mining, and how to evaluate and use the results you get Data Mining Techniques shows you how to quickly and easily tap the gold mine of business solutions lying dormant in your information systems.

[此贴子已经被作者于2005-5-1 3:33:35编辑过]

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

分享0 收藏4 回帖

关键词：relationship Data Mining Techniques Technique relations Management Customer Marketing Mining relationship

Data Mining – Seven Years Later, Lessons Learned

by Michael J. A. Berry

The 21st Century approach to Data Mining and Survival Analysis

The opportunity to write a second edition of my book allowed me to reassess an old familiar topic. Gordon Linoff and I recently completed the new edition of Data Mining Techniques for Marketing, Sales, and Customer Relationship Management. The difference between the first and second editions says a lot about how the field has evolved. It says even more about how our own perspective has changed by years of running a data mining consulting practice. (Even the title of the book has changed. When we wrote the first edition, we didn’t even know the term “customer relationship management”, so we didn’t use it in the title, even though it is an apt description of the applications of data mining we describe today.)

The first edition of Data Mining Techniques appeared in 1997. If you think back to that time, the dot-com bubble had barely started to inflate. U.S. cell phone calls cost 56 cents per minute, on average, and fewer than 25% of Americans owned a mobile phone. Data mining was a buzz word for many business people, but very little actual business data mining was occurring.

A lot has changed in seven years. Now, data mining and analytic CRM are considered mainstream. Data mining software has also matured. Instead of downloading source code you need to compile, you can buy data mining suites that come with full documentation and reasonable user interfaces.

But even if the technological and business worlds had remained the same, we would have wanted to update the book, because we learned so much in those intervening years. One of the joys of consulting is the constant exposure to new ideas, new problems, and new solutions. We may not be any smarter than when we wrote the first edition, but we do have more experience which has changed the way we approach the material.

One thing that has been driven home to us over and over again during the past seven years, is that data mining is almost all about process and only a little about clever algorithms. When the data mining process is not well understood, all the clever techniques and algorithms get applied to the wrong data, in the wrong ways, and yield wrong results. A corollary is that the skills of the human data miner and that individual’s knowledge and intuition, about how to coax meaning from recalcitrant data, are more important than tools and techniques.

The new book does cover a few more data mining techniques than the original. In addition to the seven techniques covered in the first edition—decision trees, neural networks, memory based reasoning, association rules, cluster detection, link analysis, and genetic algorithms—there is now a chapter on data mining using standard statistical techniques. These familiar tools include cross tabs and histograms. There is also another new chapter on survival analysis. Survival analysis is a technique that has been adapted from the small samples and continuous time measurements of the medical world to the large samples and discrete time measurements found in marketing data. It is used to study time-to-event problems, such as estimating the remaining lifetime of a customer relationship or the time to the next purchase. More importantly, the new edition is careful to show these techniques in their proper business context and to point out the ways they can be misused.

In our consulting practice, we have seen how often data mining is misused:

a) to learn things that aren’t true; or

b) to learn things that are true, but not useful.

For that reason, the new edition features a much-expanded discussion of the ways that data mining can provide unintended results and advises the reader of the data mining methodology and best practices that will help avoid these perils.

Finding data that is inaccurate is more dangerous than finding factual data that is not useful because important business decisions may be based on incorrect information. Data mining results often seem reliable because they are based on actual data derived in a seemingly scientific manner. This appearance of reliability can be deceiving. The data itself may be incorrect or not relevant to the question at hand. The patterns discovered may reflect past business decisions or nothing at all. Data transformations, within the system, such as summarization, may have destroyed or hidden important information. The rest of this article illustrates how these problems can arise.

It is often said that figures don’t lie, but liars can figure. When it comes to finding patterns in data, figures don’t have to actually lie in order to suggest results that aren’t true. There are so many ways to construct patterns, that any random set of data points will reveal a pattern if examined long enough.

Human beings depend so heavily on patterns in their day-to-day lives that they tend to see patterns even when they don’t exist. If you look at the night-time sky, you probably do not see a random arrangement of stars, but rather, the Big Dipper, or the Southern Cross, or Orion’s Belt. Some of you even see astrological patterns and portents that can be used to predict the future. This was an early form of data mining! The widespread acceptance of outlandish conspiracy theories is further evidence of the human need to find patterns in data.

Presumably, the reason that humans have developed such an affinity for patterns is that patterns often do reflect some underlying truth about the way the world works. The phases of the moon, the progression of the seasons, the constant alternation of night and day, even the regular appearance of a favorite TV show, at the same time, on the same day of the week, are useful because they are stable and therefore predictive. One can use these patterns to decide when it is safe to plant tomatoes or how to program the VCR. Other patterns clearly do not have any predictive power. If a fair coin comes up heads 5 times in a row, there is still a 50-50 chance that it will come up tails on the sixth toss. The challenge for data miners is to figure out which patterns are predictive and which are not—to separate signal from noise.

In more than one industry, we have been told that usage often goes down in the month before a customer leaves. Upon closer examination, this turns out to be an example of learning something that is not true. The graph below appears to illustrate putative discovery. It shows the monthly minutes of use for a cellular telephone subscriber. For seven months, the subscriber uses about 100 minutes per month. Then, in the 8^th month, usage goes down to about half that. In the 9^th month, there is no usage at all.

Does declining usage in month 8 predict cessation in month 9?

This subscriber appears to fit the pattern of a month with decreased usage preceding abandonment of the service. But appearances are deceiving. Looking at minutes of use by day instead of by month, would show that the customer continued to use the service at a constant rate until the middle of the eighth month and then stopped completely. One could presume this was because on that day, the customer began using a competing service. The putative period of declining usage does not actually exist and, certainly, does not provide a window of opportunity during which the customer can be retained. What appears to be a leading indicator is actually a trailing one.

Another common problem is finding patterns in one dataset that don’t generalize to others. The technical term for this is “overfitting.” It happens when the data miner spends too much effort trying to get the best possible results from the data that happens to be at hand and not enough effort making sure that the resulting model is stable. Model stability is the focus of our data mining methodology. A model that does a great job of explaining who placed an order from last month’s catalog, but fails to predict who will place an order from this month’s catalog, is not as useful as one that yields a predictable response rate month after month.

A third problem is discovering valid patterns and rules that can’t be applied in the intended way. For example, one way that data mining is used to find new prospects for a product is to profile the current customers and then look for people who match that profile. This is a powerful technique, but it runs into trouble when using the specific product changes the very variables used to build the profile. I once built a profile of certificate of deposit holders for a retail bank. One of the striking things the CD holders had in common was low balances in their savings accounts. Clearly, however, identifying all the people with nothing in their savings accounts and then trying to sell them CDs is highly unlikely to be a winning strategy! The point of this story is that, although a data mining tool can find the patterns, it still takes a human being to interpret them.

The good news is that once you understand these problems, they are relatively easy to avoid. Gordon and I had to learn this the hard way. Our hope is that others can now learn from our mistakes.

报纸

hanszhu 发表于 2005-1-14 10:34:00

Michael J. A. Berry -

Michael is a founder and principal of Data Miners, a highly-regarded consultancy that provides data mining and predictive modeling services, he has more than a decade of experience applying data mining techniques to business problems in marketing and CRM. With his colleague, Gordon Linoff, Michael has authored three of the most widely read and respected books on data mining, Data Mining Techniques, Mastering Data Mining, and Mining the Web (all published by John Wiley & Sons). A revised edition of Data Mining Techniques has just been published. (See http://www.amazon.com/exec/obidos/ASIN/0471470643/thedataminers)

地板

lyslz 发表于 2005-1-14 10:45:00

回一个,呵呵

7楼

hyj980098 发表于 2005-1-14 11:13:00

好东西支持

8楼

zwen 发表于 2005-1-14 11:15:00

以下是引用hyj980098在2005-1-14 11:13:07的发言： 好东西支持

同re

9楼

hanszhu 发表于 2005-1-14 11:25:00

谢谢同志们鼓励和支持!!!

10楼

chengkaim 发表于 2005-1-14 12:02:00

好东西！3ks

[推荐]Data Mining Techniques : For Marketing, Sales, and Customer Relationship Man [推广有奖]

经管之家送您一份

经管之家联合CDA

感谢您参与论坛问题回答

扫码加我拉你入群

相关帖子

Data Mining – Seven Years Later, Lessons Learned

浏览过的帖子

浏览过的版块

本版微信群

[推荐]Data Mining Techniques : For Marketing, Sales, and Customer Relationship Man [推广有奖]

经管之家送您一份

经管之家联合CDA

感谢您参与论坛问题回答

扫码加我 拉你入群

相关帖子

Data Mining – Seven Years Later, Lessons Learned

浏览过的帖子

浏览过的版块

本版微信群

扫码加我拉你入群