楼主: SPSSCHEN
4667 18

[下载][推荐][讨论]Data Mining [推广有奖]

  • 0关注
  • 0粉丝

博士生

22%

还不是VIP/贵宾

-

TA的文库  其他...

Voxco NewOccidental

Case Study NewOccidental

NoSQL NewOccidental

威望
0
论坛币
946 个
通用积分
0.6700
学术水平
7 点
热心指数
2 点
信用等级
0 点
经验
2052 点
帖子
306
精华
0
在线时间
42 小时
注册时间
2005-9-25
最后登录
2022-10-25

相似文件 换一批

+2 论坛币
k人 参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群
赵安豆老师微信:zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

感谢您参与论坛问题回答

经管之家送您两个论坛币!

+2 论坛币

CRISP-DM 1.0 Step-by-Step Data Mining Guide.2000

Pete Chapman

45652.pdf (610.54 KB)

[此贴子已经被作者于2006-3-27 0:33:39编辑过]

二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

关键词:Data Mining ning Mini Data ING 下载 推荐 讨论 Mining Data

沙发
SPSSCHEN 发表于 2006-3-27 00:27:00 |只看作者 |坛友微信交流群

[PPT] CRISP-DM

Pete Chapman (NCR), Julian Clinton (SPSS), Randy Kerber (NCR), Thomas Khabaza (SPSS), Thomas Reinartz, (DaimlerChrysler), Colin Shearer (SPSS) and Rüdiger Wirth (DaimlerChrysler) “CRISP-DM 1.0 - Step-by-step data mining guide

使用道具

藤椅
SPSSCHEN 发表于 2006-3-27 00:28:00 |只看作者 |坛友微信交流群

[PPT] CRISP-DM: A Standard Process Model for Data Mining

45:8 see also http://www.dmg.org; Chapman, P., Clinton, J., Kerber, R., Khabaza, T., Reinartz, T., Shearer, C. and Wirth, R. (2000). CRISP-DM 1.0: Step-by-step data mining guide, CRISP-DM consortium, http://www.crisp-dm.org; Clifton

使用道具

板凳
SPSSCHEN 发表于 2006-3-27 00:32:00 |只看作者 |坛友微信交流群

使用道具

报纸
mnauce 发表于 2006-3-27 07:34:00 |只看作者 |坛友微信交流群

虽然conceptually,对于时序序列数据的研究,挖掘其中的pattern也算是data mining的任务,但传统上DM更热衷于作classification方面的研究,regression是最近十年来由于DM与statistics越走越近才开始系统研究的。此外,Machine Learning/Statistical Learning传统上对data有iid的要求(虽然进来有所relax)。

统计学中常用的非参回归方法(spline, local polynomial, wavelet-like methods)中local polynomial method已经被较多的用于time series/econometrics的研究。

代表作有我老师Jianqing Fan的Nonlinear Time Series : Nonparametric and Parametric Methods (Springer Series in Statistics)

这本书现在国内也出版了。当然要更好的理解本书,也许先看Local Polynomial Modelling and Its Applications -- by Jianqing Fan会有些帮助。

最来ML领域很热的kernel method/SVM也被用于time series,比如用SVR(support vector regression)来对time series data作regression就比传统的MLE要好(因为MLE的参数方法难免Model Bias)

另外DM经常被置于一个更广阔的数据库知识发现(KDD)的大背景下,
KDD主要包括选择目标数据,预处理数据(preprocessing),转化数据(transformation),进行数据挖掘,模式提取以及诠释。

DM一般的入门书有:

Principles of Data Mining by D.Hand,etal

这本概念框架写的不错,但是内容不够深入,比较单薄。

Elements of Statistical Learning by Hastie, Tibshirani, Friedman

stat@stanford的三个教授写的,这本稍稍深入一些。

Data Mining by Han and Kamber 这本偏data base。

Data Mining by Witten and Franke 这本偏machine learning。

还有就是专门讲SVM的An Introduction to SVM by Cristianini & Shawe-Taylor

更加理论一点的有Vapnik的经典之作:Statistical Learning Theory.


[此贴子已经被作者于2006-3-27 7:35:59编辑过]

使用道具

地板
SPSSCHEN 发表于 2006-3-27 07:57:00 |只看作者 |坛友微信交流群

[此贴子已经被作者于2006-3-27 7:59:55编辑过]

使用道具

7
SPSSCHEN 发表于 2006-3-27 08:02:00 |只看作者 |坛友微信交流群

使用道具

8
SPSSCHEN 发表于 2006-3-27 08:03:00 |只看作者 |坛友微信交流群

使用道具

9
SPSSCHEN 发表于 2006-3-27 08:03:00 |只看作者 |坛友微信交流群

Nonlinear Time Series : Nonparametric and Parametric Methods (Springer Series in Statistics) (Paperback)
by Jianqing Fan, Qiwei Yao

Customers who bought this also bought

Editorial Reviews
Book Description
This book presents the contemporary statistical methods and theory of nonlinear time series analysis. The principal focus is on nonparametric and semiparametric techniques developed in the last decade. It covers the techniques for modelling in state-space, in frequency-domain as well as in time-domain. To reflect the integration of parametric and nonparametric methods in analyzing time series data, the book also presents an up-to-date exposure of some parametric nonlinear models, including ARCH/GARCH models and threshold models. A compact view on linear ARMA models is also provided. Data arising in real applications are used throughout to show how nonparametric approaches may help to reveal local structure in high-dimensional data. Important technical tools are also introduced. The book will be useful for graduate students, application-oriented time series analysts, and new and experienced researchers. It will have the value both within the statistical community and across a broad spectrum of other fields such as econometrics, empirical finance, population biology and ecology. The prerequisites are basic courses in probability and statistics. Jianqing Fan, coauthor of the highly regarded book Local Polynomial Modeling, is Professor of Statistics at the University of North Carolina at Chapel Hill and the Chinese University of Hong Kong. His published work on nonparametric modeling, nonlinear time series, financial econometrics, analysis of longitudinal data, model selection, wavelets and other aspects of methodological and theoretical statistics has been recognized with the Presidents' Award from the Committee of Presidents of Statistical Societies, the Hettleman Prize for Artistic and Scholarly Achievement from the University of North Carolina, and by his election as a fellow of the American Statistical Association and the Institute of Mathematical Statistics. Qiwei Yao is Professor of Statistics at the London School of Economics and Political Science. He is an elected member of the International Statistical Institute, and has served on the editorial boards for the Journal of the Royal Statistical Society (Series B) and the Australian and New Zealand Journal of Statistics.
Product Details
  • Paperback: 552 pages
  • Publisher: Springer; 1 edition (August 4, 2005)
  • Language: English
  • ISBN: 0387261427

[此贴子已经被作者于2006-3-27 8:09:10编辑过]

使用道具

10
SPSSCHEN 发表于 2006-3-27 08:05:00 |只看作者 |坛友微信交流群

使用道具

您需要登录后才可以回帖 登录 | 我要注册

本版微信群
加好友,备注jltj
拉您入交流群

京ICP备16021002-2号 京B2-20170662号 京公网安备 11010802022788号 论坛法律顾问:王进律师 知识产权保护声明   免责及隐私声明

GMT+8, 2024-4-25 06:43