关于本站
人大经济论坛-经管之家:分享大学、考研、论文、会计、留学、数据、经济学、金融学、管理学、统计学、博弈论、统计年鉴、行业分析包括等相关资源。
经管之家是国内活跃的在线教育咨询平台!
经管之家新媒体交易平台
提供"微信号、微博、抖音、快手、头条、小红书、百家号、企鹅号、UC号、一点资讯"等虚拟账号交易,真正实现买卖双方的共赢。【请点击这里访问】
TOP热门关键词
http://dgdsbygo8mp3h.cloudfront.net/sites/default/files/imagecache/productview_larger/5467EN.jpgTableofContentsPrefaceChapter1:DataUnderstandingChapter2:DataPreparation–SelectChapter3:DataPreparation ...
免费学术公开课,扫码加入 |
Table of Contents
Preface
Chapter 1: Data Understanding
Chapter 2: Data Preparation – Select
Chapter 3: Data Preparation – Clean
Chapter 4: Data Preparation – Construct
Chapter 5: Data Preparation – Integrate and Format
Chapter 6: Selecting and Building a Model
Chapter 7: Modeling – Assessment, Evaluation, Deployment, and Monitoring
Chapter 8: CLEM Scripting
Appendix: Business Understanding
Index
- Preface
- Chapter 1: Data Understanding
- Introduction
- Using an empty aggregate to evaluate sample size
- Evaluating the need to sample from the initial data
- Using CHAID stumps when interviewing an SME
- Using a single cluster K-means as an alternative to anomaly detection
- Using an @NULL multiple Derive to explore missing data
- Creating an Outlier report to give to SMEs
- Detecting potential model instability early using the Partition node and Feature Selection node
- Chapter 2: Data Preparation – Select
- Introduction
- Using the Feature Selection node creatively to remove or decapitate perfect predictors
- Running a Statistics node on anti-join to evaluate the potential missing data
- Evaluating the use of sampling for speed
- Removing redundant variables using correlation matrices
- Selecting variables using the CHAID Modeling node
- Selecting variables using the Means node
- Selecting variables using single-antecedent Association Rules
- Chapter 3: Data Preparation – Clean
- Introduction
- Binning scale variables to address missing data
- Using a full data model/partial data model approach to address missing data
- Imputing in-stream mean or median
- Imputing missing values randomly from uniform or normal distributions
- Using random imputation to match a variable's distribution
- Searching for similar records using a Neural Network for inexact matching
- Using neuro-fuzzy searching to find similar names
- Producing longer Soundex codes
- Chapter 4: Data Preparation – Construct
- Introduction
- Building transformations with multiple Derive nodes
- Calculating and comparing conversion rates
- Grouping categorical values
- Transforming high skew and kurtosis variables with a multiple Derive node
- Creating flag variables for aggregation
- Using Association Rules for interaction detection/feature creation
- Creating time-aligned cohorts
- Chapter 5: Data Preparation – Integrate and Format
- Introduction
- Speeding up merge with caching and optimization settings
- Merging a lookup table
- Shuffle-down (nonstandard aggregation)
- Cartesian product merge using key-less merge by key
- Multiplying out using Cartesian product merge, user source, and derive dummy
- Changing large numbers of variable names without scripting
- Parsing nonstandard dates
- Parsing and performing a conversion on a complex stream
- Sequence processing
- Chapter 6: Selecting and Building a Model
- Introduction
- Evaluating balancing with Auto Classifier
- Building models with and without outliers
- Using Neural Network for Feature Selection
- Creating a bootstrap sample
- Creating bagged logistic regression models
- Using KNN to match similar cases
- Using Auto Classifier to tune models
- Next-Best-Offer for large datasets
- Chapter 7: Modeling – Assessment, Evaluation, Deployment, and Monitoring
- Introduction
- How (and why) to validate as well as test
- Using classification trees to explore the predictions of a Neural Network
- Correcting a confusion matrix for an imbalanced target variable by incorporating priors
- Using aggregate to write cluster centers to Excel for conditional formatting
- Creating a classification tree financial summary using aggregateand an Excel Export node
- Reformatting data for reporting with a Transpose node
- Changing formatting of fields in a Table node
- Combining generated filters
- Chapter 8: CLEM Scripting
- Introduction
- Building iterative Neural Network forecasts
- Quantifying variable importance with Monte Carlo simulation
- Implementing champion/challenger model management
- Detecting Outliers with the jackknife method
- Optimizing K-means cluster solutions
- Automating time series forecasts
- Automating HTML reports and graphs
- Rolling your own modeling algorithm – Weibull analysis
- Appendix: Business Understanding
- Introduction
- Define business objectives by Tom Khabaza
- Assessing the situation by Meta Brown
- Translating your business objective into a data mining objective by Dean Abbott
- Produce a project plan – ensuring a realistic timeline by Keith McCormick
「经管之家」APP:经管人学习、答疑、交友,就上经管之家!
免流量费下载资料----在经管之家app可以下载论坛上的所有资源,并且不额外收取下载高峰期的论坛币。
涵盖所有经管领域的优秀内容----覆盖经济、管理、金融投资、计量统计、数据分析、国贸、财会等专业的学习宝库,各类资料应有尽有。
来自五湖四海的经管达人----已经有上千万的经管人来到这里,你可以找到任何学科方向、有共同话题的朋友。
经管之家(原人大经济论坛),跨越高校的围墙,带你走进经管知识的新世界。
扫描下方二维码下载并注册APP
免流量费下载资料----在经管之家app可以下载论坛上的所有资源,并且不额外收取下载高峰期的论坛币。
涵盖所有经管领域的优秀内容----覆盖经济、管理、金融投资、计量统计、数据分析、国贸、财会等专业的学习宝库,各类资料应有尽有。
来自五湖四海的经管达人----已经有上千万的经管人来到这里,你可以找到任何学科方向、有共同话题的朋友。
经管之家(原人大经济论坛),跨越高校的围墙,带你走进经管知识的新世界。
扫描下方二维码下载并注册APP
您可能感兴趣的文章
人气文章
本文标题:[图书]SPSS Modeler Cookbook
本文链接网址:https://bbs.pinggu.org/jg/ruanjianpeixun_spssruanjianpeixun_2937377_1.html
2.转载的文章仅代表原创作者观点,与本站无关。其原创性以及文中陈述文字和内容未经本站证实,本站对该文以及其中全部或者部分内容、文字的真实性、完整性、及时性,不作出任何保证或承若;
3.如本站转载稿涉及版权等问题,请作者及时联系本站,我们会及时处理。