楼主: lanbing2011
6356 5

[问答] 请教高手一个删除异常值的办法 [推广有奖]

  • 0关注
  • 1粉丝

大专生

35%

还不是VIP/贵宾

-

威望
0
论坛币
11 个
通用积分
0
学术水平
0 点
热心指数
0 点
信用等级
0 点
经验
505 点
帖子
46
精华
0
在线时间
27 小时
注册时间
2011-7-4
最后登录
2013-7-28

楼主
lanbing2011 发表于 2013-4-23 19:31:11 |AI写论文

+2 论坛币
k人 参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群
赵安豆老师微信:zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

感谢您参与论坛问题回答

经管之家送您两个论坛币!

+2 论坛币
在删除异常值时,评审专家给了个建议

In identifying outliers, in addition to the checks already performed by the authors, it is recommended that they also perform bivariate outlier analyses on the correlations among scores on  tasks (students t, Cook's D, leverage values). Participants with large values on these statistics should be removed

不知道用SPSS怎样进行双变量异常值分析,哪位大侠知道,能否告诉下,非常感谢!

二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

关键词:请教高手 异常值 Participants correlations correlation 办法 异常

沙发
mssr 发表于 2013-4-24 03:41:00
    Exploratory Data Anaylsis

    1
    Click on "Analyze." Select "Descriptive Statistics" followed by "Explore."
   
    2
    Drag and drop the columns containing the dependent variable data into the box labeled "Dependent List." Click "OK."

    3
    Remove any outliers identified by SPSS in the stem-and-leaf plots or box plots by deleting the individual data points. Alternatively, you can set up a filter to exclude these data points.

    4
    Select "Data" and then "Select Cases" and click on a condition that has outliers you wish to exclude. Determine a value for this condition that excludes only the outliers and none of the non-outlying data points.

    5
    Choose "If Condition is Satisfied" in the "Select" box and then click the "If" button just below it. Enter the rule to exclude outliers that you determined in the previous step into the box at the upper right. For example, if you were excluding measurements above 74.5 inches from the condition "height," you would enter "height < = 74.5." Click "Continue" and "OK" to activate the filter.

   

    Regression Analysis
        
        6
        In the "Analyze" menu, select "Regression" and then "Linear." Select the dependent and independent variables you want to analyze.

        7
        Click "Save" and then select "Cook's Distance." The values calculated for Cook's distance will be saved in your data file as variables labeled "COO-1."

        8
        Run a boxplot by selecting "Graphs" followed by "Boxplot." Click on "Simple" and select "Summaries of Separate Variables." Enter "COO-1" into the box labeled "Boxes Represent," and then enter an ID or name by which to identify the cases in the "Label Cases By" box.

        9
        Enlarge the boxplot in the output file by double-clicking it. Make a note of cases that lie beyond the black lines---these are your outliers. You may choose to remove all of the outliers or only the extreme outliers, which are marked by a star (*).

        10
        Go back into the data file and locate the cases that need to be erased. Working from the bottom up, highlight the number at the extreme left, in the gray column, so the the entire row is selected. Click on "Edit" and select "Clear." Repeat this step for each outlier you have identified from the boxplot.



藤椅
lanbing2011 发表于 2013-4-24 19:23:33
mssr 发表于 2013-4-24 03:41
Exploratory Data Anaylsis

    1
非常感谢您
有点不是很明白,在我的分析中,只是想借助回归分析来删除异常值,并没有因变量和自变量之分,把哪个定义为因变量,有没有标准呢?

板凳
mssr 发表于 2013-4-24 21:18:27
The researcher that means you who will do regression analysis must know which is the dependent variable that means depend on independent (predictors) variables.

报纸
lanbing2011 发表于 2013-4-30 09:54:26
mssr 发表于 2013-4-24 21:18
The researcher that means you who will do regression analysis must know which is the dependent varia ...
我用验证性因素分析探讨的结构问题,只是用回归来删除各个变量中存在的异常值,不做回归分析。

地板
lanbing2011 发表于 2013-4-30 09:54:59
mssr 发表于 2013-4-24 21:18
The researcher that means you who will do regression analysis must know which is the dependent varia ...
非常感谢你

您需要登录后才可以回帖 登录 | 我要注册

本版微信群
加好友,备注cda
拉您进交流群
GMT+8, 2025-12-29 17:34