人大经济论坛 › 论坛 › 计量经济学与统计论坛五区 › 计量经济学与统计软件 › LATEX论坛 › Guide to the Power Pose Controversy

CDA数据分析研究院

商业数据分析与大数据领航教育品牌



经管云课堂

经管/金融/财会/社科/名师公开课



学术培训

Stata 空间计量 SSCI Python

贵宾：通行论坛特权+数据库权限
+案例库+下载特权 VIP：论坛特权+更多下载次数
+ccerdata数据库+更高阅读权限+……

发帖

楼主: oliyiyi

1027 0

Guide to the Power Pose Controversy [推广有奖]

1关注
184
粉丝

版主

泰斗

还不是VIP/贵宾

TA的文库 其他...

计量文库

威望: 7 级
论坛币: 271951 个
通用积分: 31269.3519
学术水平: 1435 点
热心指数: 1554 点
信用等级: 1345 点
经验: 383775 点
帖子: 9598
精华: 66
在线时间: 5468 小时
注册时间: 2007-5-21
最后登录: 2024-4-18

楼主

oliyiyi 发表于 2016-11-17 13:40:18 |只看作者 |坛友微信交流群|倒序 |AI写论文

是否 +2 论坛币

k人参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群

赵安豆老师微信：zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

立即领取

感谢您参与论坛问题回答

经管之家送您两个论坛币！

+2 论坛币

This is the third and final post about the controversy over statistical analysis used in peer-reviewed published scholarly research. Most of the new stuff are covered in post #2 (link). Today's post covers statistical issues related to sample size, which is nothing new, but it was mentioned in Amy Cuddy's response to her critics and thus I also discuss it here.

In post #2 (link), I offer the following mental picture of the two sides in the boxing ring:

the traditionalists (e.g. Susan Fiske who wrote the scathing condemnation of the reformists) believe that there is nothing wrong with the long-accepted standards of publication - in their world view, each new published research article showing p<0.05 experiments reinforces prior publications, and strengthens the scientific basis of the research agenda;
the reformists (e.g. Andrew Gelman) believe that the standards of publication are broken - in their view, the additional published research with p<0.05 experiments creates a false sense of security because consumers cannot see (1) any of the negative studies rejected by journal editors, or not written up because of anticipated rejection, and (2) results of a universe of alternative experiments that the researchers could have done that might have shown negative results.

For more on the concepts behind these arguments, with names such as file drawers and garden of forking paths, read my earlier post.

The full agenda is as follows:

Key Idea 1: Peer Review, Manuscripts, Pop Science and TED Talks (link)

Key Idea 2: P < 0.05, P-hacking, Replication Studies, Pre-registration (link)

Key Idea 3: Negative Studies, and the File Drawer (link)

Key Idea 4: Degrees of Freedom, and the Garden of Forking Paths (link)

Key Idea 5: Sample Size (Today)

***

Key Idea 5: Sample Size

In the previous post, I ended with a brief discussion of meta-analysis, or systematic review, which attempts to consolidate evidence, and present a summary of the state of the world.

One motivation for conducting a meta-analysis is to pool the data from numerous small studies. Many results from psychological experiments come from 30-50 subjects, who are typically students enrolled in college classes. Comparatively, polls typically have thousands of respondents. In all experiments, the effect under study must be strong enough to be observable. Small samples have more noise, making it harder to observe the effect, even if it exists. If the anticipated effect is small but positive, larger samples are recommended. When a statistician complains that a study is “under-powered,” he or she is saying it needs a larger sample size.

The Ranehill, et. al. power pose replication study used 200 subjects, five times more data than the original study with 40 subjects. All else equal, the error rate drops by more than half. So it should be a lot easier to observe the power pose effect in the replication study relative to the original study, assuming that the effect comes in in the same direction.

The concern about sample size or power in a study is well-trodden territory, and not controversial. It is part of the standard conversation during the design phase of any experiment, psychological or otherwise. The current statistical critique of the power pose research has nothing to do with sample sizes.

***

While this controversy is breaking out in academia, it has a lot of meaning for anyone working with "big data."

Nobody wants false positive results, which leads to wasted time, effort and money. In this set of posts, I discuss a range of enablers of false positive results:

ignoring negative studies, or dismissing them as non-informative
testing too many response variables
testing too many sub-populations
tweaking the experiments too much
too small samples
running your study on one population and drawing conclusions about a different population

It's not that you shouldn't do any of these things - they are standard steps in a data analysis process. If different settings/variables/sub-populations lead to different conditions, you have to be careful in interpreting the findings. Most researchers do not intentionally make false-positive results. But we all are at risk of unintentionally making these mistakes. It's all too easy to come up with stories that justify the tweaking/testing/ignoring that we do.

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

分享0 收藏0 回帖

关键词：controversy Contro rover Guide contr Power

Guide to the Power Pose Controversy [推广有奖]

经管之家送您一份

经管之家联合CDA

感谢您参与论坛问题回答

扫码加我拉你入群

相关帖子

初级学术勋章

初级热心勋章

初级信用勋章

中级信用勋章

中级学术勋章

中级热心勋章

高级热心勋章

高级学术勋章

高级信用勋章

特级热心勋章

特级学术勋章

特级信用勋章

本版微信群

Guide to the Power Pose Controversy [推广有奖]

经管之家送您一份

经管之家联合CDA

感谢您参与论坛问题回答

扫码加我 拉你入群

相关帖子

初级学术勋章

初级热心勋章

初级信用勋章

中级信用勋章

中级学术勋章

中级热心勋章

高级热心勋章

高级学术勋章

高级信用勋章

特级热心勋章

特级学术勋章

特级信用勋章

本版微信群

扫码加我拉你入群