搜索
人大经济论坛 附件下载

附件下载

所在主题:
文件名:  A_Short_Note_on_P-Value_Hacking.pdf
资料下载链接地址: https://bbs.pinggu.org/a-3677918.html
附件大小:
386.17 KB   举报本内容
英文标题:
《A Short Note on P-Value Hacking》
---
作者:
Nassim Nicholas Taleb
---
最新提交年份:
2018
---
英文摘要:
We present the expected values from p-value hacking as a choice of the minimum p-value among $m$ independents tests, which can be considerably lower than the \"true\" p-value, even with a single trial, owing to the extreme skewness of the meta-distribution. We first present an exact probability distribution (meta-distribution) for p-values across ensembles of statistically identical phenomena. We derive the distribution for small samples $2<n \\leq n^*\\approx 30$ as well as the limiting one as the sample size $n$ becomes large. We also look at the properties of the \"power\" of a test through the distribution of its inverse for a given p-value and parametrization. The formulas allow the investigation of the stability of the reproduction of results and \"p-hacking\" and other aspects of meta-analysis. P-values are shown to be extremely skewed and volatile, regardless of the sample size $n$, and vary greatly across repetitions of exactly same protocols under identical stochastic copies of the phenomenon; such volatility makes the minimum $p$ value diverge significantly from the \"true\" one. Setting the power is shown to offer little remedy unless sample size is increased markedly or the p-value is lowered by at least one order of magnitude.
---
中文摘要:
我们将p值黑客攻击的预期值作为$m$独立测试中最小p值的选择,由于元分布的极端偏斜,该值可能会大大低于“真实”p值,即使是单次试验。我们首先给出了统计上相同现象集合中p值的精确概率分布(元分布)。我们推导了小样本$2<n\\leq n^*\\约30$的分布,以及样本量$n$变大时的极限分布。我们还通过给定p值的逆分布和参数化来研究测试的“幂”性质。这些公式允许调查结果复制的稳定性和“p-hacking”以及元分析的其他方面。结果表明,无论样本大小为$n$,P值都是极为偏斜和不稳定的,并且在相同的随机复制下,完全相同的协议重复之间差异很大;这种波动性使得美元兑便士的最低价值与“真实”价值存在显著差异。结果表明,除非样本量显著增加或p值降低至少一个数量级,否则设置功率几乎不能提供补救措施。
---
分类信息:

一级分类:Statistics 统计学
二级分类:Applications 应用程序
分类描述:Biology, Education, Epidemiology, Engineering, Environmental Sciences, Medical, Physical Sciences, Quality Control, Social Sciences
生物学,教育学,流行病学,工程学,环境科学,医学,物理科学,质量控制,社会科学
--
一级分类:Quantitative Finance 数量金融学
二级分类:Statistical Finance 统计金融
分类描述:Statistical, econometric and econophysics analyses with applications to financial markets and economic data
统计、计量经济学和经济物理学分析及其在金融市场和经济数据中的应用
--

---
PDF下载:
-->


    熟悉论坛请点击新手指南
下载说明
1、论坛支持迅雷和网际快车等p2p多线程软件下载,请在上面选择下载通道单击右健下载即可。
2、论坛会定期自动批量更新下载地址,所以请不要浪费时间盗链论坛资源,盗链地址会很快失效。
3、本站为非盈利性质的学术交流网站,鼓励和保护原创作品,拒绝未经版权人许可的上传行为。本站如接到版权人发出的合格侵权通知,将积极的采取必要措施;同时,本站也将在技术手段和能力范围内,履行版权保护的注意义务。
(如有侵权,欢迎举报)
二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

GMT+8, 2025-12-31 05:46