楼主: 阿袋
1308 0

[其他] Power Calculations for Regression Discontinuity Evaluations: Part 3 [推广有奖]

贵宾

已卖:5773份资源

院士

16%

还不是VIP/贵宾

-

TA的文库  其他...

各科好书新书

投资人生

论文写作投稿实战

威望
0
论坛币
569387 个
通用积分
254.7643
学术水平
304 点
热心指数
347 点
信用等级
246 点
经验
88890 点
帖子
1681
精华
5
在线时间
2905 小时
注册时间
2007-6-10
最后登录
2025-9-8

楼主
阿袋 发表于 2016-10-15 11:34:13 |AI写论文

+2 论坛币
k人 参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群
赵安豆老师微信:zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

感谢您参与论坛问题回答

经管之家送您两个论坛币!

+2 论坛币
Power Calculations for Regression Discontinuity Evaluations: Part 3
SUBMITTED BY DAVID MCKENZIE        ON MON, 09/12/2016



This is my third, and final, in a series of posts on doing power calculations for regression discontinuity (see part 1 and part 2).
Scenario 3 (SCORE DATA AVAILABLE, AT LEAST PRELIMINARY OUTCOME DATA AVAILABLE; OR SIMULATED DATA USED): The context of data being available seems less usual to me in the planning stages of an impact evaluation, but could be possible in some settings (e.g. you have the score data and administrative data on a few outcomes, and then are deciding whether to collect survey data on other outcomes). But more generally, you will be in this stage once you have collected all your data. Moreover, the methods discussed here can be used with simulated data in cases where you don’t have data.

There is then a new Stata package rdpower written by Matias Cattaneo and co-authors that can be really helpful in this scenario (thanks also to him for answering several questions I had on its use). It calculates power and sample sizes, assuming you are then going to be using the rdrobust command to analyze the data. There are two related commands here:
  • rdpower: this calculates the power, given your data and sample size for a range of different effect sizes
  • rdsampsi: this calculates the sample size you need to get a given power, given your data and that you will be analyzing it with rdrobust.

Since this uses a lot more inputs than the cases in my first two posts, it gives you more precise output, and deals with all the key factors going into the design effect: the correlation between the score and treatment assignment, the reduction in sample that comes from choosing the optimal bandwidth, and adjustments from bias-correction procedures in choosing the bandwidth. If you have this in the planning stages, you can therefore use this to help choose what sample size to survey and to check you will have enough power, as discussed in the previous posts.

Another use is once people have data, to help understand the power consequences of choosing different bandwidths for the RD estimation. For example, using the Senate elections data they provide as demonstration data with the package:

  • rdpower demvoteshfor2 demmv, tau(5)  shows that one has 81.8% power to detect a 5 percentage point jump in vote share (this is what tau gives) at the cutoff, with the optimal bandwidth chosen (which is 17.7 here). I can then see what power would be if I take a smaller bandwidth, say of 10: rdpower demvoteshfor2 demmv, tau(5) h(10) – this tells me I would only have 45.2% power with the smaller bandwidth.
Note that the “optimal” bandwidth chosen is chosen to minimize mean-squared error (or another alternative), and doesn’t take power into account at all. So depending on your tolerance for type I vs type II error and on the shape of the data, you may decide, after using this software, to choose a larger bandwidth than is optimal to minimize MSE in order to be doing an estimation that has power to detect the size effect you are aiming to be powered against.
Notes: Matias adds that you may want to use the option scaleregul(0) when using this command, which is not the default, but avoids regularization choosing quite small bandwidths.





二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

关键词:Calculations Calculation Evaluations Evaluation Continuity Power

您需要登录后才可以回帖 登录 | 我要注册

本版微信群
jg-xs1
拉您进交流群
GMT+8, 2025-12-25 07:57