1) The logistic regression used for estimation of propensity score is "different" (has special characteristics) from ordinary logistic regression:
a. Don’t care about the predictive ability of model
b. Don’t care about collinearity of covariates
c. Theoretically, your should select all variables related to your treatment assignment (in your cases, policy) and your outcome
c. Just care about whether it results in balanced matched samples
In your case, at this time, I would include all your 9 covariates (if you believe all of them are related to treatment assignment and especially your outcome) in the logistic model, then test the matched sample balance
2)By the way, if you use psmatch2, you don't have to use a separate logistic regression model to get the propensity score (although you can). You can integrate propensity score production and ATT estimation into psmatch2 like:
psmatch2 policy a b c d e f g h i, outcome(pollution) logit //assume pollution is your outcome variable
3) Use pstest to check your matched sample balance after pamatch2 like:
pstest, both
Good luck
|