Regression With Missing X's: A Review |
|||||||
| 文献名称 | Regression With Missing X's: A Review | ||||||
| 文献作者 | Roderick J. A. Little | ||||||
| 作者所在单位 | Professor, Department of Biomathematics, UCLA School of Medicine, Los Angeles | ||||||
| 文献分类 | 已发表文献 | ||||||
| 学科一级分类 | 统计 | ||||||
| 学科二级分类 | 统计学 | ||||||
| 文献摘要 |
The literature of regression analysis with missing values of the independent variables is reviewed. Six classes of procedures are distinguished: complete case analysis, available case methods, least squares on imputed data, maximum likelihood, Bayesian methods, and multiple imputation. Methods are compared and illustrated when missing data are confined to one independent variable, and extensions to more general patterns are indicated. Attention is paid to the performance of methods when the missing data are not missing completely at random. Least squares methods that fill in missing X's using only data on the X's are contrasted with likelihood- based methods that use data on the X's and Y. The latter approach is preferred and provides methods for elaboration of the basic normal linear regression model. It is suggested that more widely distributed software is needed that advances beyond complete-case analysis, available-case analysis, and naiveimputation methods. Bayesian simulation methods and multiple imputation are reviewed; these provide fruitful avenues for future research. |
||||||
| 参考文献 |
Afifi, A. A., and Clark, V. (1984), Computer-Aided Multivariate Analysis: A Computer Oriented Approach, 2nd ed., New York: Academic Press. Afifi, A. A., and Elashoff, R. M. (1966), "Missing Observations in Multi- variate Statistics: I: Review of the Literature," Journal of the American Statistical Association, 61, 595-605. (1967), "Missing Observations in Multivariate Statistics: II: Point Estimation in Simple Linear Regression," Journal of the American Sta- tistical Association, 62, 10-29. (1969a), "Missing Observations in Multivariate Statistics: III: Large Sample Analysis of Simple Linear Regression," Journal of the American Statistical Association, 64, 337-358. (1969b), "Missing Observations in Multivariate Statistics: IV: A Note on Simple Linear Regression," Journal of the American Statistical Association, 64, 358-365. Amemiya, T. (1984), "Tobit Models: A Survey," Journal of Econometrics, 24, 3-61. Anderson, A. B., Basilevsky, A., and Hum, D. P. J. (1983), "Missing Data: A Review of the Literature," in Handbook of Survey Research, eds. P. H. Rossi, J. D. Wright, and A. Anderson, New York: Academic Press, pp. 415-492. Anderson, T. W. (1957), "Maximum Likelihood Estimation for the Mul- tivariate Normal Distribution When Some Observations are Missing," Journal of the American Statistical Association, 52, 200-203. Aptech Systems (1988), GAUSS Programming Language, Kent, WA: Au- thor. Azen, S. P., Van Guilder, M., and Hill, M. A. (1989), "Estimation of Pa- rameters and Missing Values Under a Regression Model With Non-Nor- mally Distributed and Non-Randomly Incomplete Data," Statistics in Medicine, 8, 217-228. Beale, E. M. L., and Little, R. J. A. (1975), "Missing Values in Multivariate Analysis," Journal of Royal Statistical Society, Ser. B, 37, 129-145. Brown, C. H. (1990), "Protecting Against Nonrandomly Missing Data in Longitudinal Studies," Biometrics, 33, pp. 143-156. Buck, S. F. (1960), "A Method of Estimation of Missing Values in Multi- variate Data Suitable for Use with an Electronic Computer," Journal of the Royal Statistical Society, Ser. B, 22, 302-306. Chen, C.-F. (1986), "A Bayesian Approach to Nested Missing-Data Prob- lems," in Bayesian Inference and Decision Techniques, eds. P. Goel and A. Zellner, New York: Elsevier Press, pp. 355-361. Conniffe, D. (1983a), "Comments on the Weighted Regression Approach to Missing Values," Economic and Social Review, 14, 259-272. (1983b), "Small-Sample Properties of Estimators of Regression Coefficients Given a Common Pattern of Missing Data," Review of Eco- nomic Studies, L, 11 1-120. Dagenais, M. G. (1973), "The Use of Incomplete Observations in Multiple Regression Analysis: A Generalized Least Squares Approach," Journal of Econometrics, 1, 317-328. Dear, R. E. (1959), "A Principal Components Missing Data Method for Multiple Regression Models," SP-86, Santa Monica, CA: Systems De- velopment Corp. Dempster, A. P., Laird, N. M., and Rubin, D. B. (1977), "Maximum Like- lihood from Incomplete Data via the EM Algorithm," Journal of the Royal Statistical Society, Ser. B, 39, 1-38. Dixon, W. J. (ed.) (1988), BMDP Statistical Software Manual, (Vol. 2), Berkeley, CA: University of California Press. Donner, A., and Rosner, B. (1982), "Missing Value Problems in Multiple Linear Regression with Two Independent Variables," Communications in Statistics, Part A- Theory and Methods, 11, 127-140. Dorey, F., Little, R. J. A., and Schenker, N. (in press), "Multiple Imputation of Interval- Censored Data with a Threshold Response," Statistical Med- icine, 12. Edgett, G. L. (1956), "Multiple Regression with Missing Observations Among the Independent Variables," Journal of the American Statistical Associ- ation, 51, 122-13 1. Gelfand, A. E., and Smith, A. F. M. (1990), "Sampling-Based Approaches to Calculating Marginal Densities," Journal of the American Statistical Association, 85, 398-409. Glasser, M. Swamy, P. A. V. B., and Mehta, J. S. (1975), "On Bayesian Estimation of Seemingly Unrelated Regressions When Some Observations are Missing," Journal of Econometrics, 3, 157-169. Tanner, M. A. (1991), Tools for Statistical Inference. Observed Data and Data Augmentation Methods, New York: Springer-Verlag. Tanner, M., and Wong, W. (1987), "The Calculation of Posterior Distri- butions by Data Augmentation" (with discussion), Journal ofthe American Statistical Association, 82, 528-550. Timm, N. H. (1970), "The Estimation of Variance- Covariance and Cor- relation Matrices from Incomplete Data," Psychometrika, 35, 417-437. Trawinski, I. M., and Bargmann, R. W. (1964), "Maximum Likelihood with Incomplete Normal Data," Annals of Mathematical Statistics, 35, 647-657. Van Praag, B. M. S., Dijkstra, T. K., and Van Velzen, J. (1985), "Least Squares Theory Based on Distributional Assumptions With an Application to the Incomplete Observations Problem," Psychometrika, 50, 25-36. Wilks, S. S. (1932), "Moments and Distributions of Estimates of Population Parameters From Fragmentary Samples," Annals of Mathematical Sta- tistics, 3, 163-195. Yates, F. (1933), "The Analysis of Replicated Experiments When the Field Results Are Incomplete," The Empire Journal of Experimental Agriculture, 1, 129-142. |
||||||
| 关键字 | Bayesian inference; Imputation; Incomplete data; Multiple imputation | ||||||
| 发表所在刊物(或来源) | Journal of the American Statistical Association, Vol. 87, No. 420 (Dec., 1992), pp. 1227-1237 | ||||||
| 发表时间 | Dec., 1992 | ||||||
| 适用研究领域 | |||||||
| 评论 | |||||||
| 上传时间 | 2011-1-20 13:38 | ||||||
| 下载文献 |
s A Review.pdf[2.02 MB]
注:下载文献会消耗您一个“当日剩余下载次数” |
||||||
会员评论 |
|||||||
油皮皮 |
油皮皮发表于:2011-4-22 23:42 写的非常好。下来看看。 |
||||||
京ICP备16021002号-2 京B2-20170662号
京公网安备 11010802022788号
论坛法律顾问:王进律师
知识产权保护声明
免责及隐私声明