The Analysis of Social Science Data with Missing Values |
|||||||
文献名称 | The Analysis of Social Science Data with Missing Values | ||||||
文献作者 | RODERICK J. A. LITTLE;DONALD B. RUBIN | ||||||
作者所在单位 | University of California at Los Angeles;Harvard University | ||||||
文献分类 | 已发表文献 | ||||||
学科一级分类 | 统计 | ||||||
学科二级分类 | 统计学 | ||||||
文献摘要 |
Methods for handling missing data in social science data sets are reviewed. Limitations of common practical approaches, including complete-case analysis, available-case analysis and imputation, are illustrated on a simple missing-data problem with one complete and one incomplete variable. Two more principled approaches, namely maximum likelihood under a model for the data and missing data mechanism and multiple imputation, are applied to the bivariate problem. General properties of these methods are outlined, and applications to more complex missing-data problems are discussed. The EM algorithm, a convenient method for computing maximum likelihood estimates in missing-data problems, is described and applied to two common models, the multivariate normal model for continuous data and the multinomial model for discrete data. Multiple imputation under explicit or implicit models is recommended as a method that retains the advantages of imputation and overcomes its limitations. |
||||||
参考文献 |
ALLISON, P. D. (1987) "Estimation of linear models with incomplete data," pp. 71-103 in Sociological Methodology (annual). AMEMIYA, T. (1984) "Tobit models: a survey." J. of Econometrics 24: 3-61. ANDERSON, T. W. (1957) "Maximum likelihood estimation for the multivariate normal distribution when some observations are missing." J. of Amer. Star. Assn. 52: 200-203. BAKER, S. G., and N. M. LAIRD (1988) "Regression analysis for categorical variables with outcome subject to nonignorable nonresponse." J. of Amer. Star. Assn. 81: 29-41. BISHOP, Y.M.M., S. E. FIENBERG, and P. W. HOLLAND (1975) Discrete Multivariate Analysis: Theory and Practice. Cambridge: MIT Press. CHEN, T., and S. E. FIENBERG (1974) "Two-dimensional contingency tables with both completely and partially classified data." Biometrics 30: 629-642. DAVID, M., R.J.A. LITTLE, M. E. SAMUHEL, and R. K. TRIEST (1986) "Alternative methods of CPS income imputation." J. of Amer. Stat. Assn. 81: 29-41. DEMPSTER, A. P., N. M. LAIRD, and D. B. RUBIN (1977) "Maximum likelihood from incomplete data via the EM algorithm." J. of Royal Stat. Society B 39: 1-38. DIXON, W. J. led.] (1988) BMDP Statistical Software. Los Angeles: University of California Press. FUCHS, C. (1982) "Maximum likelihood estimation and model selection in contingency tables with missing data." J. of Amer. Star. Assn, 77: 270-278. GLYNN, R., N. M. LAIRD, and D. B. RUBIN (1986)"Selection modeling versus mixture modeling with nonignorable nonresponse," pp. 119-146 in H. Wainer (ed.) Drawing Inferences from Self-Selected Samples. New York: Springer-Verlag. GREENLEES, J. S., W. S. REECE, and K. O. ZIESCHANG (1982) "Imputation of missing values when the probability of response depends on the variable being imputed." J. of Amer. Star. Assn. 77: 251-261. HARTLEY, H. O., and R. R. HOCKING (1971) "The analysis of incomplete data." Biometrics 14: 174-194. HECKMAN, J. (1976) "The common structure of statistical models of truncation, sample selection, and limited dependent variables and a simple estimator for such models." Annals of Economic and Social Measurement 5: 475-492. HEITJAN, D. F., and D. B. RUBIN (1986) "Inference from coarse data using multiple imputation," pp. 138-143 in T. J. Boardman (ed.) Computer Science and Statistics: Proceedings of the 18th Symposium on the Interface. Arlington, VA: Amer. Stat. Assn. HERZOG, T. N., and D. B. RUBIN (1983) "Using multiple imputations to handle nonresponse in sample surveys," pp. 210-248 in W. G. Madow, I. Olkin, and D. B. Rubin (eds.) Incomplete Data in Sample Surveys. Vol. 2: Theory and Bibliographies. New York: Academic Press. JENNRICH, R. I., and M. D. SCHLUCHTER (1986) "Imputing for missing survey responses," pp. 22-31 in Proceedings of the Survey Research Methods Section, Amer. Stat. Assn. KALTON, G., and D. KASPRZYK (1982) "Imputing for missing survey responses," pp. 22-31 in Proceedings of the Survey Research Methods Section, Amer. Stat. Assn. LI, K. H., X. L. MENG, T. E. RAGHUNATHAN, and D. B. RUBIN (1989) "Significance levels from repeated p-values with multiply imputed data." Department of Statistics, Harvard University. (Research Report) L1, K. H., T. E. RAGHUNATHAN, and D. B. RUBIN (1988) "Large sample significance levels from multiply imputed data using moment-based statistics and an F reference distribution." Department of Statistics, Harvard University. (Research Report) LILLARD, L., J. P, SMITH, and F. WELCH (1982) "What do we really know about wages: the importance of nonreporting and census imputation." J. of Pol. Economy 94: 489-506. LITTLE, R.J.A. (1988a) "Missing data adjustments in large surveys." J. of Business and Econ. Statistics 6: 1-15. LITTLE, R.LA. (1988b)"Robust estimation of the mean and covariance matrix from data with missing values." Applied Statistics 37: 23-38. LITTLE, R.LA. (1988c) "Incomplete data in event history analysis." Presented at 1USSP Working Group on Event History Analysis, March 1988, Paris. LITTLE, R.J.A., and D. B. RUBIN (1987) Statistical Analysis with Missing Data. New York: John Wiley & Sons. LITTLE, R.J.A., and H. L. SU (1989) "Item nonresponse in panel surveys," pp. 400-425 in D. Kasprzyk, G. Duncan, and M. P. Singh (eds.) Panel Surveys. New York: John Wiley & Sons. MADOW, W. G., H. N1SSELSON, I. OLKIN, and D. B. RUBIN [eds.] (1983) Incomplete Data in Sample Surveys, Vols. 1-3. New York: Academic Press. McKENDRICK, A. G. (1926) "Applications of mathematics to medical problems." Proceedings of the Edinburgh Mathematics Society 44: 98-130. MUTHEN, B., D. KAPLAN, and M. HOLL1S (1987) "On structural equation modeling with data that are not missing completely at random." Psyehometrika 52: 431-462. OH, H. L., and F. E. SCHEUREN (1980) "Estimating the variance impact of missing CPS income data," pp. 408-415 in Proceedings of the Survey Research Methods Section, Amer. Slat. Assn. RUBIN, D. B. (1974) "Characterizing the estimation of parameters in incomplete data problems." ,J. of Amer. Star. Assn. 69: 467-474. RUB1N, D. B. (1976) "Inference and missing data." Biometrika 63: 581-592. RUBIN, D. B. (1977) "Formalizing subjective notions about the effect of nonrespondents in sample surveys." ,J. of Amer. Stat. Assn. 72: 538-543. RUBIN, D. B. (1978) "Multiple imputations in sample surveys-a phenomenological Bayesian approach to nonresponse," pp. 20-34 in Proceedings of the Survey Research Methods Section, Amer. Star. Assn. RUBIN, D. B. (1983) "lteratlvely reweighted least squares." Encyclopedia of the Stat. Sciences 4: 272-275. RUB1N, D. B. (1986) "Statistical matching and file concatenation with adjusted weights and multiple imputations." J. of Business and Econ. Statistics 4: 87-94. RUBIN, D. B. (1987) Multiple Imputation for Nonresponse in Surveys. New York: John Wiley & Sons. RUB1N, D. B., J. L. SCHAFER, and N. SCHENKER (1988) "Imputation strategies for estimating the undereount," pp. 151-159 in Bureau of the Census Fourth Annual Research Conference. Washington, DC: Department of Commerce. RUBIN, D. B., and N. SCHENKER (1986) "Multiple imputation for interval estimation from simple random samples with ignorable nonresponse." J. of Amer. Stat. Assn. 81: 366-374. SCHOENBERG, R. S. (1988) "MISS: a program for missing data." GAUSS Programming Language. Aptech Systems Inc. [P.O. Box 6487, Kent WA 98064] |
||||||
关键字 | Social Science Data ; Missing Values | ||||||
发表所在刊物(或来源) | SOCIOLOGICAL METHODS AND RESEARCH, Vol. 18, Nos. 2 & 3, November 1989/February 1990 292-326; ~ 1989 Sage Publications, Inc. | ||||||
发表时间 | November 1989/February 1990 | ||||||
适用研究领域 | |||||||
评论 | |||||||
上传时间 | 2011-1-22 23:49 | ||||||
下载文献 |
Sociological_Methods_&_Research-1989-LITTLE-292-326.pdf[2.85 MB]
注:下载文献会消耗您一个“当日剩余下载次数” |
||||||
会员评论 |
京ICP备16021002-2号 京B2-20170662号 京公网安备 11010802022788号 论坛法律顾问:王进律师 知识产权保护声明 免责及隐私声明