人大经济论坛 经典文献» 浏览文献

The Analysis of Social Science Data with Missing Values

文献名称 The Analysis of Social Science Data with Missing Values
作者所在单位 University of California at Los Angeles;Harvard University
文献分类 已发表文献
学科一级分类 统计
学科二级分类 统计学
文献摘要 Methods for handling missing data in social science data sets are reviewed.
Limitations of common practical approaches, including complete-case analysis,
available-case analysis and imputation, are illustrated on a simple missing-data
problem with one complete and one incomplete variable. Two more principled
approaches, namely maximum likelihood under a model for the data and missing
data mechanism and multiple imputation, are applied to the bivariate problem.
General properties of these methods are outlined, and applications to more
complex missing-data problems are discussed. The EM algorithm, a convenient
method for computing maximum likelihood estimates in missing-data problems, is
described and applied to two common models, the multivariate normal model for
continuous data and the multinomial model for discrete data. Multiple imputation
under explicit or implicit models is recommended as a method that retains the
advantages of imputation and overcomes its limitations.
参考文献 ALLISON, P. D. (1987) "Estimation of linear models with incomplete data," pp. 71-103
in Sociological Methodology (annual).
AMEMIYA, T. (1984) "Tobit models: a survey." J. of Econometrics 24: 3-61.
ANDERSON, T. W. (1957) "Maximum likelihood estimation for the multivariate normal
distribution when some observations are missing." J. of Amer. Star. Assn. 52: 200-203.
BAKER, S. G., and N. M. LAIRD (1988) "Regression analysis for categorical variables
with outcome subject to nonignorable nonresponse." J. of Amer. Star. Assn. 81: 29-41.
BISHOP, Y.M.M., S. E. FIENBERG, and P. W. HOLLAND (1975) Discrete Multivariate
Analysis: Theory and Practice. Cambridge: MIT Press.
CHEN, T., and S. E. FIENBERG (1974) "Two-dimensional contingency tables with both
completely and partially classified data." Biometrics 30: 629-642.
DAVID, M., R.J.A. LITTLE, M. E. SAMUHEL, and R. K. TRIEST (1986) "Alternative
methods of CPS income imputation." J. of Amer. Stat. Assn. 81: 29-41.
DEMPSTER, A. P., N. M. LAIRD, and D. B. RUBIN (1977) "Maximum likelihood from
incomplete data via the EM algorithm." J. of Royal Stat. Society B 39: 1-38.
DIXON, W. J. led.] (1988) BMDP Statistical Software. Los Angeles: University of
California Press.
FUCHS, C. (1982) "Maximum likelihood estimation and model selection in contingency
tables with missing data." J. of Amer. Star. Assn, 77: 270-278.
GLYNN, R., N. M. LAIRD, and D. B. RUBIN (1986)"Selection modeling versus mixture
modeling with nonignorable nonresponse," pp. 119-146 in H. Wainer (ed.) Drawing
Inferences from Self-Selected Samples. New York: Springer-Verlag.
GREENLEES, J. S., W. S. REECE, and K. O. ZIESCHANG (1982) "Imputation of
missing values when the probability of response depends on the variable being
imputed." J. of Amer. Star. Assn. 77: 251-261.
HARTLEY, H. O., and R. R. HOCKING (1971) "The analysis of incomplete data."
Biometrics 14: 174-194.
HECKMAN, J. (1976) "The common structure of statistical models of truncation, sample
selection, and limited dependent variables and a simple estimator for such models."
Annals of Economic and Social Measurement 5: 475-492.
HEITJAN, D. F., and D. B. RUBIN (1986) "Inference from coarse data using multiple
imputation," pp. 138-143 in T. J. Boardman (ed.) Computer Science and Statistics:
Proceedings of the 18th Symposium on the Interface. Arlington, VA: Amer. Stat.
HERZOG, T. N., and D. B. RUBIN (1983) "Using multiple imputations to handle
nonresponse in sample surveys," pp. 210-248 in W. G. Madow, I. Olkin, and D. B.
Rubin (eds.) Incomplete Data in Sample Surveys. Vol. 2: Theory and Bibliographies.
New York: Academic Press.
JENNRICH, R. I., and M. D. SCHLUCHTER (1986) "Imputing for missing survey
responses," pp. 22-31 in Proceedings of the Survey Research Methods Section, Amer.
Stat. Assn.
KALTON, G., and D. KASPRZYK (1982) "Imputing for missing survey responses,"
pp. 22-31 in Proceedings of the Survey Research Methods Section, Amer. Stat. Assn.
LI, K. H., X. L. MENG, T. E. RAGHUNATHAN, and D. B. RUBIN (1989) "Significance
levels from repeated p-values with multiply imputed data." Department of Statistics,
Harvard University. (Research Report)
L1, K. H., T. E. RAGHUNATHAN, and D. B. RUBIN (1988) "Large sample significance
levels from multiply imputed data using moment-based statistics and an F reference
distribution." Department of Statistics, Harvard University. (Research Report)
LILLARD, L., J. P, SMITH, and F. WELCH (1982) "What do we really know about
wages: the importance of nonreporting and census imputation." J. of Pol. Economy
94: 489-506.
LITTLE, R.J.A. (1988a) "Missing data adjustments in large surveys." J. of Business and
Econ. Statistics 6: 1-15.
LITTLE, R.LA. (1988b)"Robust estimation of the mean and covariance matrix from data
with missing values." Applied Statistics 37: 23-38.
LITTLE, R.LA. (1988c) "Incomplete data in event history analysis." Presented at 1USSP
Working Group on Event History Analysis, March 1988, Paris.
LITTLE, R.J.A., and D. B. RUBIN (1987) Statistical Analysis with Missing Data. New
York: John Wiley & Sons.
LITTLE, R.J.A., and H. L. SU (1989) "Item nonresponse in panel surveys," pp. 400-425
in D. Kasprzyk, G. Duncan, and M. P. Singh (eds.) Panel Surveys. New York: John
Wiley & Sons.
MADOW, W. G., H. N1SSELSON, I. OLKIN, and D. B. RUBIN [eds.] (1983) Incomplete
Data in Sample Surveys, Vols. 1-3. New York: Academic Press.
McKENDRICK, A. G. (1926) "Applications of mathematics to medical problems."
Proceedings of the Edinburgh Mathematics Society 44: 98-130.
MUTHEN, B., D. KAPLAN, and M. HOLL1S (1987) "On structural equation modeling
with data that are not missing completely at random." Psyehometrika 52: 431-462.
OH, H. L., and F. E. SCHEUREN (1980) "Estimating the variance impact of missing CPS
income data," pp. 408-415 in Proceedings of the Survey Research Methods Section,
Amer. Slat. Assn.
RUBIN, D. B. (1974) "Characterizing the estimation of parameters in incomplete data
problems." ,J. of Amer. Star. Assn. 69: 467-474.
RUB1N, D. B. (1976) "Inference and missing data." Biometrika 63: 581-592.
RUBIN, D. B. (1977) "Formalizing subjective notions about the effect of nonrespondents
in sample surveys." ,J. of Amer. Stat. Assn. 72: 538-543.
RUBIN, D. B. (1978) "Multiple imputations in sample surveys-a phenomenological
Bayesian approach to nonresponse," pp. 20-34 in Proceedings of the Survey Research
Methods Section, Amer. Star. Assn.
RUBIN, D. B. (1983) "lteratlvely reweighted least squares." Encyclopedia of the Stat.
Sciences 4: 272-275.
RUB1N, D. B. (1986) "Statistical matching and file concatenation with adjusted weights
and multiple imputations." J. of Business and Econ. Statistics 4: 87-94.
RUBIN, D. B. (1987) Multiple Imputation for Nonresponse in Surveys. New York: John
Wiley & Sons.
RUB1N, D. B., J. L. SCHAFER, and N. SCHENKER (1988) "Imputation strategies for
estimating the undereount," pp. 151-159 in Bureau of the Census Fourth Annual
Research Conference. Washington, DC: Department of Commerce.
RUBIN, D. B., and N. SCHENKER (1986) "Multiple imputation for interval estimation
from simple random samples with ignorable nonresponse." J. of Amer. Stat. Assn. 81:
SCHOENBERG, R. S. (1988) "MISS: a program for missing data." GAUSS Programming
Language. Aptech Systems Inc. [P.O. Box 6487, Kent WA 98064]
关键字 Social Science Data ; Missing Values
发表所在刊物(或来源) SOCIOLOGICAL METHODS AND RESEARCH, Vol. 18, Nos. 2 & 3, November 1989/February 1990 292-326; ~ 1989 Sage Publications, Inc.
发表时间 November 1989/February 1990
上传时间 2011-1-22 23:49
下载文献 Sociological_Methods_&_Research-1989-LITTLE-292-326.pdf[2.85 MB]



京ICP备16021002-2号 京B2-20170662号 京公网安备 11010802022788号 论坛法律顾问:王进律师 知识产权保护声明   免责及隐私声明

GMT+8, 2024-4-27 22:38