Katana VentraIP

Missing data

In statistics, missing data, or missing values, occur when no data value is stored for the variable in an observation. Missing data are a common occurrence and can have a significant effect on the conclusions that can be drawn from the data.

Missing data can occur because of nonresponse: no information is provided for one or more items or for a whole unit ("subject"). Some items are more likely to generate a nonresponse than others: for example items about private subjects such as income. Attrition is a type of missingness that can occur in longitudinal studies—for instance studying development where a measurement is repeated after a certain period of time. Missingness occurs when participants drop out before the test ends and one or more measurements are missing.


Data often are missing in research in economics, sociology, and political science because governments or private entities choose not to, or fail to, report critical statistics,[1] or because the information is not available. Sometimes missing values are caused by the researcher—for example, when data collection is done improperly or mistakes are made in data entry.[2]


These forms of missingness take different types, with different impacts on the validity of conclusions from research: Missing completely at random, missing at random, and missing not at random. Missing data can be handled similarly as censored data.

/casewise deletion

Listwise deletion

Pairwise deletion

Acock AC (2005), "Working with missing values", , 67 (4): 1012–28, doi:10.1111/j.1741-3737.2005.00191.x

Journal of Marriage and Family

Allison, Paul D. (2001), Missing Data,

SAGE Publishing

Bouza-Herrera, Carlos N. (2013), Handling Missing Data in Ranked Set Sampling,

Springer

Enders, Craig K. (2010), Applied Missing Data Analysis,

Guilford Press

Graham, John W. (2012), Missing Data,

Springer

Molenberghs, Geert; Fitzmaurice, Garrett; Kenward, Michael G.; Tsiatis, Anastasios; Verbeke, Geert, eds. (2015), Handbook of Missing Data Methodology,

Chapman & Hall

Raghunathan, Trivellore (2016), Missing Data Analysis in Practice,

Chapman & Hall

Little, Roderick J. A.; (2002), Statistical Analysis with Missing Data (2nd ed.), Wiley

Rubin, Donald B.

Tsiatis, Anastasios A. (2006), Semiparametric Theory and Missing Data, Springer

Van den Broeck J, Cunningham SA, Eeckels R, Herbst K (2005), "Data cleaning: detecting, diagnosing, and editing data abnormalities", , 2 (10): e267, doi:10.1371/journal.pmed.0020267, PMC 1198040, PMID 16138788, S2CID 5667073

PLOS Medicine

Zarate LE, Nogueira BM, Santos TR, Song MA (2006). "Techniques for Missing Value Recovering in Imbalanced Databases: Application in a marketing database with massive missing data". IEEE International Conference on Systems, Man and Cybernetics, 2006. SMC '06. Vol. 3. pp. 2658–2664. :10.1109/ICSMC.2006.385265.

doi

Department of Medical Statistics, London School of Hygiene & Tropical Medicine

Missing Data

.

Spatial and temporal Trend Analysis of Long Term rainfall records in data-poor catchments with missing data, a case study of Lower Shire floodplain in Malawi for the period 1953–2010

A unified platform for missing values methods and workflows.

R-miss-tastic

Missing values-envision