CitedEvidence
User Settings
Open AccessArticle

Sequential Bayesian Regression for Multiple Imputation and Conditional Editing

Robin Jeffries-2013-01-01-eScholarship (California Digital Library)

TL;DRAbstract

Analysts faced with errors in data apply editing rules to fix erroneous data. These edits are deterministically assigned and edits may not be correct in all cases. This dissertation presents a unified method to multiply impute missing data and multiply edit erroneous data using a sequence of Bayesian regression models. The techniques used to multiply edit erroneous data are an exact parallel for multiple imputation used to correct missing data. The models presented allow for different data types subject to several error mechanisms. This method is called Sequential Bayesian Regression for Multiple Imputation and Conditional Editing (SyBRMICE) and creates multiple fully imputed and edited data sets. Desired analyses are performed on each complete and consistently edited and imputed data set individually. Results from these analyses are combined using the same combining rules used in multiple imputation. The resulting parameter estimates and intervals will then correctly account for the e

Chat with Paper

AI Agents for this Paper

Analysts faced with errors in data apply editing rules to fix erroneous data. These edits are deterministically assigned and edits may not be correct in all cases. This dissertation presents a unified method to multiply impute missing data and multiply edit erroneous data using a sequence of Bayesian regression models. The techniques used to multiply edit erroneous data are an exact parallel for multiple imputation used to correct missing data. The models presented allow for different data types subject to several error mechanisms. This method is called Sequential Bayesian Regression for Multiple Imputation and Conditional Editing (SyBRMICE) and creates multiple fully imputed and edited data sets. Desired analyses are performed on each complete and consistently edited and imputed data set individually. Results from these analyses are combined using the same combining rules used in multiple imputation. The resulting parameter estimates and intervals will then correctly account for the e

Keywords

Imputation (statistics)Missing dataComputer scienceBayesian probabilityRegressionData collectionData miningStatistics

Chat

Click to start Chat