Electronic Thesis and Dissertation Repository

Frailty Model with Missing Covariates in Family-Based Study

Jiaqi Bi, Western University

Abstract

Multiple imputation (MI) is a widely adopted approach for handling missing data and has proven to be a robust tool, particularly when dealing with large sets of missing covariates. When data are missing at random, multiple imputation generally outperforms complete case analysis. However, in the analysis of clustered survival data arising from family-based studies with missing covariates, current MI methods do not handle the familial structure of the data, as well as the ascertainment of families. Our study proposes to integrate the kinship matrix into the multiple imputation process by calculating the conditional means and variances of the individual’s missing data given the observations of other family members, thereby explicitly incorporating family structure information. We compare the performance of our proposed methods, commonly used multiple imputation methods that do not consider the kinship matrix, and complete case analysis. Our findings indicate that failing to account for familial correlation when imputing genetically associated variables results in slightly higher biases, and liberal variance estimations. The proposed MI method is applied to the breast cancer families recruited from the Breast Cancer Family Registries to evaluate the effects of the polygenic risk score (PRS) and mutation status where the PRS is subject to missing.