ASN's Mission

To create a world without kidney diseases, the ASN Alliance for Kidney Health elevates care by educating and informing, driving breakthroughs and innovation, and advocating for policies that create transformative changes in kidney medicine throughout the world.

learn more

Contact ASN

The Latest on X

Kidney Week

ASN / Education & Meetings / Kidney Week /

Please note that you are viewing an archived section from 2020 and some content may be unavailable. To unlock all content for 2020, please visit the archives.

Abstract: PO0526

Using Autoencoders for Imputing Missing Data in eGFR Decline Trajectories of Patients with CKD

Session Information

CKD Health Services Research
October 22, 2020 | Location: On-Demand
Abstract Time: 10:00 AM - 12:00 PM

Category: CKD (Non-Dialysis)

2101 CKD (Non-Dialysis): Epidemiology, Risk Factors, and Prevention

Authors

Zamanzadeh, Davina J., University of California Los Angeles, Los Angeles, California, United States

Petousis, Panayiotis, University of California Los Angeles, Los Angeles, California, United States

Davis, Tyler Austin, University of California Los Angeles, Los Angeles, California, United States

Garlid, Anders Olav, University of California Los Angeles, Los Angeles, California, United States

Wang, Xiaoyan, University of California Los Angeles, Los Angeles, California, United States

Norris, Keith C., University of California Los Angeles, Los Angeles, California, United States

Duru, Obidiugwu, University of California Los Angeles, Los Angeles, California, United States

Tuttle, Katherine R., Providence St Joseph Health, Spokane, Washington, United States

Bui, Alex, University of California Los Angeles, Los Angeles, California, United States

Nicholas, Susanne B., University of California Los Angeles, Los Angeles, California, United States

Group or Team Name

CURE-CKD Registry Study Team

Background

Using machine learning (ML) approaches to impute missing data has not been explored in CKD progression. We investigated the utility of a data-driven imputation to improve downstream classifier prediction of rapid eGFR decline in the CURE-CKD registry.

Methods

We analyzed CKD patients at UCLA (N=13,206) over a 2-year period. We used: 1) the dataset with missing data; and 2) a censored subset with no missing data. We introduced 33% and 66% missingness by removing values by removing values either missing completely at random (MCAR); missing at random (MAR); or missing not at random (MNAR). We included: eGFR, hemoglobin (HbA1c), systolic blood pressure (SBP), number of ambulatory and inpatient visits, age, sex, ethnicity, rurality status, diagnosis of hypertension, diabetes mellitus (DM), pre-DM, and use of renin angiotensin aldosterone system inhibitors. We introduced missingness on SBP and HbA1c to mirror the original dataset. We imputed missing values using an autoencoder ML model. To predict a 40% eGFR decline over 2 years, we developed random forest models using the full and resultant imputed datasets.

Results

On the full subset, the MNAR imputation method achieved a root mean squared error (RMSE) of 0. The MAR method achieved RMSE of 3.8 at 33% missingness and 5.4 at 66%. MCAR achieved RMSE of 38.5 at 33% missingness and 56.4 at 66%. Using the random forest model to predict rapid decline on the fully observed subset without removing and imputing data achieved a receiver operating characteristic (ROC) area under the curve (AUC) mean of 80.8%±1.1 and precision/recall (PR)-AUC mean of 23.9%±1.5; the same as our methodology on MNAR, which is explained by the RMSE of 0, shown in Table 1.

Conclusion

Our method accurately imputes clinical data values while accounting for uncertainty caused by missing values.

Method	Mean ROC-AUC Missingness 66% / 33%	Mean PR-AUC Missingness 66% / 33%
MNAR	80.8±1.1 / 80.8±1.1	23.9±1.5 / 23.9±1.5
MAR	69.1±1.3 / 74.2±0.9	22.3±1.5 / 23.3±1.8
MCAR	70.6±0.9 / 69.5±0.9	9.3±0.6 / 10.9±0.6

Funding

Other NIH Support

ASN's Mission

Contact ASN

The Latest on X

Using Autoencoders for Imputing Missing Data in eGFR Decline Trajectories of Patients with CKD

Abstract: PO0526

Using Autoencoders for Imputing Missing Data in eGFR Decline Trajectories of Patients with CKD

Session Information

Category: CKD (Non-Dialysis)

Authors

Davina J. Zamanzadeh,

Panayiotis Petousis,

Tyler Austin Davis,

Anders Olav Garlid,

Xiaoyan Wang,

Keith C. Norris,

Obidiugwu Duru,

Katherine R. Tuttle,

Alex Bui,

Susanne B. Nicholas,

Group or Team Name

Background

Methods

Results

Conclusion

Funding