I. Introduction
Chronic kidney disease (CKD), also known as chronic nephropathy, ranks as the 10th leading cause of death globally and continues to represent a major public health burden [
1]. In South Korea, approximately 4.6 million individuals are estimated to have CKD, accounting for one in nine adults [
2]. Because early symptoms are often minimal, CKD frequently remains undiagnosed until an advanced stage, resulting in a high mortality risk [
3]. As diabetes and hypertension are major risk factors, CKD is closely tied to aging and other societal challenges [
4]. In 2021, chronic diseases accounted for 7.1% of national healthcare expenditures, underscoring their increasing economic impact [
5].
To effectively address CKD-related challenges—such as reducing disease incidence and associated social costs—it is crucial to identify the major factors influencing death among CKD patients. Mortality in CKD is primarily affected by clinical factors such as creatinine levels, hypoalbuminemia, nutritional status, proteinuria, and glomerular filtration rate [
6]. These clinical variables are recognized as essential predictors of disease outcomes, particularly in the era of precision medicine and electronic medical record-based healthcare environments [
7,
8]. Such data provide valuable insights into a patient’s physiological status and disease progression. However, clinical factors are often complex and not readily available in all care settings, limiting the implementation of data-driven decision support systems in primary and secondary healthcare facilities.
To overcome these limitations, it is necessary to focus on general characteristics that are more easily accessible and applicable in routine clinical practice. General characteristics allow for quick patient assessment without extensive testing, thereby enhancing accessibility. Dialysis patients, in particular, depend heavily on healthcare professionals, emphasizing the importance of the clinical decision-making abilities of medical teams. In this context, general characteristics serve as the most immediate and fundamental parameters for clinical decision-making [
9]. Notably, CKD clinical guidelines specifically list factors such as age (≥65 years), family history, and other general attributes as risk indicators [
10]. General characteristics—including age, comorbidities, and the use of dialysis equipment—have been identified as significant predictors of mortality [
6]. This highlights the importance of incorporating general characteristics when evaluating patient prognosis.
Mortality prediction in CKD patients has traditionally relied on probability-based statistical methods, which have long served as the primary analytical approach in this field [
11]. More recently, AI models that rely heavily on clinical test results or laboratory data have been developed. Although these methods can yield high accuracy, they often require detailed medical information that may be difficult to obtain or interpret in many healthcare environments [
12,
13]. In contrast, this study focuses on developing a deep learning model using only general patient characteristics—such as demographics, hospital type, and healthcare utilization patterns—without depending on complex clinical inputs. By applying machine learning-based variable selection before model construction, we sought to improve both model simplicity and transparency. This streamlined approach supports practical application in primary and secondary care settings, where detailed clinical data may be limited. Ultimately, the goal is to develop a simplified yet high-performing prognostic tool that enables timely intervention and improved outcomes in CKD management.
IV. Discussion
This study aimed to identify the factors influencing death in CKD patients and to predict prognosis based on general characteristics using data from the KNHDIS, collected by the Korea Disease Control and Prevention Agency from 2016 to 2021. The key factors influencing death included prolonged hospitalization, emergency admissions, older age, higher CCI scores, specific regions (other area and Gyeonggi Province), hospitals with 100–299 beds, and health insurance coverage. The results of this study raise several important points for discussion.
First, emergency hospitalizations and prolonged hospital stays were strongly associated with a higher risk of death in CKD patients. This association can be attributed to the progressive nature of CKD, wherein many patients remain unaware of their deteriorating renal function until an acute event necessitates emergency hospitalization and urgent dialysis [
19]. Emergency dialysis is frequently associated with severe complications, including infections and electrolyte imbalances, which significantly increase the risk of death within 3 to 12 months [
15]. Previous studies have reported that CKD patients requiring emergency hospitalization have higher one-year mortality rates than those receiving scheduled treatment [
3].
Moreover, patients requiring emergency care typically experience longer hospital stays, often exceeding twice the duration of those undergoing planned dialysis sessions [
20]. Additionally, patients who die from CKD tend to have hospitalization durations more than four times longer than those who survive [
6]. These findings suggest that prolonged hospital stays not only reflect disease severity but also exacerbate poor prognosis.
Reducing hospital length of stay is therefore a critical factor in decreasing death risk among CKD patients. To mitigate the negative effects of emergency admissions and prolonged inpatient stays, regular nephrology consultations and structured monitoring programs are essential. Ensuring timely initiation of renal replacement therapies—including hemodialysis, peritoneal dialysis, or kidney transplantation—can prevent acute deterioration, reduce the need for emergency interventions, and ultimately improve long-term survival in CKD patients.
Second, age was a significant predictor of death in CKD patients. According to the Korea Centers for Disease Control and Prevention, CKD prevalence increases with age, with individuals aged ≥60 years accounting for nearly 30% of all cases [
21]. The incidence of CKD rises by 1.22 times per year of age, and the risk of death increases by 1.05 times [
3,
4]. Given that clinical guidelines identify age ≥65 years as a risk factor for CKD [
11], tailored interventions and refined clinical strategies for older patients are essential. Furthermore, because awareness of chronic diseases is generally low [
22], some older adults may already have CKD without realizing it. Considering that the mean age of subjects in this study was 63.3 years, it is important to recognize a broader age range (including those over 65 years) as a major factor influencing mortality in CKD. As the global population of older adults continues to rise, the development of age-stratified and evidence-based clinical guidelines will be increasingly important for effective CKD management.
Third, hospital size was identified as a crucial factor influencing mortality among CKD patients. This study found that patients admitted to hospitals with 100–299 beds had a higher mortality risk compared to those in larger hospitals. This hospital category includes facilities referred to as “comprehensive hospitals” under South Korean medical law. CKD patients undergoing renal replacement therapy such as hemodialysis receive care in artificial kidney units, and their prognosis is influenced by the clinical competence of the medical staff in these units [
23]. Typically, nurses in artificial kidney units reach optimal proficiency after approximately 3 years of work experience [
24]. However, as of 2018, comprehensive hospitals had the lowest proportion of nurses with more than 2 years of experience [
25]. Because hospitals with fewer than 300 beds play a central role in managing basic diagnoses and providing chronic rehabilitation care, targeted strategies are needed to enhance clinical competency and improve CKD patient outcomes within these institutions.
Fourth, hospital location—particularly facilities located in “other areas” and Gyeonggi Province—was also a significant determinant of mortality among CKD patients. The term “other area” refers to regions excluding Seoul, Gyeonggi Province, and other metropolitan cities, including Gangwon, Chungbuk, Chungnam, Jeonbuk, Jeonnam, Gyeongbuk, Gyeongnam, and Jeju Provinces. These areas generally have fewer general hospitals and nephrologists, resulting in reduced healthcare accessibility and a higher risk of death [
26,
28]. The disparity in healthcare infrastructure between metropolitan and non-metropolitan areas may contribute to delays in CKD diagnosis and treatment, thereby worsening patient outcomes.
The Gyeonggi region, meanwhile, exhibited a lower standardized death rate but had the highest absolute number of deaths, likely reflecting its large population of older adults and high healthcare demand [
27,
28]. As of 2020, Gyeonggi Province had 521 nephrologists and the second-highest number of healthcare facilities nationwide, including tertiary hospitals, general hospitals, clinics, and other institutions, totaling 21,151 facilities, second only to Seoul [
27]. This robust medical infrastructure may attract CKD patients seeking specialized care, which could partially explain the observed regional differences in death rates.
Fifth, sex emerged as a factor influencing death in CKD patients, with men exhibiting a significantly higher risk of death than women. It is well established that men have a higher overall mortality risk and a slower rate of decline in kidney function than women [
29]. Additionally, smoking is frequently used as a proxy variable for men and is recognized as a contributor to renal fibrosis—the ultimate pathological manifestation of CKD [
30]—making it a notable risk factor for CKD-related mortality.
Finally, the strength of this study lies in its integration of machine learning and deep learning methodologies to improve mortality prediction in CKD patients. Unlike traditional statistical models that depend on hypothesis testing and assumptions about data distribution, this study employed accuracy-based evaluation metrics to enhance predictive performance. Moreover, performing variable selection before implementing deep learning helped address one of the common limitations of deep learning models—namely, the difficulty in interpreting the relative importance of specific factors.
This study has several limitations. First, the model was developed using data from the Korean healthcare system, which may limit its generalizability to other populations. External validation using internationally diverse datasets is needed to ensure broader applicability. Second, although our two-step modeling approach—LASSO feature selection followed by multilayer perceptron (MLP) classification—was designed to improve interpretability, we did not compare it with alternative architectures such as TabNet or Transformers. Future research should evaluate more complex models and examine their relative advantages depending on data structure and clinical context. Third, although the model achieved satisfactory predictive performance, it was not optimized for class imbalance. Additional performance metrics, including precision, recall, and AUC-PR, could further clarify model behavior. Furthermore, explainability tools such as SHAP or LIME were not utilized in this study, but their application could enhance clinicians’ understanding of individual risk factors and increase trust in model-driven decisions. Given the model’s simplicity and reliance on general characteristics, it holds strong potential for early triage and decision support, particularly in resource-limited healthcare environments. Interestingly, the model demonstrated a slightly higher training loss than validation loss, likely due to the use of dropout regularization and early stopping, which helped mitigate overfitting during training.
Despite these limitations, this study demonstrates that general patient characteristics can serve as valuable prognostic indicators, enabling the development of a high-performance predictive model for CKD mortality. Future studies should focus on refining predictive algorithms, incorporating additional clinical variables, and validating these findings across broader and more diverse populations to further enhance clinical applicability and support evidence-based decision-making in CKD management.