I. Introduction
Digital healthcare (DHC) interventions have the potential to improve disease control and management, population health outcomes, and healthcare quality [
1–
3]. Current DHC solutions include telehealth, digital and virtual disease management platforms, modifiable risk factor technologies, dietary counselling, psychological assistance, and personalized short messaging. DHC tools are inexpensive, convenient, easy to navigate, provide accessible/concise information and secure data management leading to higher acceptability [
4].
A combination of medical (point-of-care) data from clinical sources (e.g., electronic medical records, registries, insurance claims) and social determinants of health (SDOH, between-care) data from devices (e.g., smartphones and apps) can provide insights into patients’ behaviors, medication responses, lifestyle choices, and a holistic view of their healthcare journey [
5]. These combined data can be used to train a machine learning (ML) model to predict responses to interventions.
DHC solutions can be safe alternatives to conventional healthcare services to manage patients with cardiovascular conditions [
6]. Results from a randomized trial [
7] on 765 heart failure (HF) patients suggested that remote patient management may reduce unplanned hospitalizations, morbidity, and mortality [
8]. Data mining techniques can be used to discover patterns and associations in medical data to uncover solutions to existing gaps and has been used in HF studies [
9]. Thus, in context of HF, this study aimed to identify the right interventions for the right patient in the real-world setting using data mining by (1) describing DHC/non-DHC interventions used by HF patients; (2) identifying HF patient archetypes according to socio-demographic, clinical characteristics, procedures, laboratory tests, patient-reported outcomes (PROs), and comorbidities; and (3) describing hospitalizations/re-hospitalizations, mortality, costs, and use of DHC/non-DHC interventions in all HF patients and archetypes.
IV. Discussion
The key challenge in using retrospective databases to explore the potential of DHC interventions in the real world is the poor availability of between-care and point-of-care data that co-exist at an individual-level longitudinal detail. Unlike most studies that only included data on clinical, laboratory, and healthcare use (except one that included QoL variables) [
16], our study also considered social determinants of health as they can impact chronic disease management and enable a more targeted intervention. This combined dataset provides a more holistic view of the patient and clues to their behaviors between points of care, which is necessary to identify the patient profile most likely to benefit from chronic care interventions.
Our study population included 353 relatively young HF patients (63.5 years), with a higher proportion of women compared to the general HF population. Horiuchi et al. [
17] estimated an average age of 73 years and a proportion of 65% men in their cohort of HF patients. A systematic review of methods to identify HF patients in general practice reported a weighted mean age of 75 years [
18]. Another systematic review of cost-of-illness studies in adults with HF in the United States reported mean age 59–84 years, with most studies estimating an average age of ≥70 years [
19]. The inclusion of relatively young patients was likely due to lower participation of older patients in the NHWS.
The SF-36v2 scores at baseline suggested moderate deterioration in the physical health status and mild impairment in the mental HRQoL. The EQ-5D instrument and mean utility index value in the total population corresponded to mild HF severity at baseline [
20]. Further analysis showed that the mild HRQoL deterioration was driven by challenges in mobility and usual activities. Most patients had co-existing obesity, systemic hypertension, DM, CHD, COPD, sleep apnea, anemia, iron deficiency, atrial fibrillation and flutter, CKF, valvular heart disease, thyroid disorders, anxiety, and depression, which is consistent with the most frequent HF-associated comorbidities [
21–
23].
According to a recent report of the United States Department of Health and Human Services, the COVID-19 pandemic led to increased DHC utilisation [
24], also observed in our study (pre- vs. post-index date, 4.0% vs. 9.1%). This increase was expected in the index period, as patients were more likely to utilise DHC following disease progression or increased severity or on development of comorbidities, complications, or mobility problems. DHC interventions were associated with lower HF-related hospitalizations than in the total population and other subgroups, but with higher all-cause hospitalizations than with non-DHC interventions, possibly due to prevalence of non-cardiovascular comorbidities. The higher rate should be interpreted with caution and is likely overestimated as patients can have multiple types of interventions.
The total healthcare costs increased significantly from baseline ($22,240) to follow-up ($45,702), mainly driven by inpatient costs. Overall, 45.0% patients had all-cause hospitalizations; and 17.0%, HF-related hospitalizations. In another analysis, the high costs (mean total costs, $62,615; HF-related costs, $35,329) in the year following HF worsening in patients with HF with reduced ejection fraction (HFrEF) were attributed to inpatient encounters [
25]. Similarly, another study reported a significant (
p < 0.001) change in the mean costs/person in the year after HF diagnosis ($34,372) than in the year before diagnosis ($8,219) [
26]. However, HF-associated costs in similar published studies have varied widely [
19]. In a systematic review of HF-associated costs in the US, HF-specific hospitalizations (median cost/patient, $15,879) accounted for the increase in annual median total costs for HF care ($24,383). Costs were largely driven by the length of stay and varied based on patient characteristics (e.g., comorbidities) [
19]. A review synthesizing international cost estimates of cardiovascular events also reported lower costs [
27]. The average cost of HF hospitalization across studies was $11,686 (median, $10,291). Costs from US claims analysis were high ($27,006); and follow-up costs through 1 year, $12,931 (median, $15,238) [
27].
ML tools enable the use and analysis of large datasets to examine multiple clinical features and identify trends in disease progression and prognosis within a patient population [
7]. Several clustering analyses have been conducted to identify HF phenotypes/subclasses and comorbidity patterns in HF patients [
16,
28]. The K-means clustering method is one of the most adopted methods of clustering in real-world evidence studies due to its simplicity and performance [
29].
Although two archetypes were identified, there was a low separation between them due to a homogenous study population and relatively small sample size. Archetype 1 may comprise HFrEF patients as indicated by greater disease severity, more comorbidities, and significantly higher ARB prescription at baseline and follow up versus archetype 2. Furthermore, archetype 1 had slightly higher HF-related hospitalizations and lower all-cause hospitalizations. Better lifestyle and higher rate of heart disease prevention practices in archetype 1 may have contributed to patients’ general wellbeing, causing lower all-cause hospitalizations. Costs associated with archetype 1 were significantly higher than for archetype 2, possibly due to older age and higher comorbidities, use of laboratory tests, and medications. High variability between archetypes in terms of costs could be due to different follow-up durations, disease severity, or comorbidities. A more in-depth analysis of comorbidities, medication, New York Heart Association classes, and study outcomes stratified by HF subtypes is needed to investigate these hypotheses.
Similar studies have examined relationships between patient profiles in different HF archetypes and outcomes. In a Swedish registry study, of the four distinct HF patient clusters differing significantly in outcomes and therapeutic response, the two clusters with the lowest 1-year survival rates were characterized by older age, low body mass index, high blood pressure, prior strokes/transient ischemic attacks, more comorbidities, low β-blocker, angiotensin-converting-enzyme inhibitors and implanted devices, and high diuretics, nitrates, and digoxin uptake; patients were least likely to have a university degree and had the lowest income [
30]. As methods to diagnose/identify HF patients may differ across countries, our results may not be generalizable. Additionally, HF management choices are influenced by local guidelines, therefore treatment patterns, interventions, and costs may also differ.
Our work may set a new framework for generating data driven ML approaches that link point-of-care data with SDOH data. This deterministic linking of datasets can help to obtain insights from a more complete historical data of the patient. Furthermore, the availability of such linked datasets in the future can be improved with big data technologies, which can help to obtain better insights from patient populations.
Our study had several limitations. First, the linkage of claims data with online survey data required active patient participation, which could have been challenging for severely ill/elderly patients. Therefore, the study population (younger, healthier, fewer comorbidities) may not be representative of the general US HF population. Second, US claims database enables inclusion of many patients, especially for a medical condition like HF, but sample size was reduced due to linkage with NHWS data. However, the linkage was necessary to obtain variables not available in the claims database for better patient characterization. Third, patients were selected based on any HF diagnosis, which may have led to inclusion of those with other conditions/comorbidities, impacting patients’ outcomes, including disease severity, HRQoL, healthcare resource utilization, and costs. To reduce the likelihood of including patients without a primary HF diagnosis, those with HF during the baseline period were excluded.
More clinically meaningful archetypes could have been identified using a larger sample. However, the nature of the first HF diagnosis (inpatient/non-inpatient) was considered as a proxy for disease severity in the analyses. Additionally, mortality data were available only for approximately 85% patients in the Komodo database, potentially leading to underestimation of the number of deaths. As for all claims data analysis, medication use was assessed based on prescriptions, but information on medication adherence by patient was unavailable. Finally, missing values for clinical measures in the NHWS data could be a limitation.
In conclusion, HF is associated with substantial clinical and financial burden and impacts patients’ QoL. Efforts to integrate DHC interventions as complementary to traditional face-to-face health services may improve patient outcomes, efficiency of healthcare delivery, and cost savings. Despite certain limitations, identification of two archetypes with distinct patient profiles and outcomes using K-means clustering algorithm can help to better understand underlying disease subtypes, predict clinical outcomes, and define the right intervention for the right patient. Future studies with a larger, more enriched database are warranted to generate further clinical insights using advanced analytics.