Potential of machine learning methods to identify patients with nonvalvular atrial fibrillation
Abstract
Aim: Nonvalvular atrial fibrillation (NVAF) is associated with an increased risk of stroke however many patients are diagnosed after onset. This study assessed the potential of machine-learning algorithms to detect NVAF. Materials & methods: A retrospective database study using a Japanese claims database. Patients with and without NVAF were selected. 41 variables were included in different classification algorithms. Results: Machine learning algorithms identified NVAF with an area under the curve of >0.86; corresponding sensitivity/specificity was also high. The stacking model which combined multiple algorithms outperformed single-model approaches (area under the curve ≥0.90, sensitivity/specificity ≥0.80/0.82), although differences were small. Conclusion: Machine-learning based algorithms can detect atrial fibrillation with accuracy. Although additional validation is needed, this methodology could encourage a new approach to detect NVAF.
Atrial fibrillation (AF) is the most common form of arrhythmia, particularly in the elderly [1,2]. AF increases the risk of ischemic stroke, in particular embolic stroke, which is in general the most severe form of stroke with the poorest outcomes [3,4]. For prevention of cardioembolic stroke, anticoagulation therapy as well as appropriate AF management is essential. However, there are still many patients who are not diagnosed with AF before having a cardioembolic stroke [5,6]. Unfortunately, paroxysmal AF, reportedly constituting 50% or more of any-cause AF [7], is often asymptomatic and difficult to detect in general clinical practice [8], with electrocardiography is the only definite method for diagnosis. Shorter duration 12-lead electrocardiogram (generally for several minutes) is not sufficient to find paroxysmal AF and longer monitoring such as event recorder electrocardiography is generally required. However, wearing a device for a long period of time can be burdensome and is not always practical. Taking into consideration these limitations, a two-step approach might be useful, whereby patients at high risk for AF are first identified and then screening by electrocardiography is undertaken among these high-risk patients to diagnose AF.
Although classical cardiovascular risk factors for AF have been investigated in previous studies, the overall identification ability of these factors may not be sufficient to justify screening large populations who present with such risk factors [9–12]. It is possible that a machine-learning approach could be used to develop a better identification model for AF by allowing the consideration of a larger number of variables, although to the best of our knowledge there are currently no reports of machine leaning-based AF detection in the literature. The objective of this study was therefore to assess the identification ability of a machine learning-based algorithm for the detection of nonvalvular AF (NVAF) using data obtained from a Japanese health claims database.
Materials & methods
Data source
This study used de-identified health claims and Diagnosis Procedure Combination (the flat-free payment system) data from 314 acute care hospitals across Japan, obtained from the Medical Data Vision (MDV) Co, Ltd (Tokyo, Japan). The details of this database have been described elsewhere [13,14]. In brief, the database includes administrative data from 15 million patients in Japan. The distribution of demographic characteristics including the age and gender distributions of patients included in the database is similar to national statistics in Japan. The database includes inpatient and outpatient administrative data from participating hospitals. Although the data contain clinical and demographic characteristics, medical procedures and medications prescribed for all patients, laboratory data (clinical laboratory test results) and other clinical tests (such as blood pressure values) are not always available, with availability varying by hospital. Diseases were identified using International Classification of Disease, 10th Revision (ICD-10) codes and/or the local standard disease codes to supplement ICD-10 codes. The codes used in the current study are shown in Supplementary Table 1. Medical procedures were defined with the procedure codes shown in Supplementary Table 2.
Patient inclusion
Criteria for selection of NVAF and non-AF cohorts are shown in Figure 1. NVAF patients were defined as patients with at least two diagnosis codes for AF (I48), but not with valvular AF (standard disease code 8846941), postoperative AF (8847772), mechanical-valvular AF (T820) or chronic rheumatic heart diseases (I05-I09) between April 2008 and August 2017. Patients with a history of procedures involving prosthetic heart valves were also excluded. According to these criteria, 33,885 patients were selected into the NVAF cohort. The date of AF diagnosis (that is, first appearance of the I48 code during a whole available period in the database) was defined as the index date. The final NVAF cohort included, patients who had data for all essential variables and at least 60% of ‘optional variables’ within a period of ‘index date–6 months’ were selected (Table 1). Patients who had never been diagnosed with AF (no ICD code of I48 but having at least two visits and at least 1 year of available data) within the study period were defined as non-AF patients. Patients diagnosed with valvular AF, postoperative AF, mechanical-valvular AF or chronic rheumatic heart diseases and those with a history of valve replacement procedures were also excluded from this cohort. The index date for each non-AF patient was defined the date of last visit where laboratory data were available. Patients who had data for all essential variables and for at least 60% of ‘optional variables’ within a period of ‘index date–6 months’ were selected (Table 1). In total, 27,592 patients were included in the NVAF cohort and 463,750 patients in the non-AF cohort.
Characteristics | NVAF (n = 27,592) | Non-AF (n = 463,750) | All (n = 491,342) | Required or Optional | p-value† (NVAF vs non-AF) | |||
---|---|---|---|---|---|---|---|---|
Mean | SD | Mean | SD | Mean | SD | |||
Demographic | ||||||||
Age (years) | 76.5 | 11.0 | 66.0 | 17.0 | 66.6 | 16.8 | Required | <0.001 |
Male | 15,845 | 57.4% | 231,192 | 47.1% | 247,037 | 50.3% | Required | <0.001 |
Height (cm) | 157.1 | 10.5 | 158.1 | 10.4 | 158.1 | 10.4 | Optional | <0.001 |
Weight (kg) | 55.9 | 13.8 | 56.0 | 13.8 | 56.0 | 13.8 | Optional | 0.242 |
Diseases | ||||||||
Hypertension | 17,062 | 61.8% | 173,103 | 37.3% | 190,165 | 38.7% | Required | <0.001 |
Diabetes | 13,235 | 48.0% | 120,927 | 26.1% | 134,162 | 27.3% | Required | <0.001 |
Ischemic Stroke | 7294 | 26.4% | 38,330 | 8.3% | 45,624 | 9.3% | Required | <0.001 |
Heart Failure | 15,339 | 55.6% | 47,644 | 10.3% | 62,983 | 12.8% | Required | <0.001 |
Anemia | 8344 | 30.2% | 81,953 | 17.7% | 90,297 | 18.4% | Required | <0.001 |
Hyperthyroidism | 4203 | 15.2% | 33,793 | 7.3% | 37,996 | 7.7% | Required | <0.001 |
CAD | 8472 | 30.7% | 47,605 | 10.3% | 56,077 | 11.4% | Required | <0.001 |
PAD | 2899 | 10.5% | 17,204 | 3.7% | 20,103 | 4.1% | Required | <0.001 |
COPD | 980 | 3.6% | 9396 | 2.0% | 10,376 | 2.1% | Required | <0.001 |
VPC | 671 | 2.4% | 4919 | 1.1% | 5590 | 1.1% | Required | <0.001 |
Laboratory Value | ||||||||
RBC counts (x104/L) | 411.2 | 73.9 | 411.8 | 73.5 | 411.7 | 73.6 | Optional | 0.188 |
Hemoglobin (g/dL) | 12.7 | 2.4 | 12.5 | 2.2 | 12.5 | 2.2 | Optional | <0.001 |
Hematocrit (%) | 38.0 | 6.6 | 37.7 | 6.2 | 37.7 | 6.2 | Optional | <0.001 |
MCV (fL) | 92.7 | 6.8 | 92.3 | 107.9 | 92.3 | 104.8 | Optional | 0.538 |
MCH (pg) | 33.3 | 3.8 | 33.3 | 17.0 | 33.3 | 16.5 | Optional | 1.000 |
MCHC (%) | 31.6 | 93.8 | 30.6 | 27.3 | 30.7 | 34.6 | Optional | <0.001 |
WBC counts (/L) | 7639 | 4643 | 6915 | 6453 | 6956 | 6,367 | Optional | <0.001 |
Platelet count (x104/L) | 20.1 | 8.9 | 22.7 | 8.9 | 22.5 | 8.9 | Optional | <0.001 |
TP (g/dL) | 6.7 | 0.8 | 6.7 | 0.8 | 6.7 | 0.8 | Optional | 1.000 |
GOT (IU/L) | 41.9 | 146.4 | 37.0 | 204.4 | 37.2 | 201.6 | Optional | <0.001 |
GPT (IU/L) | 32.3 | 98.2 | 29.5 | 97.6 | 29.7 | 97.6 | Optional | <0.001 |
ALP (IU/mL) | 283.7 | 230.6 | 295.0 | 373.1 | 294.4 | 366.6 | Optional | <0.001 |
γ-GTP (IU/L) | 57.5 | 108.4 | 54.2 | 115.5 | 54.4 | 115.2 | Optional | <0.001 |
LDH (U/L) | 253.1 | 263.8 | 237.9 | 493.8 | 238.7 | 431.7 | Optional | <0.001 |
UN (mg/dL) | 22.2 | 15.1 | 18.3 | 15.3 | 18.5 | 15.3 | Optional | <0.001 |
Serum creatinine (mg/dL) | 1.1 | 1.3 | 1.0 | 1.2 | 1.0 | 1.2 | Optional | <0.001 |
UA (mg/dL) | 5.9 | 2.1 | 5.3 | 1.9 | 5.3 | 1.9 | Optional | <0.001 |
Total cholesterol (mg/dL) | 171.2 | 42.5 | 185.4 | 43.5 | 184.6 | 43.6 | Optional | <0.001 |
TG (mg/dL) | 110.8 | 86.0 | 126.1 | 93.7 | 125.1 | 93.3 | Optional | <0.001 |
HbA1c (%) | 6.2 | 1.1 | 6.2 | 1.1 | 6.2 | 1.1 | Optional | 1.000 |
CRP (mg/L) | 3.1 | 5.8 | 2.0 | 4.4 | 2.1 | 4.5 | Optional | <0.001 |
K+ (mEq/L) | 4.2 | 0.6 | 4.2 | 0.6 | 4.2 | 0.6 | Optional | 1.000 |
Cl− (mEq/L) | 103.7 | 4.9 | 104.3 | 4.5 | 104.3 | 4.5 | Optional | <0.001 |
Na+ (mEq/L) | 139.6 | 4.3 | 140.0 | 4.1 | 140.0 | 4.1 | Optional | <0.001 |
Fe2+ (g/dL) | 59.4 | 44.8 | 70.4 | 47.5 | 69.7 | 47.4 | Optional | <0.001 |
BNP (pg/mL) | 351.3 | 560.9 | 197.7 | 587.4 | 223.7 | 585.8 | Optional | <0.001 |
GFR (ml/min/1.73m2) | 60.1 | 28.0 | 73.6 | 32.7 | 72.8 | 32.6 | Optional | <0.001 |
Variable selection
Variables were selected based on known associated with NVAF, and included demographic characteristics (age, sex, height and weight) [12], comorbidities (hypertension, diabetes [15], ischemic stroke [16], heart failure [16], anemia [17], hyperthyroidism [18], chronic obstructive pulmonary disease [19] and ventricular premature contraction [20], coronary artery disease [21] and peripheral artery disease [16]) and laboratory data related to renal function (creatinine and glomerular filtration rate), metabolic disease (cholesterol and triglyceride) [12] and others (hemoglobin A1c [15], brain natriuretic protein [22] and uric acid [23]) were selected. Additional laboratory test values (hematocrit, hemoglobin, mean corpuscular volume, mean corpuscular hemoglobin, mean corpuscular hemoglobin concentration, white blood cell count, red blood cell count, platelet count, serum total protein, GOT, GPT, gamma-GTP, ALP, LDH, CRP, urinary nitrogen, K+, Cl−, Na+ and Fe2+) were selected as these tests are commonly used in general clinical practice.
Statistical analysis
Datasets
The raw data were processed to create datasets for analysis. First, missing data were imputed by multivariate imputation by chained equations. The proportion of missing values for each variable is shown in Supplementary Table 3. The dataset was then randomly divided in three at a 40:40:20 ratio, a training dataset 1, training dataset 2 and a validation dataset, respectively. And the ratio of NVAF/non-AF was kept consistent among three datasets. These datasets were used for development of both model 1 and model 2. Overall steps are shown in Figure 2. For model 2, to alleviate imbalance in the dataset (the number of non-AF patients was much larger than NVAF patients), random undersampling of the majority class was applied to the dataset so that the ratio of NVAF to non-AF was 1:1. After undersampling, the final training dataset contained 11,036 patients with NVAF and 11,036 non-AF.
Model-1
For Model 1, a ‘deep learning’ (DL) method was used to create multiple features to predict NVAF in Step 1 using the training dataset 1. DL-based features created in the Step 1 were then applied to Step 2, where Lasso regression (LR) was used to develop a classifier using the training dataset 2. In Step 3, the developed algorithm was validated using the validation dataset.
Model-2
For Model 2, stacking of multiple single classifiers was used, which included LR, ridge regression, support vector machine, random forest, DL, AdaBoost and gradient boosting. These multiple models were trained using the training dataset 1. Each model provided probability of AF as an outcome which were then cast into a second level model (LR model), where the outcomes obtained in the first step became features in the second step. The developed algorithm was validated using validation dataset in model validation step.
Variable-feature relation analysis
We investigated the relationship between features developed by stacking model 1 or 2 and each of the variables included in the analytical dataset. The relationships were described using a Pearson correlation coefficient.
Receiver operating characteristic (ROC) analysis was conducted to estimate area under the ROC curve (AUC), sensitivity and specificity, for each of the developed algorithms. All of the above analyses were performed using Python version 3.6.
Results
Patient characteristics are shown in Table 1. As expected, there were marked statistically significant differences between patients in the NVAF and non-AF cohorts in terms of key clinical and demographic characteristics. In general, patients in the NVAF cohort tended to be older with a higher comorbidity burden.
The ROC curve suggested a small difference in performance between the stacking model and other methods (Figure 3). AUC, sensitivity and specificity are shown in Table 2. Stacking models 1 and 2 had an AUC of 0.90, specificity and sensitivity were also well-balanced (Table 2). The AUC for each single model was >0.8, and the sensitivity of single models varied between 0.09 and 0.84, and specificity between 0.79 and 0.99.
Model | AUC | Sensitivity | Specificity | Sampling |
---|---|---|---|---|
Lasso regression | 0.86 | 0.09 | 0.99 | |
Ridge regression | 0.86 | 0.78 | 0.81 | undersampling |
Support vector machine | 0.86 | 0.78 | 0.79 | undersampling |
Random forest | 0.87 | 0.80 | 0.79 | undersampling |
Deep learning | 0.90 | 0.23 | 0.99 | |
AdaBoost | 0.90 | 0.84 | 0.81 | undersampling |
Gradient boosting | 0.90 | 0.82 | 0.82 | |
Stacking Model-1 | 0.90 | 0.82 | 0.82 | |
Stacking Model-2 | 0.90 | 0.80 | 0.84 |
We visualized the correlation between study variables and each of the features engineered by the DL algorithm for stacking model 1 (Supplementary Figure 1) and each of the single model algorithms developed for stacking model 2 (Supplementary Figure 2). For the deep-learning developed features used in stacking model 1, almost all of the developed features were associated to used variables, although associations were strongest with arterial disease and COPD (Supplementary Figure 1). Importantly though, some of the developed features had no correlation with used variables (for example, features 10 and 15 in Supplementary Figure 1). Age and select comorbidities including heart failure, hypertension, diabetes and coronary arterial disease were most strongly correlated with each of the single model algorithms (Supplementary Figure 2). Lab tests tended to be weekly correlated, although among the lab tests considered brain natriuretic protein, urinary nitrogen, uric acid showed the strongest associations.
Discussion
In this Japanese study, we found that machine learning methods can be used to develop an algorithm which detects patients with NVAF from patients without NVAF with a level high accuracy. All machine learning algorithms developed in this study had at least an 80% chance of correctly identifying NVAF patients. These results should be considered exploratory and require further validation, but encouragingly demonstrate the potential utility of machine-learning approaches to help detect patients with NVAF in the general population.
AF risk scores have been reported from both Western countries and Japan. The CHARGE-AF study has proposed a validated score for AF by using 5 US and European cohorts and its c-statistic was about 0.7 [10]. A Japanese risk score to predict a 10-year risk of AF proposed by the Suita study also had a c-statistic of 0.7 s [24]. Both scores have been prepared for the purpose of being used to effectively screen for AF in general outpatient clinics without 12-lead ECG measurement, and the c-statistics could be large enough for this purpose.
In the CHARGE-AF and the Suita study risk scores, variables that are established cardiovascular risk factors, or are considered as factors associated with AF, were selected and used for preparation of the scores. However, in the present study, factors available from the database irrespective of their potential association with AF, were used for development of the algorithm. As a result, our algorithm contained more information on comorbid disease and laboratory tests, however, it is uncertain how these differences affected our model performance. Unfortunately, due to the limited availability of variables, we did not perform cross-validation between our algorithm and these previously reported risk scored. Performance of the algorithm should be examined by the cross-validation in the future.
In this study, we used an administrative claims database, which includes real-world data related to patient health status and provision of clinical care collected in general clinical practice. AF is difficult to be diagnosed in clinical practice and there might be a considerable number of patients with AF who have not yet diagnosed. These patients with undiagnosed AF were included in the non-AF cohort in the present study and might have an impact on the result. This is a critical limitation of this study. A prospective, probably ‘interventional’, study may be necessary to actively diagnose NVAF to address this issue.
In the present study, we did not show which factors/features contributed the most to identification of NVAF. Some variables that are not directly significantly associated with the outcome (NVAF in this study) are sometimes considerably contributed to the outcome when a machine learning-based approach, particularly DL or stacking, is applied to development of algorithms and vice versa. This is a critical difference between conventional statistics-based approach (such as risk scores based on risk factors) and our DL-based approach.
There are limitations to our study that should be considered. First, as already mentioned above, our model was prepared and validated only by using claims data from the MDV database. This dataset might contain patients with misdiagnosed AF and this could affect model validity. Second, the MDV dataset is limited to patients treated within an acute care hospital setting who may be sicker and have more comorbid disease than would otherwise be observed in the broader Japanese population. Therefore, inferences gleaned from this study may not be applicable to patients solely seen in primary care or people who have had no recent healthcare interaction, and any comparisons of the algorithm performance seen in this study with those of CHARGE-AF and the Suita study should be viewed in this context. The algorithm developed in this study distinguished patients with diagnosed NVAF from those without a diagnosis, however it remains possible that patients with NVAF who are currently undiagnosed are distinct to those known. This possibility as well as algorithms to identify such patients should be explored in future studies.
We selected variables based on known risk factors and data clinically available in general hospital practice. However, requiring patients to have at least 60% of test data available may have introduced bias if there are systematic differences between patients with a lab test recorded and those without, and therefore patients with >60% of data available may not be representative of a general hospital-admitted population. Additionally, there is very limited information on potential confounding factors such as socioeconomic status, smoking [25] or alcohol intake [26] which might improve or maximize the machine learning based algorithm to avoid missing latent AF patients. As we comment above, our results should be regarded as speculative rather than definitive: they represent results from what can be done using very large-scale, routinely collected administrative data and require further work, in different study designs, to confirm or refute the findings.
Conclusion
Machine-learning based algorithms can detect NVAF with accuracy. Although additional validation is needed, this methodology could encourage a new approach to improve the detection of patients with NVAF in the general population in an aging society like Japan.
Future perspective
Earlier detection of NVAF is key to prevent stroke and improve outcomes of patients, particularly in increasingly aging societies like Japan. Efforts to improve screening and earlier detection of at-risk patients will continue to evolve in coming years. As health-related data proliferate and advanced analytical techniques to handle such data continue to evolve alongside such efforts, the ability of these innovate approaches to be applied in everyday clinical settings to support decision-making for the detection of the NVAF may become realized. Indeed, machine learning is playing an increasing role within many areas of healthcare, including but not limited to earlier diagnosis and has the potential to transform clinical care.
Nonvalvular atrial fibrillation (NVAF) is associated with an increased risk of stroke, however many patients are only diagnosed after onset.
Efforts to support earlier diagnosis of NVAF are needed and machine-learning algorithms which are able to handle large amounts of complex data offer a potentially exciting approach to improve identification of and potentially earlier diagnosis of NVAF.
The objective of this study was to assess the potential of machine-learning based algorithms to detect NVAF in a Japanese patient population using data obtained from a national administrative claims database.
Compared with the non-AF cohort, patients in the NVAF cohort were older, more were male, and tended to have a higher comorbidity burden.
All machine learning algorithms identified NVAF with a high level of accuracy; the stacking model which combined multiple algorithms outperformed single-model approaches, although differences between model performance were small.
This study looked factors that distinguish patients with known NVAF from those without a diagnosis, but it remains possible that patients with undiagnosed NVAF are distinct to this patient group, and this possibility should be explored in future studies.
Misdiagnosed NVAF patients could have been included in non-AF cohort, which could impact model validity.
As the algorithms were created using NVAF diagnoses recorded in an administrative database in Japan, validation using other data of different sources such as electrical medical record-based data or physical examination data is needed.
Supplementary data
To view the supplementary data that accompany this paper please visit the journal website at: http://www.futuremedicine.com/doi/suppl/10.2217/fca-2019-0056
Author contributions
R Suzuki contributed to the whole study management, development of the final protocol and the manuscript, and interpretation of the results. J Katada contributed to the final study design, development of the protocol and the analytical plan and preparation of the manuscript. S Ramagopalan and L McDonald contributed to study design, interpretation of the results and development of the algorithm. All authors critically revised successive drafts the draft manuscript and approved the final one.
Acknowledgments
Cancer Scan Co. Ltd. contributed to data cleaning and data analysis for this study.
Financial & competing interests disclosure
This study was supported by Bristol-Myers Squibb K.K and Pfizer Japan Inc. R Suzuki was a full-time employee of BMS K.K. J Katada was a full-time employee of Pfizer Japan Inc. S Ramagopalan and L McDonald were full-time employee of BMS UK. The authors have no other relevant affiliations or financial involvement with any organization or entity with a financial interest in or financial conflict with the subject matter or materials discussed in the manuscript apart from those disclosed.
No writing assistance was utilized in the production of this manuscript.
Ethical conduct of research
The authors state that they have followed the principles outlined in the Declaration of Helsinki for all human or animal experimental investigations. This study was conducted by using a structured, de-identified database which does not require patient consent according to the local regulations.
Open access
This work is licensed under the Attribution-NonCommercial-NoDerivatives 4.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-nd/4.0/
Data sharing statement
Data may be obtained from a third party and are not publicly available. Raw data will be able to be purchased from Medical Data Vision Co. Ltd. (Tokyo Japan).
Papers of special note have been highlighted as: • of interest; •• of considerable interest
References
- 1. Prevalence of diagnosed atrial fibrillation in adults: national implications for rhythm management and stroke prevention: the AnTicoagulation and risk factors in atrial fibrillation (ATRIA) study. JAMA 285(18), 2370–2375 (2001). • Epidemiology of atrial fibrillation.
- 2. Worldwide epidemiology of atrial fibrillation: a Global Burden of Disease 2010 Study. Circulation 129(8), 837–847 (2014).
- 3. The impact of atrial fibrillation on the cost of stroke: the berlin acute stroke study. Value Health 10(2), 137–143 (2007).
- 4. Evaluation of costs and outcome in cardioembolic stroke or TIA. J. Neurol. 256(6), 954–963 (2009).
- 5. . Optimal timing and duration of continuous electrocardiographic monitoring for detecting atrial fibrillation in stroke patients. J. Stroke Cerebrovasc. Dis. 22(7), 991–995 (2013).
- 6. Estimated prevalence of undiagnosed atrial fibrillation in the United States. PLoS ONE 13(4), e0195088 (2018).
- 7. . Prolonged rhythm monitoring for the detection of occult paroxysmal atrial fibrillation in ischemic stroke of unknown cause. Circulation 124(4), 477–486 (2011).
- 8. Prevalence of unknown atrial fibrillation in patients with risk factors. Europace 15(5), 657–662 (2013).
- 9. Development of a risk score for atrial fibrillation (Framingham Heart Study): a community-based cohort study. Lancet 373(9665), 739–745 (2009). •• A risk score for de novo development of atrial fibrillation in the Framingham Heart Study.
- 10. Simple risk model predicts incidence of atrial fibrillation in a racially and geographically diverse population: the CHARGE-AF consortium. J. Am. Heart Assoc. 2(2), e000102 (2013). •• A risk model to predict incidence of atrial fibrillation in general US and EU cohorts.
- 11. . Prevalence, incidence, and lifetime risk of atrial fibrillation in China: new insights into the global burden of atrial fibrillation. Chest 147(1), 109–119 (2015).
- 12. . Development of a basic risk score for incident atrial fibrillation in a Japanese general population – The Suita study. Circ. J. 81(11), 1580–1588 (2017). •• Risk score for de novo development of atrial fibrillation in Japanese general population.
- 13. . Safety and effectiveness of apixaban in comparison to warfarin in patients with nonvalvular atrial fibrillation: a propensity-matched analysis from Japanese administrative claims data. Curr. Med. Res. Opin. 34(9), 1627–1634 (2018).
- 14. . Bleeding risk of apixaban, dabigatran, and low-dose rivaroxaban compared with warfarin in Japanese patients with non-valvular atrial fibrillation: a propensity matched analysis of administrative claims data. Curr. Med. Res. Opin. 33(11), 1955–1963 (2017).
- 15. HbA1c and atrial fibrillation: a cross-sectional study in Japan. Int. J. Cardiol. 156(2), 156–159 (2012).
- 16. Current status of clinical background of patients with atrial fibrillation in a community-based survey: the Fushimi AF Registry. J. Cardiol. 61(4), 260–266 (2013).
- 17. Anemia and reduced kidney function as risk factors for new onset of atrial fibrillation (from the Ibaraki prefectural health study). Am. J. Cardiol. 115(3), 328–333 (2015).
- 18. . Arrhythmia and thyroid dysfunction. Herz 40(Suppl. 2), 101–109 (2015).
- 19. Airflow obstruction, lung function, and incidence of atrial fibrillation: the Atherosclerosis Risk in Communities (ARIC) study. Circulation 129(9), 971–980 (2014).
- 20. Frequent premature atrial complexes predict new occurrence of atrial fibrillation and adverse cardiovascular events. Europace 14(7), 942–947 (2012).
- 21. Atrial ectopy as a predictor of incident atrial fibrillation: a cohort study. Ann. Intern. Med. 159(11), 721–728 (2013).
- 22. Natriuretic peptides for the detection of paroxysmal atrial fibrillation in patients with cerebral ischemia--the Find-AF study. PLoS ONE 7(4), e34351 (2012).
- 23. Hyperuricemia is an independent competing risk factor for atrial fibrillation. Int. J. Cardiol. 231, 137–142 (2017).
- 24. Interaction of blood pressure and body mass index with risk of incident atrial fibrillation in a Japanese urban cohort: the Suita study. Am. J. Hypertens. 28(11), 1355–1361 (2015).
- 25. . Tobacco smoking and the risk of atrial fibrillation: a systematic review and meta-analysis of prospective studies. Eur. J. Prev. Cardiol. 25(13), 1437–1451 (2018).
- 26. Heavy alcohol consumption and risk of atrial fibrillation. The Circulatory Risk in Communities Study (CIRCS). Circ. J. 78(4), 955–961 (2014).