We use cookies to improve your experience. By continuing to browse this site, you accept our cookie policy.×
Skip main navigation
Aging Health
Bioelectronics in Medicine
Biomarkers in Medicine
Breast Cancer Management
CNS Oncology
Colorectal Cancer
Concussion
Epigenomics
Future Cardiology
Future Medicine AI
Future Microbiology
Future Neurology
Future Oncology
Future Rare Diseases
Future Virology
Hepatic Oncology
HIV Therapy
Immunotherapy
International Journal of Endocrine Oncology
International Journal of Hematologic Oncology
Journal of 3D Printing in Medicine
Lung Cancer Management
Melanoma Management
Nanomedicine
Neurodegenerative Disease Management
Pain Management
Pediatric Health
Personalized Medicine
Pharmacogenomics
Regenerative Medicine
Short CommunicationFree Access

HLA and red blood cell antigen genotyping in SARS-CoV-2 convalescent plasma donors

    William Lemieux

    Héma-Québec, Medical Affairs & Innovation, Québec City & Montréal, Québec, G1V 5G3, Canada

    ,
    Josée Perreault

    Héma-Québec, Medical Affairs & Innovation, Québec City & Montréal, Québec, G1V 5G3, Canada

    ,
    Gabriel André Leiva-Torres

    Héma-Québec, Transfusion Medicine, Québec City & Montréal, Québec, H4R 2W7, Canada

    ,
    Nadia Baillargeon

    Héma-Québec, Transfusion Medicine, Québec City & Montréal, Québec, H4R 2W7, Canada

    ,
    Jessica Constanzo Yanez

    Héma-Québec, Transfusion Medicine, Québec City & Montréal, Québec, H4R 2W7, Canada

    ,
    Marie-Claire Chevrier

    Héma-Québec, Transfusion Medicine, Québec City & Montréal, Québec, H4R 2W7, Canada

    ,
    Lucie Richard

    Héma-Québec, Transfusion Medicine, Québec City & Montréal, Québec, H4R 2W7, Canada

    ,
    Antoine Lewin

    Héma-Québec, Medical Affairs & Innovation, Québec City & Montréal, Québec, G1V 5G3, Canada

    &
    Patrick Trépanier

    *Author for correspondence:

    E-mail Address: patrick.trepanier@hema-quebec.qc.ca

    Héma-Québec, Medical Affairs & Innovation, Québec City & Montréal, Québec, G1V 5G3, Canada

    Published Online:https://doi.org/10.2217/fvl-2022-0058

    Abstract

    Aim: More data is required regarding the association between HLA allele and red blood cell (RBC) antigen expression in regard to SARS-CoV-2 infection and COVID-19 susceptibility. Methods: ABO, RhD, 37 other RBC antigens and HLA-A, B, C, DRB1, DQB1 and DPB1 were determined using high throughput platforms in 90 Caucasian convalescent plasma donors. Results: The AB group was significantly increased (1.5×, p = 0.018) and some HLA alleles were found to be significantly overrepresented (HLA-B*44:02, C*05:01, DPB1*04:01, DRB1*04:01 and DRB1*07:01) or underrepresented (A*01:01, B51:01 and DPB1*04:02) in convalescent individuals compared with the local bone marrow registry population. Conclusion: Our study of infection-susceptible but non-hospitalized Caucasian COVID-19 patients contributes to the global understanding of host genetic factors associated with SARS-CoV-2 infection and severity.

    The COVID-19 pandemic, caused by the SARS-CoV-2, has caused over 6.6 million deaths worldwide as of December 2022 (https://covid19.who.int/). Intense work has been done since the beginning of the pandemic to protect the most at-risk populations. Being male, of older age, obese, of a certain ethnic origin, having diabetes, asthma, or many other medical conditions, have all been associated with an increased risk of COVID-19-related complications or death [1–5]. Genetic factors, such as the expression of angiotensin-converting enzyme (ACE)-related genes, may also play a role in disease severity and could serve as a predictive marker for at-risk populations [6]. Efforts in getting a better understanding of the molecular gateways for viral entry and of the host's response have been relentless, focusing on ACE2 expression, HLA, cytokine storms and TLRs [7]. Such research could lead to an improved understanding of the disease's susceptibility and severity, the capacity for virus clearance, long term symptoms [8] and mortality consequent to infection in humans.

    The relationship between ABO blood group expression and susceptibility to infection by SARS-CoV-2 have been explored by a few groups and recently reviewed [9,10]. Group O individuals were identified as having a decreased risk of infection compared with other ABO groups, although no differences were observed regarding hospitalization and death rates caused by COVID-19 [11,12]. Several hypotheses to explain this observed link between ABO type and risk of infection have been suggested, such as the presence of anti-A and -B antibodies in O individuals [13] and the binding of the receptor-binding domain (RBD) of the viral Spike protein to group A antigens [14]. Given the important immunological deregulation and potential cytokine storm associated with mortality in COVID-19 patients [15] and the interesting report from an Italian laboratory of higher rates of direct antiglobulin test (DAT) reactivity in COVID-19 patients [16], RBC antigens and related antibodies could be involved in COVID-19 and its clinical manifestations.

    Of additional importance is the potential contribution of HLAs, given their central role in regulating immunity against viruses [17]. Considerable efforts have been made in trying to identify protective or susceptibility enhancing HLA alleles [18,19]. Binding assays have shown that HLA-A*02:01 and B*40:01 can preferentially associate with SARS-CoV-2 epitopes [20]. Using bioinformatic prediction, HLA-A*02:03 and A*31:01 were identified as protective, while A*03:02 was identified as a risk allele [21]. In silico binding affinity studies have shown that HLA-B*46:01 could increase susceptibility to disease, whereas HLA-B*15:03 is associated with protective immunity. and the same studies found that HLA-A*02:02, B*15:03, and C*12:03 were the most frequently encountered haplotypes associated with the presentation of viral epitopes [18]. Other retrospective studies aimed at identifying potentially protective or risk-increasing alleles [22–28] have been published. Such an effort requires several geographically diverse laboratories to analyze and share available data in order to generate a comprehensive overview.

    In the present study, we are also advancing the hypothesis that associations might exist between certain HLA alleles or blood groups and SARS-CoV-2 susceptibility, as well as the ability to overcome infection without hospitalization. We therefore sought to analyze and determine the existence of any trends in the expression of an extended panel of RBC antigens (ABO, RhD and 37 other antigens), and of HLA-A, B, C, DRB1, DQB1 and DPB1 alleles within a Caucasian convalescent plasma donor cohort, in comparison to different reference frequencies (textbook, local and international databases, and literature). The identification of differential patterns of RBC antigen or HLA expression in convalescent individuals, who were infected but were not hospitalized, could contribute to a better understanding of SARS-CoV-2 susceptibility and COVID-19 severity.

    Materials & methods

    Samples

    We analysed a cohort of 90 Caucasian convalescent plasma donors. Donors were randomly chosen from adult participants of the Québec cohort in the CONCOR-1 clinical trial (no. NCT04348656). All subjects received an official diagnosis of COVID-19 by the Québec Provincial Health Authority after epidemiologic investigation or after confirmation by PCR test. All subjects were COVID-19 symptomatic during infection, cleared the infection without hospitalization, and were free of symptoms for at least two weeks before donating. Since COVID-19 diagnostic was given before spring 2021, subjects were presumably infected with the Alpha (B.1.1.7) or Beta (B.1.351) variant, since no case of the Delta (B.1.617.2) variant had yet been reported in the province of Quebec [29]. All subjects were self-identified Caucasians. Our convalescent plasma donor cohort had an average age of 40.4 ± 15.0 years and consisted of 68% males. Donors were not selected with respect to their ABO group. All donors gave consent to participate in this research project, which was approved by the Héma-Québec Research Ethics Committee. Control populations were from the National Marrow Donor Program (NMDP) registry and from Héma-Québec's bone marrow donor registry. For the Héma-Québec control cohort, 1370 registered individuals typed in high-resolution, from the Montréal and Montérégie regions and of self-reported Caucasian ethnicity, were selected to match with the characteristics of the studied convalescent cohort. No variable other than geographical region and ethnicity was matched, and all donors were eligible to donate plasma in Québec.

    Phenotyping & genotyping

    ABO and RhD phenotype testing was done by serologic detection using the PK7300 from Beckman Coulter, as per the manufacturer's protocol. DNA used for genotyping was extracted from the buffy coat of whole blood samples collected in ethylenediaminetetraacetic acid (EDTA) tubes, using QIAamp Blood Mini kit (Qiagen, Hilden, Germany). RBC genotyping was performed using the Luminex xMAP® technology with the ID CORE XT platform (Progenika Biopharma-Grifols, Bizkaia, Spain), as per the manufacturer's protocol, for the following blood group antigens: Rh (C, c, E, e, Cw, hrS, hrB, V, VS), Kell (K, k, Kpa, Kpb, Jsa, Jsb), Kidd (Jka, Jkb), Duffy (Fya, Fyb), MNS (M, N, S, s, U, Mia), Diego (Dia, Dib), Dombrock (Doa, Dob, Hy, Joa), Colton (Coa, Cob), Yt (Yta, Ytb) and Lutheran (Lua, Lub). HLA genotyping was done by next-generation sequencing (NGS) on a MiSeqDx (Illumina, CA, USA), using NGSgo®-Ampx v2 kits, and interpreted with NGSengine® v2.21 (both from GenDX, Utrecht, The Netherlands).

    Statistical analyses

    For RBC antigen and HLA allele comparisons in Tables 1–3, population-wide proportions were assumed to correspond to the Caucasian prevalence estimates (from the Blood Group Antigen FactsBook [30] and from the National Marrow Donor Program database [31]), while the HLA G-group allele frequencies of the 90 participants (180 individual HLA alleles) were estimated along with the Clopper-Pearson 95% confidence interval. Z-tests for two proportions were used to test for statistical significance between populational and observed antigen prevalence. A p value inferior to the Bonferroni correction for multiple comparison per antigen group was considered significant.

    Table 1. RBC antigen frequencies for the Caucasian convalescent plasma donors compared with the Factsbook reference frequencies.
    Blood group systemPredicted phenotypeObserved prevalenceCI 95%FactsBook prevalencep-valuePost-test p-value cutoff
    RhD- C-c+E-e+0.156[0.095–0.244]0.1510.8970.0045
    D- C-c+E+e+0.022[0.006–0.077]0.0090.194
    D+ C-c+E+e+0.100[0.054–0.179]0.1180.596
    D+ C+c-E-e+0.233[0.158–0.331]0.1850.242
    D+ C+c+E-e+0.322[0.235–0.424]0.3490.589
    D+ C+c+E+e+0.167[0.104–0.257]0.1330.342
    Cw0.011[0.002–0.060]0.0200.478
    hrS1.000[0.959–1.000]0.9800.174
    hrB1.000[0.959–1.000]0.9800.174
    V0.000[0.000–0.041]0.0100.342
    VS0.000[0.000–0.041]0.0001.000
    KellK+k-0.000[0.000–0.041]0.0020.675 0.006
    K-k+0.944[0.876–0.976]0.9100.259
    K+k+0.056[0.024–0.124]0.0880.286
    Kp(a+b-)0.000[0.000–0.041]0.0001.000
    Kp(a-b+)0.956[0.891–0.983]0.9770.184
    Kp(a+b+)0.044[0.017–0.109]0.0230.184
    Js(a+b-)0.000[0.000–0.041]0.0001.000
    Js(a-b+)1.000[0.959–1.000]1.0001.000
    Js(a+b+)0.000[0.000–0.041]0.0001.000
    MNSM+N+S+s+0.267[0.186–0.366]0.2400.549 0.005
    M+N+S-s+0.144[0.086–0.232]0.2200.082
    M-N+S-s+0.111[0.061–0.193]0.1500.298
    M+N-S+s+0.178[0.112–0.269]0.1400.298
    M+N-S-s+0.122[0.070–0.206]0.0800.142
    M-N+S+s+0.078[0.038–0.152]0.0600.472
    M+N-S+s-0.033[0.011–0.093]0.0600.280
    M+N+S+s-0.067[0.031–0.138]0.0400.190
    U1.000[0.959–1.000]0.9990.764
    Mia0.000[0.000–0.041]0.0001.000
    DuffyFy(a+b-)0.256[0.177–0.354]0.1700.030 0.017
    Fy(a-b+)0.344[0.254–0.447]0.3400.936
    Fy(a+b+)0.400[0.305–0.503]0.4900.087
    Fy(a-b-)0.000[0.000–0.041]Very rareN/A
    KiddJk(a+b-)0.344[0.254–0.447]0.2630.082 0.017
    Jk(a-b+)0.222[0.149–0.318]0.2340.787
    Jk(a+b+)0.433[0.336–0.536]0.5030.184
    Jk(a-b-)0.000[0.000–0.041]RareN/A
    DiegoDi(a+b-)0.000[0.000–0.041]0.0001.000 0.017
    Di(a-b+)1.000[0.959–1.000]0.9990.764
    Di(a+b+)0.000[0.000–0.041]0.0010.764
    DombrockDo(a+b-)0.133[0.078–0.219]0.1800.246 0.010
    Do(a+b+)0.500[0.399–0.601]0.4900.849
    Do(a-b+)0.367[0.274–0.470]0.3300.453
    Hy-0.000[0.000–0.041]0.0001.000
    Jo(a-)0.000[0.000–0.041]0.0001.000
    ColtonCo(a+b-)0.900[0.824–0.946]0.9001.000 0.013
    Co(a-b+)0.000[0.000–0.041]0.0050.503
    Co(a+b+)0.100[0.054–0.179]0.0950.873
    Co(a-b-)0.000[0.000–0.041]0.0001.000
    YtYt(a+b-)0.889[0.807–0.939]0.9190.298 0.017
    Yt(a+b+)0.111[0.061–0.193]0.0780.242
    Yt(a-b+)0.000[0.000–0.041]0.0030.603
    LutheranLu(a+b-)0.000[0.000–0.041]0.0020.675 0.0125
    Lu(a-b+)0.922[0.848–0.962]0.9240.944
    Lu(a+b+)0.078[0.038–0.152]0.0740.889
    Lu(a-b-)0.000[0.000–0.041]0.0001.000
    Table 2. ABO and RhD blood group distributions for the Caucasian convalescent plasma donors compared with Factsbook reference frequencies.
    GroupAntigenObserved prevalenceCI 95%FactsBook prevalencep-value
    All donors (n = 90)A0.456[0.357–0.558]0.4300.569
    AB0.089[0.046–0.166]0.0400.018
    B0.100[0.054–0.179]0.0900.741
    O0.356[0.264–0.458]0.4400.110
    D-0.178[0.112–0.269]0.1500.459
    D+0.822[0.731–0.888]0.8500.459
    FY*A/*A individuals (n = 23)A0.435[0.256–0.632]0.4300.960
    AB0.130[0.045–0.321]0.0400.028
    B0.130[0.045–0.321]0.0900.503
    O0.304[0.156–0.509]0.4400.190

    †Bold p values represents a significant p value of <0.05.

    Table 3. Caucasian convalescent donor HLA allele frequency comparison with the NMDP database for HLA*A, B, C, DQB1 and DRB1. The g notation specifies G-groups corresponding to those presented in Gragert, Loren, Abeer Madbouly, John Freeman, and Martin Maiers. Six-locus high resolution HLA haplotype frequencies derived from mixed-resolution DNA typing for the entire US donor registry. Human Immunology 74, no. 10.
    LociAllelesObserved frequency (2n = 180)CI 95%NMDP frequencyAdjusted p-value
    HLA-A02:01G0,267[0.204–0.338]0,2721,000
    01:01G0,122[0.078–0.179]0,1591,000
    03:01G0,117[0.074–0.173]0,1401,000
    24:02G0,072[0.039–0.120]0,0891,000
    29:02G0,061[0.031–0.107]0,0310,427
    11:01G0,061[0.031–0.107]0,0591,000
    HLA-B44:02G0,128[0.083–0.186]0,0871,000
    07:02G0,089[0.052–0.140]0,1251,000
    08:01G0,083[0.047–0.134]0,1061,000
    44:030,072[0.039–0.120]0,0441,000
    40:01G0,067[0.035–0.114]0,0501,000
    18:01G0,061[0.031–0.107]0,0471,000
    35:01G0,061[0.031–0.107]0,0581,000
    27:05G0,056[0.027–0.100]0,0351,000
    HLA-C05:01G0,1389[0.092–0.198]0,0830,109
    04:01G0,1222[0.078–0.179]0,1151,000
    07:01G0,1167[0.078–0.179]0,1551,000
    06:02G0,0944[0.056–0.147]0,0951,000
    07:02G0,0944[0.056–0.147]0,1371,000
    02:02G0,0889[0.052–0.140]0,0470,138
    12:03G0,0722[0.039–0.120]0,0561,000
    03:04G0,0667[0.035–0.114]0,0741,000
    HLA-DRB107:010,178[0.125–0.242]0,1301,000
    01:010,100[0.060–0.153]0,0851,000
    03:010,089[0.052–0.140]0,1141,000
    04:010,089[0.052–0.140]0,0801,000
    13:01g0,083[0.047–0.134]0,0000,000
    15:010,078[0.043–0.127]0,1290,985
    11:01g0,056[0.027–0.100]0,0631,000
    14:01g0,056[0.027–0.100]0,0270,497
    HLA-DQB102:01g0,211[0.154–0.278]0,2121,000
    03:01g0,200[0.144–0.266]0,1961,000
    05:010,133[0.087–0.192]0,1161,000
    06:03g0,089[0.052–0.140]0,0651,000
    06:020,078[0.043–0.127]0,1280,621
    03:02g0,067[0.035–0.114]0,1031,000
    03:03g0,056[0.027–0.100]0,0431,000
    05:03g0,056[0.027–0.100]0,0290,407

    Alleles with less than 10 data points are not shown.

    NDMP: National Marrow Donor Program.

    Allele frequencies were calculated using the GENE[RATE] tool for HLA-A, B, C, DRB1, DQB1 and DPB1 [32]. HLA allelic frequencies were used with a modification of the hierfstat package to calculate the genetic distance (latter) globally and for each locus between the subjects and the reference population [33,34]. Standardized residuals of the cohort subjects were calculated to identify alleles with significant differences between the subjects and the registry. Residuals were calculated by considering the subjects' allele frequencies as the independent variable and the registry frequency as the dependant variable for each locus independently. Frequencies were deemed different at or above a difference of 3 absolute standardized residual. Statistical analyses were performed using R [35].

    Results

    Red blood cell genotype frequencies

    Genotype frequencies for Rh, Kell, MNS, Duffy, Kidd, Diego, Dombrock, Colton, Yt and Lutheran blood groups were determined in each individual, and the resulting predicted phenotypes were compared with the expected Caucasian reference frequencies. The FY*A/*A genotype (Fy[a+b-] predicted phenotype) appears to be overrepresented (nonsignificant, p = 0.030) in our convalescent cohort compared with expected frequencies (0.256 vs 0.170, respectively), as presented in Table 1. Incidentally, FY*A/*B (Fy[a+b+] predicted phenotype) individuals appear to be trending toward a decreased frequency of 0.400 compared with the expected 0.490 (Table 1), although the trend is not significant (p = 0.087). Overall, no antigen group combinations deviated significantly from the expected frequencies.

    ABO & RhD phenotyping

    ABO and RhD phenotypes were determined for each convalescent individual and compared with the expected Caucasian reference frequencies for the entire cohort (n = 90) and within FY*A/A individuals (n = 23), given its trend toward overrepresentation (Table 1). Table 2 presents the ABO phenotyping analysis for the cohort, which allowed for the identification of a significant (p = 0.0178) 2.2× increase for the AB group compared with reference frequencies (0.089 vs 0.040, respectively). While nonsignificant (p = 0.110), the O group trends toward a 0.8× underrepresentation within convalescent individuals compared with expected frequencies (0.356 vs 0.440, respectively). Interestingly, a significant (p = 0.028) 3.3× AB group overrepresentation was also found within FY*A individuals (Table 2, FY*A/*A). An apparent but nonsignificant decrease in O group individuals can be observed within the FY*A/*A individuals versus the reference frequency (0.304 vs 0.440). No other trend or significant observation was made regarding A and B groups, or RhD expression.

    HLA allele frequency comparisons

    HLA typing was done by NGS in the Caucasian convalescent donors' cohort and individual allele frequencies were determined. The convalescent cohort allele frequencies (2n = 180) were first compared with the NMDP registry frequencies for HLA-A, B, C, DRB1 and DQB1 for the most frequent alleles (Table 3), and no significant differences were identified in the most common alleles in the cohort. The convalescent cohort frequencies for HLA-A, B, C, DRB1, DQB1 and DPB1 were then compared with the Héma-Québec Stem Cell Donor Registry frequencies for Caucasians in the same geographical region (2n = 2740). The genetic distance between the convalescent donors' cohort and the Héma-Québec Registry was calculated for all loci together and on a per-locus basis for HLA-A, B, C, DRB1, DQB1 and DPB1 (Table 4). There was no comparative measurement available for the genetic distance, however the distance is low in all loci compared. Standardized residual analysis was conducted (Table 5 & Figure 1) and led to the identification of alleles that were significantly different between the convalescent cohort (n = 90 individuals) and the Héma-Québec Registry (n = 1370 individuals). HLA-B*44:02, C*05:01, DPB1*04:01, DRB1*04:01 and DRB1*07:01 were significantly overrepresented within our cohort, while A*01:01, B51:01 and DPB1*04:02 were significantly underrepresented. Finally, the alleles identified as significantly different were used to search previously published data for suspected associations with the SARS-CoV-2 virus infection and COVID-19 disease characteristics; these results are presented in Table 5. Of note, for two of the eight alleles that we have identified (DPB1*04:02 and DRB1*07:01), there was no available data in the literature for comparison.

    Figure 1. HLA allelic frequencies in the convalescent cohort and the Héma-Québec bone marrow donor registry.

    For each locus, the cohort allele frequencies (horizontal axis) are plotted against the registry frequencies (vertical axis). The dashed line represents identical expression in the cohort and registry. The dotted lines are the thresholds of 3 |standardised residuals| away from the identity line.

    Table 4. Genetic distance comparison between Caucasian convalescent donor HLA allele frequencies (2n = 180) and Héma-Québec bone marrow donor registry (2n = 2740) for HLA*A, B, C, DPB1, DQB1 and DRB1, per loci, and globally.
    LocusABCDPB1DQB1DRB1Global
    Genetic Distance0,00170,00500,00450,00180,00210,00400.0033
    Table 5. HLA alleles identified as significantly over- or underrepresented from pairwise comparisons between the convalescent cohort (2n = 180) and Héma-Québec bone marrow donor registry (2n = 2740), and their potential clinical significance.
    AllelesFrequencyStandardised residualsSignificance and referencesRef.
    CohortRegistry
    A*01:010,12220,1456-3,3570539Associated with high risk, mortality[36,37]
    B*44:020,12780,07725,15830039B*44 More susceptible to infection[26]
    B*51:010,02220,0640-3,6358415B*51 nonsignificantly correlates with infection susceptibility[26]
    C*05:010,13890,08484,11569996Risk of death[27]
    DPB1*04:010,41670,40483,39285576Direct association with severity[38]
    DPB1*04:020,08330,1128-3,7313073No data reported 
    DRB1*04:010,08890,04883,26507272Increased in severe vs asymptomatic[28]
    DRB1*07:010,17780,13973,85103473No data reported 

    †Standardised residuals ≥3 or ≤3 are considered significant.

    Discussion

    Our study took a deeper look into the RBC and HLA characteristics of a COVID-19 recovered and non-hospitalized cohort enrolled in the CONCOR-1 convalescent plasma study. We used ABO and RhD automated blood donor testing, and RBC and HLA high throughput genotyping platforms to determine the existence of potential trends regarding the frequencies of ABO, RhD, and 37 other RBC antigens and HLA genotypes within the cohort, compared with reference Caucasian populations from textbooks, public databases, and our local bone marrow donor registry. A significant AB blood group overrepresentation was identified, as well as a nonsignificant trend in FY*A/A individuals. These results suggest a possible involvement of ABO and Duffy red blood cell antigens in SARS-CoV-2 susceptibility and COVID-19 severity, as all these individuals contracted the virus, yet only had mild symptoms, and cleared the infection without needing hospitalization.

    The case for HLA association with disease susceptibility and severity is more complex. Overall, the genetic distance calculated from HLA allele frequencies is low (below 0.01) and suggests the cohort is similar to the reference population chosen. When looking at allele-level frequencies, eight HLA alleles were identified by the standardized residuals analysis as potential markers between the convalescent cohort and a stem cell registry from the same geographical region.

    One of the limitations of this study concerns the lack of a more diverse stratification of disease severity and the limited sample size for COVID-19-affected individuals. Indeed, our study lacks blood group and HLA data from hospitalized, deceased and asymptomatic COVID-19 patients, which would be of interest given that there could be a significant link between ABO and severity [39]. Additionally, while the historical ABO frequencies of Quebec Caucasian blood donors (internal data) matched that of FactsBook, such unbiased information about the frequencies of other RBC antigens is not currently available, hence the use of FactsBook Caucasian reference frequencies. Our study also does not directly address the major RBC and HLA antigen frequency differences between ethnicities. While our convalescent plasma donation program reflected our donor pool [40], the low number of non-Caucasian individuals was insufficient to conduct statistical analysis, which is unfortunate given the importance of understanding the disproportionate impact of COVID-19 on minorities [5,41]. Nonetheless, the identification of a potential overrepresentation of FY*A/*A within Caucasian COVID-19 convalescent individuals and the potential implication of the Duffy blood group could have an impact on future research directions. The trend toward overrepresentation of the Fy(a+b-) predicted phenotype among our COVID-19 convalescent cohort could be explained by the absence of the Fyb antigen, since no significant difference was observed when comparing Fy(a+b+) and Fy(a-b+). In individuals of African descent, the Fy(a-b-) phenotype is caused by a GATA box mutation upstream of the FY gene silencing Fyb expression in RBCs [42]. Given that 67% of African Americans (AA) are Duffy null [43], and that Duffy null patients have an increased mortality rate from acute lung injury [4], some groups have already hypothesized a role for Duffy in COVID-19 AA individuals [43]. We therefore suggest that Duffy allele identification might be used to select individuals at-risk for COVID-19 complications for further research on associations between COVID-19 and RBC antigens. The involvement of ABO blood groups in COVID-19 has previously been described [13,44]. The mechanism underlying the association remains elusive, but could be related to circulating natural anti-A and anti-B antibodies, or a low-efficiency furin cleavage in O-group individuals [9]. A significant overrepresentation of the AB group, and the nonsignificant trend toward underrepresentation of the O group within our cohort, appear to be in agreement with other groups' suggestion that O individuals could be less susceptible to SARS-CoV-2 infection [11,12]. Our sample size does not allow us to draw conclusions as to whether individuals from the AB group are more susceptible to infection and more efficient at clearing infection without hospitalization, or whether this bias is a consequence of the trend toward underrepresentation of O individuals, who are less susceptible to infection. It would be interesting to extend our observations to larger cohort that include severely affected patients.

    The involvement of HLA alleles in the disease outcome of COVID-19 patients is getting more evident [45,46]. Various HLA alleles have shown high binding affinity to SARS-CoV-2 peptides [18,47], and trends have been observed in many populations [22,23]. While our sample size is limited, eight alleles were identified as significantly different between the studied cohort and matched individuals from the Héma-Québec Stem Cell Donor Registry, for which we already had high resolution HLA information. Two of these eight HLA alleles have not previously been identified for their association with COVID-19: DPB1*04:02 and DRB1*07:01. The underrepresentation of HLA-A*01:01 is in agreement with its suggested association with high risk and mortality in COVID-19 individuals [36,37], and the overrepresentation of HLA-B*44:02 is compatible with an increased susceptibility to infection [26]. Interestingly, A*01:01, B*44:02 and B*51:01 were predicted as weak binder of SARS-CoV-2 peptides, and none of the other alleles we identified were found to be strong peptide binders [48]. Given that we found A*01:01 and B51:01 to be underrepresented and B*44:02 overrepresented in our cohort, it is difficult to establish a relationship between these data without considering complete haplotypes or other disease severity groups. The other associations are inconclusive, but should be considered in larger cohorts. Overall, more data is required from more diverse populations in order to develop a comprehensive view and a better understanding to manage emerging variants and infection waves.

    Altogether, we provide additional information regarding the role of RBC antigens and HLA in SARS-CoV-2 susceptibility, and consequential COVID-19 susceptibility, severity, resolution and long-term clinical consequences. More research needs to be done to get a better understanding of potentially at-risk populations, and for the identification of molecular pathways of this virus.

    Conclusion

    This study provides insights on the importance of considering the RBC and HLA antigens in regard to the susceptibility, severity, resolution and long-term clinical consequences of COVID-19. Genetic markers such as HLA could help focus our prevention efforts onto potentially more at-risk populations, and help organize vaccination strategies.

    Summary points
    • ABO, RhD, 37 other RBC antigens and HLA-A, B, C, DRB1, DQB1 and DPB1 were determined using high throughput platforms in 90 Caucasian convalescent plasma donors.

    • The AB group was significantly increased in convalescent individuals and the FY*A/*A genotype was trending as overrepresented compared with the expected frequencies.

    • HLA typing was performed on the convalescent donors and their allele frequencies were compared with the NMDP registry and the Héma-Québec Stem Cell Donor Registry. Standardized residual analysis identified eight alleles that were significantly different between the convalescent cohort and the Héma-Québec Registry.

    • Previous literature was searched for suspected associations between the identified HLA alleles and SARS-CoV-2 virus infection and COVID-19 disease characteristics.

    • Further research is needed to confirm the trends and associations observed in this study.

    Author contributions

    P Trépanier and W Lemieux designed the research and wrote the manuscript. P Trépanier, L Richard and M-C Chevrier supervised the research. J Perreault, L Richard, W Lemieux and JC Yanez acquired the data. P Trépanier, J Perreault, L Richard, W Lemieux, N Baillargeon, A Lewin and JC Yanez analyzed the data. P Trépanier, L Richard, W Lemieux, GA Leiva-Torres, N Baillargeon., M-C Chevrier and JC Yanez reviewed and edited the manuscript.

    Acknowledgments

    The authors are grateful to the convalescent plasma donors who participated in this study and the Héma-Québec team involved in convalescent donor recruitment, sample collection, as well as the technologists who performed the phenotyping and genotyping workup. The authors thank Jean-Francois Leblanc and Marie-Eve Rhéaume for revising the manuscript. The authors are also thankful for healthcare workers' dedication worldwide.

    Financial & competing interests disclosure

    The authors have no relevant affiliations or financial involvement with any organization or entity with a financial interest in or financial conflict with the subject matter or materials discussed in the manuscript. This includes employment, consultancies, honoraria, stock ownership or options, expert testimony, grants or patents received or pending, or royalties.

    No writing assistance was utilized in the production of this manuscript.

    Ethical conduct of research

    The authors state that they have obtained appropriate institutional review board approval or have followed the principles outlined in the Declaration of Helsinki for all human or animal experimental investigations. In addition, for investigations involving human subjects, informed consent has been obtained from the participants involved.

    Papers of special note have been highlighted as: • of interest

    References

    • 1. Williamson EJ, Walker AJ, Bhaskaran K et al. Factors associated with COVID-19-related death using OpenSAFELY. Nature 584(7821), 430–436 (2020).
    • 2. Guo W, Li M, Dong Y et al. Diabetes is a risk factor for the progression and prognosis of COVID-19. Diabetes Metab. Res. Rev. 36(7), (2020).
    • 3. Lighter J, Phillips M, Hochman S et al. Obesity in patients younger than 60 years is a risk factor for COVID-19 hospital admission. Clin. Infect. Dis. 71(15), 896–897 (2020).
    • 4. Kangelaris KN, Sapru A, Calfee CS et al. The association between a darc gene polymorphism and clinical outcomes in African American patients with acute lung injury. Chest 141(5), 1160–1169 (2012).
    • 5. Raisi-Estabragh Z, McCracken C, Bethell MS et al. Greater risk of severe COVID-19 in Black, Asian and Minority Ethnic populations is not explained by cardiometabolic, socioeconomic or behavioural factors, or by 25(OH)-vitamin D status: study of 1326 cases from the UK Biobank. J. Public Health 42(3), 451–460 (2020).
    • 6. Yamamoto N, Ariumi Y, Nishida N et al. SARS-CoV-2 infections and COVID-19 mortalities strongly correlate with ACE1 I/D genotype. Gene 758, 144944 (2020).
    • 7. Debnath M, Banerjee M, Berk M. Genetic gateways to COVID-19 infection: implications for risk, severity, and outcomes. FASEB J. 34(7), 8787–8795 (2020). • Reviews links between genetic factors and outcomes from COVID-19.
    • 8. Callard F, Perego E. How and why patients made Long Covid. Soc. Sci. Med. 268, 113426 (2021). • Describes the important long Covid situation.
    • 9. Pendu JL, Breiman A, Rocher J, Dion M, Ruvoën-Clouet N. ABO blood types and COVID-19: spurious, anecdotal, or truly important relationships? A reasoned review of available data. Viruses 13(2), 160 (2021). • Highlights the relavance of ABO to COVID-19.
    • 10. Gutiérrez-Valencia M, Leache L, Librero J, Jericó C, Enguita Germán M, García-Erce JA. ABO blood group and risk of COVID-19 infection and complications: a systematic review and meta-analysis. Transfusion (Paris) 62(2), 493–505 (2022).
    • 11. Barnkob MB, Pottegård A, Støvring H et al. Reduced prevalence of SARS-CoV-2 infection in ABO blood group O. Blood Adv. 4(20), 4990–4993 (2020).
    • 12. Wu Y, Feng Z, Li P, Yu Q. Relationship between ABO blood group distribution and clinical characteristics in patients with COVID-19. Clin. Chim. Acta 509, 220–223 (2020).
    • 13. Goel R, Bloch EM, Pirenne F et al. ABO blood group and COVID-19: a review on behalf of the ISBT COVID-19 working group. Vox Sang. 116(8), 849–861 (2021).
    • 14. Wu S-C, Arthur CM, Wang J et al. The SARS-CoV-2 receptor-binding domain preferentially recognizes blood group A. Blood Adv. 5(5), 1305–1309 (2021).
    • 15. Hu B, Huang S, Yin L. The cytokine storm and COVID-19. J. Med. Virol. 93(1), 250–256 (2021).
    • 16. Berzuini A, Bianco C, Paccapelo C et al. Red cell-bound antibodies and transfusion requirements in hospitalized patients with COVID-19. Blood 136(6), 766–768 (2020).
    • 17. Sanchez-Mazas A. HLA studies in the context of coronavirus outbreaks. Swiss Med. Wkly 150, w20248 (2020).
    • 18. Nguyen A, David JK, Maden SK et al. Human leukocyte antigen susceptibility map for severe acute respiratory syndrome coronavirus 2. J. Virol. 94(13), e00510-20 (2020).
    • 19. Saulle I, Vicentini C, Clerici M, Biasin M. Antigen presentation in SARS-CoV-2 infection: the role of class I HLA and ERAP polymorphisms. Hum. Immunol. 82(8), 551–560 (2021).
    • 20. Ahmed SF, Quadeer AA, McKay MR. Preliminary identification of potential vaccine targets for the COVID-19 coronavirus (SARS-CoV-2) based on SARS-CoV immunological studies. Viruses 12(3), 254 (2020). • Both highlight important HLA-related informations regarding COVID-19.
    • 21. Romero-López JP, Carnalla-Cortés M, Pacheco-Olvera DL et al. Prediction of SARS-CoV2 spike protein epitopes reveals HLA-associated susceptibility. Res. Prepr. [Internet] (2020). Available from: www.researchsquare.com/article/rs-25844/v1
    • 22. Wang W, Zhang W, Zhang J, He J, Zhu F. Distribution of HLA allele frequencies in 82 Chinese individuals with coronavirus disease-2019 (COVID-19). HLA 96(2), 194–196 (2020).
    • 23. Novelli A, Andreani M, Biancolella M et al. HLA allele frequencies and susceptibility to COVID-19 in a group of 99 Italian patients. HLA 96(5), 610–614 (2020).
    • 24. Lorente L, Martín MM, Franco A et al. HLA genetic polymorphisms and prognosis of patients with COVID-19. Med. Intensiva. 45(2), 96–103 (2020).
    • 25. Warren RL, Birol I. HLA predictions from the bronchoalveolar lavage fluid and blood samples of eight COVID-19 patients at the pandemic onset. Bioinformatics 36(21), 5271–5273 (2020).
    • 26. Correale P, Mutti L, Pentimalli F et al. HLA-B*44 and C*01 Prevalence Correlates with Covid19 Spreading across Italy. Int. J. Mol. Sci. 21(15), 5205 (2020).
    • 27. Sakuraba A, Haider H, Sato T. Population difference in allele frequency of HLA-C*05 and its correlation with COVID-19 mortality. Viruses 12(11), 1333 (2020).
    • 28. Langton DJ, Bourke SC, Lie BA et al. The influence of HLA genotype on the severity of COVID-19 infection. HLA 98(1), 14–22 (2021). • Both highlight important HLA-related informations regarding COVID-19.
    • 29. Données sur les variants du SRAS-CoV-2 au Québec [Internet]. INSPQ. Available from: www.inspq.qc.ca/covid-19/donnees/variants
    • 30. Reid ME, Lomas-Francis C, Olsson ML. The Blood Group Antigen FactsBook. Academic Press (2012).
    • 31. Gragert L, Madbouly A, Freeman J, Maiers M. Six-locus high resolution HLA haplotype frequencies derived from mixed-resolution DNA typing for the entire US donor registry. Hum. Immunol. 74(10), 1313–1320 (2013).
    • 32. Nunes JM, Buhler S, Roessli D, Sanchez-Mazas A. the HLA-net 2013 collaboration. The HLA-net GENE[RATE] pipeline for effective HLA data analysis and its application to 145 population samples from Europe and neighbouring areas. Tissue Antigens 83(5), 307–323 (2014).
    • 33. Weir BS, Goudet J. A unified characterization of population structure and relatedness. Genetics 206(4), 2085 (2017).
    • 34. Takezaki N, Nei M. Genetic distances and reconstruction of phylogenetic trees from microsatellite DNA. Genetics 144(1), 389 (1996).
    • 35. R Core Team. R: a language and environment for statistical computing. In: R Foundation for Statistical Computing. Vienna, Austria (2013).
    • 36. Shkurnikov M, Nersisyan S, Jankevic T et al. Association of HLA class I genotypes with severity of coronavirus disease-19. Front. Immunol. 12, 423 (2021).
    • 37. Pisanti S, Deelen J, Gallina AM et al. Correlation of the two most frequent HLA haplotypes in the Italian population to the differential regional incidence of Covid-19. J. Transl. Med. 18(1), 352 (2020).
    • 38. Vishnubhotla R, Sasikala M, Ketavarapu V, Reddy DN. High-resolution HLA genotyping identifies alleles associated with severe COVID-19: a preliminary study from India. Immun. Inflamm. Dis. 9(4), 1781–1785 (2021).
    • 39. Al-Youha SA, Alduaij W, Al-Serri A et al. The impact of ABO blood groups on clinical outcomes and susceptibility to COVID-19: a retrospective study in an unselected population. Transfusion (Paris) 61(5), 1631–1641 (2021).
    • 40. Trépanier P, Chevrier M, Constanzo Yanez J, Baillargeon N, St-Pierre C, Perreault J. Adapting to supply-and-demand emerging trends for antigen-negative red blood cell units. Transfusion (Paris) 61(5), 1489–1494 (2021).
    • 41. Tai DBG, Shah A, Doubeni CA, Sia IG, Wieland ML. The disproportionate impact of COVID-19 on racial and ethnic minorities in the United States. Clin. Infect. Dis. 72(4), 703–706 (2021). • Highlights the race concerns.
    • 42. Tournamille C, Colin Y, Cartron JP, Le Van Kim C. Disruption of a GATA motif in the Duffy gene promoter abolishes erythroid gene expression in Duffy-negative individuals. Nat. Genet. 10(2), 224–228 (1995).
    • 43. Hebbel RP, Vercellotti GM. SARS-CoV-2 severity in African Americans – a role for Duffy Null? Haematologica 105(12), 2892 (2020).
    • 44. Szymanski J, Mohrmann L, Carter J et al. ABO blood type association with SARS-CoV-2 infection mortality: a single-center population in New York City. Transfusion (Paris) 61(4), 1064–1070 (2021).
    • 45. Migliorini F, Torsiello E, Spiezia F, Oliva F, Tingart M, Maffulli N. Association between HLA genotypes and COVID-19 susceptibility, severity and progression: a comprehensive review of the literature. Eur. J. Med. Res. 26(1), 84 (2021).
    • 46. Tavasolian F, Rashidi M, Hatam GR et al. HLA, immune response, and susceptibility to COVID-19. Front. Immunol. 11, 3581 (2021).
    • 47. Tomita Y, Ikeda T, Sato R, Sakagami T. Association between HLA gene polymorphisms and mortality of COVID-19: an in silico analysis. Immun. Inflamm. Dis. 8(4), 684–694 (2020).
    • 48. Barquera R, Collen E, Di D et al. Binding affinities of 438 HLA proteins to complete proteomes of seven pandemic viruses and distributions of strongest and weakest HLA peptide binders in populations worldwide. HLA 96(3), 277–298 (2020).