Genetic diversity of the toll-like receptor 2 (TLR2) in hare (<i>Lepus capensis</i>) populations from Tunisia

Asma Awadi; Hichem Ben Slimen; Steve Smith; Jonas Kahlen; Mohamed Makni; Franz Suchentrunk

doi:10.1016/j.crvi.2018.06.005

Molecular biology and genetics

Genetic diversity of the toll-like receptor 2 (TLR2) in hare (Lepus capensis) populations from Tunisia

Asma Awadi ¹ ; Hichem Ben Slimen ^1,² ; Steve Smith ³ ; Jonas Kahlen ³ ; Mohamed Makni ¹ ; Franz Suchentrunk ³

¹ UR Génomique des insectes ravageurs des cultures d’intérêt agronomique (GIRC), Université de Tunis El-Manar, 2092 El Manar, Tunis, Tunisia
² Institut supérieur de biotechnologie de Béja, Beja 9000, University of Jendouba, Tunisia
³ Research Institute of Wildlife Ecology, University of Veterinary Medicine Vienna, Savoyenstrasse 1, 1160 Vienna, Austria

Comptes Rendus. Biologies, Volume 341 (2018) no. 6, pp. 315-324.

Résumé

Toll-like receptors (TLRs) are a major group of proteins that recognize molecular components of infectious agents, known as pathogen associated molecular patterns (PAMPs). The structure of these genes is similar and characterized by the presence of an ectodomain, a signal transmembrane segment and a highly conserved cytoplasmic domain. The latter domain is homologous to the human interleukin-1 receptor (IL1R) and human IL-18 receptor (IL-18R) and designated TIR domain. The latter domain of the TLR genes was suggested to be very conservative and its evolution is driven by purifying selection. Variability and evolution of the TIR sequences of TLR2 gene were studied in three hare populations from Tunisia with different ecological characteristics (NT–North Tunisia with Mediterranean, CT–Central Tunisia with semi-arid, and ST–South Tunisia with arid climate). Sequencing of a 372 bp fragment of TIR2 revealed 25 alleles among 110 hares. Twenty variable nucleotide positions were detected, of which 7 were non-synonymous. The highest variability was observed in CT, with 16 polymorphic positions. In ST, only 4 polymorphic nucleotide positions were detected with all diversity values lower than those recorded for the other two populations. By using several approaches, no positive selection was detected. However, evidence of purifying selection was found at two positions. The logistic models of the most common TIR2 protein variant that we run to examine whether its occurrence was affected by climatic variation independent of the geographic sample location suggested only a longitudinal effect. Finally, the mapping of the non-synonymous mutations to the inferred tertiary protein structure showed that they were all localized in the different loop regions. Among all non-synonymous substitutions, three were suggested to be deleterious as evidenced by PROVEAN analysis. The observed patterns of variability characterized by low genetic diversity in ST might suggest that the TIR region was more affected, than other markers, by genetic drift or/and that these patterns were shaped by different selective pressures under different ecological conditions. Notably, this low diversity was not detected by other (putatively neutral) microsatellite markers analysed in the course of other studies. But low diversity was also found for two MHC class II adaptive immune genes. As expected from functionally important regions, the evolution of the TIR2 domain is mainly driven by purifying selection. However, the occurrence of deleterious non-synonymous substitutions might highlight the flexible evolution of the TIR genes and/or their interactions with other proteins.

Métadonnées

Reçu le : 2018-04-09
Accepté le : 2018-06-15
Publié le : 2018-07-01

PMID

DOI : 10.1016/j.crvi.2018.06.005

Mots-clés : Toll-like receptor 2 (TLR2), TIR domain, Lepus capensis, Genetic diversity, Natural selection, Functional variation

Affiliations des auteurs :

Asma Awadi ¹ ; Hichem Ben Slimen ^{1, 2} ; Steve Smith ³ ; Jonas Kahlen ³ ; Mohamed Makni ¹ ; Franz Suchentrunk ³

¹ UR Génomique des insectes ravageurs des cultures d’intérêt agronomique (GIRC), Université de Tunis El-Manar, 2092 El Manar, Tunis, Tunisia
² Institut supérieur de biotechnologie de Béja, Beja 9000, University of Jendouba, Tunisia
³ Research Institute of Wildlife Ecology, University of Veterinary Medicine Vienna, Savoyenstrasse 1, 1160 Vienna, Austria

@article{CRBIOL_2018__341_6_315_0,
     author = {Asma Awadi and Hichem Ben Slimen and Steve Smith and Jonas Kahlen and Mohamed Makni and Franz Suchentrunk},
     title = {Genetic diversity of the toll-like receptor 2 {(TLR2)} in hare {(\protect\emph{Lepus} capensis}) populations from {Tunisia}},
     journal = {Comptes Rendus. Biologies},
     pages = {315--324},
     publisher = {Elsevier},
     volume = {341},
     number = {6},
     year = {2018},
     doi = {10.1016/j.crvi.2018.06.005},
     language = {en},
}

TY  - JOUR
AU  - Asma Awadi
AU  - Hichem Ben Slimen
AU  - Steve Smith
AU  - Jonas Kahlen
AU  - Mohamed Makni
AU  - Franz Suchentrunk
TI  - Genetic diversity of the toll-like receptor 2 (TLR2) in hare (Lepus capensis) populations from Tunisia
JO  - Comptes Rendus. Biologies
PY  - 2018
SP  - 315
EP  - 324
VL  - 341
IS  - 6
PB  - Elsevier
DO  - 10.1016/j.crvi.2018.06.005
LA  - en
ID  - CRBIOL_2018__341_6_315_0
ER  -

%0 Journal Article
%A Asma Awadi
%A Hichem Ben Slimen
%A Steve Smith
%A Jonas Kahlen
%A Mohamed Makni
%A Franz Suchentrunk
%T Genetic diversity of the toll-like receptor 2 (TLR2) in hare (Lepus capensis) populations from Tunisia
%J Comptes Rendus. Biologies
%D 2018
%P 315-324
%V 341
%N 6
%I Elsevier
%R 10.1016/j.crvi.2018.06.005
%G en
%F CRBIOL_2018__341_6_315_0

Asma Awadi; Hichem Ben Slimen; Steve Smith; Jonas Kahlen; Mohamed Makni; Franz Suchentrunk. Genetic diversity of the toll-like receptor 2 (TLR2) in hare (Lepus capensis) populations from Tunisia. Comptes Rendus. Biologies, Volume 341 (2018) no. 6, pp. 315-324. doi : 10.1016/j.crvi.2018.06.005. https://comptes-rendus.academie-sciences.fr/biologies/articles/10.1016/j.crvi.2018.06.005/

Version originale du texte intégral

Le texte intégral ci-dessous peut contenir quelques erreurs de conversion par rapport à la version officielle de l'article publié.

1 Introduction

The adaptive and the innate immunity are the two parts of the mammalian immune system. The adaptive immune system is based on molecules corresponding to MHC antigens, T-cell receptors, B-cell receptors and antibodies. However, the innate immunity provides the first line of defence against infection [1] and constitutes a set of disease-resistance mechanisms that are not specific to a particular pathogen but that include cellular and molecular components that recognize classes of molecules peculiar to frequently encountered pathogens [2]. Among these components, Toll-like receptors (TLRs) are a major group of proteins that recognize molecular components of infectious agents, known as pathogen associated molecular patterns (PAMPs) [2]. To date, 13 TLRs have been described in mammals, although it has been shown that not all species contain this full component of receptors [3]. The structure of these genes is similar and characterized by the presence of an ectodomain, a signal transmembrane segment and a highly conserved cytoplasmic domain homologous to the human interleukin-1 receptor (IL1R) and human IL-18 receptor (IL-18R) and designated TIR domain [4,5]. In mammals, TIR domains are involved in mediating interactions in the Toll-like receptor and interleukin-1 signalling pathways [6]. Among TLR genes, TLR2 is located on the outer membrane and forms a dimer complex with TLR1 or TLR6 to recognize peptidoglycans, lipoproteins or lipoteichoic acid of Gram-positive bacteria [7,8]. This gene is widely expressed across species and recognises the greatest number of PAMPs, detecting components from bacteria, viruses and fungi [9,10]. TLR2 was suggested to exhibit high levels of polymorphism in several mammal species [11–13].

Evolutionary patterns of genes of the innate immune system are still under intense debate. The classical view considers this polymorphism of the evolutionary ancient TLR genes to be strongly optimized by natural selection and, therefore, should evolve under purifying selection [14]. Indeed, several point mutations affecting TLR genes were suggested to alter the immune response [15] or to increase susceptibility to infection in sheep [16] and in humans [17,18]. However, recent studies [19,20] have suggested that TLR genes involved in pathogen recognition are evolving in direct response to pathogen-mediated selective pressures. Evidences of adaptive substitutions were observed in bovine TLR2 and TLR5 [12,21], in TLR4 in primates and birds [22,23], and in TLR2 in birds [19] and in sheep [13].

Hares from Tunisia are found across the whole country along a steep ecological gradient ranging from a Mediterranean humid climate in the north down to a Saharan climate in the south. Population genetic data on these hares that were based on nuclear and mitochondrial DNA markers (microsatellites, transferrin intron sequences, mtCR1 sequences) indicated relatively high levels of gene flow and high genetic diversity [24,25]. However, variability of the adaptive immunity MHC class II genes showed more spatial partitioning than the supposedly neutral microsatellite markers, parallel to strong positive selection on these immune genes [26]. Moreover, the observed pattern of positive selection followed climatic variation across the country suggesting occurrence of different pathogen pressures in the different ecoregions. In this study, we examined the level of genetic diversity of the TIR domain of the TLR2 gene of the innate immune system in hares from three regions in Tunisia with two very different climates (NT–North Tunisia, with Mediterranean climate, and ST–South Tunisia, with arid Saharan climate) and one region between these two regions (CT–Central Tunisia, a transition zone with semi-arid climate). We aimed to investigate the level of genetic diversity within and among populations and to compare diversity patterns of TIR2 sequences to the earlier results from other markers [24–26]. In addition, we intended to test whether the observed pattern of diversity has been shaped by neutral or selective processes and if climatic differences may affect the occurrence of protein variants. We looked also for evidence of positive and purifying selection at single codons of the analysed sequences. Finally, the tertiary structure of TIR2 encoded proteins was predicted using computational program and homology modelling methods.

2 Material and methods

2.1 Samples

A total of 110 hares were collected by hunters at fifteen locations in Tunisia across a distance of less than 500 km between the northern Mediterranean seaboard with Mediterranean climate and high annual rainfall (ca. 916 mm) and the arid northern parts of the Saharan desert with less than 100 mm annual rainfall. Localities and sample size of these specimens are shown in Fig. 1 along with assignment of localities to the three regions NT–North Tunisia, CT–Central Tunisia, ST–South Tunisia. Those three regions were operationally considered three populations.

Fig. 1
Sampling regions of hares from North, Central and South Tunisia. Sample sizes appear in parentheses. Hares were grouped into three populations according to climatic, geographic and phenotypic data. The northern population with samples from six regions (FER: Fernana; JEN: Jendouba; TAB: Tabarka; BEJ: Béja, STH: Sidi Thabet and KLB: Kélibia); the central population with samples from six regions (NAD: Nadhour; WES: Weslatia; KAL: Kalâa; BKL: Bekalta; CHE: Cherarda; SND: Sned); and the southern population with samples from three regions (DOU: Douz; TAT: Tataouine, and BGD: Ben Guerdène). Masquer
Sampling regions of hares from North, Central and South Tunisia. Sample sizes appear in parentheses. Hares were grouped into three populations according to climatic, geographic and phenotypic data. The northern population with samples from six regions (FER: Fernana; JEN: Jendouba; ... Lire la suite

2.2 DNA amplification and typing via next-generation sequencing

Protocols used for DNA extraction are described in previous publications [25,27,28]. We targeted a 372 bp fragment of Toll like receptor 2 corresponding to the Toll-interleukin-1 receptor domain protein (TIR2) in a total of 110 hare specimens. Briefly, library preparation was performed by firstly amplifying each sample using the primer pair 5′-ATGCGTTCGTGTCCTACAGC-3′ and TLR-R 5′-CTCAAGTTCCCCCAGAACC-3′. A second round of PCRs was carried out to attach unique DNA barcodes to all samples and achieve compatibility with Illumina's MiSeq flow cell. PCR products were then purified, and after the quality and quantity of PCR products were estimated, all samples were pooled and sent to the Microsynth (AG) for sequencing on an Illumina Miseq using 2 × 250 bp chemistry.

Initial TIR2 sequence data processing was achieved as outlined in Biedrzycka et al. [29] and Sebastien et al. [30] using the different amplicon sequencing analysis tools available at: http://www.evobiolab.biol.amu.edu.pl/amplisat/.

2.3 Analysis of polymorphism and genetic differentiation

DNA polymorphism within populations (haplotype diversity h, nucleotide diversity π, and mean number of pairwise differences k) was estimated using DNASP v. 5 [31]. The Tajima's D [32], also implemented in the same program, was used to test the hypothesis that sequence variation of the TIR2 domain does not differ from neutral expectations. A test of deviation from Hardy–Weinberg equilibrium was calculated using GENEPOP 4.0 [33] separately for each population. The GENETIX program v. 4.05 [34] was used to calculate allele frequencies, observed (H_O) and expected heterozygosity (H_E). The FSTAT program, version 2.9.3 [35] was used to calculate population-specific values of allelic richness (Rs) based on a rarefaction approach to account for different sample sizes.

Population differentiation was determined by calculation of standardized pairwise F_ST (10,000 permutations) in GENETIX.

A Median-Joining network [36] was constructed in order to model the phylogenetic relationships among the observed haplotypes and their distribution among the three populations. The network was rooted by one outgroup haplotype of European rabbits (Oryctolagus cuniculus) to obtain an indication of its evolutionary direction.

2.4 Selection analyses

Different approaches were used to test whether positive selection historically operated on the TIR2 sequences. First, we used CODEML [37] to test for site-specific positive selection. Different codon-based models of selection exist in PAML, which generally produce equivalent results although some tests are suggested to be more conservative than others [38]. To increase the likelihood of detection of positive selection, we used the less conservative M7/M8 test to examine the extent of selection acting on the TIR2 domain. The null model M7 (beta) was compared to M8 (beta plus omega). The comparison was performed using the likelihood ratio test (LRT): twice the log-likelihood difference was compared with a χ² distribution with degrees of freedom equal to the difference in the number of parameters between both models. Significant amino acids sites under positive selection were considered using the Bayes Empirical Bayes (BEB) approach with posterior probability at 95% cut-off [39].

In a second approach, the OmegaMap program [40] based on a Bayesian population genetics approximation to the coalescent theory, was used as proposed by Smith et al. [41]. It generates means and credible intervals for the selection parameter (dN/dS = ω) and recombination rate (ρ = 4 N r) for each codon, respectively (N and r represent the effective population size and the per codon rate of recombination). Two Markov chain Monte Carlo runs of 250,000 iterations (25,000 iteration burn-in) on population allele frequencies at each locus were compared for convergence. Codons are considered as positively selected with posterior probabilities greater than 95%.

Finally, we tested for codon-specific signatures of positive and negative selection across the TIR2 sequences using the DATAMONKEY webserver (http://www.datamonkey.org/; last accessed 15th August 2017) [42]. We first identified recombination break points with genetic algorithm recombination detection (GARD) [43]. The output was then used to run four different maximum likelihood methods for detection of selection: SLAC (single likelihood ancestral counting), FEL (fixed effects likelihood), REL (random effects likelihood), and Mixed Effects Model of Evolution (MEME) [44]. Significance levels of P < 0.25 in SLAC and FEL and P < 0.05 in MEME and Bayes factors > 50 in REL were considered as indicating positively selected sites. We considered a codon to be positively selected only if it was identified by at least two of the methods [42].

In addition to the applied tests, and in order to infer molecular signatures of contemporary selection, we used the model-based approach of Beaumont and Nichols [45], based on TIR2 genotypes, implemented in Lositan [46] to compare the observed F_ST values estimated at each locus to a null distribution of F_ST conditional on heterozygosity. The TIR2 sequences as well as the genotypes of fourteen microsatellite loci studied earlier in these hares in the same laboratory [24] were tested for neutrality under 50,000 simulations, estimated neutral mean F_ST, an infinite alleles mutation model, a 99% confidence interval and a false discovery rate of 0.1%.

2.5 Testing for climate effects on occurrence of TIR2 protein variants

Given that TIR2 protein variant A was the most frequent in all three populations and all other variants were by far less frequent, we focused on examining whether the presence of this variant either in a homozygous or heterozygous genotype may be affected by climate independent of geographic position of the collected hares. We used the WORLDCLIM data set for 2.5-min intervals (Version 1.4, http://www.worldclim.org/bioclim.htm) to extract mean climate data for all sampling locations and used the following climate variables Bio 1, 5, 6, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19 to run a principal-component analysis (PCA). The non-rotated PCA was based on a variance-covariance matrix of ln-transformed original values with eigen values greater than 1.0 times the mean eigen value. The individual scores of each of the two extracted factors (i.e. climate factor 1 and climate factor 2; see § “Results”) were then used in our logistic model as independent variables together with geographical latitude and longitude. The major problems for the modelling, however, were that latitude and individual scores for climate factor 1were highly correlated (r = +0.938), meaning that one variable could be almost fully substituted by the other. Therefore, we run two separate models, one with latitude, longitude, and climate factor 2 scores, and one with climate factor 1 scores, longitude, and climate factor 2 scores. According to information theory [47], the model with the lowest AICc value should be chosen for argumentation and statistical conclusion.

The models were run in R [48] and their syntaxes were as follows:

Gm = glm(tlrprotA ∼ long + lat + Fac2, data = dat, family = binomial(link = “logit”))

Gm = glm(tlrprotA ∼ long + Fac1 + Fac2, data = dat, family = binomial(link = “logit”))where “tlrprotA” is the protein variant A being present either in a homozygous or heterozygous genotype, or absent, “long” is geographical longitude, “lat” is geographical latitude, “Fac1”, and “Fac2” are the climate factors 1 and 2 as obtained from the PCA.

2.6 Protein structure analysis and in silico prediction of mutation effects

Fold recognition of TIR2 was performed with HHPRED (https://www.toolkit.tuebingen.mpg.de/#/tools/hhpred, last accessed 15 August 2017) [49]. The crystal structure of the human Toll like Receptor 2 TIR signalling domain (PDB: 1FYX_A; 2.8 Å resolution) was identified as the best template for residues 14–134 (HHPred E-value = 2.6 × 10⁻¹⁷). The tertiary structure prediction was generated using MODELLER (https://www.toolkit.tuebingen.mpg.de/#/tools/modeller, last accessed 15 August 2017) [50] based on the above-mentioned template. Amino acid residues were mapped onto this structure, and a PyMOL script file was generated for visualization using PyMOL (version 1.8) (http://www.pymol.org) [51].

Finally, to assess the functional effect of all amino acid substitutions, the Protein Variation Effect Analyser (PROVEAN) [52] was used. The default confidence threshold of −2.5 was used to determine if an amino acid replacement is likely to influence the protein function. The most frequent allele sequence was used as a template, and every fixed amino acid replacement per sequence was used as a query.

3 Results and discussion

This is the first study presenting a partial sequence analysis of TLR2 gene of a hare species (Lepus capensis). Several studies addressing genetic variability of TLRs of mammals have shown different levels of diversity [13,19,38,53]. Moreover, genetic diversity was clearly different between the LRR molecules, the transmembrane, and the TIR domains with the last region being very conservative. The analysis of genetic variability of TLR3 in 80 wild and domestic rabbit lineages [53] revealed 41 single nucleotide polymorphisms (SNPs) of which fourteen were non-synonymous. Among these non-synonymous mutations, eleven were detected in the LRR molecules and only 1 in the TIR region. Smith et al. [13] have detected 18 polymorphic sites in ten sheep breeds with two of them localized in the TIR domain. Here, we analysed genetic diversity and selection of the TIR domain corresponding to positions 1999 to 2370 of the total TLR2 gene of the rabbit O. cuniculus (GenBank accession No.: NM_001082781). We detected 25 alleles (GenBank accession numbers MH493864-MH493888) based on 20 variable sites of which seven were non-synonymous in the 372 TIR2 sequence fragments studied (Fig. 2). The observed diversity is summarized through the population genetic parameters reported in Table 1. Overall haplotype diversity was 0.733, whereas overall nucleotide diversity was 0.00399 and mean number of pairwise differences was 1.483. The values calculated separately for the three populations indicated that genetic diversity was lower in the southern population. These results were further confirmed by the geographical distribution of the 25 detected alleles. The numbers of alleles per population were 19, 14, and 4 in NT, CT and ST, respectively (Appendix A). In general, in all three populations, allele 2 was highly frequent, whereas most other alleles were detected at very low frequencies (Appendix A). A geographical comparison of the obtained genotypes revealed many alleles specific to single populations (“private alleles”). Only four alleles were shared between the three populations (Appendix A), whereas six alleles were specific to NT, twelve alleles to CT and no allele to ST. Finally, allelic richness (Rs), which measure the number of alleles independent of sample size, was high and similar in NT (12.13) and CT (12.47) and low in ST (4.00) (Appendix A).

Fig. 2
Amino acid sequence alignment of TIR2 haplotypes. A to J are the names of the different proteins detected in the TIR2 gene. Positively and negatively selected codons identified by each method are indicated with “*” and “–”, respectively. Numbering of amino acid positions is based on the full gene sequence in O. cuniculus (NM_001082781).

Table 1

	N	S	H	π	Hd	k	D	Rs	H _E	H _O
NT	50	12	14	0.00603	0.695	2.24163	−0.48291	12.126	0.6808	0.6400
CT	130	16	19	0.00714	0.789	2.65760	−0.40866	12.471	0.7833	0.7846
ST	40	4	4	0.00356	0.547	1.32436	0.97479	4.000	0.5338	0.6000
ALL	220	20	25	0.00399	0.733	1.48281	−1.54535	–	–	–

The high genetic diversity of the presently studied gene segment is congruent with results of other genetic markers [24,25] confirming the general tendency of hare populations in Tunisia to display high values of genetic diversity. This might indicate a population growth from bottlenecked ancestral populations [24,25] or/and ancient and recent gene flow from neighbouring regions not sampled in the studies carried out until now [24,27]. In addition, the high diversity of the immune genes studied including the current marker might be influenced by the strong climate and habitat variation. However, in line with the studied MHC loci that were found to be under strong positive selection, but unlike the neutral microsatellite markers, the genetic diversity (i.e. allelic richness) of the currently studied TLR2 locus sequences was relatively low in the ST with its arid climate. Notably, the currently found genetic diversity values across the three populations were similar between the three regions as calculated from mtCR1 [24] and transferrin [25] sequences. The currently reduced diversity in the ST population might suggest that the diversity of the functionally important TIR2 domain is more affected than other markers studied until now by genetic drift. Indeed, Knaffer et al. [54] showed that TLR diversity was affected to a greater extent by contemporary bottlenecks than MHC and microsatellite loci in saddleback (Philesturnus carunculatus) populations. The currently observed loss of genetic diversity towards the southern arid region might be also explained by natural selection. As observed for MHC genes [26], rare allele advantage–as a form of balancing selection–might be proposed to be an important determinant of TIR2 variability in the studied hare populations. Actually, we observed a high number of rare and private alleles; more specifically, 17 rare alleles (68%) were detected and were found solely as heterozygous alleles, as expected in the context of balancing selection [55]. However, the Tajima's D test did not reject the null hypothesis that all studied populations are evolving under neutrality (Appendix A). Moreover, only the NT population deviated from Hardy Weinberg expectations (Appendix A).

Pairwise F_ST values ranged between −0.001 (NT-ST) and 0.037 (CT-ST) (Appendix A). This low differentiation was also confirmed by the median network of the obtained haplotypes (Fig. 3) that indicated little phylogenetic divergence and an absence of geographically meaningful phylogroups. This finding is in accordance with earlier results based on partial transferrin sequences [25] and genotypes of fourteen microsatellite loci [27]. As most common alleles are generally common across the studied regions, we do not see any profound reorganization of regional TLR2 gene pools despite the occurrence of a quite remarkable overall number of private alleles at low frequencies. The presence of a high number of unique haplotypes might suggest that evolution in TLR genes are mainly driven by point mutations rather than recombination and gene conversion. Indeed, no recombination was obtained in the currently studied sequences when using GARD.

Fig. 3
Median-joining network showing the phylogenetic relationships among TIR2 haplotypes. Relative haplotype (= allele) frequencies correspond to haplotype circle size (see Appendix A). Mutation steps are indicated by numbers on lines connecting haplotypes if higher than one. Small grey square indicates inferred haplotype. Green circle/pie segment: northern population, red circle/pie segment: central population, and yellow circle/pie segment: southern population.

The PCA of the chosen climate factors yielded two principal components (factors): the first factor that explained 87.4% of the variance of the climate variables; according to the loadings of the bioclimatic variables into this factor (i.e. the correlation of the individual scores with the transformed values of the climate variables used for the PCA), it could be interpreted as a general precipitation factor. The second factor could be interpreted as a factor of ambient temperature during cold and wet periods of the year. The two logistic models of the most common TIR2 protein variant A that we run to examine whether its occurrence was affected by climatic variation independent of the geographic sample location returned very similar AICc values (model with latitude in: AICc = 46.023 and model with climate factor 1 in: AICc = 47.552). This did not allow preferring one over the other model [47]. For the model with latitude instead of climate factor 1 the values of relative variable importance (RVI) amounted to 0.71 for longitude, 0.63 for latitude, and 0.44 for climate factor 2. Hence, there was only a longitudinal effect, when accepting the RVI threshold value of 0.70 for statistically important variables [47]. For the model with climate factor 1 instead of latitude in the RVI values amounted to 0.71 for longitude, 0.61 for climate factor 2, and 0.30 for climate factor 1. Obviously, whereas there was a significant longitude effect, neither latitude nor any climate factor had a significant effect on the occurrence of the most common TIR2 protein A across the three climatic zones in Tunisia. However, the high explanatory power of latitude in terms of the climate factor 1 did not allow us to investigate the role of the currently used climate variable on the distribution of the TIR2 protein by fully accounting for the pure geographical sample distribution, i.e. potential neutral population genetic causes. A wider geographic sample arrangement, e.g., across larger parts of North Africa, may, however, allow running such a model. It might also confirm the currently identified TIR2 protein variant “A” as a general key protein independent of climatic variation, or contrary yield a significant climatic effect on its distribution. It might also help explain the meaning of the high number of protein variants that we have currently specifically detected in the more humid and less hot climate zone of Tunisia.

Immune genes are expected to be under strong natural selection due to their essential roles in recognizing and eliminating infectious agents. Indeed, a signature of positive selection has been reported for several TLRs genes including TLR2 [1,19,38,56]. However, most of the positions suggested under positive selection were reported for outside of the TIR domains. Areal et al. [19] found that positively selected sites are mainly localized in the LRR domain whereas the TIR domains of several TLR genes contained only few sites or none under positive selection in several mammalian species. Similarly, Smith et al. [13] found nine positively-selected sites that were all positioned within the extracellular domain of the ovine TLR2. According to Ishengoma and Agaba [38] the mapping of positively-selected sites to the three major TLR domains revealed that 92 to 100% sites were located in the extracellular domain of the studied TLR genes. In our study we applied several tests to detect positive and negative selection, but no sites were found to have evolved under positive selection according to PAML, DATAMONKEY, and OmegaMap. For PAML, the null model in CODEML was preferred over the alternative model (P (ΔLRT) > 0.05) in the model comparison between M7 (ln L = −707.836516) vs M8 (ln L = −706.062386) suggesting that the analysed sites were evolving under strong purifying selection or neutrally. In DATAMONKEY, two positions (84 and 96; Fig. 2) were suggested to be under positive selection by REL. However, those signals for the latter two positions were not confirmed by any of the other DATAMONKEY tests (Fig. 2). Therefore, we considered them as false positive signals [42]. Accordingly, the outlier test revealed that the studied locus was evolving neutrally (Fig. 4). However, negative selection was suggested in a total of ten amino acid positions among which only two (51 and 82) were confirmed by more than one of the different DATAMONKEY tests (Fig. 2).

Fig. 4
Plot of the outlier tests based on TLR2 genotypes and fourteen microsatellite loci for the three populations. The upper and lower 99% confidence interval boundaries are indicated by solid line.

The TIR domain of the TLRs is characterized by three Box regions (Boxes 1, 2, and 3) which are important in signal transduction; they are highly conserved and should consequently be under strong purifying selection [57,58]. As expected, due to their functional constraints, none of the seven non-synonymous sites were located in these boxes. The mapping of these non-synonymous sites to the protein structure of the TIR domain (Fig. 5) showed that they were distributed along the different loop regions (14 in AA loop; 62 in BC loop; 94, 96, and 97 in CD loop; 109 in DD loop; 131 in EE loop). The functional effects of the loop regions are better known for the BB loop; it has been suggested that non-synonymous mutations in this region inhibit the response to lipopolysaccharide (LPS) as observed in mouse TLR4 [59,60] or to affect signal transduction as observed in sheep TLR2. This functional importance of the loop regions is confirmed in the current study by our PROVEAN analysis that indicated that three substitutions (E13K, A94T and A97T) among the non-synonymous positions have deleterious effects. However, comparisons of TIR domains between different species revealed the occurrence of large insertions or deletions in several loop regions of these domains leading to size variation of the TIR regions among the available sequences [57]. Two hypotheses might be suggested to explain the observed diversity profiles: first, Barreiro et al. [61] suggested that non-viral TLRs have a more flexible evolution and therefore tolerate non-synonymous mutations which, in some circumstances, can be subject to positive selection and become fixed in some populations. This higher tolerance is because the function of non-viral TLRs is more redundant than of viral TLRs. Indeed, several surface TLRs are able to recognize the same bacteria and fungi components. Therefore, a non-synonymous mutation in one TLR does not necessarily mean the extinction of the function and does not compromise immunity [57]; second, TIR domains function through self-association and interaction with other TIR domains (homotypic interactions) to create scaffolds that facilitate the formation of protein complexes. TIR domains have also been shown to interact with proteins that do not contain TIR domains (heterotypic interactions). Such interactions could be an indication of co-evolution and positive selection or fixation of deleterious mutations in either genome can lead to subsequent selection acting on the other.

Fig. 5
Structural modelling of the TIR domain of TLR2. In our homology model are located the non-synonymous mutations.

4 Conclusions

The currently observed pattern of genetic diversity and genetic differentiation in TIR2 sequences in hare populations from Tunisia might result from a combination of various neutral population genetic processes, such as historical reduction of effective population size, subdivision, and migration, as well as natural selection. However, we are currently unable to disentangle these different effects in the studied populations. Moreover, a combination of such potential evolutionary factors might blur their detection with the different statistical methods. Evidence of purifying selection was found in the analysed sequences conforming to the general evolution pattern of these genes. However, climatic factors as potential surrogates of varying pathogenic landscapes may also affect the distribution of protein variants, but to prove such effects, a wider geographical sampling would be necessary.

Disclosure of interest

The authors declare that they have no competing interest.

Acknowledgements

Laboratory work was done at the Genetics laboratory of the Research Institute of Wildlife Ecology (Vienna, Austria). We wish to express our thanks to Mrs. Anita Haiden (Vienna) for valuable help in the laboratory. Partial financial support was provided by “Wildlife Research–Franz Suchentrunk”.

Appendix A Allele frequencies and HWE test P-value (P) for the three Tunisian hare populations. The sample size is indicated between parentheses for each population. The corresponding protein for each allele is also given. Ten protein variants were detected and were named from A to J (see Fig. 2).

Allele	Protein	NT (25)	CT (65)	ST (20)
1	A		0.0077
2	A	0.5400	0.4308	0.6500
3	A		0.1077
4	B	0.0200	0.0154
5	C	0.0200	0.0077
6	D	0.0200	0.0538	0.0750
7	A	0.0200	0.0308
8	A	0.0800	0.0692	0.1500
9	A	0.0200	0.0615
10	E		0.0385
11	A	0.1200	0.0538	0.1250
12	A		0.0308
13	F		0.0231
14	G		0.0154
15	A		0.0154
16	D		0.0154
17	H	0.0200
18	A		0.0077
19	A		0.0077
20	D		0.0077
21	A	0.0200
22	A	0.0400
23	I	0.0400
24	I	0.0200
25	J	0.0200
P		0.0123	0.0519	0.1849
F _ST		NT–CT 0.012	NT–ST –0.001	CT–ST 0.037**

Bibliographie

[1] Y. Huang; N.D. Temperley; L. Ren; J. Smith; N. Li; N.W. Burt Molecular evolution of the vertebrate TLR1 gene family — a complex history of gene duplication, gene conversion, positive selection and co-evolution, BMC Evol. Biol., Volume 11 (2011), p. 149

[2] R. Medzhitov; C.A. Janeway Innate immunity: the virtues of a nonclonal system of recognition, Cell, Volume 91 (1997), pp. 295-298

[3] M. Hans; V.M. Hans Toll-like receptors and their dual role in periodontitis: a review, Int. J. Oral Sci., Volume 53 (2011), pp. 263-271

[4] J.K. Bell; G.E. Mullen; C.A. Leifer; A. Mazzoni; D.R. Davies; D.M. Segal Leucine-rich repeats and pathogen recognition in Toll-like receptors, Trends Immunol., Volume 24 (2003), pp. 528-533

[5] B. Beutler; Z. Jiang; P. Georgel; K. Crozat; B. Croker; S. Rutschmann; X. Du; K. Hoebe Genetic analysis of host resistance: Toll-like receptor signaling and immunity at large, Annu. Rev. Immunol., Volume 24 (2006), pp. 353-389

[6] Z.L. Chang Important aspects of Toll-like receptors, ligands and their signaling pathways, Inflamm. Res., Volume 59 (2010), pp. 791-808

[7] O. Takeuchi; S. Sato; T. Horiuchi; K. Hoshino; K. Takeda; Z. Dong; R.L. Modlin; S. Akira Cutting edge: role of toll-like receptor 1 in mediating immune response to microbial lipoproteins, J. Immunol., Volume 169 (2002), pp. 1-6

[8] U. Buwitt-Beckmann; H. Heine; K. Wiesmüller; G. Jung; R. Brock; S. Akira; A.J. Ulmer Toll-like receptor 6-independent signaling by diacylated lipopeptides, Eur. J. Immunol. (2005), pp. 282-289

[9] R. Barbalat; L. Lau; R.M. Locksley; G.M. Barton Toll-like receptor 2 on inflammatory monocytes induces type I interferon in response to viral but not bacterial ligands, Nat. Immunol., Volume 10 (2009), pp. 1200-1207

[10] N.J. Gay; M. Gangloff Structure and function of Toll receptors and their ligands, Annu. Rev. Biochem., Volume 76 (2007), pp. 141-165

[11] M. Vinkler; T. Albrecht The question waiting to be asked: innate immunity receptors in the perspective of zoological research, Folia Zool., Volume 58 (2009), pp. 15-28

[12] S.A. Smith; O.C. Jann; D. Haig; G.C. Russell; D. Werling; E.J. Glass; R.D. Emes Adaptive evolution of Toll-like receptor 5 in domesticated mammals, BMC Evol. Biol., Volume 12 (2012), p. 122

[13] S.A. Smith; D. Haig; R.D. Emes Novel ovine polymorphisms and adaptive evolution in mammalian TLR2 suggest existence of multiple pathogen binding regions, Gene, Volume 540 (2014), pp. 217-225

[14] P. Parham Innate immunity: the unsung heroes, Nature, Volume 423 (2003), p. 20

[15] S. Merx; M. Neumaier; H. Wagner; C.J. Kirschning; P. Ahmad-Nejad Characterization and investigation of single nucleotide polymorphisms and a novel TLR2 mutation in the human TLR2 gene, Hum. Mol. Genet., Volume 16 (2007), pp. 1225-1232

[16] M.R. Bhide; R. Mucha; I. Mikula; L. Kisova; R. Skrabana; M. Novak; I. Mikola Novel mutations in TLR genes cause hyporesponsiveness to Mycobacterium avium subsp. paratuberculosis infection, BMC Genet. (2009), pp. 10-21

[17] M. Ben-Ali; M.R. Barbouche; S. Bousnina; A. Chabbou; K. Dellagi Toll-like receptor2 Arg677Trp polymorphism is associated with susceptibility to tuberculosis in Tunisian patients, Clin. Diagn. Lab. Immunol., Volume 11 (2004), pp. 625-626

[18] D. O’Connell Host response: genital herpes takes its toll, Nat. Rev. Microbiol., Volume 5 (2007), pp. 746-747

[19] H. Areal; J. Abrantes; P.J. Esteves Signatures of positive selection in Toll like receptor (TLR) genes in mammals, BMC Evol. Biol., Volume 11 (2011), p. 368

[20] B. Tschirren; L. Raberg; H. Westerdahl Signatures of selection acting on the innate immunity gene Toll-like receptor 2 (TLR2) during the evolutionary history of rodents, J. Evol. Biol., Volume 24 (2011), pp. 1232-1240

[21] O.C. Jann; D. Werling; J. Chang; D. Haig; E.J. Glass Molecular evolution of bovine Toll-like receptor 2 suggests substitutions of functional relevance, BMC Evol. Biol., Volume 8 (2008), p. 288

[22] T. Nakajima; H. Ohtani; Y. Satta; Y. Uno; H. Akari; T. Ishida; A. Kimura Natural selection in the TLR-related genes in the course of primate evolution, Immunogenetics, Volume 60 (2008), pp. 727-735

[23] G. Wlasiuk; M.W. Nachman Adaptation and constraint at toll-like receptors in primates, Mol. Biol. Evol., Volume 27 (2010), pp. 2172-2186

[24] H. Ben Slimen Phylogénie morphologique et moléculaire des lièvres d’Afrique du Nord du genre Lepus, Faculty of Sciences of Tunis, 2008 (PhD thesis) (356 p.)

[25] A. Awadi; F. Suchentrunk; M. Makni; H. Ben Slimen Phylogenetic relationships and genetic diversity of Tunisian hares (Lepus sp. or spp., Lagomorpha) based on partial nuclear gene transferrin sequences, Genetica, Volume 144 (2016), pp. 497-512

[26] A. Awadi; H. Ben Slimen; S. Smith; F. Knauer; M. Makni; F. Suchentrunk Positive selection and climatic effects on MHC class II gene diversity in hares (Lepus capensis) from a steep ecological gradient in North Africa, Sci. Rep-UK (2018) (In press)

[27] H. Ben Slimen; H. Schaschl; F. Knauer; F. Suchentrunk Selection on the mitochondrial ATP synthase 6 and the NADH dehydrogenase 2 genes in hares (Lepus capensis L., 1758) from a steep ecological gradient in North Africa, BMC Evol. Biol., Volume 17 (2017), p. 46

[28] H. Ben Slimen; F. Suchentrunk; A.B. Shahin; A. Ben Ammar Elgaaied Phylogenetic analysis of mtCR-1 sequences of Tunisian and Egyptian hares (Lepus sp. or spp., Lagomorpha) with different coat colours, Mamm. Biol., Volume 72 (2007), pp. 224-239

[29] A. Biedrzycka; A. Sebastian; M. Migalska; H. Westerdahl; J. Radwan Testing genotyping strategies for ultra-deep sequencing of a co-amplifying gene family: MHC class I in a passerine bird, Mol. Ecol. Resour., Volume 17 (2017), pp. 642-655

[30] A. Sebastian; M. Herdegen; M. Migalska; J. Radwan AMPLISAS: a web server for multilocus genotyping using next-generation amplicon sequencing data, Mol. Ecol. Resour., Volume 16 (2016), pp. 498-510

[31] P. Librado; J. Rozas DnaSP v5: A software for comprehensive analysis of DNA polymorphism data, Bioinformatics, Volume 25 (2009), pp. 1451-1452

[32] F. Tajima Statistical method for testing the neutral mutationhypothesis by DNA polymorphism, Genetics, Volume 123 (1989), pp. 585-595

[33] F. Rousset GENEPOP’007: A complete re-implementation of the GENEPOP software for Windows and Linux, Mol. Ecol. Resour., Volume 8 (2008), pp. 103-106

[34] J. Goudet FSTAT, version 2.9.3. A program to estimate and test gene diversities and fixation indices, Lausanne University, Lausanne, Switzerland, 2001

[35] K. Belkhir; P. Borsa; L. Chikhi; N. Raufaste; F. Bonhomme GENETIX 4.05. logiciel sous Windows TM pour la génétique des populations, Laboratoire « Génome, populations, interactions », CNRS UMR 5171, Université Montpellier-2, Montpellier, France, 1996–2004

[36] H.-J. Bandelt; P. Forster; A. Rohl Median-joining networks for inferring intraspecific phylogenies, Mol. Biol. Evol., Volume 16 (1999), pp. 37-48

[37] Z. Yang PAML 4, phylogenetic analysis by maximum likelihood, Mol. Biol. Evol., Volume 24 (2007), pp. 1586-1591

[38] E. Ishengoma; M. Agaba Evolution of toll-like receptors in the context of terrestrial ungulates and cetaceans diversification, BMC Evol. Biol., Volume 17 (2017), p. 54

[39] Z. Yang; R. Nielsen; N. Goldman; A.M. Pedersen Codon-substitution models for heterogeneous selection pressure at amino acid sites, Genetics, Volume 155 (2000), pp. 431-449

[40] D.J. Wilson; G. McVean Estimating diversifying selection and functional constraint in the presence of recombination, Genetics, Volume 172 (2006), pp. 1411-1425

[41] S. Smith; J. Goüy de Bellocq; F. Suchentrunk; H. Schaschl Evolutionary genetics of MHC class II beta genes in the brown hare, Lepus europaeus, Immunogenetics, Volume 63 (2011), pp. 743-751

[42] S.L.K. Pond; S.D.W. Frost DATAMONKEY: rapid detection of selective pressure on individual sites of codon alignments, Bioinformatics, Volume 21 (2005), pp. 2531-2533

[43] S.L.K. Pond; D. Posada; M.B. Gravenor; C.H. Woelk; S.D.W. Frost Automated phylogenetic detection of recombination using a genetic algorithm, Mol. Biol. Evol., Volume 23 (2006), pp. 1891-1901

[44] B. Murrell; J.O. Wertheim; S. Moola; T. Weighill; K. Scheffer; P.S.L. Kosakovsky Detecting individual sites subject to episodic diversifying selection, PLoS Genet., Volume 8 (2012), p. e1002764

[45] M.A. Beaumont; R.A. Nichols Evaluating loci for the use in the genetic analysis of population structure, Proc. R. Soc. Lond. B., Volume 263 (1996), pp. 1619-1636

[46] T. Antao; A. Lopes; R.J. Lopes; A. Beja-Pereira; G. Luikart LOSITAN: a workbench to detect molecular adaptation based on a F_ST-outlier method, BMC Bioinformatics, Volume 9 (2008), p. 323

[47] K.P. Burnham; D.R. Anderson Model Selection and Multimodel Inference: A Practical Information-Theoretical Approach, Springer-Verlag, New York, 2002

[48] R Core Team R: A language and environment for statistical computing, R Foundation for Statistical Computing, Vienna, Austria, 2016 https://www.R-project.org/

[49] A. Meier; J. Söding Automatic Prediction of Protein 3D Structures by Probabilistic Multi-template Homology Modeling, PLoS Comput. Biol., Volume 11 (2015), p. e1004343

[50] V. Alva; S.Z. Nam; J. Söding; A.N. Lupas The MPI bioinformatics Toolkit as an integrative platform for advanced protein sequence and structure analysis, Nucleic Acids Res., Volume 44 (2016), p. W410-W415

[51] PyMOL, The PyMOL Molecular Graphics System, Version 1.8 Schrödinger, LLC.

[52] Y. Choi; G.E. Sims; S. Murphy; J.R. Miller; A.P. Chan Predicting the Functional Effect of Amino Acid Substitutions and Indels, PLoS One, Volume 7 (2012), p. e46688

[53] J. Abrantes; H. Areal; P.J. Estevas Insights into the European rabbit (Oryctolagus cuniculus) innate immune system: genetic diversity of the toll-like receptor 3 (TLR3) in wild populations and domestic breeds, BMC Genet., Volume 14 (2013), p. 73

[54] G.J. Knafler; C.E. Grueber; J.T. Sutton; I.G. Jamieson Differential patterns of diversity at microsatellite, MHC, and TLR loci in bottlenecked South Island saddleback populations, New Zeal. J. Ecol., Volume 41 (2017), pp. 98-106

[55] N. Takahata; M. Nei Allelic genealogy under overdominant and frequency-dependent selection and polymorphism of major histocompatibility complex loci, Genetics, Volume 124 (1990), pp. 967-978

[56] E. Quéméré; M. Galan; J.-F. Cosson; F. Klein; S. Aulagnier; E. Gilot-Fromont; J. Merlet; M. Bonhomme; A.J. Hewison; N. Charbonnel Immunogenetic heterogeneity in a widespread ungulate: the European roe deer (Capreolus capreolus), Mol. Ecol., Volume 24 (2015), pp. 3873-3887

[57] Y. Xu; X. Tao; B. Shen; T. Horng; R. Medzhitov; J.L. Manley; L. Tong Structural basis for signal transduction by the Toll/interleukin-1 receptor domains, Nature, Volume 408 (2000), pp. 111-115

[58] J.L. Slack; K. Schooley; T.P. Bonnert; J.L. Mitcham; E.E. Qwarnstrom; J.E. Sims; S.K. Dower Identification of two major sites in the type I interleukin-1 receptor cytoplasmic region responsible for coupling to proinflammatory signaling pathways, J. Biol. Chem., Volume 275 (2000), pp. 4670-4678

[59] Z. Jiang; P. Georgel; C. Li; J. Choe; K. Crozat; S. Rutschmann; X. Du; T. Bigby; S. Mudd; S. Sovath; I.A. Wilson; A. Olson; B. Beutler Details of Toll-like receptor: adapter interaction revealed by germ-line mutagenesis, Proc. Natl. Acad. Sci. USA, Volume 103 (2006), pp. 10961-10966

[60] A. Poltorak; X. He; I. Smirnova; M.Y. Liu; C. Van Huffel; X. Du; D. Birdwell; E. Alejos; M. Silva; C. Galanos; M. Freudenberg; P. Ricciardi-Castagnoli; B. Layton; B. Beutler Defective LPS signaling in C3H/HeJ and C57BL/10ScCr mice: mutations in Tlr4 gene, Science, Volume 282 (1998), pp. 2085-2088

[61] L.B. Barreiro; M. Ben-Ali; H. Quach; G. Laval; E. Patin; J.K. Pickrell; C. Bouchier; M. Tichit; O. Neyrolles; B. Gicquel; J.R. Kidd; K.K. Kidd; A. Alcaïs; J. Ragimbeau; S. Pellegrini; L. Abel; J.L. Casanova; L. Quintana-Murci Evolutionary dynamics of human Toll-like receptors and their different contributions to host defence, PLoS Genet., Volume 5 (2009), p. e1000562

Cité par

Milomir Stefanović; Mihajla Djan; Nevena Veličković; Yasin Demirbaş; Ladislav Paule; Csongor István Gedeon; Annika Posautz; Christoph Beiglböck; Anna Kübber-Heiss; Franz Suchentrunk Purifying selection shaping the evolution of the Toll-like receptor 2 TIR domain in brown hares (Lepus europaeus) from Europe and the Middle East, Molecular Biology Reports, Volume 47 (2020) no. 4, p. 2975 | DOI:10.1007/s11033-020-05382-x
José G. Ham-Dueñas; Ricardo Canales-del-Castillo; Gary Voelker; Irene Ruvalcaba-Ortega; Carlos E. Aguirre-Calderón; José I. González-Rojas; Arnar Palsson Adaptive genetic diversity and evidence of population genetic structure in the endangered Sierra Madre Sparrow (Xenospiza baileyi), PLOS ONE, Volume 15 (2020) no. 4, p. e0232282 | DOI:10.1371/journal.pone.0232282
Fabiana Neves; Ana Águeda-Pinto; Ana Pinheiro; Joana Abrantes; Pedro J. Esteves Strong selection of the TLR2 coding region among the Lagomorpha suggests an evolutionary history that differs from other mammals, Immunogenetics, Volume 71 (2019) no. 5-6, p. 437 | DOI:10.1007/s00251-019-01110-3

Cité par 3 documents. Sources : Crossref

Commentaires - Politique