Rift Valley Fever (RVF) virus (Family Bunyaviridae) is an arthropod-borne RNA virus that infects primarily domestic ruminants and occasionally humans. RVF epizootics are characterized by numerous abortions and mortality among young animals. In humans, the illness is usually characterized by a mild self-limited febrile illness, which could progress to more serious complications. RVF virus is widespread and endemic in many regions of Africa. In Western Africa, several outbreaks have been reported since 1987 when the first major one occurred at the frontier of Senegal and Mauritania. Aiming to evaluate the spreading and molecular epidemiology in these countries, RVFV isolates from 1944 to 2008 obtained from 18 localities in Senegal and Mauritania and 15 other countries were investigated. Our results suggest that a more intense viral activity possibly took place during the last century compared to the recent past and that at least 5 introductions of RVFV took place in Senegal and Mauritania from distant African regions. Moreover, Barkedji in Senegal was possibly a hub associated with the three distinct entries of RVFV in West Africa.
Citation: Soumaré POL, Freire CCM, Faye O, Diallo M, Oliveira JVCd, et al. (2012) Phylogeography of Rift Valley Fever Virus in Africa Reveals Multiple Introductions in Senegal and Mauritania. PLoS ONE 7(4): e35216. doi:10.1371/journal.pone.0035216
Editor: Tetsuro Ikegami, The University of Texas Medical Branch, United States of America
Received: December 16, 2011; Accepted: March 13, 2012; Published: April 23, 2012
Copyright: © 2012 Soumaré et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This study was funded by Institute Pasteur of Dakar in Senegal and FAPESP (Fundação de Amparo a Pesquisa do Estado de Sao Paulo, Brazil) project #00/04205-6 (VGDN program). CCMF has CAPES scholarship and PMAZ holds a CNPq-PQ scholarship. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Rift Valley fever (RVF) is a mosquito borne anthropozoonosis affecting livestock and also humans throughout Africa and the Arabian Peninsula. Animal infections lead to high mortality among young ruminants and abortions of pregnant females while human infections are associated with a wide array of syndromes ranging from influenza-like illness to severe symptoms including hemorrhages, encephalitis, hepatitis, ocular complications and fatal outcomes . Animal and human mortality and morbidity caused by RVF often result in high public health burden and substantial economic losses in endemic countries . RVF is caused by the RVF virus (RVFV), a phlebovirus (Family Bunyaviridae) with three single-stranded RNA segments genome: L (large), M (medium) and S (small). The L segment encodes the RNA-dependent RNA polymerase , the M segment envelope glycoproteins G1 and G2 and two non-structural proteins of 14 and 78 kDa molecular weight, respectively . The S segment has an ambisense coding strategy, since it encodes for the nucleocapsid protein (N) on the 5′-3′, positive sense and a non-structural protein (NSs) in the 5′-3′, negative sense . The latter protein plays a major role in innate immunity by blocking interferon gene expression .
Although RVF was discovered in 1930 , , the disease remained a veterinary concern until a major outbreak occurred in Egypt in 1977 , , where RVF caused about one million human infections, 600 deaths and severe widespread epizootics . Subsequently, more studies on RVFV transmission characterized additional major outbreaks derived from endemic sub-Saharan African countries –. The exchange from the enzootic cycle (i.e., among wild animals) to peridomestic transmission (i.e., among animals such as cattle, lambs and goats) is thought to cause the occasional spillover outbreaks into humans . The most recent epidemics occurred in Kenya, Somalia, Tanzania in 2007 , South Africa in 2008  and again in 2010 , Sudan in 2008 where more than 200 deaths were reported  and Mauritania in 2010 (Faye, unpublished data). A second introduction occurred when RVF, coming from East Africa, moved into Saudi Arabia and Yemen in the Arabian Peninsula . It went for the first time outside of Africa –where it had been confined so far– becoming a threat to the Middle East. In the light of its natural history, RVFV appears to be a great example of an emerging pathogen with great potential of dispersion and impact on human and animal health and, a good model for an integrated approach to human and animal health (i.e., the ‘one health’ concept). Several authors using molecular phylogeny methods attempted to unravel mechanisms underlying its distribution and dispersal in Africa , –. These studies revealed that: (i) RVFV genomes carry low genetic diversity ranging from 1 to 5% sequence divergence, (ii) change at a mean evolutionary rates ranging from 2.8 to 3.9E-4 substitution/site/year , (iii) can be grouped into several lineages , , (iv) can be grouped by geographic origin , (v) undergo reassortment in nature  and, (vi) the extant genetic diversity coalesces about 120–130 years ago . However, except for Kenya , limited information is available on the evolution of RVFV in a country or regional scale. This lack of information poses the necessity to investigate the determinants of RVFV circulation in West Africa. In order to fill this gap, we sequenced 48 RVFV isolates from Senegal and Mauritania and along with sequences from Guinea and Burkina Faso, to allow inferences on the dispersal patterns of RVFV in West Africa. We also added to the study samples from East Africa to establish migratory patterns in Africa at a coarse grain.
Results and Discussion
Forty-eight RVFV isolates collected over a period of 20 years (1983 to 2003) from different areas of Senegal and Mauritania were included in the study (Table S1). For each strain NSs, G2, and polymerase genes were partially sequenced. Overall levels of sequence diversity and polymorphisms were within ranges previously observed and no deletions or insertions were found , . Our analyses of recombination, using several different methods, showed the absence of intra gene recombination on the regions we studied and low levels of genetic diversity, in agreement with what was previously observed by , –. There are many plausible explanations for the low rates of genetic change estimated for RVFV, which could entail factors involved in genetic variability reduction, ranging from speed of replication, high rate of vertical transmission to reduced viremia. Moreover, the alternation between vertebrates and invertebrate hosts during the RVFV life cycle could hinder viral variation by imposing strong selective advantage in keeping highly adapted phenotypes . Likewise, selection analyzes of the three genomic segments uncovered several sites under strong negative selection, indicated by ω<0, (Table 1) shows purging of deleterious polymorphisms in functionally important genes. On the other hand, a lack of positively selected sites indicated by ω>0, shows a lack directional change, typical of highly adapted genotypes. These findings also agree with the fact that the natural cycle of RVFV imposes several transmission bottlenecks due to alternation between a diverse set of arthropods vectors and mammal hosts that keep selective pressure towards genomic stability, such as the preservation of the S segment in vivo . Moreover, the genomic stability we observed is consistent with the enzootic status of RVFV in several places in Africa.
Table 1. Negatively selected codon sites detected in RVFV.doi:10.1371/journal.pone.0035216.t001
Phylodynamics of RVFV
We first investigated the phylogenetic signal content in our data by reconstructing 50 thousand quartets for each gene segment using the likelihood mapping method (see methods section). Our results (Table S2) indicated that the M segment had the higher phylogenetic signal content given its lower percentage of unresolved quartets, followed by the L and S segments. Maximum clade credibility (MCC) trees obtained during Bayesian inference for the S (Figure 1), M (Figure 2) and L (Figure 3) segments and trees obtained by maximum likelihood (data not shown), grouped samples from Senegal and Mauritania as sister taxa, as indicated by the preferential clustering of yellow and green dots in the tips of distinct clades in the trees. The occurrence of green and yellow dots in five distinct groups of taxa in Figure 1, 2 and 3 is evidence for independent shared introductions in these countries. These findings were well supported by significant posterior probabilities and by high bootstrap values for maximum likelihood trees, inferred with GARLI (data not shown). Furthermore, the phylogenetic adjacency of lineages, indicated by clustering of green and yellow dots in Figure 1, 2 and 3, indicated a recurrent exchange between the two neighbor countries during the last century. We then did phylogeographic reconstructions based on the S segment alone, since it had the highest posterior probabilities during state reconstructions. This allowed a coarse-grained view of the spatio-temporal patterns of RVFV spread in the African continent in general and, the pattern of movement into Mauritania and Senegal in particular. Importantly, the time to the most recent common ancestor (TMRCA) parameter values that we estimated from sequences from each genetic segment were different. This could be explained to some extent by the large and distinct number of negatively selected sites among segments, which could lead to an underestimation of the age of nodes near to the root of tree . In addition, albeit obtaining high posterior values in MCC trees, we used different-sized sequences during our reconstructions of viral genealogies, which is acceptable within the likelihood framework, but could nonetheless, lead to underestimation of branch lengths for taxa with shorter sequences . On the other hand, segmented genome allows reassortment events  that even at low frequency , this could cause differences in TMRCA estimates for different segments.
Figure 1. Maximum clade credibility (MCC) tree for the short (S) segment.
Relationship among 167 strains of RVFV isolated from different localities and countries. Samples from Mauritania and Senegal are shown by green and yellow dots respectively. The recurrent independent clustering of green and yellow dots suggests multiple introductions of RVFV into these countries throughout the 20th century. Posterior probability values greater than 90% are shown near tree nodes.doi:10.1371/journal.pone.0035216.g001
Figure 2. Maximum clade credibility (MCC) tree for the medium (M) segment.
Relationship among 128 strains of RVFV isolated from different localities and countries. Samples from Mauritania and Senegal are shown by green and yellow dots respectively. The recurrent independent clustering of green and yellow dots suggests multiple introductions of RVFV into these countries throughout the 20th century. Posterior probability values greater than 90% are shown near tree nodes.doi:10.1371/journal.pone.0035216.g002
Figure 3. Maximum clade credibility (MCC) tree for the long (L) segment.
Relationship among 126 strains of RVFV isolated from different localities and countries. Samples from Mauritania and Senegal are shown by green and yellow dots respectively. The recurrent independent clustering of green and yellow dots suggests multiple introductions of RVFV into these countries throughout the 20th century. Posterior probability values greater than 90% are shown near tree nodes.doi:10.1371/journal.pone.0035216.g003
Based on the S segment of the RVFV that better resolved phylogeographic patterns, we observed that our trees had temporal coherence, since the age of key nodes of the S MCC tree agreed with known events, such as the first report of RVFV in Kenya in 1930 ,  and the estimated arrival in this country from Zimbabwe. A root for the tree was dated between 1909 and 1920 and placed somewhere near the Southern tip of the Rift Valley. We assume that variation in genetic diversity in time (measured as effective population size times the generation time, Ne.g) estimated with the Monte Carlo Markov Chain (MCMC) method, correlates with RVFV demography and is proportional to the number of infections in time. The Bayesian skyride plots (BSP) for all genomic segments of RVFV had an increase in Ne.g in the first half of last century followed by a continuous decrease in the last half (Figure 4). Moreover, the BSPs agreed with a recent origin of RVFV and intense viral activity in the first half of 20th century followed by dispersal in the African Continent. While the reduction in viral population size or infection events (Ne.g) are consistent with enzootic activity near the present. This finding agrees with the RVFV spread among large populations of susceptible hosts following to the introduction of livestock in East Africa namely Kenya in the first half of the 20th century to improve husbandry , . In essence, the higher levels of Ne.g in the past (Figure 4) agree with the fact that RVFV has been active over the last century causing major outbreaks and suggest a recent trend of enzootism.
Figure 4. Bayesian skyride plot (BSP) for the three genomic segments of RVFV.
The effective population size times the generation time (Ne.g) parameter approximates the number of infections in time. The plots overlay indicated complex oscillations with higher viral activity starting at around 1930, culminating sometime between the 40's and 60's, followed by a steady decrease from the 70's to the present. The stabilization of near the present is consistent with enzooticism, since the virus is not kept in human populations.doi:10.1371/journal.pone.0035216.g004
Phylogeography of RVFV in the African continent
Phylogeographic reconstructions summarized in Figure S1 unveiled a complex pattern of viral movement across long distances in the African continent that we represented in maps (Figures 5 and 6). The dates in the map arrows in Figures 5 and 6 indicate the reconstructed TMRCA for a lineage found at the locality shown by the arrowhead and is assumed as the oldest possible year of introduction of that lineage at a given locality. We can observe a stepwise spread from a locality to the next, by focusing on East Africa, were denser sampling allows greater detail (Figure 6). Our inferences unveiled five distinct introductions in Senegal and Mauritania during the past century (Figure 5 and KML file in Dataset S1). The first arrival in West Africa (blue line in Figure 5) was related to strains from Zimbabwe, South Africa and Uganda possibly from around 80 years ago, spreading to several locations in Senegal (SN) and Mauritania (MR). These inferences are consistent with serological surveys done by the Pasteur Institute in Dakar that suggest that the coastal zone of West Africa may have had RVFV during the first half of the last century (data not published). The second introduction to West Africa (black line in Figure 5 and Figure 6) was associated with strains from South Africa or Zimbabwe sharing a common ancestor around 72 years ago. This introduction was associated with samples from Diawara (SN) and Hodh El Garbi (MR), and an outbreak in 1998 in Diawara . The third introduction in West Africa (shown as a green line in Figure 5 and Figure 6) consisted of strains that originated in Zimbabwe in the beginning of last century, then moved to Central African Republic around 70 years ago and, from there reached Senegal and Mauritania around 47 years ago and Guinea 38 years ago. Samples from this introduction were isolated in Barkedji (SN) in 1993 and Rosso (MR) in 1987 and in Guinea in 1981 and 1984. The fourth introduction (shown as solid black line in Figure 5 and Figure 6) was evidenced by a single strain isolated in Kedougou in 1983 associated with an old South African lineage that moved to Egypt samples around 47 years ago and Madagascar around 40 years ago. The most recent introduction in west Africa (fifth introduction, red line, in Figure 5 and Figure 6) was due to a wave of RVFV that spread to distant places in Africa caused by a lineage from the Southern part of Africa that moved to Zimbabwe and Kenya 47 years ago and then reached Madagascar, Somalia, Tanzania, Angola, Senegal, Mauritania and Saudi Arabia during the last 28 years. The most interesting feature of this RVFV wave was that it recurrently reemerged in Kenya from where it was broadcasted to other localities.
Figure 5. Large-scale geographic spread of RVFV in Africa and Saudi Arabia based on the S segment.
The directed lines connect the sources and target localities (shown by arrows) of viral lineages. The distinct introductions into Senegal and Mauritania were represented by different colors. The estimated time to the most recent common ancestor of strains from different countries are shown within rectangles and could be interpreted as the oldest possible year of introduction of that lineage at that locality.doi:10.1371/journal.pone.0035216.g005
Figure 6. Geographic spread of RVFV in Mauritania and Senegal.
The directed lines connect the sources and target localities (shown by arrows) of viral lineages. The distinct introductions into Senegal and Mauritania were represented by different colors. The estimated time to the most recent common ancestor of strains from different countries are shown within rectangles and could be interpreted as the oldest possible year of introduction of that lineage at that locality.doi:10.1371/journal.pone.0035216.g006
Phylogeography of RVFV in West Africa
The history of introductions, exchanges and spread of RVFV among Senegalese and Mauritanian territories is shown in greater detail in Figure 6. The map in Figure 6 summarizes independent phylogeographic reconstructions, one for each introduction. The MCC trees (available from authors upon request) agreed well with global MCC tree Figure S1. According to our reconstructions, the first introduction possibly originating from South Africa in the 30's appears to have taken place in Barkedji (SN). Subsequently, it moved to Rosso (MR) around 1970 and Kolda (SN) around 1980. The Rosso lineage was then broadcasted to the provinces of Garack, Nkick and Keur Macene in Mauritania around 1980. Later in 1985, a lineage from Nkick moved into Terg in the North of Mauritania. On the other hand, a lineage from Kolda moved into east Mauritania to Ayoun El Atrouss around the end of the 80's, and from there, to Guimi around 2000. The second introduction, possibly from central Africa, reached Diawara in Senegal around 1940. From there it moved to Hodh el Garbi in Mauritania around 1970. As for the first and third introductions, the fifth (most recent introduction, indicated by a red line) also appears to have entered Senegal in Barkedji and from there, reached the Mauritanian territories of Lazaret and Hseytine. They then spread from Hseytine to Kiffa (MR) and from Lazaret to Matar Lajam and Dar Naim and from the last to Tijiga during the last decade.
Biologic correlates of complex RVFV spread patterns
By contrasting different scales of RVFV spread, fine-grain in West Africa and coarse-grained elsewhere in the continent, we unveiled an intricate dispersal pattern that shows: (i) viral dispersal across long distances, some times at fast rates, as indicated by the large distances among places of isolation of closely-related strains , (ii) persistence in different places for more than 70 years, like that observed for old lineages from South Africa, Zimbabwe, Central African Republic, Kenya and Senegal and, (iii) a gradual spread at nearby localities that we show in greater detail in Senegal and Mauritania. The capacity of RVFV to cross large geographical extensions, could be assisted by the movement of infected mammals and mosquitoes . Likewise, animal herds, acting as reservoirs, could warrant the maintenance of RVFV enzootic circulation at a regional level . The spread of RVF from endemic regions to areas without disease through animal migration routes has been postulated for previous outbreaks. For example, RVFV has been exported to Egypt in 1977, probably by lambs from Sudan  and from Eastern Africa to Madagascar in 1991  and then to the Arabian Peninsula in 2000 . Nevertheless, some evidence for enzootic activity has also been put forward. For example, it has been proposed that there is persistent enzootic activity of RVFV in Senegal based on seroprevalence data, which could help explain outbreaks in Mauritania and Egypt without heavy rain falls . Moreover, an enzootic cycle has been described in Barkedji, which could be maintained by vertical transmission among Aedes mosquitoes .
Crucially, our results also support the notion that Barkedji functions as a hub, broadcasting RVFV to other localities in West Africa. At a coarse grain, we observed that the Zimbabwe spread viruses during the first half of last century, while Kenya experienced a more intense activity as a hub during the second half. The notion that Barkedji may be an important gateway to RVFV in Senegal and Mauritania was previously suggested by serologic and entomologic surveys , . An important role of Barkedji appears to be independently supported by the fact that it is a known crossroad of migration movements of herds between the southern and northern regions of Senegal and, to a larger extent, to southern Mauritania. Moreover, the distinct introductions in Barkedji may help explain the maintenance of the endemic cycle at the regional scale, by feeding the zoonotic pool required for its persistence . Because mosquitoes cannot fly more than a few hundred meters during their lifetime, their role in long-range dissemination may be limited . This important role however, could be fulfilled by infected members of roaming herds that could help the introduction and dispersion of RVFV strains across Africa . Perhaps the lack of well-characterized reservoirs in both sides of the Senegalese Valley could be explained by a constant replenishment of potential hosts in the area. Our data agrees with the notion that infected hosts may regularly introduce and reintroduce viruses through places such as Barkedji, possibly through migratory routes. Nevertheless, any additional understanding of the putative role of Barkedji as a RVFV gateway to Senegal and Mauritania requires investigating a multitude of factors and processes that ultimately would help maintain and replenish the enzootic pool.
Materials and Methods
Ethics Statements and Clinical samples
The study was submitted and approved by the Ethics Committee of Institute Pasteur of Dakar in Senegal. Samples from Senegal and Mauritania are part of the Institute Pasteur in Dakar CRORA collection (Centre collaborateur OMS de référence et de recherche pour les arbovirus et virus de fièvres hémorragiques). Animal and human samples were sampled from1983 to 2003 at outbreak sites. Human samples used to isolate the virus were sent by physicians as part of routine diagnostics procedures and were verbally authorized (when possible) by patients that were kept anonymous. On the other hand, samples obtained from vertebrates and insects, were provided by veterinarians and entomologists from national health authorities during RVFV outbreaks. No ethical statement for animal experiments was demanded, since no animal experiments were conducted, other than sampling near human cases. Nevertheless, all animal blood sampling was performed while minimizing suffering. In addition, all samples of the study were collected in the course of routine surveillance by national health authorities.
The strains used were obtained from mosquitoes, humans and animals in Senegal and Mauritania. Date of isolation of strains and animal source are listed in Table S1, geographical origins are shown in Figure S2. RVFV samples were obtained from the virus collection of the WHO reference center for arboviruses research in the Institute Pasteur of Dakar. They have been propagated in Vero E6 or AP61 cells cultured in Leibovitz medium L15 supplemented with 5% fetal bovine serum, antibiotics, Fungizone and Tryptose (for AP61 cells). Viruses were harvested after infected cells presented a cytopathic effect. Additional reference sequences from other localities in Africa were obtained from GenBank (http://www.ncbi.nlm.nih.gov/genbank/) and they were listed in Dataset S2.
Viral RNA extraction and amplification
Viral RNA was extracted from cells culture supernatant by using the QIAgen Viral RNA mini kit (Qiagen, Valencia, CA) according to the manufacturer's instructions and collected in 50 µl of elution buffer. Partial regions of the G2 (718 bp), NSs (601 bp) and L (129 bp) genes were amplified using the sets of primers MRV1a-MRV2g, NS3a-NS2g, and Wag-Xg respectively (Table S3). Reverse transcription-PCR reactions for the cDNA synthesis were performed for NSs and G2 using the AMV Reverse Transcriptase (M5101). Taq DNA polymerase (M1865) of Promega Corporation (Winsconsin, USA) was used for the PCR amplification of both regions. Primers sequences and protocols were previously described . Whereas for the L region the RT-PCR was amplified with the superscript II (SII) (Invitrogen, USA) and Takara taq DNA polymerase (Takara, USA) for the synthesis of the cDNA and polymerase amplification respectively. First strand cDNA synthesis was performed in a final volume of 20 µl by incubating first 10 µl of eluted RNA, 1 µl dNTP (10 mM- Amersham), 1 µl of reverse primer (100 µg/µl) and 1 µl H2O at 65°C for 5 min and then rapidly chilled on ice. The following mixture: 4 µl 5xbuffer, 1 µl DTT, 1 µl RNasin Ribonuclease Inhibitor 2500 U (Promega) and 1 µl SII transcriptase (18080–044) was added before an incubation at 50°C for 60 min followed by a heat inactivation at 70°C for 15 min. The amplification reaction was initiated with 5 µl of the resulting cDNA as template. Thermocycler program was: one cycle of 95°C for 2 min 30 sec, 45 cycles each one with 95°C for 45 sec, 45°C for 30 sec and 72°C for 45 sec, and one final extension cycle at 72°C for 5 min. RT-PCR amplification from field sample isolates was done in the Institute Pasteur in Dakar, Senegal.
RT-PCR products were separated by electrophoresis on a 1% agarose gel. Bands of the appropriate molecular size were excised and DNA was recovered using the QiaQuick Gel Extraction Kit (Qiagen) as specified by the manufacturer. Both strands sequencing was performed using the same reverse and forward primers as for the amplification. Sequencing reactions were purified by precipitation: (Hi-Di from Applied Biosystems Foster City, CA) and finally run on an ABI Prism 3100 Genetic Analyzer (Applied Biosystems). DNA sequencing was done at the Microbiology Department of the Biomedical Sciences Institute at the University of São Paulo, Brazil. GenBank accession numbers for S segment sequences are from JN995251 to JN995298 and for M sequences are from JN995299 to JN9954345, while sequences from the L segment portion of the polymerase gene are available from the authors upon request because GenBank requires sequences longer than 200 nucleotides.
Sequences were inspected and assembled with Geneious Pro3.5.4 program (http://www.geneious.com/) and aligned using the multiple sequence alignment algorithm Clustal W  and Se-Al v 2.0 (http://tree.bio.ed.ac.uk/software/seal/). To prevent potential biases during phylogenetic inference due to recombination, we first analyzed all sequences with RDP4beta 4.8 program that incorporates RDP, GENECONV, Chimaera, Maxchi, Bootscan, SiScan and 3Seq  to uncover evidence for recombination events. Only events with p-values≤0.05 that were detected by three or more methods were considered using the Bonferroni correction to prevent false positive results. In addition, intending to infer the selection pressures acting on each genomic segment of RVFV, we estimated the difference between the non-synonymous (dN) and synonymous (dS) rates per codon site using the Single Likelihood Ancestor Counting (SLAC) algorithm available in HyPhy v0.99 , assuming a significance level of 1% (α = 0.01). In the HyPhy output, values of ω are expressed as ω = dN-dS. Therefore, ω greater than zero are indicative of directional, positive selection (ω>0), while values below zero (ω<0) indicate purifying, negative selection.
Prior to the analyses, the phylogenetic signal content of the sequence datasets to phylogenetic reconstruction was investigated by Likelihood mapping  with TREE-PUZZLE . Phylogenetic trees were generated by Maximum Likelihood (ML) criterion using GARLI v0.96  that uses a stochastic genetic algorithm to estimate simultaneously the best topology, branch lengths and substitution model parameters that maximize the log-Likelihood (lnL). The confidence of ML trees was accessed by the convergence of lnL scores from ten independent replicates. Likewise previous RVFV reports , GTR model with Gamma-distributed rate variation (γ) and a proportion of invariable sites (I) substitution model was used. Since we had dates of isolation, we estimated substitution rates per site per year (μ) with R8s v1.71  using the Penalized Likelihood method  that employs a semi-parametric approach, using different substitution rates on every branch with a nonparametric roughness penalty, which impose costs according to the model once rates change too quickly from branch to branch. In addition, Maximum Clade Credibility (MCC) trees were inferred using a Markov Chain Monte Carlo (MCMC) Bayesian approach under GTR+γ+I and a relaxed (uncorrelated lognormal) molecular clock  with the μ previously estimated on the program BEAST v1.6.1 . Moreover, we investigated the variation in effective population size times the generation time (Ne.g) to infer viral demography by Bayesian Skyride Plots . MCMC convergence was obtained during four independent runs with 50 million of generations, which were sufficient to obtain a proper sample from the posterior at MCMC stationarity. The stationarity of parameters was also assessed by allowing the effective sample sizes (ESS) to reach values above 200 as inspected using Tracer v1.5 (http://tree.bio.ed.ac.uk/software/tracer/). Furthermore, to infer the history of geographical dispersion of RVFV strains, we used a discrete model attributing state characters representing isolation locality of each of the strains with the Bayesian Stochastic Search Variable (BSSVS) algorithm  implemented in BEAST v1.6.1. This method estimates the most probable state at each node in the MCC trees, allowing us to reconstruct ancestral positions for ancestral viral lineages along the tree. For phylogeographic reconstructions, each locality was coded as a discrete trait. For Senegal and Mauritania we coded for each locality, down to the city level, while for other African countries we coded localities at national level (coarse grain). We argue that this approach was justified since it allow to focus on the movement within and between Senegal and Mauritania at a finer grain, while also showing its relationship with other African countries at a coarse grain. Likewise, to map routes of spread we resolved cities in Senegal and Mauritania and pinned incoming and outgoing routes at the center of the other African countries.
Maximum clade credibility (MCC) tree summarizing geographical states reconstructions along a time-scaled tree. Ancestral states reconstructions indicating the most probable location of a lineage in time along colored branches and dates of nodes are shown. Countries are ISO coded as follows: Burkina Faso (BF), Central African Republic (CF), Egypt (EG), Guinea (GN), Kenya (KE), Madagascar (MG), Mauritania (MR), Namibia (NA), Saudi Arabia (SA), Senegal (SN), South Africa (ZA), Tanzania (TZ), Uganda (UG), Zimbabwe (ZW), Angola (AO), Zambia (ZM), Somalia (SO).
Map from Senegalese and Mauritanian territories. Circles represent the locations from RVFV isolates.
Source, geographical origin and date of isolations of RVFV strains used in this study. *Patient died.
Likelihood mapping of the three RVFV genomic segments. *The percentage of the unresolved quartets is an indicator of phylogenetic suitability from data under analysis. If the percentage is higher, the suitability is lower. Our results suggested that Medium segment is the best for phylogenetic inference due to the lower associated uncertainty.
Description of PCR primers used to amplify Senegal and Mauritania samples. *Relative position to strain MP12 whose GenBank accession numbers from Small, Medium and Large segments are respectively X53771, M11157 and X56464.
Spread of RVFV strains in Africa. A kml file to picture the history of RVFV migration in Africa along the time, executable in Google earth (http://earth.google.com).
Additional Reference Sequences of RVFV. An xls file showing the sequences recovered from GenBank (http://www.ncbi.nlm.nih.gov/genbank/).
We would like to thank the Editor and two anonymous referees for their thorough contribution on all relevant aspects of the manuscript.
Conceived and designed the experiments: POLS OF MD PMAZ AAS. Performed the experiments: POLS OF MD JVCO. Analyzed the data: POLS CCMF AAS PMAZ. Contributed reagents/materials/analysis tools: AAS PMAZ. Wrote the paper: POLS CCMF PMAZ AAS. First co-authors: POLS CCMF.
- 1. Laughlin LW, Meegan JM, Strausbaugh LJ, Morens DM, Watten RH (1979) Epidemic Rift Valley fever in Egypt: observations of the spectrum of human illness. Transactions of the Royal Society of Tropical Medicine and Hygiene 73: 630–633.
- 2. Rich KM, Wanyoike F (2010) An assessment of the regional and national socio-economic impacts of the 2007 Rift Valley fever outbreak in Kenya. The American journal of tropical medicine and hygiene 83: 52–57. doi:10.4269/ajtmh.2010.09-0291.
- 3. Müller R, Poch O, Delarue M, Bishop DH, Bouloy M (1994) Rift Valley fever virus L segment: correction of the sequence and possible functional role of newly identified regions conserved in RNA-dependent polymerases. The Journal of general virology 75(Pt 6): 1345–1352.
- 4. Collett MS, Purchio AF, Keegan K, Frazier S, Hays W, et al. (1985) Complete nucleotide sequence of the M RNA segment of Rift Valley fever virus. Virology 144: 228–245.
- 5. Giorgi C, Accardi L, Nicoletti L, Gro MC, Takehara K, et al. (1991) Sequences and coding strategies of the S RNAs of Toscana and Rift Valley fever viruses compared to those of Punta Toro, Sicilian Sandfly fever, and Uukuniemi viruses. Virology 180: 738–753.
- 6. Billecocq A, Spiegel M, Vialat P, Kohl A, Weber F, et al. (2004) NSs protein of Rift Valley fever virus blocks interferon production by inhibiting host gene transcription. Journal of virology 78: 9798–9806. doi:10.1128/JVI.78.18.9798-9806.2004.
- 7. Findlay GM, Daubney R (1931) The Virus of Rift Valley Fever or Enzoötic Hepatitis. The Lancet 24: 1350–1351.
- 8. Daubney R, Hudson JR (1932) Rift Valley Fever. The Lancet 6: 611–612. doi:10.1017/S0031182000003036.
- 9. Meegan JM (1979) The Rift Valley fever epizootic in Egypt 1977–78. 1. Description of the epizzotic and virological studies. Transactions of the Royal Society of Tropical Medicine and Hygiene 73: 618–623.
- 10. Martin V, Chevalier V, Ceccato P, Anyamba A, De Simone L, et al. (2008) The impact of climate change on the epidemiology and control of Rift Valley fever. Revue Scientifique Et Technique (International Office of Epizootics) 27: 413–426.
- 11. Chevalier V, Thiongane Y, Lancelot R (2009) Endemic transmission of Rift Valley Fever in Senegal. Transboundary and emerging diseases 56: 372–374. doi:10.1111/j.1865-1682.2009.01083.x.
- 12. Faye O, Diallo M, Diop D, Bezeid OE, Bâ H, et al. (2007) Rift Valley fever outbreak with East-Central African virus lineage in Mauritania, 2003. Emerging infectious diseases 13: 1016–1023.
- 13. Zeller HG, Fontenille D, Traore-Lamizana M, Thiongane Y, Digoutte JP (1997) Enzootic activity of Rift Valley fever virus in Senegal. The American Journal of Tropical Medicine and Hygiene 56: 265–272.
- 14. Traoré-Lamizana M, Fontenille D, Diallo M, Bâ Y, Zeller HG, et al. (2001) Arbovirus surveillance from 1990 to 1995 in the Barkedji area (Ferlo) of Senegal, a possible natural focus of Rift Valley fever virus. Journal of Medical Entomology 38: 480–492.
- 15. Marrama L, Spiegel A, Ndiaye K, Sall AA, Gomes E, et al. (2005) Domestic transmission of Rift Valley Fever virus in Diawara (Senegal) in 1998. The Southeast Asian Journal of Tropical Medicine and Public Health 36: 1487–1495.
- 16. Sall AA, Zanotto PM, Vialat P, Sène OK, Bouloy M (1998) Molecular epidemiology and emergence of Rift Valley fever. Memórias do Instituto Oswaldo Cruz 93: 609–614.
- 17. Weaver SC (2005) Host range, amplification and arboviral disease emergence. Archives of virology Supplementum 33–44.
- 18. World Health Organization (2007) WHO | Rift Valley Fever in Kenya, Somalia and the United Republic of Tanzania. Available:http://www.who.int/csr/don/2007_05_09/en/. Accessed 11 August 2011.
- 19. Archer BN, Weyer J, Paweska J, Nkosi D, Leman P, et al. (2011) Outbreak of Rift Valley fever affecting veterinarians and farmers in South Africa, 2008. South African Medical Journal = Suid-Afrikaanse Tydskrif Vir Geneeskunde 101: 263–266.
- 20. World Health Organization (2010) WHO | Rift Valley fever in South Africa. Available:http://www.who.int/csr/don/2010_03_30a/en/. Accessed 10 August 2011.
- 21. World Health Organization (2008) WHO | Rift Valley Fever in Sudan - update 5. Available:http://www.who.int/csr/don/2008_01_22/en/index.html. Accessed 11 August 2011.
- 22. World Health Organization (2000) WHO | Saudi Arabia and Yemen: First cases of Rift Valley fever reported outside Africa, 2000. Available:http://www.who.int/csr/outbreaknetwork/saudiarabia/en/. Accessed 11 August 2011.
- 23. Bird BH, Khristova ML, Rollin PE, Ksiazek TG, Nichol ST (2007) Complete genome analysis of 33 ecologically and biologically diverse Rift Valley fever virus strains reveals widespread virus movement and low genetic diversity due to recent common ancestry. Journal of virology 81: 2805–2816. doi:10.1128/JVI.02095-06.
- 24. Bird BH, Githinji JWK, Macharia JM, Kasiiti JL, Muriithi RM, et al. (2008) Multiple virus lineages sharing recent common ancestry were associated with a Large Rift Valley fever outbreak among livestock in Kenya during 2006–2007. Journal of virology 82: 11152–11166. doi:10.1128/JVI.01519-08.
- 25. Grobbelaar Aa, Weyer J, Leman Pa, Kemp A, Paweska JT, et al. (2011) Molecular epidemiology of Rift Valley fever virus. Emerging infectious diseases 17: 2270–2276. doi:10.3201/eid1712.111035.
- 26. Sall AA, de A Zanotto PM, Vialat P, Sene OK, Bouloy M (1998) Origin of 1997–98 Rift Valley fever outbreak in East Africa. Lancet 352: 1596–1597.
- 27. Pepin M, Bouloy M, Bird BH, Kemp A, Paweska J (2010) Rift Valley fever virus(Bunyaviridae: Phlebovirus): an update on pathogenesis, molecular epidemiology, vectors, diagnostics and prevention. Veterinary research 41: 61.
- 28. Sall AA, Zanotto PM, Sene OK, Zeller HG, Digoutte JP, et al. (1999) Genetic reassortment of Rift Valley fever virus in nature. Journal of virology 73: 8196–8200.
- 29. Sall AA, de A Zanotto PM, Zeller HG, Digoutte JP, Thiongane Y, et al. (1997) Variability of the NS(S) protein among Rift Valley fever virus isolates. The Journal of general virology 78(Pt 11): 2853–2858.
- 30. Moutailler S, Roche B, Thiberge J-M, Caro V, Rougeon F, et al. (2011) Host alternation is necessary to maintain the genome stability of rift valley fever virus. PLoS neglected tropical diseases 5: e1156. doi:10.1371/journal.pntd.0001156.
- 31. Wertheim JO, Kosakovsky Pond SL (2011) Purifying selection can obscure the ancient age of viral lineages. Molecular biology and evolution. doi:10.1093/molbev/msr170.
- 32. Wiens JJ (2003) Missing data, incomplete taxa, and phylogenetic accuracy. Systematic Biology 52: 528–538.
- 33. Johnson K (1993) Emerging Viruses in Context: An Overview of Viral Hemorragic Fevers. In: Morse S, editor. Emerging Viruses. New York: Oxford University Press. 317 p.
- 34. Pfeffer M, Dobler G (2010) Emergence of zoonotic arboviruses by animal trade and migration. Parasites & Vectors 3: 35. doi:10.1186/1756-3305-3-35.
- 35. Favier C, Chalvet-Monfray K, Sabatier P, Lancelot R, Fontenille D, et al. (2006) Rift Valley fever in West Africa: the role of space in endemicity. Tropical Medicine & International Health: TM & IH 11: 1878–1888. doi:10.1111/j.1365-3156.2006.01746.x.
- 36. Gad AM, Feinsod FM, Allam IH, Eisa M, Hassan AN, et al. (1986) A possible route for the introduction of Rift Valley fever virus into Egypt during 1977. The Journal of Tropical Medicine and Hygiene 89: 233–236.
- 37. Miller BR, Godsey MS, Crabtree MB, Savage HM, Al-Mazrao Y, et al. (2002) Isolation and genetic characterization of Rift Valley fever virus from Aedes vexans arabiensis, Kingdom of Saudi Arabia. Emerging Infectious Diseases 8: 1492–1494.
- 38. Yamar BA, Diallo D, Kebe CMF, Dia I, Diallo M (2005) Aspects of bioecology of two Rift Valley Fever Virus vectors in Senegal (West Africa): Aedes vexans and Culex poicilipes (Diptera: Culicidae). Journal of Medical Entomology 42: 739–750.
- 39. LaBeaud AD, Cross PC, Getz WM, Glinka A, King CH (2011) Rift Valley fever virus infection in African buffalo (Syncerus caffer) herds in rural South Africa: evidence of interepidemic transmission. The American Journal of Tropical Medicine and Hygiene 84: 641–646. doi:10.4269/ajtmh.2011.10-0187.
- 40. Thompson JD, Higgins DG, Gibson TJ (1994) CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic acids research 22: 4673–4680.
- 41. Martin DP, Lemey P, Lott M, Moulton V, Posada D, et al. (2010) RDP3: a flexible and fast computer program for analyzing recombination. Bioinformatics (Oxford, England) 26: 2462–2463. doi:10.1093/bioinformatics/btq467.
- 42. Pond SLK, Frost SDW, Muse SV (2005) HyPhy: hypothesis testing using phylogenies. Bioinformatics (Oxford, England) 21: 676–679. doi:10.1093/bioinformatics/bti079.
- 43. Strimmer K, von Haeseler A (1997) Likelihood-mapping: a simple method to visualize phylogenetic content of a sequence alignment. Proceedings of the National Academy of Sciences of the United States of America 94: 6815–6819.
- 44. Schmidt Ha, Strimmer K, Vingron M, von Haeseler A (2002) TREE-PUZZLE: maximum likelihood phylogenetic analysis using quartets and parallel computing. Bioinformatics (Oxford, England) 18: 502–504.
- 45. Zwickl D (2006) (2006) Genetic algorithm approaches for the phylogenetic analysis of large biological sequence datasets under the maximum likelihood criterion.
- 46. Sanderson MJ (2003) r8s: inferring absolute rates of molecular evolution and divergence times in the absence of a molecular clock. Bioinformatics (Oxford, England) 19: 301–302.
- 47. Sanderson MJ (2002) Estimating absolute rates of molecular evolution and divergence times: a penalized likelihood approach. Molecular biology and evolution 19: 101–109.
- 48. Drummond AJ, Ho SYW, Phillips MJ, Rambaut A (2006) Relaxed phylogenetics and dating with confidence. PLoS biology 4: e88. doi:10.1371/journal.pbio.0040088.
- 49. Drummond AJ, Rambaut A (2007) BEAST: Bayesian evolutionary analysis by sampling trees. BMC Evolutionary Biology 7: 214. doi:10.1186/1471-2148-7-214.
- 50. Minin VN, Bloomquist EW, Suchard MA (2008) Smooth skyride through a rough skyline: Bayesian coalescent-based inference of population dynamics. Molecular biology and evolution 25: 1459–1471. doi:10.1093/molbev/msn090.
- 51. Lemey P, Rambaut A, Drummond AJ, Suchard MA (2009) Bayesian phylogeography finds its roots. PLoS computational biology 5: e1000520. doi:10.1371/journal.pcbi.1000520.