The timing of the origin and diversification of rodents remains controversial, due to conflicting results from molecular clocks and paleontological data. The fossil record tends to support an early Cenozoic origin of crown-group rodents. In contrast, most molecular studies place the origin and initial diversification of crown-Rodentia deep in the Cretaceous, although some molecular analyses have recovered estimated divergence times that are more compatible with the fossil record. Here we attempt to resolve this conflict by carrying out a molecular clock investigation based on a nine-gene sequence dataset and a novel set of seven fossil constraints, including two new rodent records (the earliest known representatives of Cardiocraniinae and Dipodinae). Our results indicate that rodents originated around 61.7–62.4 Ma, shortly after the Cretaceous/Paleogene (K/Pg) boundary, and diversified at the intraordinal level around 57.7–58.9 Ma. These estimates are broadly consistent with the paleontological record, but challenge previous molecular studies that place the origin and early diversification of rodents in the Cretaceous. This study demonstrates that, with reliable fossil constraints, the incompatibility between paleontological and molecular estimates of rodent divergence times can be eliminated using currently available tools and genetic markers. Similar conflicts between molecular and paleontological evidence bedevil attempts to establish the origination times of other placental groups. The example of the present study suggests that more reliable fossil calibration points may represent the key to resolving these controversies.
Citation: Wu S, Wu W, Zhang F, Ye J, Ni X, et al. (2012) Molecular and Paleontological Evidence for a Post-Cretaceous Origin of Rodents. PLoS ONE 7(10): e46445. doi:10.1371/journal.pone.0046445
Editor: Alistair Robert Evans, Monash University, Australia
Received: May 2, 2012; Accepted: August 31, 2012; Published: October 5, 2012
Copyright: © Wu et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: Author SW was supported by the Putnam Expedition Grants and the Robert G. Goelet Research Fund from the Museum of Comparative Zoology of Harvard University. Authors WW, JY, XN, and JM were supported by the National Natural Science Foundation of China (408720320). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Molecular clocks and fossil record are the two major approaches to date evolutionary divergence times, which are crucial for using the Tree of Life to understand evolutionary processes and mechanisms. In the case of major divergences among groups of placental mammals, the general tendency has been for paleontological studies to suggest that these events took place in the Paleocene, while molecular ones place them deep in the Cretaceous , , although some molecular studies have recovered estimated divergence times that are more compatible with the fossil record , , , . The general pattern of disagreement has confounded our ability to discern the influence of the K/Pg extinction event on the radiation of extant mammals. This problem is particularly evident in rodents, a group that accounts for approximate 42% of extant mammalian diversity . The oldest known fossil that can be clearly identified as a member of the rodent lineage, the fragmentary possible crown-rodent Acritoparamys, has a Late Paleocene age of about 57 million years (Ma) , . However, early molecular studies almost unanimously supported a Cretaceous radiation of rodents , , , , , , , , , although Douzery et al.  and Kitazoe et al.  obtained molecular results that placed the earliest divergences within crown-Rodentia in the Paleocene and was therefore compatible with the fossil record. Despite these exceptions, a strong discrepancy still persists between the fossil record and the preponderance of results from molecular clock studies. Resolving this discrepancy is therefore critical not only for understanding the evolutionary history and dynamics of rodents, but also for assessing the reliability of molecular clocks and fossils to accurately estimate divergence times.
Molecular clocks attempt to pinpoint divergence events whereas the fossil record alone can yield minimum estimates given by the first known fossil occurrence for a given group , . The problem of molecular rate heterogeneity, a major source undermining the accuracy of molecular clock estimates, has been addressed by applying relaxed molecular clocks across sequences . The availability of reliable fossils that can be used as calibration points, therefore, may hold the key to obtaining an accurate time estimate of the origin and radiation of rodents. In this study, we employed five rodent calibrations based on recent fossil discoveries, including two new fossils that are chronologically constrained with palaeomagnetic chrons. The earliest known fossils of Dipodinae (three-toed jerboas) is dated to 10.5 Ma from the beginning of the Late Miocene of China (Fig. S1 and Fig. S2), providing an upper (more recent) bound for the divergence time between Dipodinae and Allactaginae (five-toed jerboas). The earliest known fossils of the two extant genera of Cardiocraniinae (dwarf jerboas), Cardiocranius and Salpingotus, are dated to 9 Ma . The lower (more ancient) bound of each of these divergence events is 13 Ma, in the middle part of the Middle Miocene, based on recent biostratigraphic and paleomagnetic data. The earliest known myodont, Erlianomys, was discovered from the Early Eocene (54 Ma) , providing a reliable lowest known bound for the divergence time between the two primary myodont groups, Dipodoidea and Muroidea (Fig. S3) . The upper bound of this divergence can be constrained to 43 Ma, based on the earliest known dipodoids  and muroids  from China. We therefore use the split between mice and rats  as well as between octodontids and erethizontids ,  as calibration points, because of their improved fossil record and stratigraphic data. In order to achieve a balanced distribution of calibration points within the phylogeny, we also applied two well-defined non-rodent calibrations in successive sister lineages to rodents, including the split of marsupials and placentals , , and feliforms and caniforms .
We use the fossils noted above to create seven fossil calibrations with a nine-gene sequence dataset to re-evaluate the timing of rodent origin and diversification. For taxon sampling, we included major lineage across rodents, and sampled comprehensively within Dipodoidea to include all six subfamilies, an approach we referred to as “bottom-up” taxon sampling (i.e. building up an analytic model from a foundation of many individual data samples, versus the “top-down” approach of inferring an analytic model from relatively few data points). This sampling approach allowed us to accurately incorporate these new calibration points based on Chinese dipodoid fossils that are comparatively recent in geological time. Our analysis implements a relaxed molecular clock model using Bayesian and maximum likelihood approaches. Our results suggest that rodents originated and diversified after the K/Pg boundary at the beginning of the Cenozoic, a finding consistent with patterns found in the fossil record.
Test of molecular rate heterogeneity
The BEAST  analysis shows that substantial rate variation across the data set was only found in the CNR1 locus, with an ucld.stdev parameter of 1.788 (95% confidence interval (CI) 1.35–2.19). The ucld.stdev values of all other loci lie between 0 and 1, indicating that these loci have a moderate level of rate variation (see Table S1). The ucld.stdev parameter of the concatenated partitioned data set is 0.57 (95% CI 0.45–0.71), which is lower than that of all other individual genes except IRBP. These results demonstrate the necessity of utilizing a relaxed molecular clock mode on the DNA dataset for the molecular dating estimates.
Molecular dating with complete fossil constraints
We inferred a time-calibrated phylogenetic tree for 41 mammal species focusing on the superfamily Dipodoidea (jerboas and relatives) for which we sampled 18 species, representing all six subfamilies. Nine unlinked nuclear genes were used to construct the tree using Bayesian  and likelihood  criteria. Topologically, the Bayesian and maximum likelihood approaches gave identical, highly resolved phylogeny, supporting Hystricomorpha as the most basal rodent clade (bootstrap support = 71, posterior probability = 0.98, Fig. 1). For time-calibrated trees, we estimated divergence times using Bayesian  (Fig. 2) and likelihood  approaches, which returned similar results (Table 1). Our estimated divergence times are much younger than those estimated in previous molecular analyses, and are more congruent with the fossil record.
Figure 1. Phylogenetic relationships for rodents and outgroups estimated using Bayesian and maximum likelihood algorithms.
The Bayesian posterior probability and the maximum likelihood bootstrap values for each of the nodes are provided from the left to the right of the slash, respectively. Support scores are not shown for nodes that receive a full support of both posterior probability and bootstrap value.doi:10.1371/journal.pone.0046445.g001
Figure 2. Molecular time scale for the orders of Rodentia, Lagomorpha, Primates, Carnivora and Perissodactyla obtained from the Bayesian estimates based on seven fossil calibration points and relaxed molecular clock model.
Fossil constraints are indicated by circle A to G on the corresponding nodes: A. 160–190 Ma for the split between Placentalia and Marsupialia; B. 38–61.7 Ma for the split between Caniformia and Feliformia; C. 28.5–37 Ma for the split between Octodontomys and Erethizon; D. 43–54 Ma for the basal split of Myodonta; E. 7.3–12.2 Ma for the split between mice and rats; F. 9–13 Ma for the split between Cardiocranius and Salpingotus; G. 10.5–13 Ma for the split between Dipodinae and Allactaginae.doi:10.1371/journal.pone.0046445.g002
Table 1. Support values of divergence dates for Bayesian and penalized likelihood estimates using seven fossil calibration points.doi:10.1371/journal.pone.0046445.t001
We estimate that the divergence between rodents and lagomorphs occurred about 61.7 Ma (Bayesian, Bayesian credibility interval (BCI) 52.8–71) or 62.4 Ma (likelihood). The basal divergence of rodents was estimated to be 57.7 Ma (Bayesian, BCI 50.1–66) or 58.9 Ma (likelihood). These estimates are roughly consistent with recent interpretations of the early gliran fossil record. The oldest known Glires, Heomys and Mimotona, may both fall on the lagomorph stem , although Meng et al.  recovered Heomys as a stem-rodent rather than a stem-lagomorph. However, both analyses agree that Heomys and Mimotona belong to the gliran lineage. Both genera are placed in the early Late Paleocene , , an age close to the rodent-lagomorph divergence time estimated by our study. The appearance of sciuromorphs was estimated at 55.6 Ma (Bayesian, BCI 48.4–63.2) or 57 Ma (likelihood). The divergence between castorimorphs and myodonts was estimated at 52.9 Ma (Bayesian, BCI 46.5–59.9) or 54.6 Ma (likelihood). In addition, we estimate the basal divergence of hystricomorphs to be 50.2 Ma (Bayesian, BCI 41.6–58.7) or 52.3 Ma (likelihood), a result consistent with the oldest hystricomorph fossils, which date to ~50 Ma . We estimate the origin of crown Hystricognathi to be 36.9 Ma (Bayesian, BCI 31.6–43) or 35.2 Ma (likelihood), compatible with the occurrence of the oldest hystricid fossils at ~34 Ma .
We tested for the Node Density Effect (NDE) ,  using the online utility available at (http://www.evolution.reading.ac.uk/pe/index.html), which is based on the method outlined by Venditti et al. (2006) . The results of this test indicate that the NDE is present in our tree (β significantly greater than zero). By successively pruning clades in the tree, we find that the NDE is present because of the correlation between nodes and branch lengths associated with the increasing sampling of Dipodinae and Allactaginae in relation to the rest of the tree. Reducing the numbers of taxa within Dipodinae and Allactaginae or pruning out Dipodinae removes the NDE. The NDE may have the effect of causing our estimates of divergence time to appear more recent, because unsampled lineages could downwardly bias the molecular divergences. To test this, we re-analyzed our data set using BEAST with reduced taxon sampling in the subfamilies Dipodinae and Allactaginae. For these two subfamilies, we sampled two taxa for each, including Dipus sagitta and Jaculus blanfordi for Dipodinae, and Allactaga elater and Allactodipus bobrinskii for Allactaginae, because such taxon sampling removes the NDE from our tree. Our results show that the BEAST analysis based on the reduced tree produced divergence times for major nodes that are similar to those for the full taxa (Table S6), and statistically we found no significant difference between these two estimates (t-test, p-value = 0.954). These results suggest that the impact of the NDE on the estimated divergence times for the rest of the tree is limited.
Although the confidence intervals attached to our Bayesian estimates are relatively wide, as is often the cases for studies like this one that employ a limited number of genes , the fact that our Bayesian and maximum likelihood estimates generally agree with one another reinforces the results of both analyses and suggests that the dates are reliable.
Sensitivity test of fossil age constraints
We tested the sensitivity of applying different fossil constraints for their impact on the estimated times of divergence within rodents.
Compared with the estimated dates using all constraints, omitting constraint D for the divergence between Dipodoidea and Muroidea has limited impact on the estimated times for all nodes. The estimated date of 44 Ma (BCI 33.9–54.9) for Dipodoidea-Muroidea divergence is close to 45.4 Ma (BCI 43–49.3) when this fossil constraint was employed (Table 2). When constraints on tip rodent nodes of calibration point E, F and G were relaxed, the estimates for most nodes increased remarkably, pushing the basal divergence of Glires into the Cretaceous at 68.4 Ma (BCI 55.6–82.6) (Table 2). When only using constraint E, the most commonly used rodent calibration for mouse-rat divergence, and the two non-rodent constraints A and B, as expected , estimates for all nodes in Euarchontoglires incleased dramatically: 84.3 Ma (BCI 57.6–116.5) for the base of Boereoeutheria, 79 Ma (BCI 53.2–108.6) for the base of Euarchontoglires, 66.4 Ma (BCI 41–95.7) for the base of Primates, 74.7 Ma (BCI 50.6–103) for the base of Glires, and 69.7 Ma (BCI 46.5–96.1) for the base of Rodentia (Table 2). In addition, we test an alternative constraining of a minimum age of 124.6 Ma and a maximum age of 138.4 Ma for the split between placentals and marsupials to assess the sensitivity of descendant nodes to this date. The minimum age assignments were based on the Early Cretaceous Eomaia , which was previously regarded as the oldest known eutherian before the recent discovery of Juramaia sinensis  from the Late Jurassic. The maximum age was based on the basal therian Vincelestes . The analysis shows that the change of this age resulted in slightly younger estimates of divergence times for deep nodes, but had little impact on recent nodes (Table 2). One major uncertainty in the rodent phylogeny is the position of the root of Rodentia. Based on dental morphology, Marivaux et al.  places Hystricomorpha as the basalmost clade of rodents. However, molecular phylogenetic studies support either Sciuromorpha, Hystricomorpha or the clade formed by Sciuromorpha and Hystricomorpha as the most basal clade of rodents, but all of these placements received relatively weak statistical support , , , . To test whether acceptance of the alternative topologies would strongly affect our dating estimates, we conducted two alternative BEAST analyses by changing the root of Rodentia from Hystricomorpha to (1) Sciuromorpha and (2) the clade formed by Sciuromorpha and Hystricomorpha. The results showed that neither permutation had an important impact on the results, so our major conclusions remain unchanged (Figs. S4, S5).
Table 2. Summary of results of sensitivity test of fossil age constraints.doi:10.1371/journal.pone.0046445.t002
The above analyses demonstrate that changes of the age of non-rodent constraints have limited impact on the estimated divergence times of nodes in the rodent tree. By contrast, removal of all three recent rodent constraints results in dramatic increase of ages for other nodes, reverting to the older divergence times estimated by earlier studies. Moreover, these results indicate that the use of a single mouse-rat constraint for the divergence time estimates for rodents can result in overestimates for all nodes in Glires.
When constaints on tip rodent nodes E, F and G were relaxed, the estimated dates for all nodes change little (Table 2). This is not surprising, since age constraints on the root and deep nodes have a much bigger influence on the divergence estimates of other nodes in the program r8s . Setting the age of the root to 180 Ma only caused a slight increase of the date estimates for nodes, including 64.8 Ma for the basal split of Glires, and 61.1 Ma for the basal diversification of Rodentia (Table 2). These analyses show that the influence of the root's age on the estimated dates of major rodent lineages is limited.
Three hypotheses have been proposed to characterize the evolutionary radiation of placental mammals: the Explosive Model puts the origin of placental orders and their intraordinal diversification shortly after the K/Pg boundary, whereas the Short Fuse Model places the origin of placental orders and intraordinal diversification in the Cretaceous, and the Long Fuse Model posits Cretaceous origins of placental orders but intraordinal diversification after the K/Pg boundary. Paleontological evidence favors the Explosive Model, suggesting that the origin and diversification of placental mammals occurred following the K/Pg extinction event that wiped out the non-avian dinosaurs and opened up many ecological niches. By contrast, recent molecular studies support either the Short Fuse or the Long Fuse Models, which suggests that continental breakup in the Late Cretaceous contributed to the origin and/or diversification of placental mammals, rather than the opening of ecological niches by differential extinction among groups. For rodents, most previous molecular studies consistently support a Short Fuse Model for them, making rodents one of the oldest placental orders which originated and diversified in the Cretaceous , , , , . Because rodents lack a Cretaceous fossil record, however, there is no evidence to indicate whether their postulated diversification in the Cretaceous would have been driven by tectonic events or by other factors.
Recent phylogenetic studies, based on extensive sampling of fossil mammals, have placed all Cretaceous eutherians outside the placental crown groups , , . Our estimated divergence times are consistent with a rapid radiation of major rodent lineages during the Paleocene. Therefore, our results agree with those of a few other recent molecular studies ,  in supporting the hypothesis that the origin and intraordinal diversification of rodents occurred after the K/Pg Boundary about 65 Ma, following the extinction of non-avian dinosaurs. These diversification patterns are consistent with the Explosive Model as applied to mammalian orders generally, rather than with the Short Fuse Models for radiation within the orders of Rodentia.
Our study is methodologically similar to others that employed the relaxed molecular clock and multiple fossil calibration points , , , , ,  (Fig. 3a). However, the divergence times we obtained, particularly those for rodents, are significantly younger than those in some recent studies that considerably predate the K/Pg boundary (Fig. 3b). Compared to the results of previous studies, our younger estimates are probably attributable to the fact that we employed multiple, internal rodent fossil constraints, which are well documented stratigraphically in a continuous sequence dated with convincing paleomagnetic chrons (see Figs. S2, S3). The study of Springer et al. (2003)  applied one rodent constraint, the split of mouse and rat, with a minimum age of 12 Ma (Fig. 3b). But recent increased resolution of the fossil record has decreased the minimum age constraint for mouse-rat to be around 7.3 Ma . Additionally, our sensitivity test shows that the use of a single rodent calibration point can result in overestimates for all nodes in rodents. Several recent studies employed multiple rodent fossil constraints , , , . However, their estimated divergence times for rodents are still similar to that of Springer et al. (2003), supporting the Short Fuse Model of rodent diversification (Fig. 3b).
Figure 3. Comparisons among rodent fossil constraints used and divergence times estimated.
A. The temporal distribution of rodent fossil constraints used. The circles and squares represent the minimum and maximum age constraints for each of the fossil calibration points, respectively. The bar connecting the circle and square shows the range between the minimum and maximum age constraints. Note that most calibration points used by Huchon et al. 2007 and Meredith et al. 2011 have much larger gap between the minimum and maximum age constraints, compared to that used by this study. B. Divergence time estimates obtained for major boreoeutherian lineages. The estimates of Springer et al. 2003, Huchon et al. 2007 and Meredith et al. 2011 are close to each other, and predate the K/Pg boundary. However, the divergence estimates obtained by the present study are much younger and very close to the K/Pg boundary. Numbers from 1 to 5 indicate: 1. Base of Rodentia; 2. Base of Glires (rodents and lagomorphs); 3. Base of Primates; 4. Base of Euarchontoglires; 5. Base of Boreoeutheria.doi:10.1371/journal.pone.0046445.g003
The major difference between the fossil constraints used in this study and those used by Huchon et al. (2007)  and Meredith et al. (2011)  is that the minimum and maximum age constraints for many rodent calibration points used in both the latter studies are farther apart than the constraints for points used in our study (Fig. 3a), because of uncertainty in the paleontological and stratigraphic data associated with the fossils in question. For example, the youngest rodent fossil constraint used by Meredith et al. (2011), the split between Ctenomyidae and Ocodontidae, has a minimum age of 9.07 Ma and a maximum age of 34 Ma, for a range of 24.93 Ma. Our fossil calibration points are well constrained, with smaller amounts of time between the minimum and maximum ages (Fig. 3a). By contrast, the Zhang et al. (2012)  study was similar to ours in that dipodoids were densely sampled, but differed from ours in that sampling of major rodent lineages outside Dipodoidea was very limited. This lack of extensive sampling of rodent lineages may account for the fact that Zhang et al. (2012) recovered a comparatively early divergence time for the rodent-lagomorph node, consistent with earlier studies rather than with our results. Conversely, dos Reis et al. (2012)  carried out a study with extensive genomic sampling and broad taxonomic sampling across Placentalia, but sampled relatively few rodents of any kind. dos Reis et al. (2012) estimated the rodent-lagomorph divergence to have occurred at 70.8 Ma (Confidence Interval: 69.9–71.8), considerably earlier than the estimate recovered by our study, and the discrepancy may arise from either the lack of dense sampling within Rodentia by dos Reis et al. (2012) or the lack of extensive genomic sampling and/or broad sampling outside Rodentia in our analysis. The strengths and weaknesses of these different sampling approaches remain to be fully explored.
Our results show that reconciliation between estimates of divergence times based on molecular clock and paleontological data is possible with standard tools and genetic markers, at least for the Rodentia, the most speciose of all mammalian orders. Achieving this consistency requires a reasonable number of reliable fossil calibration points supported by a well-constrained paleontological and stratigraphic record. The consistency between our results and the paleontological record suggests that similar controversies regarding the origin and diversification of other major biological groups, including the post-K/Pg diversification of various orders of modern mammals and birds  and the “Cambrian Explosion” of animal phyla  are potentially resolvable given adequate and reliable fossil calibration points.
Materials and Methods
Taxon and genomic sampling
This study includes 33 rodent species across major rodent lineages and eight outgroup taxa. The taxa examined and their classification are provided in Table S2. Portions of nine unlinked the nuclear genes were sampled, including alpha 2B adrenergic receptor (A2AB), cannabinoid receptor 1 (CNR1), growth hormone receptor (GHR), interphotoreceptor retinoid binding protein (IRBP), breast cancer susceptibility (BRCA1), von Willebrand factor (vWF), ATPase, Cu++ transporting, alpha polypeptide (ATP7A), 3′-UTR region of cAMP responsive element modulator (Crem), recombination activating gene 2 (RAG2). Detailed information of these loci is provided in Table S3.
Genomic DNA was prepared from either muscle or liver tissue samples using DNeasy Tissue Kit (Qiagen, inc). PCR reactions were undertaken in 25-µL volumes with the following conditions: 94°C (5–10 min); 35 cycles of 94°C (45 s); 55°C (45 s); 72°C (40–60 s); 72°C (5–10 min). Sequence data were collected on ABI 3730 DNA Analyzers for both directions subsequent to Big Dye chemistry. The primers for both PCR and sequencing reactions are identified in Table S4.
DNA sequences were aligned using Kalign as implemented in the program eBioX (http://www.ebioinformatics.org) under default conditions, and refined manually using MacClade 4.06 . Ambiguous sites including potentially heterozygous sites were encoded based on IUPAC Ambiguity protocol. The GenBank accession numbers for each of the sequence data are provided in Table S5.
Phylogenetic trees were constructed using maximum likelihood and Bayesian criteria. The Akaike information criterion was used to determine the best substitution models of sequence evolution based on the results from MODELTEST 3.07  (Table S3). The maximum likelihood estimates for the best tree were performed with the program PhyML 3.0  for the concatenated dataset with 100 bootstrap replicates.
Bayesian inference of phylogeny was performed with the program MrBayes 3.1.2 . We performed four independent runs under identical conditions with partitions defined for each of the nine loci evolving with independent model parameters. For each analysis, one run was performed with four chains, and was sampled every 2000 generations for fifty million generations after a burn-in cycle of 5000 trees. The convergence of each run was examined with the program Tracer 1.5 .
Test of molecular rate heterogeneity
The levels of molecular rate heterogeneity for the concatenated dataset and for each of the loci were examined in the program BEAST 1.5.4 . When running under the uncorrelated relaxed lognormal clock model, BEAST can measure the ucld.stdev parameter, which can determine how clock-like the DNA dataset is. The dataset is strictly clock-like if the ucld.stdev parameter is 0, and the dataset has substantial rate heterogeneity among lineages if the parameter is greater than 1. The dataset has a moderate level of heterogeneity if the ucld.stdev parameter lies between 0 and 1 . For each individual locus, the test incorporated in BEAST was performed using five million generations sampled every 500 generations. For the concatenated dataset, BEAST run was performed using 40 million generations, sampled every 500 generations for two independent runs.
Molecular estimates of divergence times
We employed two approaches to estimates divergence times of each node: a Bayesian method as implemented in the program BEAST 1.5.4, and a penalized likelihood method in the program r8s 1.71 .
Bayesian analyses of molecular dating were estimated for the combined dataset with the substitution models for each gene partition. The relaxed molecular clock model was chosen for all BEAST analyses , since the estimated value of ucld.parameter is 0.57 for the concatenated DNA dataset. Each run of BEAST analyses comprised forty million generations, sampled either every 500 generations or 1000 generations for two independent runs. The output files of the two independent analyses were combined using LogCombiner 1.5.4  to produce the final results. Each run was examined with the program Tracer 1.5  for convergence.
The program r8s  was used to compare maximum likelihood results with those obtained from BEAST. The r8s analyses were performed using the best maximum likelihood tree calculated in PhyML. We used the penalized likelihood model, the log penalty function and the truncated Newton algorithm based on the recommendation of the developer . The optimal smoothing parameter was determined by a cross-validation run with r8s. For the nodes used as fossil age constraints, the check-gradient function was conducted to determine if the estimated values of divergence timings of these nodes fall beyond the age constraints. The date of root (marsupial-placental) was fixed at 160 and 180 Ma in order to assess the sensitivity of descendant nodes to the age of the root. Since the use of the two different root ages do not have significant impact on the estimated ages of other nodes (Table 2), this study used 160 as the age of the root for all r8s analyses.
We applied seven fossil age constraints for the molecular dating analyses in this study. Minimum age constraints were based on the earliest known fossil record of a member of one of the divergent lineages. Where possible, the maximum age constraints are based on the age of the youngest well-sampled horizon that does not contain any members of the divergent lineages, in a stratigraphic sequence in which members of these lineages subsequently appear. When a stratigraphic sequence suitable for setting a particular upper bound by this method was not available, the age of the oldest member of the stem lineage leading up to the divergence was used as the upper bound. For the program BEAST, these calibration points were set as soft constraints with upper and lower bounds that allow for a 2.5% chance of lying beyond each user-input bound. The r8s program only allows the fossil calibrations to be set as a hard bound. Fossil constraints are as follows (Fig. 2):
- we assigned a minimum age of 160 Ma and a maximum age of 190 Ma for the divergence between marsupials and placentals, based on the earliest known placental mammal Juramaia sinensis  and the basal mammal Hadrocodium .
- We assigned a minimum age of 38 Ma and a maximum age of 61.7 Ma for the divergence between caniforms and feliforms, based on the oldest known crown carnivoran Hesperocyon and Daphoenus from the Late Eocene  and the oldest stem carnivore Protictis schaffi from the early Paleocene , respectively.
- We assigned a minimum age of 28.5 Ma and a maximum age of 37 Ma for the divergence between Octodontomys and the clade formed by Cavia and Erethizon, based on the oldest fossil record of Caviomorpha from the Late Eocene  and the oldest fossil erethizontid (Steiromys sp.) from the mid-Oligocene .
- The earliest muroid is Pappocricetodon , and the earliest dipodoid is Primisminthus and Banyuesminthus . All the above species emerge in the middle part of the Eocene of China. Erlianomys, which is from the lower part of the Arshanto Formation in Nuhetingboerhe of Inner Mongolia, China, represents the earliest fossil record of myodonts . Based on recent magneto-stratigraphic analyses, the Nuhetingboerhe section was dated to the early part of the Early Eocene  (Fig. S3). Consistent with these fossil and stratigraphic results, we assigned a minimum age of 43 Ma and a maximum age of 54 Ma for the divergence between muroids and dipodoids.
- We assigned a minimum age of 7.3 Ma and a maximum age of 12.2 Ma for the divergence between mice and rats, based on the occurence of the earliest known mouse Mus sp. and the earliest known Prognomys from the late Miocene of Pakistan .
- We assigned a minimum age of 9 Ma for the divergence between the two cardiocraniine genera Cardiocranius and Salpingotus, based on the occurence of the earliest known Cardiocranius (C. pussillus) and the earliest known Salpingotus (S. primitivus) from the Late Miocene of China .
- We assigned a minimum age of 10.5 Ma for the divergence between the two dipodoid subfamilies Dipodinae and Allactaginae, based on the occurence of the earliest known dental fossils of Dipodinae from the middle bed of the Dingshanyanchi Formation, Xinjiang, China (Fig. S1). The cheek teeth of the Dingshanyanchi species lack the mesoloph and mesocone on the upper molars, and have no mesolophid and mesoconid on the lower molars. Cusps on the labial and lingual sides of each molar show an alternating, rather than opposite, arrangement. The anteroloph of M2 and ectolophid of the lower molars are oriented oblique to the longitudinal axis. These dental features represent synapomorphies of dipodine molars. According to stratigraphic and palaeomagnetic results, the middle bed of the Dingshanyanchi Formation falls toward the base of the long normal magnetic chron C5n.2n, and thus dates to the earliest part of the Late Miocene  (Fig. S2).
We set a maximum age of 13 Ma for the divergence between Dipodinae and Allactaginae and the divergence between Salpingotus and Cardiocranius. The Middle Miocene deposits are well exposed and have been extensively sampled in northern China and surrounding areas , , , . The only Dipodidae that can be found during the Middle Miocene is Protalactaga, a primitive genus. These deposits produce no species of extant dipodid subfamilies – Dipodinae, Allactaginae, Euchoreutinae and Cardiocraniinae – not even in the northern Junggar Basin of Xinjiang, which has a continuous geological sequence ranging from the Late Oligocene to the Late Miocene and where screen washing has been applied for decades , , . On this basis, the maximal boundary for these two divergence events was placed in the middle part of the Middle Miocene.
Occlusal view of molars of the earliest fossil representative of Dipodinae from the Dingshanyanchi Formation. A. right m1 (IVPP V 16905.2). B. right m2 (IVPP V 16905.3). C. right M2 (IVPP V 16905.1). D. left m3 (IVPP V 16905.5). E. right m3 (IVPP V 16905.4).
Magneto-stratigaphic sequence of the Dingshanyanchi Formation, Xinjiang, China. This figure was based on and modified from Sun et al. 2010 . Blue arrow indicates the layer that produced the earliest fossil representative of Dipodinae showing in Figure S1.
Magneto-stratigraphic sequence of the Arshanto Formation. This figure was based on and modified from Sun et al. 2009 . Blue arrow indicates the layer that produced the earliest known myodont fossil, Erlianomys.
Divergence times obtained from Bayesian estimates based on the alternative topology with Sciuromorpha as the basal clade of Rodentia. Note that the estimated divergence times support a post-Cretaceous origin and diversification of Rodentia.
Divergence times obtained from Bayesian estimates based on the alternative topology with the clade of Sciuromorpha and Hystricomorpha as the basal clade of Rodentia. Note that the estimated divergence times support a post-Cretaceous origin and diversification of Rodentia.
Results of the test of molecular rate heterogeneity. The ucld.stdev parameters for each locus and the concatenated, partitioned data-set estimated by BEAST. Abbreviations: 95% C. I. = 95% Confidence Interval; ESS = Effective Sample Size.
List of taxon sampling for this study. Abbreviations: MVZ = Museum of Vertebrate Zoology, University of California, Berkeley; MCZ = Museum of Comparative Zoology, Harvard University; AMNH = American Museum of Natural History; NMNH = National Museum of Natural History, Smithsonian Institution.
Characteristics of genes included show the AIC weights supporting the best model for each entry.
List of GenBank accession numbers.
Comparisons of divergence times for major nodes estimated using BEAST with full taxa and with a tree that is free of NDE by reducing taxon sampling in the subfamilies Dipodinae and Allactaginae. Values in parentheses are the 95% Bayesian credibility intervals. Note that these two analyses produced similar estimates of divergence times for major nodes. Statistical test shows that there is no significant difference between these two time estimates (t-test, p-value = 0.954).
We are grateful to D. Kramerov of the Russian Academy of Sciences, and A. Shahin of the Minia University of Egypt for donating DNA and tissue samples. We would like to thank F. Jenkins, L. Flynn and C. Sullivan for comments. We thank the Museum of Comparative Zoology (MCZ), Harvard University, the Museum of Vertebrate Zoology, University of California at Berkeley, the American Museum of Natural History, New York, and the National Museum of Natural History, Smithsonian Institution for the loan of tissues.
Conceived and designed the experiments: SW SE JM. Performed the experiments: SW. Analyzed the data: SW JM CO. Contributed reagents/materials/analysis tools: WW FZ JY XN JS. Wrote the paper: SW CO.
- 1. Bromham L, Penny D (2003) The modern molecular clock. Nat Rev Genet 4: 216–224. doi: 10.1038/nrg1020
- 2. Smith AB, Peterson KL (2002) Dating the time of origin of major clades: Molecular clocks and the fossil record. Ann Rev Earth Planet Sci 30: 65–88.
- 3. Douzery EJP, Delsuc F, Stanhope MJ, Huchon D (2003) Local molecular clocks in three nuclear genes: divergence times for rodents and other mammals and incompatibility among fossil calibrations. J Mol Evol 57: S201–S213. doi: 10.1007/s00239-003-0028-x
- 4. Kitazoe Y, Kishino H, Waddell PJ, Nakajima N, Okabayashi T, et al. (2007) Robust time estimation reconciles views of the antiquity of placental mammals. PLos one 2: e384. doi: 10.1371/journal.pone.0000384
- 5. Hallstrom BM, Janke A (2010) Mammalian evolution may not be strictly bifurcating. Mol Biol Evol 27: 2804–2816. doi: 10.1093/molbev/msq166
- 6. dos Reis M, Inoue J, Hasegawa M, Asher RJ, Donoghue MJ, et al. (2012) Phylogenomic datasets provide both precision and accuracy in estimating the timescale of placental mammal phylogeny. Proc R Soc B doi:10.1098/rspb.2012.0683.
- 7. Wilson DE, Reeder DM (2005) Mammal Species of the World: A Taxonomic and Geographic Reference. Baltimore, MD: Johns Hopkins University Press.
- 8. Dawson MR, Beard CK (1996) New Late Paleocene rodents (Mammalia) from Big Multi Quarry, Washakie Basin, Wyoming. Palaeovertebrata 25: 301–321. doi: 10.2992/007.080.0204
- 9. Meng J, Hu Y-M, Li C-K (2003) The osteology of Rhombomylus (Mammalia, Glires): Implications for phylogeny and evolution of Glires. Bull Amer Mus Natl Hist 275: 1–247. doi: 10.1206/0003-0090(2003)275<0001:toormg>2.0.co;2
- 10. Huchon D, Catzeflis FM, Douzery EJP (2000) Variance of molecular datings, evolution of rodents and the phylogenetic affinities between Ctenodactylidae and Hystricognathi. Proc Zool Soc London B 267: 393–402. doi: 10.1098/rspb.2000.1014
- 11. Huchon D, Chevret P, Jordan U, Kilpatrick CW, Ranwez V, et al. (2007) Multiple molecular evidences for a living mammalian fossil. Proc Natl Acad Sci 104: 7495–7499. doi: 10.1073/pnas.0701289104
- 12. Springer MS, Murphy WJ, Eizirik E, O'Brien SJ (2003) Placental mammal diversification and the Cretaceous-Tertiary boundary. Proc Natl Acad Sci 100: 1056–1061. doi: 10.1073/pnas.0334222100
- 13. Kumar S, Hedges SB (1998) A molecular timescale for vertebrate evolution. Nature 392: 917–920. doi: 10.1038/31927
- 14. Bininda-Emonds ORP, Cardillo M, Jones KE, MacPhee RDE, Beck RMD, et al. (2007) The delayed rise of present-day mammals. Nature 446: 507–512. doi: 10.1038/nature05634
- 15. Meredith RW, Janecka JE, Gatesy J, Ryder OA, Fisher CA, et al. (2011) Impacts of the Cretaceous terrestial revolution and KPg extinction on mammal diversification. Science 334: 521–524. doi: 10.1126/science.1211028
- 16. Adkins RM, Gelke EL, Rowe D, Honeycutt RL (2001) Molecular phylogeny and divergence time estimates for major rodent groups: Evidence from multiple genes. Mol Biol Evol 18: 777–791. doi: 10.1093/oxfordjournals.molbev.a003860
- 17. Rutschmann F (2006) Molecular dating of phylogenetic trees: a brief review of current methods that estimate divergence times. Divers Distrib 12: 35–48. doi: 10.1111/j.1366-9516.2006.00210.x
- 18. Li Q, Zheng S-H (2005) Note on four species of dipodids (Dipodidae, Rodentia) from the Late Miocene Bahe Formation, Lantian, Shaanxi. Vert PalAsiat 43: 283–296.
- 19. Li Q, Meng J (2010) Erlianomys combinatus, a primitive myodont rodent from the Eocene Arshanto Formation, Nuhetingboerhe, Nei Mongol, China. Vert PalAsiat 48: 133–144.
- 20. Zhang Q, Xia L, Kimura Y, Shenbrot G, Zhang Z-Q, et al. (2012) Tracing the origin and diversification of Dipodoidea (Order: Rodentia): Evidence from fossil record and molecular phylogeny. Evol Biol doi: 10.1007/s11692-11012-19167-11696.
- 21. Tong Y-S (1997) Middle Eocene small mammals from Liguanqiao Basin of Henan Province and Yuanqu Basin of Shanxi Province, central China. Paleont Sinica, N S C 26.
- 22. Tong Y-S (1992) Papporicetodon, a pre-Oligocene cricetid genus (Rodentia) from central China. Vert PalAsiat 30: 1–16.
- 23. Jacobs LJ, Flynn LJ (2005) Of mice again: the Siwalik rodent record, murine distribution, and molecular clocks. In: Lieberman DE, Simth RJ, Kelley SJ, editors. Interpreting the past: Essays on human, primate, and mammal evolution in honor of David Pilbeam. Boston, MA: Brill Academic Publishers. pp. 63–80.
- 24. Walton AH (1997) Rodents. In: Kay RF, Madden RH, Cifelli RL, Flynn JJ, editors. Vertebrate paleontology in the Neotropics The Miocene Fauna of La Venta, Colombia. Washington, DC: Smithsonian Institution Press. pp. 392–409.
- 25. Wyss AR, Flynn JJ, Norell MA, Swisher CC, Charrier R, et al. (1993) South America's earliest rodent and recognition of a new interval of mammalian evolution. Nature 365: 434–437. doi: 10.1038/365434a0
- 26. Luo Z-X, Crompton AW, Sun A-L (2001) A New Mammaliaform from the Early Jurassic and Evolution of Mammalian Characteristics. Science 292: 1535–1540. doi: 10.1126/science.1058476
- 27. Luo Z-X, Yuan C-X, Meng Q-J, Ji Q (2011) A Jurassic eutherian mammal and the divergence of marsupials and placentals. Nature 476: 442–445. doi: 10.1038/nature10291
- 28. Tomiya S (2011) A new basal Caniform (Mammalia: Carnivora) from the middle Eocene of North America and remarks on the phylogeny of early carnivorans. PLoS one 6: e24146. doi: 10.1371/journal.pone.0024146
- 29. Drummond AJ, Rambaut A (2007) BEAST: Bayesian evolutionary analysis by sampling trees. BMC Evol Biol 7: 214. doi: 10.1186/1471-2148-7-214
- 30. Ronquist F, Huelsenback JP (2003) Bayesian phylogenetic inference under mixed models. Bioinformations 19: 1572–1574. doi: 10.1093/bioinformatics/btg180
- 31. Guindon S, Gascuel O (2003) A simple, fast and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst Biol 52: 696–704.
- 32. Sanderson MJ (2003) r8s: inferring absolute rates of molecular evolution and divergence times in the absence of a molecular clock. Bioinformatics 19: 301–302. doi: 10.1093/bioinformatics/19.2.301
- 33. Asher RJ, Meng J, Wible JR, McKenna MC, Rougier GW, et al. (2005) Stem Lagomorpha and the Antiquity of Glires. Science 307: 1091–1094. doi: 10.1126/science.1107808
- 34. Li C-K, Chow M-C (1994) The origin of rodents. In: Tomida Y, editor. Rodent and lagomorph families of Asian origins and diversification. Tokyo, Japan: National Science Museum.
- 35. Hartenberger JL (1982) A review of the Eocene rodents of Pakistan. Univ Mich Mus Paleo Contr 26: 19–35.
- 36. Sallam HM, Seiffert ER, Steiper ME, Simons EL (2009) Fossil and molecular evidence constain scenarios for the early evolutionary and biogeographic history of hystricognathous rodents. Proc Natl Acad Sci 106: 16722–16727. doi: 10.1073/pnas.0908702106
- 37. Fitch WM, Beintema JJ (1990) Correcting parsimonious trees for unseen nucleotide substitutions: The effect of dense branching as exemplified by ribnuclease. Mol Biol Evol 7: 438–443.
- 38. Fitch WM, Bruschi M (1987) The evolution of prokaryotic ferredoxins-with a general method correcting for unobserved substitutions in less branched lineages. Mol Biol Evol 4: 381–394.
- 39. Venditti C, Meade A, Pagel M (2006) Detecting the node-sensity artifact in phylogeny reconstruction. Syst Biol 55: 637–643.
- 40. Ji Q, Luo ZX, Yuan CX, Wible JR, Zhang JP, et al. (2002) The earliest known eutherian mammal. Nature 816–822. doi: 10.1038/416816a
- 41. Kielan-Jaworowska Z, Cifelli RL, Luo ZX (2004) Mammals from the age of dinosaurs: origin, evolution, and structure. New York: Columbia University Press.
- 42. Marivaux L, Vianey-Liaud M, Jaeger J (2004) High-level phylogeny of early Tertiary rodents: dental evidence. Zool J Linnean Soc 142: 105–134. doi: 10.1111/j.1096-3642.2004.00131.x
- 43. Huchon D, Madsen O, Sibbald MJJB, Ament K, Stanhope MJ, et al. (2002) Rodent phylogeny and a timescale for the evolution of Glires: Evidence from an extensive taxon sampling using three nuclear genes. Mol Biol Evol 19: 1053–1065. doi: 10.1093/oxfordjournals.molbev.a004164
- 44. Blanga-Kanfi S, Miranda H, Penn O, Pupko T, DeBry RW, et al. (2009) Rodent phylogeny revised: analysis of six nuclear genes from all major rodent clades. BMC Evol Biol 9: 71. doi: 10.1186/1471-2148-9-71
- 45. Wible JR, Rougier GW, Novacek MJ, Asher RJ (2007) Cretaceous eutherians and Laurasian origin for placental mammals near the K/T boundary. Nature 447: 1003–1006. doi: 10.1038/nature05854
- 46. Cartwright P, Collins A (2007) Fossil and phylogenies: integrating multiple lines of evidence to investigate the origin of early major metazoan lineages. Integr Comp Biol 47: 744–751. doi: 10.1093/icb/icm071
- 47. Maddison DR, Maddison WP (2003) MacClade 4: analysis of phylogeny and character evolution, version 4.06. Sunderland, MA: Sinauer Associates.
- 48. Posada D, Crandal K (1998) MODELTEST: testing the model of DNA substitution. Bioinformatics 14: 817. doi: 10.1093/bioinformatics/14.9.817
- 49. Ronquist F, Huelsenbeck JP (2003) MrBayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics 19: 1572. doi: 10.1093/bioinformatics/btg180
- 50. Drummond AJ, Ho SYW, Phillips MJ, Rambaut A (2006) Relaxed phylogenetics and dating with confidence. PLos Biology 4: e88. doi: 10.1371/journal.pbio.0040088
- 51. Sun B, Yue L-P, Wang Y-Q, Meng J, Wang J-Q, et al. (2009) Magnetostratigraphy of the Early Paleogene in the Erlian Basin. J Stratigraphy 33: 62–68.
- 52. Sun J-M, Ye J, Wu W-Y, Ni X-J, Bi S-D, et al. (2010) Late Oligocene-Miocene mid-latitude aridification and wind patterns in Asian interior. Geology 38: 515–518. doi: 10.1130/g30776.1
- 53. Meng J, Ye J, Wu W-Y, Ni X-J, Bi S-D (2008) The Neogene Dingshanyanchi Formation in northern Junggar basin of Xinjiang and its stratigraphic implications. Vertebrata PalAsiat 46: 90–110.
- 54. Qiu Z-D (1996) Middle Miocene micromammalian fauna from the Tunggur, Nei Mongol. Beijing: Science Press.
- 55. Wu W-Y, Meng J, Ye J, N iX-J, Bi S-D, et al. (2009) The Miocene mammals from Dingshanyanchi Formation of northern Junggar Basin, Xinjiang. Vertebrata PalAsiat 47: 208.
- 56. Zazhigin VS, Lopatin AV (2000) The history of the Dipodoidea (Rodentia, Mammalia) in the Miocene of Asia: 3. Allactaginae. Paleont J 34: 553–565.