Whole genome-wide chromosome fusion and new gene birth in the Monopterus albus genome
Cell & Bioscience volume 10, Article number: 67 (2020)
Teleost fishes account for over half of extant vertebrate species. A core question in biology is how genomic changes drive phenotypic diversity that relates to the origin of teleost fishes.
Here, we used comparative genomic analyses with chromosome assemblies of diverse lineages of vertebrates and reconstructed an ancestral vertebrate genome, which revealed phylogenomic trajectories in vertebrates. We found that the whole-genome-wide chromosome fission/fusions took place in the Monopterus albus lineage after the 3-round whole-genome duplication. Four times of genomic fission/fusions events resulted in the whole genome-wide chromosome fusions in the genomic history of the lineage. In addition, abundant recently evolved new genes for reproduction emerged in the Monopterus albus after separated from medaka. Notably, we described evolutionary trajectories of conserved blocks related to sex determination genes in teleosts.
These data pave the way for a better understanding of genomic evolution in extant teleosts.
Fish is an enormously species-rich group in vertebrates. Total number of fish species is over 35,200, and approximately 2000 new fish species have been named in the past 5 years . On the earth, fishes account for more than half of extant vertebrate species, and show a rich evolutionary diversity . Compared to land vertebrates, fishes display remarkable variations in morphological and physiological adaptations, which is one of successful group of vertebrates evolutionarily. Fishery is an important part in the economy of many nations. Fish not only provides food for people, but also has immense values to humans, such as in recreations and sports. With in-deep study, some have also become model species in biology, ecology, medicine, and fishery, such as zebrafish and medaka [3, 4].
Although belonging to different orders, zebrafish (Cypriniformes) and medaka (Beloniformes) show their respective advantages as vertebrate model organisms for biology research. Several other fish species have distinct features in evolution. For example, pufferfish Takifugu rubripes (Tetraodontiformes) is a model organism owing to a particularly small and compact genome of 400 Mb , and stickleback fish (Perciformes) has been widely used to study adaptive evolution, because of its plasticity to new niche by adaptive radiation . The teleost Monopterus albus is a new model species for evolution, genetics and development . The Monopterus albus has an unusual reproductive strategy, known as protogynous hermaphroditism; it begins life as a female, but transform to a male through an intersex stage naturally. This reproductive advantage ensures successful establishment of new colonies when isolated small populations appear with extremely biased sex ratios during natural selection. Natural sex reversal in Monopterus albus was first reported in 1944 by Liu . Three years later, Bullough discussed it in Nature . Aerial respiration was a crucial step in the origin of tetrapods. As an air-breather, Monopterus albus is an ideal species for deepening our understanding of vertebrate evolution from survival in the sea to survival on land. Monopterus albus can breathe air, and it is capable of surviving for a long period without water. Its physiological features, including amphibious ability, make the species a successful invader around the globe. It is distributed mainly in China, Japan, Korea, Thailand, Lao, Indonesia, Malaysia, Philippines, India and Australia and United States . In fact, it has the potential for disrupting currently threatened ecosystems . Furthermore, Monopterus albus has the smallest haploid number (2n = 24) of chromosomes among those of most freshwater fishes (2n = 24–446) and the fewest chromosome arms (all telocentric) with no heteromorphic sex chromosome , which is an ideal material for chromosomal evolution studies.
Rapid establishment of sex determination after whole-genome duplication (WGD) is essential for survival of species. Understanding how and why the two sexes arise has been a topic of great interest since Darwin’s time, garnering both theoretical and observational efforts [11, 12]. The sex chromosomes XX/XY in mammals and ZZ/ZW in birds evidently evolved from different pairs of autosomes independently within the last 360 million years [13,14,15,16]. The W chromosome evolved in parallel with the Y chromosome, preserving ancestral genes through purifying selection . Recombination suppression on the chromosomes X/Y and Z/W has resulted in degradation and differentiated sex chromosomes [18,19,20]. For example, the human Y chromosome has retained a few dozen genes with male-specific functions. Notably, several mammals have completely lost their Y chromosome, resulting in XO sex determination systems [21, 22]. The endpoint and fate of Y degradation have sparked intense interest in recent years [20, 23]. These ancient sex chromosomes can provide information about the evolutionary fates of sex chromosomes, but they shed little light on the early stages of sex chromosome evolution in vertebrates . In many vertebrate species, the sex chromosomes are morphologically undifferentiated and remain largely identical [25, 26]. For example, a single missense single nucleotide polymorphism in amhr2 on XY, which emerged at approximately 40 MYA, can determine sex in pufferfish (Takifugu rubripes) . However, the chromosome evolutionary mechanisms underlying sex determination remain poorly understood, particularly in teleosts, which represent half of all living vertebrate species.
Recently, we sequenced and assembled the whole genome of the teleost Monopterus albus at chromosome level . Given the smallest chromosome number among teleost fishes, it is interesting to see how third whole-genome duplication (the 3R WGD) occurred in the Monopterus albus lineage, because 3R WGD occurred in the other teleost lineages after divergence from the Holostei [4, 29,30,31,32]. Taking advantage of comparative genomics and the unique genome structure of Monopterus albus, we described the phylogenomic events and evolutionary history of Monopterus albus, focusing on the genomic scenario behind the 3R WGD. By reconstructing an ancestral vertebrate genome using available chromosomal assemblies of related vertebrates along with the assembly from the Monopterus albus chromosomes, we systematically analyzed genomic events from 2R to 3R to post-WGD, and provided genomic history of teleost fishes based on striking genomic features. These analyses revealed the whole-genome-wide chromosome fission/fusion events and abundant recently evolved new genes for testis development in the Monopterus albus lineage after separated from medaka ~ 70 MYA. In addition, we described evolutionary trajectories of conserved blocks related to sex determination genes in teleosts and three independent origins of corresponding loci/chromosomes in vertebrates.
Materials and methods
Analysis of gene duplication in the Monopterus albus genome
Duplicated genes in Monopterus albus were analyzed by alignment of the protein sequences within the genome by Blastp (E value < 10−7). The Monopterus albus whole-genome data were uploaded from GenBank under the accession AONE00000000 . Protein sequence pairs with identity and coverage of over 60% were selected as candidate gene pairs. To confirm real gene duplications, the paired sequences were blasted against the NCBI non-redundant protein database to confirm that they were the same genes. The filtered gene pairs were mapped onto the genome with the program Circos (version 0.69) .
Ancestral genome reconstruction and chromosome evolution
Models of vertebrate genome evolution were deduced from the phylogenetic tree [31, 34]. Datasets of gene annotations and protein sequences of each species were obtained from Ensembl (release 95), including those of medaka, Tetraodon, stickleback, zebrafish, tongue sole, spotted gar, Xenopus, lizard, snake, chicken and human. All the protein sequences were aligned to the protein databases of Monopterus albus and other genomes using Blastp with E value < 10−7, and lower quality datasets (identity < 30%) were filtered. The filtered genes were mapped onto the genome with the program Circos (version 0.69). MCScanX  was used to find conserved syntenic blocks (≥ 5 genes, E value < 10−5) between species. Insertions and deletions of other genes (1/25, 1 insertion/deletion in a block of 25 genes) were allowed within the blocks, based on adequate information for comparison and block conservation among species. Chromosome synteny analysis using the data from Ensembl was performed using the JCVI package (https://doi.org/10.5281/zenodo.31631), with a parameter “–minspan = 5”. A chromosome model of the vertebrate ancestor was constructed with the conserved syntenic blocks shared by all genomes. The numbers of genes in the blocks between Monopterus albus and pre-3R WGD species were 3983 (chicken), 2448 (human) and 5680 (spotted gar), which were approximately half those of the 3R WGD fishes (8256 in zebrafish, 7791 in Tetraodon, 9575 in medaka and 10,297 in stickleback). The final ancestral genome model was inferred from all the species genomes by the maximum parsimony method based on both duplicated genes and conserved syntenic blocks. The evolutionary relationship of the chromosomal blocks from the ancestral genome to the different species genomes was analyzed and displayed by a Perl script.
Ka and Ks value calculations
Nonsynonymous substitution (Ka) and synonymous substitution (Ks) values were calculated based on alignments of the coding regions of paired genes between two species. CDS sequences without untranslated regions of two genes were extracted from the Ensembl and the Monopterus albus CDS databases and then aligned by ClustalX 2.0  with the default parameters. The alignments were sent to MEGA 6, and Ka and Ks values were calculated using the Nei-Gojobori method with Kimura’s two-parameter model .
Gene coverage and expression levels
Gene expression levels were calculated by the ‘reads per kb per million reads’ (RPKM) method  to eliminate the influence of sequencing discrepancies and differences in gene length. Therefore, the gene expression levels were directly comparable among different tissue samples. When a gene had more than one transcript, the longest transcript was used to calculate its coverage and expression level. Total transcriptome reads, which were transformed into expression levels, were mapped onto the contig assembly using the program TopHat  (version 1.3.3). The Monopterus albus transcriptome data were uploaded from Gene Expression Omnibus GSE43649 .
Analysis of new genes
The pipeline for the identification of new genes was divided into two parts (Additional file 1: Fig. S1), based on the methods described previously . First, we used 20,456 Monopterus albus protein sequences (predicted by FGENESH and GENSCAN)  to search against vertebrate protein sequences (1,562,657) and invertebrate protein sequences (3,715,843) by Blastp, and 4,221 genes were identified that did not have any homologous protein sequences in any organism. After we excluding the genes that were either too short (180 bp) or lacked start and stop codons, 2888 genes were retained as candidate orphan genes in Monopterus albus based on definition of no homologues in closely related lineages. RNA-seq datasets were used to confirm that 1533 genes were new protein-coding orphan genes. Second, 24,056 protein sequences in Monopterus albus are self-searched by Blastp, and candidate pairs were identified. These genes are compared with coding genes of the related fish species (Tilapia, Medaka, Stickleback, Tetraodon and Zebrafish), and then those genes with no homologue among these species were defined as duplication new genes. By analysis of differences in exon number between parental genes (multiple exons) and candidate new genes (1 exon), the corresponding genes with one exon were defined as new genes arising from retroposition. Relative expression levels of the new genes of Monopterus albus were calculated using log2(RPKM +1). The statistical hypothesis was tested using the Mann–Whitney U test and Kruskal–Wallis test in the R language. The chromosome distributions of new genes, orphan genes and other genes were statistically tested by χ2 tests.
Evolutionary trajectory of the chromosomes in fishes
To investigate the evolutionary trajectory of the chromosomes in fishes with addition of the Monopterus albus chromosome data based on phylogenetic relationship, we first confirmed 3R WGD occurred in the Monopterus albus lineage. Circos mapping showed that many duplicated genes existed in pairs among chromosomes (Additional file 1: Fig. S2), suggesting that the 3R WGD had occurred in the Monopterus albus genome as the other teleost fishes did. Comparative mapping of conserved syntenic blocks (CSB) among diverse vertebrate genomes was used to trace the origin and evolution of the chromosomes (Additional file 1: Fig. S3). The evolutionary distribution of the CSBs among teleosts (Monopterus albus, medaka, stickleback, Tetraodon and zebrafish), Holostei (spotted gar), birds (chicken) and mammals (human) was determined according to phylogenetic tree reconstruction, and we identified the evolutionary relationship of these diverse vertebrate genomes based on the 12-chromosome model using a common-ancestor gene set of the ancestral vertebrate genome (proto-chromosome A to L) [31, 34]. Among the 2R species without the 3R WGD, the tetrapod (chicken and human) and Holostei (spotted gar) genomes have undergone extensive chromosomal fission and block recombination from the ancestral vertebrate genome (Fig. 1). Furthermore, these 2R vertebrates retained smaller CSBs (≤ 14 genes/block) than 3R teleosts (Fig. 1; Additional file 1: Fig. S4). The 3R WGD took place in the teleost lineage after divergence from the Holostei, approximately 298.2 MYA, and caused the number of chromosomes to double in the teleost ancestor. After the 3R WGD, the teleosts underwent broadly genomic events, including extensive gene/region loss, block recombination and chromosomal fission/fusion (Fig. 1). For example, the chromosomal number in Monopterus albus decreased from n = 24 to n = 12. In addition, zebrafish diverged from other fishes ~ 160 MYA and had a relatively small CSB ranging from 10 to 26 genes/block, whereas the other teleosts with a short history (< 90 MYA) have retained a large CSB with > 65 genes/block (Additional file 1: Fig. S4), indicating that early speciation accumulated genomic variations during evolution, while these CSBs have homoplasic characters in the lately diverged fish clades in particular.
From partial to whole genome-wide chromosome fusion in fishes
Chromosome number has halved during the evolution of Monopterus albus, which is attributable to either chromosome loss or fusion. Comparative mapping of the CSBs clearly indicated that chromosomal fission/fusions mainly occurred in the fish lineages after separated from zebrafish (Fig. 1). Major events of fission/fusions were involved in large chromosomal regions, even in whole chromosomes from ancestors, while minor events took place, often in fusion of small chromosomal fragments. Notably, the Monopterus albus genome, compared to those of its relatives, underwent whole-genome-wide chromosomal fission/fusions (WGCF), in addition to rearrangement events involving syntenic blocks, after diverging from medaka ~ 70 MYA (Figs. 1, 2a).
After the 3R WGD, 24 ancestral chromosomes in Teleostei (A1, A2, B1, B2,… L1, L2) underwent block recombination, chromosomal loss (e.g., L2), fission (e.g., A1, A2, C2 and G2) and fusion events (Fig. 2a). For example, a Monopterus albus -specific fusion event occurred at approximately 70-75 MYA to form chromosomes 2 and 4. Three major block recombination events (C2 to F2, H1 to E1, and I2 to F2) also occurred on the chromosomes 2 and 4 (Fig. 2a). To confirm the chronological order of these events, we used the Ks value (synonymous substitutions per synonymous site) to measure evolutionary time. The calculated Ks values based on CDS alignments between Monopterus albus and close relatives are consistent with the evolutionary times of species divergence. Thus, these cross-species comparisons indicated four evolutionary stages of the formation in the Monopterus albus genome (Fig. 2b). Stage 1, the first in chronological order, involved the formation of 24 chromosomes through the 3R WGD, approximately 160–298 MYA; stage 2 included the first round of chromosomal fusion events on chromosomes 3 and 10, approximately 86–160 MYA; stage 3 was a chromosomal fusion event on chromosome 9, approximately 75–86 MYA; and stage 4, the most recent, which occurred ~ 70–75 MYA, included the formation of all other chromosomes, e.g. chromosomes 2 and 4, through chromosomal fusion and block recombination events. Thus, after the 3R WGD approximately 300 MYA to the genome ~ 70 MYA, the genomic history of Monopterus albus through WGCF, genomic reorganization and diploidization spanned approximately 230 million years.
Evolutionary trajectories of conserved blocks related to sex determination genes
Comparative mapping of the CSB and chromosome evolution facilitates to trace formation trajectory of sex-associated blocks/chromosomes, and gets insights into sex determination in teleosts in particular. DMRT1 on the Z chromosome is a male-determining gene in chicken . To trace evolutionary trajectory of the bird-Z chromosome, comparative mapping of the CSB was used among genomes in far-distant vertebrates. Genomic analysis indicated that the syntenic Z blocks were shared among diverse vertebrates from birds, reptiles, amphibians to teleost fishes, which is consistent with the conserved dmrt1 on their corresponding Z-associated chromosomes (Fig. 3a, b). Notably, these Z-associated chromosomes were originated from common ancestor chromosome E (Figs. 1, 3b). After 3R WGD, the chromosome E duplicated and diploidized into 2 chromosomes in many teleosts, whereas it has evolved and fragmented in zebrafish genome. In particular, the duplicated E chromosomes fused recently with two other chromosomes (G and F) to form chromosomes 2 and 4 around ~ 70.3 MYA, which carry the highest number of chicken Z genes (227 and 183) in the Monopterus albus genome (Additional file 1: Figs. S5, S6; Table S1). Both birds and the fish Tongue sole have ZZ/ZW sex determination system with the DMRT1 on its Z chromosome . Male-determining gene dmy in Medaka was duplicated from dmrt1 during evolution [43, 44]. However, the other vertebrates retained the chromosomal region with dmrt1 on their autosomes, including chromosome 2 in snake, chromosome 1 in Xenopus, chromosome 5 in zebrafish, and chromosome 2 in Monopterus albus. These results suggested that the conserved syntenic block with dmrt1 retained male-determination from ancestor chromosome after 2R WGD, regardless of the Z or autosomes the block located on approximately ~ 500 MYA.
In mammals, male-determining gene is SRY on the Y chromosome, which evolved from SOX3 on the chromosome X . Evolutionary trajectory of chromosome X indicated that the X was originated from ancestor chromosome G, and the corresponding CSBs were conserved on chromosome X in mammals, but on autosomes in chicken (chromosome 4), zebrafish (chromosomes 5, 10, 14 and 15), stickleback (chromosomes 4 and 7), medaka (chromosomes 10 and 14), and Monopterus albus (chromosomes 7 and 9). Moreover, the corresponding chromosomes in non-mammals retained SOX3, not SRY (Fig. 3c, d).
A great challenge is to search female-determining chromosomes/genes in teleosts in particular. By means of the comparative mapping of conserved blocks, we found that 3R WGD generated a pair of common ancestor chromosome F in the teleost lineage ~ 300 MYA. The ancestor chromosome F is evolutionarily conserved in teleosts, and has evolved into chromosomes 16 in zebrafish, 8 in Tetraodon, 20 in stickleback, 16 in medaka, and 4 in Monopterus albus with Rspo1, a key factor involved in ovary development (Fig. 3e, f). To investigate functions of the descendant chromosomes of the ancestor chromosome F, we analyzed Gene ontology (GO) of the genes shared among these chromosomes in zebrafish, medaka, stickleback, Tetraodon and Monopterus albus, and observed most of these shared genes are enriched in regulation of cell adhesion, tissue morphogenesis and Wnt signaling pathway in biological processes, and nucleoside binding and protein phosphorylation in molecular function (Additional file 1: Fig. S7). Interestingly, the key factors of the Rspo1/Wnt signaling pathway, including Rspo1, Wnt4b, Ctnnb1 and Gsk3a/b, were associated with ovary development (Fig. 3g), which is consistent with functional data [46, 47]. This finding suggests that conserved blocks from the common chromosome F retained genes for female sex determination in the linages of teleosts. Thus, for uniformity, we call these descendants from the common ancestor chromosome F as the Fs (female sex-determinant F) in extant teleosts.
Newly evolved genes in the Monopterus albus
Because the sex determination mechanism in Monopterus albus evolved after its divergence from related species, e.g., medaka and stickleback, we investigated recently evolved new genes that became fixed in Monopterus albus less than 70 MYA, to detect evolutionary innovation that may illuminate how Monopterus albus evolved. In total, we identified 94 new genes from recent gene duplications, 2 new genes from retroposition at a threshold of 40%, and 1533 orphan genes (Additional file 1: Figs. S1, S8, Table S2). The generation mechanisms of DNA-based duplications, RNA-based duplications and orphan genes are quite different. The new gene monal_003703 on chromosome 5 was generated by DNA-based duplication from its parental gene alkbh6 on chromosome 6 (Fig. 4a, b). Both genes shared 100% of amino acid identity. RNA-based duplication produced the new gene monal_008061 with one exon by retroposition from its parental gene rsu1 with eight exons (Fig. 4c, d), and they shared 92.8% of amino acid identity. Syntenic alignments showed that these new genes emerged only in the Monopterus albus lineage. Orphan genes have no homologous sequences in the closely related species medaka, tilapia, stickleback, Tetraodon and zebrafish (Additional file 1: Fig. S9).
To test whether specific chromosomes are associated with certain detectable patterns of new gene distribution, we mapped these genes to the chromosomes (Additional file 1: Table S3). We found that there was a significantly biased distribution of these new genes, mainly on chromosomes 1, 2, 3 and 4 (χ2 test, p < 0.01), whereas a biased distribution of orphan genes was also observed on all 12 chromosomes (χ2 test, p < 0.05) (Fig. 5a).
We examined the expression levels of new genes, parental genes and orphan genes using the RNA-seq data (Gene Expression Omnibus GSE43649) . A comparison of expression levels among the three stages of gonad differentiation revealed an interesting pattern. In general, we found that the expression levels of new genes and orphan genes were lower than those in the overall distribution of all other genes in gonads, and the expression distributions of new genes showed significantly higher levels for the testis than for the ovary or the ovotestis (p < 0.01) (Fig. 5b, c). A significant difference was found in orphan genes, which showed an even stronger trend similar to that found in the new genes. The average expression in the testis was higher than that in the ovary (p < 0.001), whereas that in the ovary was significantly different from that in the ovotestis (p < 0.01). Overall, newly evolved genes showed significantly elevated expression in the testis. These abundant new genes with high expression in testis emerged in the Monopterus albus lineage after split from medaka ~ 70 MYA.
It is becoming increasingly evident that the 3R WGD occurred in the common ancestor of all extant teleosts [4, 29,30,31,32]. Afterwards, several teleost linages probably underwent up to six WGD events, particularly in the carp linages . For example, 4R WGD events have been detected in salmonids ~ 80 MYA  and the common carp ~ 8.2 MYA . These WGD events could have shaped the history of evolutionary lineages of teleosts, because they create genetic diversity and innovation for adaptive radiation. These WGDs and subsequent diploidization events have tremendous impacts on speciation. Taking advantage of comparative genomics and the genome structure of Monopterus albus, here we describe the post-WGD genomic events, evolutionary mechanisms through the whole-genome-wide chromosome fission/fusions, and new gene generations in the Monopterus albus lineage. We also reveal evolutionary trajectories of conserved blocks related to sex-determining genes in teleosts.
Genomic variation accumulation can shape the evolutionary direction of lineages under positive selection pressure. During the 2R WGD, the tetrapod (chicken and human) and Holostei (spotted gar) genomes experienced extensive chromosomal fission and block recombination from the ancestral vertebrate genome. The shuffle effect of CSB could be a major cause of block/gene loss after WGD. Moreover, due to accumulation of genomic recombination events, earlier diverged 2R vertebrates retained small CSBs in comparison with 3R teleosts. The same situation applies to 3R teleosts. Zebrafish diverged from other fishes ~ 160 MYA and had smaller CSBs than other teleosts with a short history (< 90 MYA). The variation accumulation is probably different from the mutation accumulation through Muller’s Ratchet observed in asexual lineages , as the latter effect often contributes to the extinction of lineages. Instead, the variation accumulation of post-WGD reflects sustaining evolution. In fact, the accumulation of genetic mutations in human adult stem cells was observed in a tissue-specific manner in common mutational processes during life , which supports that variation accumulation shapes the direction of lineages during evolution in vertebrates.
The whole-genome-wide chromosome fusion is of particular interest in evolution. The WGCF occurred in the Monopterus albus lineage after the 3R WGD. The Monopterus albus is distributed worldwide in tropical and subtropical rivers and can naturally change its sex from female to male during its life [7, 8]. The Monopterus albus can breathe air, and it is capable of surviving for long periods without water. Its physiological features, including its amphibious abilities, make the species a successful invader around the globe. The WGCF contributes to lower chromosome number among other fish species and speciation through all chromosome fusion. Moreover, a recent finding showed that chromosome fusion events involved another 5 chromosomes that most likely occurred in this species recently, leading to a reduction in chromosome number from 2n = 24 to 2n = 18 . These two types of genetically distinct groups live together in central Thailand. Therefore, sustaining evolution through chromosome fusion events still takes place in the Monopterus albus lineage. Intriguingly, chromosomal fission and fusion events were observed in the astronaut after NASA’s 1-year spaceflight . The astronaut postflight was accompanied by cognitive decline, which is often observed in many chromosome syndromes in humans . Chromosome fission/fusion events are probably a tremendous risk of species variation and new speciation.
Evolution of sex determination has been a topic of great interest since Darwin’s time. Our genomic and evolutionary analysis of post-WGD reveal an evolutionary trajectory of sex determination in fish and highlight a role for the teleost Fs chromosomes in female sex determination. Although sex determination systems are variable in vertebrates, there are common genetic components, from genes to chromosomes, shared by the varied systems, owing to the convergent evolutionary pressure. This could explain the inexorable evolutionary processes and underlying mechanisms from labile to entrenched sex chromosomes. In the present study, we trace the evolutionary history of the regions syntenic to the avian Z chromosome from ancestral chromosome E, to the mammalian X from ancestral chromosome G, and to the teleost Fs from ancestral chromosome F in vertebrates. These chromosomes have evolved from different ancestral chromosomes, indicating independent origins from the 2R ancestor (Fig. 3h).
The syntenic Sox3 block from ancestral chromosome G is conserved, and Sox3 evolved into the male-determining gene Sry on the Y chromosome only in mammals [45, 55], while Sox3 is essential for ovary development in zebrafish . The syntenic Dmrt1 block retains from ancestral chromosome E in near all vertebrates, and Dmrt1 is a major factor for male development in mammals , birds , reptiles , amphibians  and fishes [60,61,62], regardless of its location on the Z/X/Y or autosomes. The Y-linked dmy originated from a duplicate copy of autosomal dmrt1 and can determine male sex in medaka [43, 44]. Over 300 million years, the common ancestral chromosome F evolved a set of descendant chromosomes among the teleost lineages, for example, chromosomes 16 in zebrafish, 8 in Tetraodon, 20 in stickleback, 16 in medaka, and 4 in Monopterus albus, probably including other teleost fishes, on which Rspo1, a pivotal factor for ovary determination [63, 64], has retained. Intriguingly, they inherited historically key components of the Rspo1/Wnt signaling pathway from ancestor chromosome F, including Rspo1, Wnt4b, Ctnnb1 and Gsk3a/b. The pathway is required for ovary development in vertebrates [46, 47]. Consistent with this, breeding experiments showed a female determination signal on chromosome 16 in zebrafish . Together, these data suggest that Rspo1/Wnt signaling components in the common chromosome F are important for female sex determination in the teleost linages.
Molecular mechanisms that generate new genes mainly include DNA-based duplication, retroposition, and de novo origination. Orphan genes have no homologues in closely related lineages. New genes could contribute to the rapid evolution of the genome under positive selection, which favors the success of genetic innovation for adaptive radiation after the 3R WGD. Nevertheless, these candidate new genes identified in Monopterus albus remain to be functionally confirmed. Candidate orphan genes should be further determined when more genomes have been sequenced in the related species in the Monopterus albus lineage. Genetic studies have shown that the evolution of sex and sex chromosomes impacts the fixation of new genes and their distribution between sex chromosomes and autosomes [65, 66]. Sex-dependent selection has been shown to drive expression divergence between species , differentiation of expression level between the X chromosome and autosomes  and new gene evolution . The fixation force of new genes was tested as positive selection in the early stages of new genes in Drosophila  and further confirmed in individual case analyses [70, 71]. The male expression of new genes may be interpreted in terms of male-driven sexual selection [66, 72, 73]. Monopterus albus showed a remarkable consistency with the prediction of the Darwin-Bateman paradigm and observations in mammals and insects that males evolve rapidly by recruiting abundant new genes that show biased expression in the testis .
We describe phylogenomic events in teleost fishes including Monopterus albus and its relatives after whole-genome duplications, reveal evolutionary mechanisms through the whole-genome-wide chromosome fission/fusions, new gene generations in Monopterus albus, and show evolutionary trajectory of conserved blocks related to sex-determining genes, the chromosome Fs for female sex determination in teleosts in particular. These data pave the way for a better understanding of genomic evolution in extant teleosts.
Availability of data and materials
The Monopterus albus whole-genome data were obtained from GenBank under the accession AONE00000000. Raw transcriptome data were obtained from Gene Expression Omnibus as GSE43649.
- alkbh6 :
alkB homolog 6
- amhr2 :
anti-Mullerian hormone type 2 receptor
- AR :
Conserved syntenic blocks
- Ctnnb1 :
Catenin beta 1
- cyp19a1a :
Cytochrome P450, family 19, subfamily A, polypeptide 1a
Doublesex and mab-3 related transcription factor 1
- dmy :
Doublesex- and mab-3-related transcription factor 1Y-like
Female sex-determinant F
- Gsk3a/b :
Glycogen synthase kinase 3 alpha/beta
Leucine-rich repeat-containing G protein-coupled receptor 4/5/6
Millions of years ago
Ring finger protein 43
Reads per kb per million reads
- rsu1 :
Ras suppressor protein 1
SRY-box transcription factor
Sex determining region Y
Whole-genome-wide chromosome fusion
- Wnt4b :
Wingless-type MMTV integration site family, member 4b
Zinc and ring finger 3
Fricke R, Eschmeyer WN, Fong JD. Species by family/subfamily in Eschmeyer’s Catalog of Fishes. San Francisco: California Academy of Sciences; 2019.
Cheng H, Guo Y, Yu Q, Zhou R. The rice field eel as a model system for vertebrate sexual development. Cytogenet Genome Res. 2003;101(3–4):274–7.
Wittbrodt J, Shima A, Schartl M. Medaka–a model organism from the far East. Nat Rev Genet. 2002;3(1):53–64.
Howe K, Clark MD, Torroja CF, Torrance J, Berthelot C, Muffato M, Collins JE, Humphray S, McLaren K, Matthews L, et al. The zebrafish reference genome sequence and its relationship to the human genome. Nature. 2013;496(7446):498–503.
Brenner S, Elgar G, Sandford R, Macrae A, Venkatesh B, Aparicio S. Characterization of the pufferfish (Fugu) genome as a compact model vertebrate genome. Nature. 1993;366(6452):265–8.
Marques DA, Jones FC, Di Palma F, Kingsley DM, Reimchen TE. Experimental evidence for rapid genomic adaptation to a new niche in an adaptive radiation. Nat Ecol Evol. 2018;2(7):1128–38.
Liu CK. Rudimentary hermaphroditism in the Symbranchoid eel, Monopterus Javanesis. Sinensia. 1944;15:1–8.
Bullough W. Hermaphroditism in the lower vertebrates. Nature. 1947;160(4053):9.
Collins TM, Trexler JC, Nico LG, Rawlings TA. Genetic diversity in a morphologically conservative invasive taxon: multiple introductions of swamp eels to the southeastern United States. Conserv Biol. 2002;16(4):1024–35.
Zhou R, Cheng H, Tiersch T. Differential genome duplication and fish diversity. Rev Fish Biol Fisher. 2001;11(4):331–7.
Bachtrog D, Mank JE, Peichel CL, Kirkpatrick M, Otto SP, Ashman TL, Hahn MW, Kitano J, Mayrose I, Ming R, et al. Sex determination: why so many ways of doing it? PLoS Biol. 2014;12(7):e1001899.
Bull JJ. Evolution of sex determining mechanisms. Benjamin/Cummings Pub Co. 1983.
Matsubara K, Tarui H, Toriba M, Yamada K, Nishida-Umehara C, Agata K, Matsuda Y. Evidence for different origin of sex chromosomes in snakes, birds, and mammals and step-wise differentiation of snake sex chromosomes. Proc Natl Acad Sci USA. 2006;103(48):18190–5.
Vallender EJ, Lahn BT. Multiple independent origins of sex chromosomes in amniotes. Proc Natl Acad Sci USA. 2006;103(48):18031–2.
Ohno S. Sex chromosomes and sex-linked genes. Berlin: Springer; 1967. p. 192.
Smith JJ, Voss SR. Bird and mammal sex-chromosome orthologs map to the same autosomal region in a salamander (ambystoma). Genetics. 2007;177(1):607–13.
Bellott DW, Skaletsky H, Cho TJ, Brown L, Locke D, Chen N, Galkina S, Pyntikova T, Koutseva N, Graves T, et al. Avian W and mammalian Y chromosomes convergently retained dosage-sensitive regulators. Nat Genet. 2017;49(3):387–94.
Charlesworth B. Model for evolution of Y chromosomes and dosage compensation. Proc Natl Acad Sci USA. 1978;75(11):5618–22.
Graves JA. Did sex chromosome turnover promote divergence of the major mammal groups? BioEssays. 2016;38:734–43.
Bachtrog D. Y-chromosome evolution: emerging insights into processes of Y-chromosome degeneration. Nat Rev Genet. 2013;14(2):113–24.
Just W, Rau W, Vogel W, Akhverdian M, Fredga K, Graves JA, Lyapunova E. Absence of Sry in species of the vole Ellobius. Nat Genet. 1995;11(2):117–8.
Sutou S, Mitsui Y, Tsuchiya K. Sex determination without the Y chromosome in two Japanese rodents Tokudaia osimensis osimensis and Tokudaia osimensis spp. Mamm Genome. 2001;12(1):17–21.
Graves JA. Weird animal genomes and the evolution of vertebrate sex and sex chromosomes. Annu Rev Genet. 2008;42:565–86.
Taravella AM, Sayres MA. Fruitful analysis of sex chromosomes reveals X-treme genetic diversity. Genome Biol. 2016;17(1):244.
Stock M, Horn A, Grossen C, Lindtke D, Sermier R, Betto-Colliard C, Dufresnes C, Bonjour E, Dumas Z, Luquet E, et al. Ever-young sex chromosomes in European tree frogs. PLoS Biol. 2011;9(5):e1001062.
da Silva M, Matoso DA, Vicari MR, de Almeida MC, Margarido VP, Artoni RF. Repetitive DNA and meiotic behavior of sex chromosomes in Gymnotus pantanal (Gymnotiformes, Gymnotidae). Cytogenet Genome Res. 2011;135(2):143–9.
Kamiya T, Kai W, Tasumi S, Oka A, Matsunaga T, Mizuno N, Fujita M, Suetake H, Suzuki S, Hosoya S, et al. A trans-species missense SNP in Amhr2 is associated with sex determination in the tiger pufferfish, Takifugu rubripes (fugu). PLoS Genet. 2012;8(7):e1002798.
Zhao X, Luo M, Li Z, Zhong P, Cheng Y, Lai F, Wang X, Min J, Bai M, Yang Y, et al. Chromosome-scale assembly of the Monopterus genome. Gigascience. 2018;7(5):5.
Braasch I, Gehrke AR, Smith JJ, Kawasaki K, Manousaki T, Pasquier J, Amores A, Desvignes T, Batzel P, Catchen J, et al. The spotted gar genome illuminates vertebrate evolution and facilitates human-teleost comparisons. Nat Genet. 2016;48(4):427–37.
Kasahara M, Naruse K, Sasaki S, Nakatani Y, Qu W, Ahsan B, Yamada T, Nagayasu Y, Doi K, Kasai Y, et al. The medaka draft genome and insights into vertebrate genome evolution. Nature. 2007;447(7145):714–9.
Jaillon O, Aury JM, Brunet F, Petit JL, Stange-Thomann N, Mauceli E, Bouneau L, Fischer C, Ozouf-Costaz C, Bernot A, et al. Genome duplication in the teleost fish Tetraodon nigroviridis reveals the early vertebrate proto-karyotype. Nature. 2004;431(7011):946–57.
Postlethwait JH, Woods IG, Ngo-Hazelett P, Yan YL, Kelly PD, Chu F, Huang H, Hill-Force A, Talbot WS. Zebrafish comparative genomics and the origins of vertebrate chromosomes. Genome Res. 2000;10(12):1890–902.
Krzywinski M, Schein J, Birol I, Connors J, Gascoyne R, Horsman D, Jones SJ, Marra MA. Circos: an information aesthetic for comparative genomics. Genome Res. 2009;19(9):1639–45.
Naruse K, Tanaka M, Mita K, Shima A, Postlethwait J, Mitani H. A medaka gene map: the trace of ancestral vertebrate proto-chromosomes revealed by comparative gene mapping. Genome Res. 2004;14(5):820–8.
Wang Y, Tang H, Debarry JD, Tan X, Li J, Wang X, Lee TH, Jin H, Marler B, Guo H, et al. MCScanX: a toolkit for detection and evolutionary analysis of gene synteny and collinearity. Nucleic Acids Res. 2012;40(7):e49.
Larkin MA, Blackshields G, Brown NP, Chenna R, McGettigan PA, McWilliam H, Valentin F, Wallace IM, Wilm A, Lopez R, et al. Clustal W and Clustal X version 2.0. Bioinformatics. 2007;23(21):2947–8.
Tamura K, Stecher G, Peterson D, Filipski A, Kumar S. MEGA6: molecular evolutionary genetics analysis version 6.0. Mol Biol Evol. 2013;30(12):2725–9.
Mortazavi A, Williams BA, McCue K, Schaeffer L, Wold B. Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat Met. 2008;5(7):621–8.
Trapnell C, Pachter L, Salzberg SL. TopHat. discovering splice junctions with RNA-Seq. Bioinformatics. 2009;25(9):1105–11.
Long M, VanKuren NW, Chen S, Vibranovski MD. New gene evolution: little did we know. Annu Rev Genet. 2013;47:307–33.
Smith CA, Roeszler KN, Ohnesorg T, Cummins DM, Farlie PG, Doran TJ, Sinclair AH. The avian Z-linked gene DMRT1 is required for male sex determination in the chicken. Nature. 2009;461(7261):267–71.
Chen S, Zhang G, Shao C, Huang Q, Liu G, Zhang P, Song W, An N, Chalopin D, Volff JN, et al. Whole-genome sequence of a flatfish provides insights into ZW sex chromosome evolution and adaptation to a benthic lifestyle. Nat Genet. 2014;46(3):253–60.
Nanda I, Kondo M, Hornung U, Asakawa S, Winkler C, Shimizu A, Shan Z, Haaf T, Shimizu N, Shima A, et al. A duplicated copy of DMRT1 in the sex-determining region of the Y chromosome of the medaka, Oryzias latipes. Proc Natl Acad Sci USA. 2002;99(18):11778–83.
Matsuda M, Nagahama Y, Shinomiya A, Sato T, Matsuda C, Kobayashi T, Morrey CE, Shibata N, Asakawa S, Shimizu N, et al. DMY is a Y-specific DM-domain gene required for male development in the medaka fish. Nature. 2002;417(6888):559–63.
Foster JW, Graves JAM. An Sry-Related Sequence on the Marsupial X-Chromosome - Implications for the Evolution of the Mammalian Testisdetermining Gene. Proc Natl Acad Sci USA. 1994;91(5):1927–31.
Chassot AA, Gillot I, Chaboissier MC. R-spondin1, WNT4, and the CTNNB1 signaling pathway: strict control over ovarian differentiation. Reproduction. 2014;148(6):R97–110.
Zhou L, Charkraborty T, Zhou Q, Mohapatra S, Nagahama Y, Zhang Y. Rspo1-activated signalling molecules are sufficient to induce ovarian differentiation in XY medaka (Oryzias latipes). Sci Rep. 2016;6:19543.
Lien S, Koop BF, Sandve SR, Miller JR, Kent MP, Nome T, Hvidsten TR, Leong JS, Minkley DR, Zimin A, et al. The Atlantic salmon genome provides insights into rediploidization. Nature. 2016;533(7602):200–5.
Xu P, Zhang X, Wang X, Li J, Liu G, Kuang Y, Xu J, Zheng X, Ren L, Wang G, et al. Genome sequence and genetic diversity of the common carp, Cyprinus carpio. Nat Genet. 2014;46(11):1212–9.
Felsenstein J. The evolutionary advantage of recombination. Genetics. 1974;78(2):737–56.
Blokzijl F, de Ligt J, Jager M, Sasselli V, Roerink S, Sasaki N, Huch M, Boymans S, Kuijk E, Prins P, et al. Tissue-specific mutation accumulation in human adult stem cells during life. Nature. 2016;538(7624):260–4.
Supiwong W, Pinthong K, Seetapan K, Saenjundaeng P, Bertollo LAC, de Oliveira EA, Yano CF, Liehr T, Phimphan S, Tanomtong A, et al. Karyotype diversity and evolutionary trends in the Asian swamp eel Monopterus albus (Synbranchiformes, Synbranchidae): a case of chromosomal speciation? BMC Evol Biol. 2019;19(1):73.
Garrett-Bakelman FE, Darshi M, Green SJ, Gur RC, Lin L, Macias BR, McKenna MJ, Meydan C, Mishra T, Nasrini J, et al. The NASA Twins Study: A multidimensional analysis of a year-long human spaceflight. Science. 2019;364:6436.
Vogel F, Motulsky AG. Human genetics: problems and approaches. Berlin; New York: Spinger-Verlag; 1997.
Sinclair AH, Berta P, Palmer MS, Hawkins JR, Griffiths BL, Smith MJ, Foster JW, Frischauf AM, Lovell-Badge R, Goodfellow PN. A gene from the human sex-determining region encodes a protein with homology to a conserved DNA-binding motif. Nature. 1990;346(6281):240–4.
Hong Q, Li C, Ying R, Lin H, Li J, Zhao Y, Cheng H, Zhou R. Loss-of-function of sox3 causes follicle development retardation and reduces fecundity in zebrafish. Protein Cell. 2019;10(5):347–64.
Raymond CS, Murphy MW, O’Sullivan MG, Bardwell VJ, Zarkower D. Dmrt1, a gene related to worm and fly sexual regulators, is required for mammalian testis differentiation. Genes Dev. 2000;14(20):2587–95.
Ge C, Ye J, Weber C, Sun W, Zhang H, Zhou Y, Cai C, Qian G, Capel B. The histone demethylase KDM6B regulates temperature-dependent sex determination in a turtle species. Science. 2018;360(6389):645–8.
Mawaribuchi S, Musashijima M, Wada M, Izutsu Y, Kurakata E, Park MK, Takamatsu N, Ito M. Molecular Evolution Of Two Distinct dmrt1 promoters for germ and somatic cells in vertebrate gonads. Mol Biol Evol. 2017;34(3):724–33.
Webster KA, Schach U, Ordaz A, Steinfeld JS, Draper BW, Siegfried KR. Dmrt1 is necessary for male sexual development in zebrafish. Dev Biol. 2017;422(1):33–46.
Guo YQ, Cheng HH, Huang X, Gao S, Yu HS, Zhou RJ. Gene structure, multiple alternative splicing, and expression in gonads of zebrafish Dmrt1. Biochem Bioph Res Co. 2005;330(3):950–7.
Huang X, Guo Y, Shui Y, Gao S, Yu H, Cheng H, Zhou R. Multiple alternative splicing and differential expression of dmrt1 during gonad transformation of the rice field eel. Biol Reprod. 2005;73(5):1017–24.
Parma P, Radi O, Vidal V, Chaboissier MC, Dellambra E, Valentini S, Guerra L, Schedl A, Camerino G. R-spondin1 is essential in sex determination, skin differentiation and malignancy. Nat Genet. 2006;38(11):1304–9.
Tomizuka K, Horikoshi K, Kitada R, Sugawara Y, Iba Y, Kojima A, Yoshitome A, Yamawaki K, Amagai M, Inoue A, et al. R-spondin1 plays an essential role in ovarian development through positively regulating Wnt-4 signaling. Hum Mol Genet. 2008;17(9):1278–91.
Charlesworth B, Barton NH. The relative rates of evolution of sex chromosomes and autosomes. Am Nat. 1987;130(1):113–46.
Long M, Vibranovski MD, Zhang YE. Evolutionary interactions between sex chromosomes and autosomes. In: Rama SS, Jianping X, Rob JK, editors. Rapidly evolving genes and genetic systems. New York: Oxford Univ. Press; 2012. p. 101–14.
Ranz JM, Castillo-Davis CI, Meiklejohn CD, Hartl DL. Sex-dependent gene expression and evolution of the Drosophila transcriptome. Science. 2003;300(5626):1742–5.
Parisi M, Nuttall R, Naiman D, Bouffard G, Malley J, Andrews J, Eastman S, Oliver B. Paucity of genes on the Drosophila X chromosome showing male-biased expression. Science. 2003;299(5607):697–700.
Chen S, Zhang YE, Long M. New genes in Drosophila quickly become essential. Science. 2010;330(6011):1682–5.
Schrider DR, Stevens K, Cardeno CM, Langley CH, Hahn MW. Genome-wide analysis of retrogene polymorphisms in Drosophila melanogaster. Genome Res. 2011;21(12):2087–95.
Chen S, Ni X, Krinsky BH, Zhang YE, Vibranovski MD, White KP, Long M. Reshaping of global gene expression networks and sex-biased gene expression by integration of a young gene. EMBO J. 2012;31(12):2798–809.
Bateman AJ. Intra-sexual selection in Drosophila. Heredity (Edinb). 1948;2(Pt. 3):349–68.
Darwin C. The descent of man, and Selection in relation to sex. London: J. Murray; 1871.
Authors thank Professor Jianzhi Zhang of University of Michigan for reading the manuscript.
This work was supported by the National Natural Science Foundation of China (31771370, 31771487, and 31970539).
Ethics approval and consent to participate
Consent for publication
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Cheng, Y., Shang, D., Luo, M. et al. Whole genome-wide chromosome fusion and new gene birth in the Monopterus albus genome. Cell Biosci 10, 67 (2020). https://doi.org/10.1186/s13578-020-00432-0