- Open Access
The Mesobuthus martensii genome reveals the molecular diversity of scorpion toxins
Cell & Biosciencevolume 4, Article number: 1 (2014)
Recent complete sequencing of the genome of the Asian scorpion, Mesobuthus martensii, highlights the molecular diversity of its venom neurotoxin/defensin genes and interesting features of their evolution.
Scorpion is a mysterious organism in the animal kingdom. It is a living fossil , has poisonous venom , and can be fluorescent . Such unique features have increasingly attracted scientists’ attention and interests around the world. One exciting progress in scorpion research is a recent report from Dr. Wenxin Li’s group in China for the complete genome sequencing of the Asian scorpion, Mesobuthus martensii. The M. martensii genome has an estimated size of 1323.73 Mb but interestingly contains predicted 32,016 genes, more than those in human . The analysis by Cao et al. also revealed that M. martensii genome has, in addition to a large gene content, a high level of transposable element accumulation, a divergent evolution of venom neurotoxin/defensin genes, and an expansion of gene families often associated with unique biological features of scorpions .
One important finding from the genome sequence analysis is the molecular diversity of venom neurotoxins . For the first time, Cao et al. systematically analyzed the gene structure and organization of neurotoxins at the genome level. They discovered 116 neurotoxin genes present in the M. martensii genome: 61 NaTx (toxins for sodium channels) genes, 46 KTx (toxins for potassium channels) genes, 5 ClTx (toxins for chloride channels) genes, and 4 CaTx (toxins for ryanodine receptors) genes. Compared to the known neurotoxins previously identified through cDNA cloning and biochemical purification, most of the 45 newly discovered neurotoxin genes belong to the NaTx and KTx families. Their analysis thus provided a different but complete picture of the molecular diversity of neurotoxins in M. martensii.
Earlier transcriptome analyses of scorpion venomous gland by Li’s group provided evidence that the scorpion venom has a large variety of biologically active peptides. The Li team began the scorpion research in 1990s by first constructing a cDNA library for the scorpion M. martensii in order to isolate and characterize new toxin genes . Later, Ma et al. carried out a transcriptomic analysis of the venom gland of the scorpion Scorpiops jendeki. Their work revealed that the venom of Scorpiops jendeki has more than 10 known types and 9 atypical types of peptides/proteins . Subsequently, in 2010, they used both transcriptomic and proteomic analyses to determine the toxin content of the venom of the scorpion Heterometrus petersii. In the same year, the Li group published a comparative analysis of the venom transcriptome of the scorpion Lychas mucronatus from two different geographical regions in China: one region in Hainan province and the other region in Yunnan province . Interestingly, this study identified a large number of new venom molecules and also revealed that venom peptides/proteins of the same scorpion species from different geographical regions are highly diversified. These findings suggest the possibility that scorpions evolve in order to adapt to new environment by changing the primary structure and abundance of venom peptides/proteins. Last year, this same group continued their study with transcriptome analysis of three new scorpion species: two Buthidae species (Lychas mucronatus and Isometrus maculatus) and one Euscorpiidae species (Scorpiops margerisonae) . More recently, a similar analysis of the venom glands from two scorpion species of the family Chaerilidae, Chaerilus tricostatus and Chaerilus tryznai revealed 14 types of venom peptides/proteins and 74 atypical venom molecules . Their cumulative transcriptomic analyses were responsible for the majority of new toxin molecules discovered in the field of scorpion toxins. Together, these findings created the genetic resource libraries for scorpion toxin research and will help to accelerate the drug discovery of toxin peptides. In fact, by analyzing molecular diversity of scorpion toxins and their structure-function relationships, they were the first to reveal a critical role of acidic residue function in toxin activity  and developed a novel drug lead .
The analyses of the M. martensii genome by Cao et al.  revealed some unique features in the structure and organization of neurotoxin genes. Out of the 116 neurotoxin genes, 109 are expressed in the venomous gland. Most of the neurotoxin genes contain one intron located at the end of the coding region for the signal peptide, while a smaller number had no or two introns. 44% (51/116) of the neurotoxin genes are present in clusters on seventeen scaffolds. Within each cluster are neurotoxin genes of the same family that share similar gene structure and organization, indicating that frequent gene duplication occurred at the neurotoxin loci. Similar features of gene cluster and structural organization were also found for the six defensin genes in M. martensii, suggesting an evolutionary trajectory parallel to that of neurotoxin genes.
The M. martensii genome sequence also allowed Cao et al.  to investigate the origin of scorpion neurotoxin genes, a subject of intense recent debate. Hierarchical clustering was used to group the related neurotoxin genes from M. martensii, including 54 NaTx, 41 KTx, 5 ClTx and 6 defensins. Two major groups corresponding to pharmacological classes were formed. Group 1 comprises NaTx genes, whereas group 2 contains KTx, ClTx and defensin genes. These findings not only point to monophyly of the neurotoxin and definsin genes, but also implicate a strong structure-function relationship by the association of functional determinants with the sequence homology groups. They also suggest that NaTx likely diverged early from the common ancestor(s) of KTx, ClTx and defensin, and subsequently KTx, ClTx and defensins evolved into their separate families.
Together with the transcriptome analyses of the venomous glands from different Chinese scorpion species, the genome sequence of M. martensii allows the characterization of a large family of venom molecules, in addition to other genes involved in defense and detoxification of the animal species. Such information provides a valuable resource for further study on the biology of the venom toxin, as well as for future therapeutic approach that targets neurotoxins for treatment of human diseases. Clearly, more questions will arise from the completion of the M. martensii genome, such as those on the comparative genomics of scorpion toxins and the biological functions of scorpion toxins, which will attract new generation of scientists to work on this exciting field of research and development.
Polis GA: Introduction. The Biology of Scorpions. Edited by: Polis GA. 1990, 1-8. California: Standford University Press.
Possani LD: Peptides and genes coding for scorpion toxins that affect ion-channels. Biochimie. 2000, 82: 861-868. 10.1016/S0300-9084(00)01167-6
Frost LM: A coumarin as a fluorescent compound in scorpion cuticle. Scorpions 2001: in memoriam, Gary A. Edited by: Fet V, Selden PA. 2001, 365-368. Polis: British Arachnological Society.
Cao Z: The genome of Mesobuthus martensii reveals a unique adaptation model of arthropods. Nat Commun. 2013, 4: 2602.
Zhu S: Molecular cloning and sequencing of two 'short chain' and two 'long chain' K+ channel-blocking peptides from the Chinese scorpion Buthus martensii Karsch. FEBS Lett. 1999, 457: 509-514. 10.1016/S0014-5793(99)01101-1
Ma Y: Transcriptome analysis of the venom gland of the scorpion Scorpiops jendeki: implication for the evolution of the scorpion venom arsenal. BMC Genomics. 2009, 10: 290. 10.1186/1471-2164-10-290
Ma Y: Molecular diversity of toxic components from the scorpion Heterometrus petersii venom revealed by proteomic and transcriptome analysis. Proteomics. 2010, 10: 2471-2485. 10.1002/pmic.200900763
Ruiming Z: Comparative venom gland transcriptome analysis of the scorpion Lychas mucronatus reveals intraspecific toxic gene diversity and new venomous components. BMC Genomics. 2010, 11: 452. 10.1186/1471-2164-11-452
Ma Y: Extreme diversity of scorpion venom peptides and proteins revealed by transcriptomic analysis: implication for proteome evolution of scorpion venom arsenal. J Proteomics. 2012, 75: 1563-1576. 10.1016/j.jprot.2011.11.029
He Y: Molecular diversity of Chaerilidae venom peptides reveals the dynamic evolution of scorpion venom components from Buthidae to non-Buthidae. J Proteomics. 2013, 89: 1-14.
Han S: Protein-protein recognition control by modulating electrostatic interactions. J Proteome Res. 2010, 9: 3118-3125. 10.1021/pr100027k
Han S: Structural basis of a potent peptide inhibitor designed for Kv1.3 channel, a therapeutic target of autoimmune disease. J Biol Chem. 2008, 283: 19058-19065. 10.1074/jbc.M802054200
JM was supported by extramural research grants from the National Institutes of Health (NIH) and YBS was supported by the Intramural Research Program of NICHD, NIH.
The authors declare that they have no competing interests.
JM and YBS wrote the manuscript and approved the final manuscript.