First complete genome assembly of planarian flatworm reveals treasure trove on the function and evolution of genes.
The planarian flatworm Schmidtea mediterranea is an extraordinary animal. Even when cut into tiny pieces, each piece can regenerate back into a complete and perfectly proportioned miniature planarian. Key to this ability are fascinating adult stem cells, a single one of which can restore a complete worm. But how Schmidtea mediterranea achieves these feats is so far poorly understood. An important step towards this goal is the first highly contiguous genome assembly of Schmidtea mediterranea that researchers at the Max Planck Institute of Molecular Cell Biology and Genetics (MPI-CBG) in Dresden in cooperation with the Heidelberg Institute for Theoretical Studies (HITS) report in the current issue of Nature. The assembly reveals a genome that contains novel giant repeat elements, new flatworm-specific genes, but also the absence of other genes that were so far thought to be absolutely essential for keeping an animal alive. The discovery has potential implications in the fields of regeneration research, stem cell biology and bioinformatics.
A complete and fully assembled genome is critical for understanding the biological characteristics of an organism. Scientists have previously attempted to sequence the genome of Schmidtea mediterranea, but ended up with a collection of more than 100.000 short pieces. The reason for this is that a great deal of the genome consists of many, nearly identical copies of the same sequence that repeats over and over.To overcome this challenge of an exceptionally repetitive genome, the research groups of Jochen Rink and Eugene Myers at the MPI-CBG utilized Pacific Bioscience’s long-read sequencing technology, operated at the DRESDEN-concept Sequencing Center, a joint operation between the MPI-CBG and the TU Dresden. This relatively new technology can directly “read” contiguous stretches of the genome up to 40.000 base pairs (or “letters”) long. Such long reads are dramatically more effective at bridging repetitive stretches in the genome than the more broadly used 100-500 base pair reads, thus resulting in up to 100-fold improvements in genome assembly statistics over previous assemblies.
Siegfried Schloissnig (HITS) was primarily responsible for developing a novel software system, called “Marvel”, that solves more of the jigsaw puzzle posed by the long-reads than previous such systems, and more efficiently. The assembly of the Schmidtea mediterranea genome involved eight terabytes of data that took the high-performance computing cluster at the HITS three weeks to complete.
But what can scientists actually do with the abundance of genetic information in a genome assembly? One of the surprises in the case of Schmidtea mediterranea was the likely absence of highly conserved genes such as MAD1 and MAD2. Both are present in nearly all other organisms because they fulfil a function in a checkpoint that ensures that both daughter cells get the same number of chromosomes after cell division. Yet despite the MAD1/2 gene loss, planarians retained the checkpoint function. How this is possible is one of the questions that the genome will help to answer. But Jochen Rink and his group are especially excited about using the genome assembly for understanding how planarians manage to regenerate from an arbitrary tissue piece. Rink explains: “We already know some of the genes required for regenerating a head, but now we can also search for the regulatory control sequences that activate the head genes only at the front end of a regenerating piece”. Further, the Rink group has assembled a large collection of planarian species from around the world, many of which have lost the ability to regenerate. “With a powerful toolbox for the assembly of difficult genomes now in place, we hope to soon use genome comparisons to understand why some animals regenerate, while so many do not. At least in the case of flatworms”, summarizes Rink.
Markus Alexander Grohme, Siegfried Schloissnig, Andrei Rozanski, Martin Pippel, George Young, Sylke Winkler, Holger Brandl, Ian Henry, Andreas Dahl, Sean Powell, Michael Hiller, Eugene Myers, Jochen Christian Rink: Schmidtea mediterranea and the evolution of core cellular mechanisms, Nature, 24 January 2018
+49 (0) 351 210 2080
+49 (0) 6221 533 245
About the MPI-CBG
The Max Planck Institute of Molecular Cell Biology and Genetics (MPI-CBG) is one of 83 institutes of the Max Planck Society, an independent, non-profit organization in Germany. 500 curiosity-driven scientists from over 50 countries ask: How do cells form tissues? The basic research programs of the MPI-CBG span multiple scales of magnitude, from molecular assemblies to organelles, cells, tissues, organs, and organisms.
The Heidelberg Institute for Theoretical Studies (HITS) was established in 2010 by the physicist and SAP co-founder Klaus Tschira (1940-2015) and the Klaus Tschira Foundation as a private, non-profit research institute. HITS conducts basic research in the natural sciences, mathematics and computer science, with a focus on the processing, structuring, and analyzing of large amounts of complex data and the development of computational methods and software. The research fields range from molecular biology to astrophysics. The shareholders of HITS are the HITS Stiftung, which is a subsidiary of the Klaus Tschira Foundation, Heidelberg University and the Karlsruhe Institute of Technology (KIT). HITS also cooperates with other universities and research institutes and with industrial partners. The base funding of HITS is provided by the HITS Stiftung with funds received from the Klaus Tschira Foundation. The primary external funding agencies are the Federal Ministry of Education and Research (BMBF), the German Research Foundation (DFG), and the European Union.