生物通报道:来自马普研究院等处的研究人员发表了题为“A de novo assembly of the newt transcriptome combined with proteomic validation identifies new protein families expressed during tissue regeneration”的文章,针对基因组特别大, 难以进行测序的动物,提出了一种新技术策略——将转录组与蛋白质组联系在一起,由此开启解析这些动物奇特基因组的神奇之旅。相关成果公布在1月20日Genome Biology杂志上。
Looso M, Preussner J, Sousounis K, Bruckskotten M, et. al. A de novo assembly of the newt transcriptome combined with proteomic validation identifies new protein families expressed during tissue regeneration. (2013) Genome Biology 14:R16.
A de novo assembly of the newt transcriptome combined with proteomic validation identifies new protein families expressed during tissue regeneration
BACKGROUND: Notophthalmus viridescens, an urodelian amphibian, represents an excellent model organism to study regenerative processes, but mechanistic insights into molecular processes driving regeneration have been hindered by paucity and poor annotation of coding nucleotide sequences. The enormous genome size and the lack of a closely related reference genome have so far prevented assembly of the urodelian genome.
RESULTS: We describe the de novo assembly of the transcriptome of the newt Notophthalmus viridescens and its experimental validation. RNA pools covering embryonic and larval development, different stages of heart, appendage and lens regeneration, as well as a collection of different undamaged tissues were used to generate sequencing datasets on Sanger, Illumina and 454 platforms. Through a sequential de novo assembly strategy, hybrid datasets were converged into one comprehensive transcriptome comprising 120,922 non-redundant transcripts with a N50 of 975. 38,384 putative transcripts were annotated and around 15,000 transcripts were experimentally validated as protein coding by mass spectrometry based proteomics. Bioinformatical analysis of coding transcripts identified 826 proteins specific for urodeles. Several newly identified proteins establish novel protein families based on the presence of new sequence motifs without counterparts in public databases, while others containing known protein domains extend already existing families and also constitute new ones.
CONCLUSIONS: We demonstrate that our multistep assembly approach allows de novo assembly of the newt transcriptome with an annotation grade comparable to well characterized organisms. Our data provide the groundwork for mechanistic experiments to answer the question whether urodeles utilize proprietary sets of genes for tissue regeneration.