REAL

T2prhd: a tool to study the patterns of repeat evolution

Sipos, Botond and Somogyi, Kálmán and Andó, István and Pénzes, Zsolt (2008) T2prhd: a tool to study the patterns of repeat evolution. BMC Bioinformatics, 27. ISSN 1471-2105

[img]
Preview
Text
1471-2105-9-27.pdf

Download (930kB) | Preview

Abstract

BACKGROUND: The models developed to characterize the evolution of multigene families (such as the birth-and-death and the concerted models) have also been applied on the level of sequence repeats inside a gene/protein. Phylogenetic reconstruction is the method of choice to study the evolution of gene families and also sequence repeats in the light of these models. The characterization of the gene family evolution in view of the evolutionary models is done by the evaluation of the clustering of the sequences with the originating loci in mind. As the locus represents positional information, it is straightforward that in the case of the repeats the exact position in the sequence should be used, as the simple numbering according to repeat order can be misleading. RESULTS: We have developed a novel rapid visual approach to study repeat evolution, that takes into account the exact repeat position in a sequence. The "pairwise repeat homology diagram" visualizes sequence repeats detected by a profile HMM in a pair of sequences and highlights their homology relations inferred by a phylogenetic tree. The method is implemented in a Perl script (t2prhd) available for downloading at http://t2prhd.sourceforge.net and is also accessible as an online tool at http://t2prhd.brc.hu. The power of the method is demonstrated on the EGF-like and fibronectin-III-like (Fn-III) domain repeats of three selected mammalian Tenascin sequences. CONCLUSION: Although pairwise repeat homology diagrams do not carry all the information provided by the phylogenetic tree, they allow a rapid and intuitive assessment of repeat evolution. We believe, that t2prhd is a helpful tool with which to study the pattern of repeat evolution. This method can be particularly useful in cases of large datasets (such as large gene families), as the command line interface makes it possible to automate the generation of pairwise repeat homology diagrams with the aid of scripts

Item Type: Article
Subjects: Q Science / természettudomány > QP Physiology / élettan
Depositing User: János Zsámboki
Date Deposited: 30 Sep 2013 13:15
Last Modified: 19 May 2023 11:06
URI: http://real.mtak.hu/id/eprint/6804

Actions (login required)

Edit Item Edit Item