Complete sequence of a 93.4 kb contig from chromosome 3 of Trypanosoma cruzi containing a strand switch region
1998 (English)In: Genome Research, ISSN 1088-9051, E-ISSN 1549-5469, Vol. 8, no 8, 809-816 p.Article in journal (Refereed) Published
We have initiated large-scale sequencing of the third smallest chromosome of the CL Brener strain of Trypanosoma cruzi and we report here the complete sequence of a contig consisting of three cosmids. This contig covers 93.4 kb and has been found to contain 20-30 novel genes and several repeat elements, including a novel chromosome 3-specific 400-bp repeat sequence. The intergenic sequences were found to be rich in di- and trinucleotide repeats of varying lengths and also contained several known T. cruzi repeat elements. The sequence contains 29 open reading frames (ORFs) longer than 700 bp, the longest being 5157 bp, and a large number of shorter ORFs. Of the long ORFs, seven show homology to known genes in parasites and other organisms, whereas four ORFs were confirmed by sequencing of cDNA clones. Two shorter ORFs were confirmed by a database homology and a cDNA clone, respectively, and one RNA gene was identified. The identified genes include two copies of the gene for alanine-aminotransferase as well as genes for glucose-6-phosphate isomerase, protein kinases and phosphatases, and an ATP synthase subunit. An interesting feature of the sequence was that the genes appear to be organized in two long clusters containing multiple genes on the same strand. The two clusters are transcribed in opposite directions and they are separated by an approximately 20-kb long, relatively GC-rich sequence, that contains two large repetitive elements as well as a pseudogene for cruzipain and a gene for U2snRNA. It is likely that this strand switch region contains one or more regulatory and promoter regions. The reported sequence provides the first insight into the genome organization of T. cruzi and shows the potential of this approach for rapid identification of novel genes. [The sequence data described in this paper have been submitted to the GenBank data library under accession nos. AF052831-AF052833.]
Place, publisher, year, edition, pages
1998. Vol. 8, no 8, 809-816 p.
Medical and Health Sciences
IdentifiersURN: urn:nbn:se:uu:diva-52101DOI: 10.1101/gr.8.8.809PubMedID: 9724326OAI: oai:DiVA.org:uu-52101DiVA: diva2:80010