Detection of genes with atypical nucleotide sequence in microbial genomes
2002 (English)In: Journal of Molecular Evolution, ISSN 0022-2844, E-ISSN 1432-1432, Vol. 54, no 3, 365-375 p.Article in journal (Refereed) Published
Along the gene, nucleotides in various codon positions tend to exert a slight but observable influence on the nucleotide choice at neighboring positions. Such context biases are different in different organisms and can be used as genomic signatures. In this paper, we will focus specifically on the dinucleotide composed of a third codon position nucleotide and its succeeding first position nucleotide. Using the 16 possible dinucleotide combinations, we calculate how well individual genes conform to the observed mean dinucleotide frequencies of an entire genome, forming a distance measure for each gene. It is found that genes from different genomes can be separated with a high degree of accuracy, according to these distance values. In particular, we address the problem of recent horizontal gene transfer, and how imported genes may be evaluated by their poor assimilation to the host's context biases. By concentrating on the third- and succeeding first position nucleotides, we eliminate most spurious contributions from codon usage and amino-acid requirements, focusing mainly on mutational effects. Since imported genes are expected to converge only gradually to genomic signatures, it is possible to question whether a gene present in only one of two closely related organisms has been imported into one organism or deleted in the other. Striking correlations between the proposed distance measure and poor homology are observed when Escherichia coli genes are compared to Salmonella typhi, indicating that sets of outlier genes in E. coli may contain a high number of genes that have been imported into E. coli, and not deleted in S. typhi.
Place, publisher, year, edition, pages
2002. Vol. 54, no 3, 365-375 p.
Biochemistry and Molecular Biology
IdentifiersURN: urn:nbn:se:uu:diva-90105DOI: 10.1007/s00239-001-0051-8PubMedID: 11847562OAI: oai:DiVA.org:uu-90105DiVA: diva2:162310