Proc. Natl. Acad. Sci. U.S.A.

Heartwater, a tick-borne disease of domestic and wild ruminants, is caused by the intracellular rickettsia Ehrlichia ruminantium (previously known as Cowdria ruminantium). It is a major constraint to livestock production throughout subSaharan Africa, and it threatens to invade the Americas, yet there is no immediate prospect of an effective vaccine. A shotgun genome sequencing project was undertaken in the expectation that access to the complete protein coding repertoire of the organism will facilitate the search for vaccine candidate genes. We report here the complete 1,516,355-bp sequence of the type strain, the stock derived from the South African Welgevonden isolate. Only 62% of the genome is predicted to be coding sequence, encoding 888 proteins and 41 stable RNA species. The most striking feature is the large number of tandemly repeated and duplicated sequences, some of continuously variable copy number, which contributes to the low proportion of coding sequence. These repeats have mediated numerous translocation and inversion events that have resulted in the duplication and truncation of some genes and have also given rise to new genes. There are 32 predicted pseudogenes, most of which are truncated fragments of genes associated with repeats. Rather then being the result of the reductive evolution seen in other intracellular bacteria, these pseudogenes appear to be the product of ongoing sequence duplication events.

Source:http://purl.uniprot.org/citations/15637156

Statements in which the resource exists as a subject.
PredicateObject
rdf:type
rdfs:comment
Heartwater, a tick-borne disease of domestic and wild ruminants, is caused by the intracellular rickettsia Ehrlichia ruminantium (previously known as Cowdria ruminantium). It is a major constraint to livestock production throughout subSaharan Africa, and it threatens to invade the Americas, yet there is no immediate prospect of an effective vaccine. A shotgun genome sequencing project was undertaken in the expectation that access to the complete protein coding repertoire of the organism will facilitate the search for vaccine candidate genes. We report here the complete 1,516,355-bp sequence of the type strain, the stock derived from the South African Welgevonden isolate. Only 62% of the genome is predicted to be coding sequence, encoding 888 proteins and 41 stable RNA species. The most striking feature is the large number of tandemly repeated and duplicated sequences, some of continuously variable copy number, which contributes to the low proportion of coding sequence. These repeats have mediated numerous translocation and inversion events that have resulted in the duplication and truncation of some genes and have also given rise to new genes. There are 32 predicted pseudogenes, most of which are truncated fragments of genes associated with repeats. Rather then being the result of the reductive evolution seen in other intracellular bacteria, these pseudogenes appear to be the product of ongoing sequence duplication events.
skos:exactMatch
uniprot:name
Proc. Natl. Acad. Sci. U.S.A.
uniprot:author
Allsopp B.A., Allsopp M.T., Berthier D., Botha M., Brayton K.A., Collins N.E., Corton C.H., Faber F.E., Jongejan F., Josemans A., Joubert F., Liebenberg J., Louw E., Maillard J.C., Pretorius A., Steyn H.C., Thomson N.R., Zweygarth E., de Villiers E.P., van Heerden H., van Kleef M., van Strijp M.F.
uniprot:date
2005
uniprot:pages
838-843
uniprot:title
The genome of the heartwater agent Ehrlichia ruminantium contains multiple tandem repeats of actively variable copy number.
uniprot:volume
102
dc-term:identifier
doi:10.1073/pnas.0406633102