Mol. Microbiol.

We have initiated a project to sequence the 3 Mbp genome of the thermoacidophilic archaebacterium Sulfolobus solfataricus P2. Cosmids were selected from a provisional set of minimally overlapping clones, subcloned in pUC18, and sequenced using a hybrid (random plus directed) strategy to give two blocks of contiguous unique sequence, respectively, 100,389 and 56,105 bp. These two contigs contain a total of 163 open reading frames (ORFs) in 26-29 putative operons; 56 ORFs could be identified with reasonable certainty. Clusters of ORFs potentially encode proteins of glycogen biosynthesis, oxidative decarboxylation of pyruvate, ATP-dependent transport across membranes, isoprenoid biosynthesis, protein synthesis, and ribosomes. Putative promoters occur upstream of most ORFs. Thirty per cent of the predicted strong and medium-strength promoters can initiate transcription at the start codon or within 10 nucleotides upstream, indicating a process of initial mRNA-ribosome contact unlike that of most eubacterial genes. A novel termination motif is proposed to account for 15 additional terminations. The two contigs differ in densities of ORFs, insertion elements and repeated sequences; together they contain two copies of the previously reported insertion sequence ISC 1217, five additional IS elements representing four novel types, four classes of long non-IS repeated sequences, and numerous short, perfect repeats.

Source:http://purl.uniprot.org/citations/8899719

Statements in which the resource exists as a subject.
PredicateObject
rdf:type
rdfs:comment
We have initiated a project to sequence the 3 Mbp genome of the thermoacidophilic archaebacterium Sulfolobus solfataricus P2. Cosmids were selected from a provisional set of minimally overlapping clones, subcloned in pUC18, and sequenced using a hybrid (random plus directed) strategy to give two blocks of contiguous unique sequence, respectively, 100,389 and 56,105 bp. These two contigs contain a total of 163 open reading frames (ORFs) in 26-29 putative operons; 56 ORFs could be identified with reasonable certainty. Clusters of ORFs potentially encode proteins of glycogen biosynthesis, oxidative decarboxylation of pyruvate, ATP-dependent transport across membranes, isoprenoid biosynthesis, protein synthesis, and ribosomes. Putative promoters occur upstream of most ORFs. Thirty per cent of the predicted strong and medium-strength promoters can initiate transcription at the start codon or within 10 nucleotides upstream, indicating a process of initial mRNA-ribosome contact unlike that of most eubacterial genes. A novel termination motif is proposed to account for 15 additional terminations. The two contigs differ in densities of ORFs, insertion elements and repeated sequences; together they contain two copies of the previously reported insertion sequence ISC 1217, five additional IS elements representing four novel types, four classes of long non-IS repeated sequences, and numerous short, perfect repeats.
skos:exactMatch
uniprot:name
Mol. Microbiol.
uniprot:author
Allard G., Chan C.C.-Y., Charlebois R.L., Doolittle W.F., Gaasterland T., Klenk H.-P., Liu Q.Y., Penny S.L., Ragan M.A., Schenk M.E., Sensen C.W., Singh R.K., Young F.
uniprot:date
1996
uniprot:pages
175-191
uniprot:title
Organizational characteristics and information content of an archaeal genome: 156 kb of sequence from Sulfolobus solfataricus P2.
uniprot:volume
22
dc-term:identifier
doi:10.1111/j.1365-2958.1996.tb02666.x