We have initiated a project to sequence the 3 Mbp genome of the thermoacidophilic archaebacterium Sulfolobus solfataricus P2. Cosmids were selected from a provisional set of minimally overlapping clones, subcloned in pUC18, and sequenced using a hybrid (random plus directed) strategy to give two blocks of contiguous unique sequence, respectively, 100,389 and 56,105 bp. These two contigs contain a total of 163 open reading frames (ORFs) in 26-29 putative operons; 56 ORFs could be identified with reasonable certainty. Clusters of ORFs potentially encode proteins of glycogen biosynthesis, oxidative decarboxylation of pyruvate, ATP-dependent transport across membranes, isoprenoid biosynthesis, protein synthesis, and ribosomes. Putative promoters occur upstream of most ORFs. Thirty per cent of the predicted strong and medium-strength promoters can initiate transcription at the start codon or within 10 nucleotides upstream, indicating a process of initial mRNA-ribosome contact unlike that of most eubacterial genes. A novel termination motif is proposed to account for 15 additional terminations. The two contigs differ in densities of ORFs, insertion elements and repeated sequences; together they contain two copies of the previously reported insertion sequence ISC 1217, five additional IS elements representing four novel types, four classes of long non-IS repeated sequences, and numerous short, perfect repeats.
Predicate | Object |
---|---|
rdf:type | |
rdfs:comment |
We have initiated a project to sequence the 3 Mbp genome of the thermoacidophilic archaebacterium Sulfolobus solfataricus P2. Cosmids were selected from a provisional set of minimally overlapping clones, subcloned in pUC18, and sequenced using a hybrid (random plus directed) strategy to give two blocks of contiguous unique sequence, respectively, 100,389 and 56,105 bp. These two contigs contain a total of 163 open reading frames (ORFs) in 26-29 putative operons; 56 ORFs could be identified with reasonable certainty. Clusters of ORFs potentially encode proteins of glycogen biosynthesis, oxidative decarboxylation of pyruvate, ATP-dependent transport across membranes, isoprenoid biosynthesis, protein synthesis, and ribosomes. Putative promoters occur upstream of most ORFs. Thirty per cent of the predicted strong and medium-strength promoters can initiate transcription at the start codon or within 10 nucleotides upstream, indicating a process of initial mRNA-ribosome contact unlike that of most eubacterial genes. A novel termination motif is proposed to account for 15 additional terminations. The two contigs differ in densities of ORFs, insertion elements and repeated sequences; together they contain two copies of the previously reported insertion sequence ISC 1217, five additional IS elements representing four novel types, four classes of long non-IS repeated sequences, and numerous short, perfect repeats.
|
skos:exactMatch | |
uniprot:name |
Mol. Microbiol.
|
uniprot:author |
Allard G.,
Chan C.C.-Y.,
Charlebois R.L.,
Doolittle W.F.,
Gaasterland T.,
Klenk H.-P.,
Liu Q.Y.,
Penny S.L.,
Ragan M.A.,
Schenk M.E.,
Sensen C.W.,
Singh R.K.,
Young F.
|
uniprot:date |
1996
|
uniprot:pages |
175-191
|
uniprot:title |
Organizational characteristics and information content of an archaeal genome: 156 kb of sequence from Sulfolobus solfataricus P2.
|
uniprot:volume |
22
|
dc-term:identifier |
doi:10.1111/j.1365-2958.1996.tb02666.x
|