Nature

The determination of the total 5,224 base-pair DNA sequence of the virus SV40 has enabled us to locate precisely the known genes on the genome. At least 15.2% of the genome is presumably not translated into polypeptides. Particular points of interest revealed by the complete sequence are the initiation of the early t and T antigens at the same position and the fact that the T antigen is coded by two non-contiguous regions of the genome; the T antigen mRNA is spliced in the coding region. In the late region the gene for the major protein VP1 overlaps those for proteins VP2 and VP3 over 122 nucleotides but is read in a different frame. The almost complete amino acid sequences of the two early proteins as well as those of the late proteins have been deduced from the nucleotide sequence. The mRNAs for the latter three proteins are presumably spliced out of a common primary RNA transcript. The use of degenerate codons is decidedly non-random, but is similar for the early and late regions. Codons of the type NUC, NCG and CGN are absent or very rare.

Source:http://purl.uniprot.org/citations/205802

Download in:

Named Graph Language Inference

Statements in which the resource exists as a subject.
Predicate	Object
rdf:type	uniprot:Journal_Citation
rdfs:comment	The determination of the total 5,224 base-pair DNA sequence of the virus SV40 has enabled us to locate precisely the known genes on the genome. At least 15.2% of the genome is presumably not translated into polypeptides. Particular points of interest revealed by the complete sequence are the initiation of the early t and T antigens at the same position and the fact that the T antigen is coded by two non-contiguous regions of the genome; the T antigen mRNA is spliced in the coding region. In the late region the gene for the major protein VP1 overlaps those for proteins VP2 and VP3 over 122 nucleotides but is read in a different frame. The almost complete amino acid sequences of the two early proteins as well as those of the late proteins have been deduced from the nucleotide sequence. The mRNAs for the latter three proteins are presumably spliced out of a common primary RNA transcript. The use of degenerate codons is decidedly non-random, but is similar for the early and late regions. Codons of the type NUC, NCG and CGN are absent or very rare.
skos:exactMatch	http://purl.uniprot.org/medline/78156432, http://purl.uniprot.org/pubmed/205802
uniprot:name	Nature
uniprot:author	Contreras R., Fiers W., Haegeman G., Rogiers R., Volckaert G., Ysebaert M., van Herreweghe J., van Heuverswyn H., van de Voorde A.
uniprot:date	1978
uniprot:pages	113-120
uniprot:title	Complete nucleotide sequence of SV40 DNA.
uniprot:volume	273
dc-term:identifier	doi:10.1038/273113a0