pubmed:abstractText |
We determined the sequence of the 2,138 nucleotides in the Sendai virus genome just following the 3' proximal 3,686 nucleotides which we had previously reported (Nucleic Acids Res. 11, 7317-7330, 1983). This covers the entire third gene of 1,173 nucleotides and the 3' proximal 1,013 nucleotides of the fourth gene. Like the NP and P+C genes, both the third and fourth genes start from consensus sequence R1 (3'-UCCCAC(or UA)UUUC) at the 3' end and the third gene terminates with consensus sequence R2 (3'-AUUCUUUUU) at the 5' end. The third gene was identified as M, and the deduced 348 amino acids indicated that the M protein is rich in basic residues and has hydrophobic domains near the C-terminal. The fourth gene, although sequencing is not complete yet, was identified as F, since a large open reading frame found in the gene contains the characteristic sequence of 20 amino acids located at the N-terminal of the F1 protein. Analyses of the amino acid sequence suggested that the structure of the F gene product is NH2-signal peptide-F2-F1-COOH.
|