The complete nucleotide sequence of two human T-cell leukaemia type III (HTLV-III) proviral DNAs each have four long open reading frames, the first two corresponding to the gag and pol genes. The fourth open reading frame encodes two functional polypeptides, a large precursor of the major envelope glycoprotein and a smaller protein derived from the 3'-terminus long open reading frame analogous to the long open reading frame (lor) product of HTLV-I and -II.