pubmed:abstractText |
There are at least three immunoglobulin epsilon genes (C epsilon 1, C epsilon 2, and C epsilon 3) in the human genome. The nucleotide sequences of the expressed epsilon gene (C epsilon 1) and one (C epsilon 3) of the two epsilon pseudogenes were compared. The results show that the C epsilon 3 gene lacks the three intervening sequences entirely and has a 31-base A-rich sequence 16 bases 3' to the putative poly(A) addition signal, indicating that the C epsilon 3 gene is a processed gene. The C epsilon 3 gene sequence is homologous to the five separate DNA segments of the C epsilon 1 gene; namely, a segment in the 5'-flanking region (100 bases) and four exons, which are interrupted by a spacer region or intervening sequences. Long terminal repeat (LTR)-like sequences which contain TATAAA and AATAAA sequences as well as terminal inverted repeats are present in both 5'- and 3'-flanking regions. The 5' and 3' LTR-like sequences do not, however, constitute a direct repeat, unlike transposable elements of eukaryotes and retroviruses. The 3' LTR-like sequence is repetitive in the human genome, but is not homologous to the Alu family DNA. Models for the evolutionary origin of the processed gene flanked by the LTR-like sequences are discussed. The C epsilon 3 gene has a new open frame which codes potentially for an unknown protein of 292 amino acid residues.
|