A novel replication-independent histone H2a gene in mouse

Background An uncharacterized histone H2a-coding transcript (E130307C13) has been cloned from a mouse full-length cDNA library. This transcript is encoded on chromosome 6, approximately 4 kb upstream of a histone H4 gene, Hist4h4. The proteins encoded by this transcript and the human H2afj mRNA isoform-2 have the highest amino acid similarity. In this paper, we characterize it from the expression pattern given by quantitative RT-PCR. Results Quantitative RT-PCR indicated that the gene that encodes E130307C13 (E130307C13) is regulated in a replication-independent manner, and therefore it is H2afj. Certainly, H2afj transcript lacks a stem-loop structure at the 3'-UTR but contains a poly (A) signal. In addition, its promoter region has a different structure from those of the replication-dependent histone H2a genes. Conclusion The bioinformatics imply that E130307C13 is a replication-independent H2a gene. In addition, quantitative RT-PCR analysis shows that it is replication-independent. Thus, it is H2afj, a novel replication-independent H2a gene in mouse.


Background
Replication of the eukaryotic chromosomes requires the synthesis of histones to package the newly replicated DNA into chromatin. Control of the level of histone mRNA accounts for much of the control of histone protein synthesis [1]. Mouse has 18 replication-dependent histone H2a genes [2]. Among these 18 genes, 13 genes are located in the Hist1 cluster on chromosome 13, 4 in the Hist2 cluster on chromosome 3 and 1 in the Hist3 cluster on chromosome 11 [2]. The histone mRNAs that are cellcycle-regulated increase 35-fold as cells progress from mitosis through G1-phase and into S-phase [3]. The promoters of histone genes contain CCAAT and TATA boxes [4]. The replication-dependent histone H2a genes lack introns and a poly (A) signal. They have a conserved stemloop structure (5'-GGCTCTTTTCAGAGCC-3') at the 3'-UTR, which plays an important role in mRNA processing and stability [5][6][7].
Mouse also has two replication-independent histone H2a genes, H2afx on chromosome 9 and H2afz chromosome 3. These two genes encode polyadenylated mRNAs. H2afx mRNA has both a polyadenylated tail and a stem-loop structure [8]. Mouse replication-dependent histone H2a mRNAs and H2afx mRNA have a nuclear export element (5'-ACAACAAGAAGACGCGCATCAT-3') in the protein-coding region that functions to export the mRNA from the nucleus to the cytoplasm [9].
An uncharacterized histone H2a-coding transcript (E130307C13, FANTOM clone ID; NM_177688, Genebank accession number) has been cloned from a mouse full-length cDNA library. E130307C13 lies on chromosome 6, approximately 4 kb upstream of Hist4h4. In this paper, we characterize it from the expression pattern given by quantitative RT-PCR. In addition to this, we compared the structure with the sequences deposited in the international DNA/protein database.

Results and discussion
Comparison of the putative amino acid sequence encoded by E130307C13 with the amino acid sequences deposited in the international DNA/protein database showed that it has the highest similarity to that encoded by human H2afj mRNA isoform-2 (NM_177925, Genebank accession number; Figs. 1, 2). Human has two isoforms of H2afj [10,11]. Isoform-1 (NM_018267, Genebank accession number) is produced after splicing of two introns; isoform-2 does not need intron splicing for maturation.
Interestingly, H2afj also lies near a histone H4 gene, on human chromosome 12. According to the nomenclature of histone genes [2], if E130307C13 is regulated in a replication-dependent manner, it is recognized as Hist4h2a. But if it is regulated in a replication-independent manner, it is recognized as H2afj.
Each product of the quantitative RT-PCR gave a single band on the agarose gel of the expected size (Fig. 3). Observation using the quantitative RT-PCR showed that the expression pattern of E130307C13 is typical of replication-independent histone gene (Table 1, Fig. 3). The expression pattern of E130307C13 is more similar to that of the replication-independent H2afz than to that of the replication-dependent Hist2h2aa2. The expression of Hist2h2aa2 increased along with cell cycle progression from the beginning of S-phase (0 h), peaked at 2 h, and then decreased (Fig. 3). On the other hand, E130307C13 and H2afz were expressed constantly (Fig. 3). These results suggest that E130307C13 is a replication-independent histone H2a gene in mouse. According to the nomenclature of histone genes [2], E130307C13 is recognized not as Hist4h2a but as H2afj.
Upstream of the 5'-end of E130307C13, no TATA box was found. In addition, the first CCAAT box lies 230 bases upstream of the translation start codon (Fig. 4). The other replication-dependent H2a genes have the first CCAAT and TATA boxes within 100 bases upstream of the translation start codon (Fig. 4). The replication-independent genes H2afx and H2afz also have a TATA box (Fig. 4). Thus, histone H2a genes have a TATA box in the promoters, except for E130307C13. Interestingly, the promoter of H2afz lacks CCAAT box but includes TATA box, on the other hand, that of E130307C13 lacks TATA box but includes CCAAT box (Fig. 4).
In addition, the 3'-UTR of the E130307C13 mRNA does not include the conserved stem-loop structure (Fig. 5). But the E130307C13 mRNA has two poly (A) signals at the middle and near the 3'-end. It indicates that the E130307C13 gives rise to two differentially polyadenylated mRNA transcripts. Considering the position at the 3'-UTR, it has a possibility that the poly (A) signal near the 3'-end is functional. Except for the pseudogene Hist1h2aj, the replication-dependent H2a mRNAs have the conserved stem-loop structure at the 3'-UTR. H2afx gives rise to a cell-cycle-regulated mRNA ending in the stem-loop when it is transcribed during S-phase, and a polyadenylated mRNA that is present in G1-phase cells [2,8].
H2afz mRNA lacks the stem-loop structure and has poly (A) signals. H2afz contains four introns in the proteincoding region and needs a splicing mechanism to produce the mature mRNA. Interestingly, E130307C13 lacks introns. Comparing the transcript structure of Phylogenetic relationships among 44 amino acids sequences from human and mouse histone H2a related proteins   RT-PCR products on agarose gel and expression patterns. Lanes 1, 100 bp ladder; 2, RT-PCR product amplified with E130307C13 primer set 2; 3, that with E130307C13 primer set 1; 4, that with Hist2h2aa2 primer set 1; 5, that with Hist2h2aa2 primer set 2; 6, that with H2afz primer set. Hist2h2aa2_2 H2afz E130307C13 with those of the other histone H2a genes suggests that E130307C13 has replication-independent characteristics.

Conclusion
The bioinformatics imply that E130307C13 is a replication-independent H2a gene. In addition, quantitative RT-PCR analysis shows that it is replication-independent. Thus, it is H2afj, a novel replication-independent H2a gene in mouse.

Phylogenetic tree construction
Multiple alignment of 44 amino acids sequences of histone H2a related proteins from human and mouse was created using the CLUSTAL W [12] on the DNA Databank of Japan. The phylogenetic tree by the neighbor-joining method with 1000 bootstrap analyses was constructed based on the multiple alignment using MEGA version 2.1 [13].

Cell cycle synchronization
The cell cycle of Hepa 1-6 cells was synchronized at the end of G1-phase by the addition of thymidine-hydroxyurea. The cell cycle arrest was released by washing out the thymidine-hydroxyurea, then the cells were harvested at intervals of 1 h from 0 h to 11 h.

RNA extraction
Total RNA was extracted by using the RNeasy mini kit (Qiagen) according to the manual for the cell line. After that, each sample was treated with DNase I.

cDNA synthesis
RNA (approximately 0.5 µg) and random hexamer primers were heated to 70°C for 10 min, followed by cooling on ice for 5 min. The cDNA was synthesized using Superscript III First Strand buffer (Invitrogen) according to the manual. The reverse transcriptase was inactivated by a 15min incubation at 70°C.

Quantitative PCR
The following primers were used: 5'-AACTGTAGCCCG-GCCCG-3' and 5'-TTCGTCTGTTTGCGCTTT-3' (primer Sequences between CCAAT and TATA boxes upstream of the histone H2a and E130307C13 coding regions Figure 4 Sequences between CCAAT and TATA boxes upstream of the histone H2a and E130307C13 coding regions. Underlines indicate CCAAT and TATA boxes. The ATGs located at the 3'-end indicate translation start codon. Numbers in parentheses represent numbers of bases not shown. The Hist1h2aj is a pseudogene, lacking a start codon.