- Research article
- Open Access
Genome-wide characterization and expression analysis of MYB transcription factors in Gossypium hirsutum
© The Author(s). 2016
- Received: 29 June 2016
- Accepted: 29 August 2016
- Published: 9 September 2016
MYB family proteins are one of the most abundant transcription factors in the cotton plant and play diverse roles in cotton growth and evolution. Previously, few studies have been conducted in upland cotton, Gossypium hirsutum. The recent release of the G. hirsutum genome sequence provides a great opportunity to identify and characterize the entire upland cotton MYB protein family.
In this study, we undertook a comprehensive genome-wide characterization and expression analysis of the MYB transcription factor family during cotton fiber development. A total of 524 non-redundant cotton MYB genes, among 1986 MYB and MYB-related putative proteins, were identified and classified into four subfamilies including 1R-MYB, 2R-MYB, 3R-MYB, and 4R-MYB. Based on phylogenetic tree analysis, MYB transcription factors were divided into 16 subgroups. The results showed that the majority (69.1 %) of GhMYBs genes belong to the 2R-MYB subfamily in upland cotton.
Our comparative genomics analysis has provided novel insights into the roles of MYB transcription factors in cotton fiber development. These results provide the basis for a greater understanding of MYB regulatory networks and to develop new approaches to improve cotton fiber development.
- Comparative genomics analysis
- Upland cotton
- MYB genes
- Fiber development
Plant growth and development are controlled by multigene families. Transcription factors play a key role in the regulation of gene transcription and commonly comprise four distinct domains: a DNA-binding domain, a nuclear localization signal, a transcription activation domain, and an oligomerization site . These four domains work together to control many aspects of plant growth and development by activating or suppressing the transcriptional process . Additionally, transcription factors are regularly encoded by multigene families which makes analyzing their individual roles more complex . Compared with fungi and animals, the MYB transcription factors of higher plants are more broadly dispersed in the genome . The MYB domain is highly conserved among plants, and proteins usually contain between one and four repeats (SONT domains) named R1, R2, R3, and R4. Each repeat is comprised of 50–53 amino acids which encode three α-helices, the second and third of which form a helix–turn–helix (HTH) structure . The third α-helix forms the transcription factor DNA recognition site and interacts with the major groove of DNA . Moreover, it is comprised of regularly spread triplet tryptophan residues that group together to make a hydrophobic core . In contrast, the C-terminal promoter domain of different MYB proteins is quite diverse, leading to the broad variety of regulatory roles of the MYB gene family [7, 8]. The MYB domain was first identified in the avian myeloblastosis virus (v-myb)  and three additional MYB genes (c-myb, A-myb, and B-myb) were identified in different organisms such as vertebrates, insects, fungi, and slime molds [7, 10, 11]. The corn C1 gene was the first MYB gene identified in plants, and encodes a c-myb-like transcription factor responsible for the regulation of anthocyanin biosynthesis . Generally, R2R3-MYB domain proteins are the predominant form found in higher plants .
To date, the plant transcription factor database (http://planttfdb.cbi.edu.cn/) contains approximately 8746 MYB, and 6410 MYB-related, sequences . These genes may be involved in various plant cell activities such as secondary metabolism, hormone signaling [8, 14], environmental stress, cell development , and organ growth [16, 17].
Recently, several genome-wide analyses of MYB transcription factors have been conducted in Arabidopsis, rice , maize , Salvia miltiorrhiza , soybean , apple , sugarcane , and Chinese cabbage . These studies can be utilized to identify MYB transcription factors in other plants, including cotton. However, very little information about MYB gene diversity and abundance in upland cotton is available. Several studies have shown that the MYB transcription factor family has a role in regulating fiber progress in cotton. The GhMYB109 and GhMYB2 genes, that belong to the R2R3-MYB subfamily, are associated with positive regulation of cotton fiber development [24, 25]. Moreover, silencing of GhMYB25 is involved in the production of short fibers in cotton , while suppression of GhMYB25-like produces fiber-less cotton . The GhCPC gene belongs to the 3R-MYB subfamily and is a negative regulator of cotton fiber elongation . A recent report has revealed that ten MYB (MIXTA-like) genes were highly expressed during early fiber development in Gossypium hirsutum (cultivar). However, examination of gene expression in three naked seed (fiber-less mutants) cotton mutants revealed that only one group of MIXTA-like genes had decreased expression levels .
Upland cotton is one of the most important fiber crops in the world and provides raw material for the textile industry . Transcriptome analyses showed that several pathways were regulated in developing cotton fibers . The status of those up-regulated or down-regulated pathways, and the molecular mechanisms by which they are controlled requires further investigation [32, 33]. As MYB proteins are one of the largest transcription factor families in higher plants, they may play key roles in regulating diverse pathways in cotton during fiber development . Comprehensive analysis of upland cotton MYB proteins and their evolutionary variations, through tetraploid cotton, might help to reveal critical molecular mechanisms of cotton development and growth. In addition, the recent release of the G. hirsutum genome sequence  provides a great tool to identify and characterize the entire MYB protein family in upland cotton.
Here, we conducted a comprehensive genome-wide analysis of the MYB transcription factor family in G. hirsutum. A total of 524 MYB transcription factor encoding genes were identified and subsequently subjected to systematic analyses including: phylogenetic tree analysis, chromosomal location determination, gene structure analysis, conserved motifs identification, transcriptome (RNA-seq) analysis, and qRT-PCR analysis of selected MYB genes. Our genome-wide analysis of the GhMYB gene family might contribute to future studies on the functional characterization of MYB proteins in G. hirsutum. These research findings will provide information fundamental to determining the molecular and regulatory mechanisms of MYB transcription factors in cotton.
Identification of MYB gene family in upland cotton
Upland cotton protein sequences were downloaded from the Cotton Genome Project (http://cgp.genomics.org.cn/page/species/download.jsp?category=hirsutum) for computational analysis. Corresponding protein sequences were downloaded from the Arabidopsis database (TAIR; http://www.Arabidopsis.org/), cacao, mays, Vitis vinifera, Populus trichocarpa and Gossypium raimondii sequences were downloaded from the plant transcription factor database (http://planttfdb.cbi.edu.cn/). A local BLASTP search was performed to identify candidate MYB members, using Arabidopsis, cacao, mays, Vitis vinifera, Populus trichocarpa and G. raimondii MYB protein sequences as the query. Hits with e-values of 1e- 10 were deemed to be members of the MYB family. Furthermore, to confirm the protein sequences derived from the selected cotton MYB, candidate genes were examined using the domain analysis programs of Pfam (Protein family: http://pfam.sanger.ac.uk/) and SMART (Simple Modular Architecture Research Tool: http://smart.embl-heidelberg.de/). All redundant sequences were manually discarded, resulting in 524 MYB protein sequences. Additional analysis was based on cluster W alignment results.
Mapping MYB genes on chromosomes
The chromosomal position of all GhMYB genes was determined through BLASTN searches against the G. hirsutum genome project (Cotton Genome Project of the Institute of Cotton Research of Chinese Academy of Agricultural Sciences). GhMYB genes were mapped on the chromosome using the Map Chart software. Two types of gene duplications were identified: tandem and segment duplication events. Gene duplications were identified provided that the length of the aligned sequence covered >80 % of the longer gene, and that basic on the similarity of the gene alignment regions was >80 % . In addition, to further estimate GhMYB genes duplication events, the synonymous (Ks) and non-synonymous (Ka) substitution rates of evolution were calculated using the DnaSP software (version 5.10) . Ka/Ks calculator was run on those GhMYB gene pairs to estimate their synonymous and non-synonymous rates of evolution. To estimate the evolutionary time of duplicated genes, Ks values were translated into duplication time in millions of years based on a rate of 1 substitution per synonymous site per year. The duplication events time (T) was calculated as T = Ks/2λ × 10−6 Mya (approximate value for clock-like rate λ = 1.5 × 10−8 years for cotton) .
The phylogenetic tree of MYB transcription factor genes was generated using multiple sequence alignments of upland cotton, Arabidopsis and cacao MYB protein sequences using Cluster W (http://www.ebi.ac.uk/Tools/msa/clustalw2/). Phylogenetic and molecular evolutionary analyses were performed using MEGA 6.0 software (http://www.megasoftware.net) with pairwise distance and the Neighbor-Joining (NJ) method. The tree was constructed with the following parameters: Substitution, Poisson Model; data subset to use, the p-distance, complete deletion; replication, bootstrap analysis with 1,000 replicates. Moreover, maximum likelihood and minimum evolution methods were also used in our phylogenetic tree to validate the result from the NJ method. Additionally, a separate phylogenetic tree was constructed with all the GhMYB protein sequences in G. hirsutum for further analysis.
Gene structure analysis and identified motifs
MYB genomic and cDNA sequences were obtained from the Cotton Genome Project of the Institute of Cotton Research of Chinese Academy of Agricultural Sciences (http://cgp.genomics.org.cn/page/species/download.jsp?Category=hirsutum). The Online Gene Structure Display Server (GSDS 2.0) (http://gsds.cbi.pku.edu.cn/index.php) was used to examine gene structure by comparing each cDNA sequence with the corresponding genomic sequence. Conserved protein motifs in G. hirsutum MYBs were identified using the MEME program (version 4.8.2) (http://meme.nbcr.net/meme/intro.html). The following parameters were used: any number of repetitions, the maximum number of motifs-20, and optimum width from 6 to 250.
Plant materials, RNA extraction and qRT-PCR analysis
Where ∆∆Ct = ∆Ct of the treated sample minus ∆Ct of the untreated control sample, and ∆Ct is the difference in threshold cycles for GhNAC18 target and the GhActin1 internal reference.
RNA-seq data analysis
To analyze upland cotton MYB expression patterns, we used Illumina RNA-seq data, including five stages of cotton fiber development (−1, 1, 3, 5, and 10 DPA) from wild-type (TM −1) upland cotton. Gene expression levels were calculated as reads per kilobase of exon model per million mapped reads (RPKM) units (Additional file 1: Table S1). Fold changes of different genes expression analysis and the related statistical computations of the two tested conditions were performed using the DESeq R package (1.10.1). The resulting P-values were adjusted using Benjamini’s and Hochberg’s method to control the false rate . Only genes with an adjusted P-value < 0.05 found using DESeq were categorized as differentially expressed. Heat maps were generated and hierarchical clustering was performed using genesis_v126.96.36.199.09.10-DIGERATI software.
Genome-wide identification of upland cotton GhMYB transcription factors
The different subfamilies of GhMYB transcription factor types distributed on upland cotton chromosomes
Transcription factor Types
Atypical MYB genes (4R-MYB)
Chromosomal distribution and annotation MYB genes
Tandem and segmental duplication events are the main causes of gene-family expansion in upland cotton. Based on the whole genome analysis of gene duplications, 73 duplicated GhMYB gene pairs were made by segmental and tandem duplication, including 40 duplication events within the At and Dt chromosomes as well as 33 duplication events between the At, Dt chromosomes and scaffold (Additional file 5: Table S4), indicating that segmental duplications and tandem duplications contributed to the expansion of GhMYB in upland cotton. At list, two or more GhMYB genes reside on the same chromosome or on different chromosomes. A tandem duplication event is when gene duplication happens within the same chromosome while segmental duplication is when duplicated genes are located in different chromosomes. In this study, clusters formed by GhMYBs in the upland cotton (AD) genome were identified to explain the mechanism behind the expansion of the GhMYB family in cotton. We found that 5 gene pairs duplicated tandemly into chromosomes (At_chr5, At_chr7, At_chr12, Dt_chr5 and Dt_chr7) and 68 gene pairs duplicated segmentally, which deeply contributed to the expansion of the GhMYB transcription factors in upland cotton. The results also indicated that, among the duplication events in the GhMYB transcription factor family, the gene pairs that appeared to be derived from segmental duplication events occurred earlier than those that arose from tandem duplication (Additional file 5: Table S4). A gene duplication event, occurring during the course of cotton evolution, has led to the creation of new gene functions . The origin of multigene families has been attributed to a region-specific gene duplication that occurred in upland cotton . Furthermore, to calculate the evolutionary time of these identified MYB in G. hirsutum, an estimation of their synonymous and non-synonymous substitution rates during evolution, Ks and Ka values were calculated using the DnaSp software. Nucleotide substitutions in protein-coding genes can be categorized as synonymous or non-synonymous substitutions as elaborated in (Additional file 5: Table S4). The Ka/Ks ratio is a measure used to examine the mechanisms of gene duplication evolution after divergence from their ancestors. A Ka/Ks value of 1 suggests neutral selection, a Ka/Ks value of <1 suggests purifying selection, and a Ka/Ks value of >1 suggests positive selection (Hurst 2002). Here, we estimated synonymous and non-synonymous substitutions ratios (Ka/Ks) for the 73 pairs of segmentally and tandemly duplicated genes. It was found that most of GhMYB genes had Ka/Ks values of less than 1, implying that GhMYB genes have evolved under the effect of purifying selection while 17 duplicated genes had the Ka/Ks ratio more than 1, implying that those had evolved under positive selection. We further used Ks to estimate the time of GhMYB genes duplication events during the evolutionary time of upland cotton genome. The tandem and segmental duplication events in upland cotton that occurred between 0.26 (Ks = 0.0078) and 124.42 mya (Ks = 3.7326), with an average of 46.494 mya (million years ago). The Ks of tandem duplications of GhMYB genes occurred from 2.76 (Ks = 0.0828) mya to 44.343 (Ks = 1.3303) mya, with average 26.0493 mya. The results suggest that the expansion of the GhMYB genes in upland cotton which originated from At and Dt genomes mostly arose from whole genome duplication events during their evolution.
Phylogenetic analysis of MYB transcription factors genes in upland cotton
To identify potential relationships between the various GhMYB gene family members, a Neighbor-Joining (NJ) phylogenetic tree was constructed. Examination of protein sequence similarity and phylogenetic tree analysis allowed us to divide the 524 upland cotton MYB genes into 16 subgroups, which ranged in size from 2 to 68 MYB genes (Additional file 6: Figure S2A). The bootstrap values for some subgroups of the NJ tree were low as a result of relatively large number of gene sequences that were also found in earlier study . Supporting the phylogenetic analysis of subgroups, most GhMYB protein domain repeats exhibited high similarities within subgroups. Hence, we strongly sought other evidence to check the reliability of our phylogenetic tree. The phylogenetic trees of MYB TFs were reconstructed with maximum likelihood and minimum evolution methods to validate the result from the NJ and pairwise distance method. The trees constructed by the three methods mentioned above, are almost the same with only minimal differences at some subgroups (subgroup 7 and 16), implying that the tree methods were mainly consistent with each other (Additional file 7: Figure S3).
MYB gene structure analysis and conserved motif identification
Gene structure analysis of 524 GhMYB transcription factors was performed. To provide greater insight into their intron/exon structure, cDNA and corresponding genomic sequences were compared. Approximately 90.84 % of upland cotton GhMYB genes contained between 1 and 12 introns. Similar to that described in Arabidopsis, Vitis vinifera, and Eucalyptus grandis [41, 42], most GhMYB transcription factors contained 1 (17.4 %) or 2 introns (52.7 %). The remaining 22.51 % of GhMYB genes contained more than two introns. However, only 8.2 % of GhMYB transcription factor genes contained no introns. Most of the intronless genes were clustered into the 13 and 21 subgroups. Furthermore, an uprooted phylogenetic tree was constructed using GhMYB protein sequences to assess the similarities in intron/exon structure within GhMYB gene subgroups (Additional file 6: Figure S2A). Within subgroups the majority of GhMYB genes contained similar exon/intron distribution arrangements, particularly related to exon length and intron number. Most of the 2R-MYB transcription factor genes had a conserved gene structure with three exons and two introns. Additionally, the size of the third exon was more variable than that of the first and second exon (Additional file 6: Figure S2B). High levels of variation in the sequence of the third exon are reported to be associated with functional divergence among R2R3-MYB genes . Whilst few R2R3-MYB genes contained no intron, they were clustered into 13 subgroups. It was noted that the duplication of 2R during the early development of MYB proteins containing two repeats gave rise to the 3R-MYB domains. Therefore, it was found that most of the 1R-MYB, 3R-MYB, and 4R-MYB genes were disrupted by more than four introns. This is consistent with previous reports which have suggested that most MYB-related genes in higher plants contained more than 2 introns . Within each subgroup, most of the GhMYB genes generally had similar intron/exon structures, as it was described previously by Jiang, Gu, and Peterson and Matus [42, 45]. There was a strong connection between the phylogeny tree analysis and the intron/exon structure of the GhMYB transcription factor family in upland cotton.
Further investigation of the variation within the conserved motifs of GhMYB proteins using the MEME program identified 20 conserved motifs, which we designated motifs 1 to 20. Most of the GhMYB proteins within the same subgroup showed similar motif compositions, while high variance was observed between the different subgroups. This is consistent with previous reports suggesting that MYB family members with similar protein arrangements were classified into the same subfamily . For example, all GhMYB proteins in subgroup1 possessed motifs 1, 2, 3, and 12 while all members in subgroup 15 contained motifs 2, 3, 4, 6, 7, and 15 (Additional file 8: Figure S4). In addition, some motifs were specific to a distinctive subgroup, indicative of a particular function of that subgroup. Though the functions of most of the conserved motifs remain to be identified, they are likely to play an important role in the transcriptional regulation of target genes, and may indicate further functional diversification in specific species. Our results suggest that these motifs are evolutionarily conserved and functionally important. This result was similar to Stracke and Dubos who suggested that if MYB genes from the same subgroup share similar protein motifs they probably share similar functions [8, 46].
Upland cotton MYB family relationships with other plant
In addition, to gain more insights on divergence of the MYB genes after polyploidization, the non-synonymous (Ka) and synonymous (Ks) nucleotide substitutions and their ratio (Ka/Ks) were analyzed for the homologous gene pairs between G. raimondii (D5) and upland cotton G. hirsutum (Dt). Of the 90 MYB gene pairs D5/Dt homologous, 45 were identical (Ka = Ks = 0 or Ka/Ks ratio = 0), 41 had a Ka/Ks less than 1, suggesting that MYB genes have evolved mainly under the effect of purifying selection. However, only 3 MYB genes had a Ka/Ks ratio more than 1, suggesting that these genes have been evolved by positive selection (Additional file 10: Table S6). This result implying that most of the ancestral MYB genes have been retained in upland cotton G. hirsutum after polyploidization.
Expression profiles of MYB genes in G. hirsutum
MYB gene expression was analyzed using RNA-seq data from different stages of cotton fiber development including −1, 1, 3, 5, and 10 DPA. It was noted that 431/524 (82.3 %) MYB genes were expressed in at least one stage of cotton fiber development and the expression 93/524 (17.7 %) MYB genes were not detected by RNA-seq (Additional file 1: Table S1). In addition, of the 431 GhMYB genes with detectable expression, 190 exhibited low expression levels during different stages of fiber development (Additional file 6: Figure S2C). According to the phylogenetic tree analysis, the expression of MYB transcription factors can also be divided into 16 subgroups. All genes within subgroups 2, 3 and 5 exhibited low or undetectable expression levels during the five early stages of cotton fiber development. In Arabidopsis, this subgroup has been shown to be involved in salt tolerance [47, 48]. GhMYBs showed elevated transcript levels in the five cotton fiber developmental periods, suggesting that they might be important for maintenance of metabolic processes and normal cotton fiber development. Most MYB genes in subgroup 1, 4 and subgroups 12 to 15 were highly expressed in the five analyzed stages of fiber development. Many MYB MIXTA-like transcription factors could be involved in the regulation of epidermal cell differentiation in different plant species, including specifying cell shape in petals, vegetative trichome initiation, and branching and seed fiber initiation [26, 27]. Recently, it has been reported that ten MYB (MIXTA-like) genes were highly expressed during early fiber development in G. hirsutum. In contrast, only one group of MIXTA-like genes had low expression levels in three natural fiber-less mutants . These mutants provide a means to analyze the roles of MYB transcription factors in the control fiber development in upland cotton. Interestingly, expression of the rice MYB91 was stimulated by other abiotic stresses and hormone treatment. Moreover, in Arabidopsis the MYB91 gene has been shown to integrate endogenous developmental signals with different environmental conditions . In addition, MYB88 normally maintains fate and developmental progression throughout the stomatal cell lineage . These results indicate that MYB genes can have multiple functions in plant growth and stress responses. Sixty six genes in subgroup 7, CotAD_42675 (MYB2 or GL1), CotAD_18666 (MYB109), CotAD_46807 (MYB109), CotAD_02818 and ect, were highly expressed during the initiation and elongation stages of fiber development, implying that these genes may be involved in a complex network of fiber development. In fact, MYB2 stimulates cotton fiber development  and MYB109 is specifically expressed in fiber initiation and elongation stages . Recently, it was found that MYB2 and MYB109 promote normal fiber development in cotton . In this study, 15 MYB-3R genes were identified and clustered into subgroups 10 and 13. The expression of three of the 15 MYB-3R genes could not be detected during cotton fiber development. Therefore, the fact that the MYB-3R family is easily identifiable and characterized could make them suitable targets for genetic engineering approaches aimed at improving cotton fiber development. Previous reports suggest that GhCPC, belonging to the MYB-3R subfamily, negatively controls cotton fiber development during the initiation and early elongation stages in mutant cotton [28, 51]. These results are consistent with previous studies which have suggested that MYBs played a crucial role in a wide variety of biological processes including: cell growth, cell cycle control, signal transduction, metabolic and physiological stability, and response to environmental stimuli [37, 52]. Therefore, the MYB family might provide a means to regulate cotton fiber development and offer a path to understanding cell fiber development during the initiation and elongation stages.
Expression verification of GhMYB genes involved in cotton fiber development
The MYB gene family is part of the biggest transcription factor family in higher plants and plays an important role in plant growth and development. We undertook a comprehensive genome-wide characterization and expression analysis of the MYB transcription factor family in cotton fiber development. A total of 524 MYB genes were identified and classified into four subfamilies. Based on phylogenetic tree analysis, these MYB transcription factors were classified into 16 subgroups. Proteins within the same subgroup contained very similar gene structures and protein motifs. Additionally, our results revealed that MYB genes were distributed across the entire upland cotton genome. Moreover, RNA-seq data showed that MYB genes play an important role in plants. The expression profiles of 20 genes during cotton fiber development, obtained by qRT-PCR, show that different MYB genes can positively or negatively regulate cotton fiber development. Additionally, other MYB genes are expressed in both mutant and wild-type fiber, further highlighting the diverse functions of MYB proteins in the development of the cotton fiber cell. This study provides strong evidence that GhMYB genes play a major role in cotton fiber development and provides a platform for the characterization of interesting MYB genes in the future.
The authors appreciate the support of the National Science and Technology Support Program of China (2013BAD01B03).
This study was supported by the National Science and Technology of China and China Scholarship Council (CSC) all contributed with funding towards the research.
Availability of materials and data
All related data are available within the manuscript and its additional files.
HS prepared the manuscript draft and contributed the experiment. SH, GS contributed to the design and performed the statistical analysis. WG assisted to draft the manuscript. JS and XD provided the materials and revised the manuscript. All authors are read and approved the final manuscript.
The authors declare that they have no competing interests.
Consent for publication
Ethics approval and consent to participate
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
- Riechmann JL, Heard J, Martin G, Reuber L, Jiang C, Keddie J, et al. Arabidopsis transcription factors: genome-wide comparative analysis among eukaryotes. Science. 2000;290:2105–10.View ArticlePubMedGoogle Scholar
- Ptashne M. How eukaryotic transcriptional activators work. Nature. 1988;335:683–9.View ArticlePubMedGoogle Scholar
- Zimmermann IM, Heim MA, Weisshaar B, Uhrig JF. Comprehensive identification of Arabidopsis thaliana MYB transcription factors interacting with R/B-like BHLH proteins. Plant J. 2004;40:22–34.View ArticlePubMedGoogle Scholar
- Stracke R, Werber M, Weisshaar B. The R2R3-MYB gene family in Arabidopsis thaliana. Curr Opin Plant Biol. 2001;4:447–56.View ArticlePubMedGoogle Scholar
- Kanei-Ishii C, Sarai A, Sawazaki T, Nakagoshi H, He DN, Ogata K, et al. The tryptophan cluster: a hypothetical structure of the DNA-binding domain of the myb protooncogene product. J Biol Chem. 1990;265:19990–5.PubMedGoogle Scholar
- Saikumar P, Murali R, Reddy EP. Role of tryptophan repeats and flanking amino acids in Myb-DNA interactions. Proc Natl Acad Sci U S A. 1990;87:8452–6.View ArticlePubMedPubMed CentralGoogle Scholar
- Martin C, Paz-Ares J. MYB transcription factors in plants. Trends Genet. 1997;13:67–73.View ArticlePubMedGoogle Scholar
- Dubos C, Stracke R, Grotewold E, Weisshaar B, Martin C, Lepiniec L. MYB transcription factors in Arabidopsis. Trends Plant Sci. 2010;15:573–81.View ArticlePubMedGoogle Scholar
- Klempnauer K-H, Gonda TJ, Michael BJ. Nucleotide sequence of the retroviral leukemia gene v-myb and its cellular progenitor c-myb: The architecture of a transduced oncogene. Cell. 1982;31:453–63.View ArticlePubMedGoogle Scholar
- Lipsick JS. One billion years of Myb. Oncogene. 1996;13:223–35.PubMedGoogle Scholar
- Weston K. Myb proteins in life, death and differentiation. Curr Opin Genet Dev. 1998;8:76–81.View ArticlePubMedGoogle Scholar
- Paz-Ares J, Ghosal D, Wienand U, Peterson PA, Saedler H. The regulatory c1 locus of Zea mays encodes a protein with homology to myb proto-oncogene products and with structural similarities to transcriptional activators. EMBO J. 1987;6:3553–8.PubMedPubMed CentralGoogle Scholar
- Jin J, Zhang H, Kong L, Gao G, Luo J. PlantTFDB 3.0: A portal for the functional and evolutionary study of plant transcription factors. Nucleic Acids Res. 2014;42:1182–7.View ArticleGoogle Scholar
- Allan AC, Hellens RP, Laing WA. MYB transcription factors that colour our fruit. Trends Plant Sci. 2008;13:99–102.View ArticlePubMedGoogle Scholar
- Geethalakshmi S, Barathkumar S, Prabu G. The MYB Transcription Factor Family Genes in Sugarcane (Saccharum sp.). Plant Mol Biol Report. 2014;512–31Google Scholar
- Pesch M, Hülskamp M. One, two, three…models for trichome patterning in Arabidopsis? Curr Opin Plant Biol. 2009;12:587–92.View ArticlePubMedGoogle Scholar
- Balkunde R, Pesch M, Hu M, Hülskamp M. Trichome patterning in Arabidopsis thaliana: from genetic to molecular models. Curr Top Dev Biol. 2010;91:299–321.View ArticlePubMedGoogle Scholar
- Katiyar A, Smita S, Lenka SK, Rajwanshi R, Chinnusamy V, Bansal KC. Genome-wide classification and expression analysis of MYB transcription factor families in rice and Arabidopsis. BMC Genomics. 2012;13:544.View ArticlePubMedPubMed CentralGoogle Scholar
- Du H, Feng B-R, Yang S-S, Huang Y-B, Tang Y-X. The R2R3-MYB transcription factor gene family in maize. PLoS One. 2012;7:e37463.View ArticlePubMedPubMed CentralGoogle Scholar
- Li C, Lu S. Genome-wide characterization and comparative analysis of R2R3-MYB transcription factors shows the complexity of MYB-associated regulatory networks in Salvia miltiorrhiza. BMC Genomics. 2014;15:277.View ArticlePubMedPubMed CentralGoogle Scholar
- Du H, Yang S-S, Liang Z, Feng B-R, Liu L, Huang Y-B, et al. Genome-wide analysis of the MYB transcription factor superfamily in soybean. BMC Plant Biol. 2012;12:106.View ArticlePubMedPubMed CentralGoogle Scholar
- Cao Z-H, Zhang S-Z, Wang R-K, Zhang R-F, Hao Y-J. Genome wide analysis of the apple MYB transcription factor family allows the identification of MdoMYB121 gene confering abiotic stress tolerance in plants. PLoS One. 2013;8:e69955.View ArticlePubMedPubMed CentralGoogle Scholar
- Wang Z, Tang J, Hu R, Wu P, Hou X-L, Song X-M, et al. Genome-wide analysis of the R2R3-MYB transcription factor genes in Chinese cabbage (Brassica rapa ssp. pekinensis) reveals their stress and hormone responsive patterns. BMC Genomics. 2015;16:17.View ArticlePubMedPubMed CentralGoogle Scholar
- Pu L, Li Q, Fan X, Yang W, Xue Y. The R2R3 MYB transcription factor GhMYB109 is required for cotton fiber development. Genetics. 2008;180:811–20.View ArticlePubMedPubMed CentralGoogle Scholar
- Wang S, Wang J-W, Yu N, Li C-H, Luo B, Gou J-Y, et al. Control of plant trichome development by a cotton fiber MYB gene. Plant Cell. 2004;16:2323–34.View ArticlePubMedPubMed CentralGoogle Scholar
- Machado A, Wu Y, Yang Y, Llewellyn DJ, Dennis ES. The MYB transcription factor GhMYB25 regulates early fibre and trichome development. Plant J. 2009;59:52–62.View ArticlePubMedGoogle Scholar
- Walford S-A, Wu Y, Llewellyn DJ, Dennis ES. GhMYB25-like: a key factor in early cotton fibre development. Plant J. 2011;65:785–97.View ArticlePubMedGoogle Scholar
- Liu B, Zhu Y, Zhang T. The R3-MYB gene GhCPC negatively regulates cotton fiber elongation. PLoS One. 2015;1:1–17.Google Scholar
- Zhang T, Hu Y, Jiang W, Fang L, Guan X, Chen J, et al. Sequencing of allotetraploid cotton (Gossypium hirsutum L. acc. TM-1) provides a resource for fiber improvement. Nat Biotechnol. 2015;33:531–7.View ArticlePubMedGoogle Scholar
- Lee JJ, Woodward AW, Chen ZJ. Gene expression changes and early events in cotton fibre development. Ann Bot. 2007;100:1391–401.View ArticlePubMedPubMed CentralGoogle Scholar
- Liu K, Sun J, Yao L, Yuan Y. Transcriptome analysis reveals critical genes and key pathways for early cotton fiber elongation in Ligon lintless-1 mutant. Genomics. 2012;100:42–50.View ArticlePubMedGoogle Scholar
- Gilbert MK, Kim HJ, Tang Y, Naoumkina M, Fang DD. Comparative transcriptome analysis of short fiber mutants ligon-lintless 1 and 2 reveals common mechanisms pertinent to fiber elongation in cotton (Gossypium hirsutum L.). PLoS One. 2014;9:23–5.Google Scholar
- Liang W, Fang L, Xiang D, Hu Y, Feng H, Chang L, et al. Transcriptome Analysis of Short Fiber Mutant Ligon lintless-1 (Li 1) Reveals Critical Genes and Key Pathways in Cotton Fiber Elongation and Leaf. Development. 2015;1:1–18.View ArticleGoogle Scholar
- Li F, Fan G, Lu C, Xiao G, Zou C, Kohel RJ, et al. Genome sequence of cultivated Upland cotton (Gossypium hirsutum TM-1) provides insights into genome evolution. Nat Biotechnol. 2015;33:524–30.View ArticlePubMedGoogle Scholar
- Yang S, Zhang X, Yue JX, Tian D, Chen JQ. Recent duplications dominate NBS-encoding gene expansion in two woody species. Mol Genet Genomics. 2008;280:187–98.View ArticlePubMedGoogle Scholar
- Librado P, Rozas J. DnaSP v5: a software for comprehensive analysis of DNA polymorphism data. Bioinformatics. 2009;25:1451–2.View ArticlePubMedGoogle Scholar
- Wang K, Wang Z, Li F, Ye W, Wang J, Song G, et al. The draft genome of a diploid cotton Gossypium raimondii. Nat Genet. 2012. p. 1098–103.Google Scholar
- Livak KJ, Schmittgen TD. Analysis of relative gene expression data using real- time quantitative PCR and the 2 Ϫ ⌬⌬ C T method. Gene Expr. 2001;408:402–8.Google Scholar
- Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J Royal Stat Soc. 1995. p. 289–300.Google Scholar
- Rong J, Feltus FA, Liu L, Lin L, Paterson AH. Gene copy number evolution during tetraploid cotton radiation. Heredity. 2010;105:463–72.View ArticlePubMedGoogle Scholar
- Soler M, Camargo ELO, Carocha V, Cassan-Wang H, San Clemente H, Savelli B, et al. The Eucalyptus grandis R2R3-MYB transcription factor family: evidence for woody growth-related evolution and function. New Phytol. 2015;206:1364–77.View ArticlePubMedGoogle Scholar
- Matus JT, Aquea F, Arce-Johnson P. Analysis of the grape MYB R2R3 subfamily reveals expanded wine quality-related clades and conserved gene structure organization across Vitis and Arabidopsis genomes. BMC Plant Biol. 2008;8:83.View ArticlePubMedPubMed CentralGoogle Scholar
- Dias AP, Braun EL, McMullen MD, Grotewold E. Recently duplicated maize R2R3 Myb genes provide evidence for distinct mechanisms of evolutionary divergence after duplication. Plant Physiol. 2003;131:610–20.View ArticlePubMedPubMed CentralGoogle Scholar
- Du H, Wang YB, Xie Y, Liang Z, Jiang SJ, Zhang SS, et al. Genome-wide identification and evolutionary and expression analyses of MYB-related genes in land plants. DNA Res. 2013;20:437–48.View ArticlePubMedPubMed CentralGoogle Scholar
- Jiang C, Gu X, Peterson T. Identification of conserved gene structures and carboxy-terminal motifs in the Myb gene family of Arabidopsis and Oryza sativa L ssp indica. Genome Biol. 2004;5:R46.View ArticlePubMedPubMed CentralGoogle Scholar
- Feller A, MacHemer K, Braun EL, Grotewold E. Evolutionary and comparative analysis of MYB and bHLH plant transcription factors. Plant J. 2011;66:94–116.View ArticlePubMedGoogle Scholar
- Xu R, Wang Y, Zheng H, Lu W, Wu C, Huang J, et al. Salt-induced transcription factor MYB74 is regulated by the RNA-directed DNA methylation pathway in Arabidopsis. J Exp Bot. 2015;66:5997–6008.View ArticlePubMedPubMed CentralGoogle Scholar
- Lippold F, Sanchez DH, Musialak M, Schlereth A, Scheible W-R, Hincha DK, et al. AtMyb41 regulates transcriptional and metabolic responses to osmotic stress in Arabidopsis. Plant Physiol. 2009;149:1761–72.View ArticlePubMedPubMed CentralGoogle Scholar
- Zhu N, Cheng S, Liu X, Du H, Dai M, Zhou DX, et al. The R2R3-type MYB gene OsMYB91 has a function in coordinating plant growth and salt stress tolerance in rice. Plant Sci. 2015;236:146–56.View ArticlePubMedGoogle Scholar
- Lee E, Liu X, Eglit Y, Sack F. FOUR LIPS and MYB88 conditionally restrict the G1/S transition during stomatal formation. J Exp Bot. 2013;64:5207–19.View ArticlePubMedPubMed CentralGoogle Scholar
- Taliercio EW, Boykin D. Analysis of gene expression in cotton fiber initials. BMC Plant Biol. 2007;7:22.View ArticlePubMedPubMed CentralGoogle Scholar
- Ma J, Wang Q, Sun R, Xie F, Jones DC, Zhang B. Genome-wide identification and expression analysis of TCP transcription factors in Gossypium raimondii. Sci Rep. 2014;4:1–10.Google Scholar
- Ambawat S, Sharma P, Yadav NR, Yadav RC. MYB transcription factor genes as regulators for plant responses: An overview. Physiol Mol Biol Plants. 2013;19:307–21.View ArticlePubMedPubMed CentralGoogle Scholar
- Tombuloglu H, Kekec G, Sakcali MS, Unver T. Transcriptome-wide identification of R2R3-MYB transcription factors in barley with their boron responsive expression analysis. Mol Genet Genomics. 2013;288:141–55.View ArticlePubMedGoogle Scholar
- Thyssen GN, Fang DD, Turley RB, Florane C, Li P, Naoumkina M. Next generation genetic mapping of the Ligon-lintless-2 (Li2) locus in upland cotton (Gossypium hirsutum L.). TAG. Theoretical and applied genetics. Theor Appl Genet. 2014;127:2183–92.View ArticlePubMedGoogle Scholar
- Salih H, Leng X, He S-P, Jia Y, Gong W, Du X-M. Characterization of the early fiber development gene, Ligon-lintless 1 (Li1), using microarray. Plant Gene. 2016;6:59–66.View ArticleGoogle Scholar