Sparse feature selection identifies H2A.Z as a novel, pattern-specific biomarker for asymmetrically self-renewing distributed stem cells
There is a long-standing unmet clinical need for biomarkers with high specificity for distributed stem cells (DSCs) in tissues, or for use in diagnostic and therapeutic cell preparations (e.g., bone marrow). Although DSCs are essential for tissue maintenance and repair, accurate determination of their numbers for medical applications has been problematic. Previous searches for biomarkers expressed specifically in DSCs were hampered by difficulty obtaining pure DSCs and by the challenges in mining complex molecular expression data. To identify such useful and specific DSC biomarkers, we combined a novel sparse feature selection method with combinatorial molecular expression data focused on asymmetric self-renewal, a conspicuous property of DSCs. The analysis identified reduced expression of the histone H2A variant H2A.Z as a superior molecular discriminator for DSC asymmetric self-renewal. Subsequent molecular expression studies showed H2A.Z to be a novel “pattern-specific biomarker” for asymmetrically self-renewing cells, with sufficient specificity to count asymmetrically self-renewing DSCs in vitro and potentially in situ.
Despite the important role of distributed stem cells (DSCs) in homeostatic tissue cell renewal and tissue repair, and their use in currently available cell replacement therapies (e.g., hematopoietic stem cell transplantation), there are no effective means for quantifying their number. This unmet research and clinical need is due to the lack of biomarkers with sufficient specificity for distinguishing DSCs from other tissue cell types. Previous searches for these elusive biomarkers relied on global gene expression profiles for DSCs that were based on comparisons of genes expressed in embryonic stem cells (ESCs) and genes expressed in cell populations enriched for adult stem cells (ASCs) ( Ivanova et al., 2002, Ramalho-Santos et al., 2002 and Fortunel et al., 2003), a sub-category of DSCs ( Sherley, 2008 and Noh et al., 2011). These ASC-enriched populations also contained a significant fraction of non-stem committed progenitors and differentiating progeny cells that limited their utility for identifying genes whose expression was unique to DSCs, i.e., “stemness genes” ( Ivanova et al., 2002, Ramalho-Santos et al., 2002, Fortunel et al., 2003 and Burns and Zon, 2002). In addition, gene expression profiles based on specific expression in both ESCs and DSC-enriched populations necessarily exclude genes whose expression is specific to either of these distinctive stem cell types. One essential distinction is that ESCs do not undergo asymmetric self-renewal, which is a universal defining property of DSCs ( Noh et al., 2011, Loeffler and Potten, 1997,Sherley, 2002 and Sherley, 2005).
We applied a different strategy to identify DSC-specific genes based on targeting deterministic asymmetric self-renewal, which is one of the two forms of asymmetric self-renewal, the other being stochastic asymmetric self-renewal (see Discussion later). Mammalian DSCs self-renew asymmetrically to replenish cells in tissues that undergo cell turnover but maintain a constant cell mass (Loeffler and Potten, 1997,Sherley, 2002 and Sherley, 2005). Each deterministic asymmetric self-renewal division by DSCs yields a new DSC and a non-stem cell sister. The non-stem cell sister is a committed progenitor of differentiated cells responsible for mature tissue functions (Loeffler and Potten, 1997, Sherley, 2002 and Sherley, 2005). Because asymmetric self-renewal is unique to DSCs, some genes whose expression is highly associated with asymmetric self-renewal may also identify DSCs (Noh et al., 2011).
Here, we describe the application of this strategy using genetically engineered cell lines that adopt deterministic asymmetric self-renewal conditionally. Restoration of normal wild-type p53 protein expression induces these lines to undergo asymmetric self-renewal in a similar way to DSCs (Noh et al., 2011, Sherley et al., 1995a, Liu et al., 1998a, Rambhatla et al., 2001 and Rambhatla et al., 2005). When p53 expression is reduced, the cells switch to symmetric self-renewal. In vivo, symmetric self-renewal by DSCs is regulated to increase tissue mass during adult maturation growth and to repair injured tissues (Loeffler and Potten, 1997, Sherley, 2002 and Sherley, 2005). When controls that constrain DSCs to asymmetric self-renewal are disrupted (e.g., by p53 mutations), the risk of proliferative disorders like cancer increases ( Sherley, 2002, Sherley et al., 1995a and Rambhatla et al., 2005). In vitro, the experimental capture of this functional feature of DSCs served as a surrogate for natural occurring DSC properties that are otherwise inaccessible. The novelty in this work is manifold but derives primarily from the combination these experimental models with a highly effective but unconventional gene microarray bioinformatics approach to identify a small number of genes predicted to have a high degree of biomarker specificity for DSCs.
Materials and methods
Analysis of cell division kinetics and quality control of cell cultures
All studies involving mice were conducted at the Boston Biomedical Research Institute (BBRI), the previous host institution for the Adult Stem Cell Technology Center (renamed Asymmetrex, LLC). All procedures were approved by the BBRI Institutional Animal Care and Use Committee. Euthanasia was conducted according to the guidelines of the U.S. National Institutes of Health Office of Laboratory Animal Welfare.
During asymmetric self-renewal in culture, cells give rise to two different progeny with each cell division, dividing and non-dividing. PDCasym was calculated by PDCasym = (Nt − N0)/(Fd × N0) where N0 and Nt are the number of cells present in culture at time 0 h and time t, respectively (Merok et al., 2002, Sherley et al., 1995a and Sherley et al., 1995b). Fd is the fraction of new sister cells that divide in a population. The PDC value of symmetrically dividing cells (PDCsym) was calculated with the equation, PDCsym = ln(Nt / N0) / ln2, derived from the exponential cell growth equation, Nt = N0ekt (Merok et al., 2002 and Sherley et al., 1995b). To assess whether the cell division kinetics were close to ideal asymmetric self-renewal for transcription profiling, the PDCsym/PDCasym ratio was calculated at the time when cells were harvested (see Supplementary Information). The generation time (GT) for each asymmetric cell growth model was determined from time-lapse microscopy observations, or from the average doubling time of the symmetrically dividing counterpart (Rambhatla et al., 2001 and Sherley et al., 1995b). We used 0.5 for Fd to calculate the cell kinetics parameters for all experiments and generate the ideal ratio curves for asymmetric self-renewal (Supplementary Fig. S1; Supplementary Table S1).
The Ind-8, tC-2, and 1h-3 cells for asymmetric self-renewal and Con-3, tI-3, and 1g-1 cells for respective control symmetric self-renewal were maintained as described (Sherley et al., 1995a, Liu et al., 1998a,Rambhatla et al., 2001, Rambhatla et al., 2005, Merok et al., 2002 and Liu et al., 1998b). For the analyses of the self-renewal kinetics of Ind-8 and Con-3 cells, cells were grown over a 3-day period of until about 50% confluency, trypsinized, and replated in Zn-free medium (DMEM, 10% dialyzed fetal bovine serum [DFBS], 5 μg/ml puromycin) at a cell:plating area:medium volume of 100,000:75 cm2:20 ml. This ratio was held constant for all experiments, unless specified otherwise. Sixteen to 24 h later, the culture medium was replaced with the same volume of Zn-free medium, or medium supplemented to the specified concentration of ZnCl2. This time was designated as 0-hour in the analyses. For the 1h-3 and 1g-1 cell growths, cells were grown to 50% confluency at 37 °C and replenished with fresh growth medium (DMEM, 10% DFBS, 1 mg/ml G418 sulfate). Sixteen hours later, the cells were trypsinized and replated at either 37 °C or 32.5 °C. This time was 0-hour in the analyses. In the case of the growth analyses of tI-3 and tC-2 cells, cells were grown and re-plated in the same way as for Ind-8 and Con-3 cells except for the medium (DMEM, 10% DFBS, 1 mg/ml G418 sulfate plus 5 μg/ml puromycin). Cells were harvested by trypsin treatment and counted with a Model ZM Coulter Counter to confirm the induction of asymmetric self-renewal in culture by PDC ratio analysis.
Human hSATs (strain 18Ubic) and HFSCs (strain 3C5) were cultured and maintained as previously described (Huh et al., 2011 and Homma et al., 2012). To suppress the asymmetric self-renewal and H2A.Z asymmetry of hSATs, their culture medium was supplemented with 1.5 mM xanthine as described for HFSCs (Huh et al., 2011). Mouse p53 KO MEFs were kindly provided by L. Donehower. Mouse p53 wt/cdkn1A KO fibroblasts were kindly provided by T. Jacks. They were subsequently immortalized on a standard 3T3 cell schedule (Todaro and Green, 1963).
cDNA micro-array data analysis
Cells for total RNA extraction were harvested at the time estimated for PDC = 2 (36 h for Ind-8 and Con-3, 48 h for tC-2, tI-3, 1h-3, and 1g-1) and were selected for microarray experiments based on the actual PDC ratio curves derived from replicate cell cultures grown in parallel. The same PDC ratio values were maintained to obtain similar fractions of cycling stem-like cells and non-cycling differentiating cells for all self-renewal pattern comparison models. Total RNA was extracted with the Trizol reagent (Invitrogen, Carlsbad, CA) and impurities were removed with the Qiagen RNeasy kit (Qiagen, Valencia, CA). Fifty to seventy micrograms of total RNA was used for cDNA syntheses. Arabidopsis thaliana mRNAs (Stratagene, La Jolla, CA) were introduced as internal probe standards into reverse transcription reactions to normalize data between different arrays. Cy3- or Cy5-fluorescently labeled cDNAs were hybridized onto the National Institute for Aging 15K mouse cDNA prefabricated arrays ( Tanaka et al., 2000), supplied by the Massachusetts Institute of Technology (MIT)-BioMicro Center, using the procedure provided by the MIT-BioMicro Center. Hybridized microarrays were scanned with the arrayWoRxe™ Biochip Reader (Applied Precision LLC, Northwest Issaquah, WA). The fluorescence intensity of each spot was analyzed from the scanned tiff images by using the DigitalGenome™ software (MolecularWare, Inc. Cambridge, MA). The Cy3 and Cy5 fluorescence intensities were normalized by calculating the normalization factor from total intensity normalization ( Quackenbush, 2001). Analyses for each self-renewal pattern comparison were performed as duplicate independent experiments. For each comparison, we performed two chip hybridizations with reciprocally labeled Cy3 or Cy5 target cDNAs to each biological sample. The entire analysis incorporated data from 16 independent chips, which comprised two dye-swap technical replicate arrays for each of the four asymmetric–symmetric comparisons.
A gene was selected for data analyses only if the mean value of foreground pixels of the spot was greater than the sum of the mean and two standard deviations of the background pixels. For individual gene probe spots, the expression intensities of Cy5 and Cy3 channels were estimated by subtracting mean backgrounds from mean foregrounds. The ratios of the final gene expression intensities for the asymmetrically self–renewing states to the respective symmetrically self-renewing states were calculated. These ratio values were used for sparse feature selection. The ratio data were deposited for public access in National Center for Biotechnology Information Gene Expression Omnibus Database the under accession number GSE40183.
Sparse feature selection
The EM algorithm was applied to the cDNA array data provided. The data were aggregated so that all asymmetric cell division array data were given a dependent variable class label of − 1 and all symmetric cell division array data were given a class label of + 1. The different culture treatments used to promote symmetric or asymmetric division were not modeled separately in the computational experiments. All symmetrically self-renewing cells were assigned to the symmetric class, and all those self-renewing asymmetrically were assigned to the asymmetric class, regardless of how the symmetric was controlled experimentally. This was to avoid artifacts caused by the different methods of inducing symmetry or asymmetry of division.
The cDNA micro-array dataset (GEO Accession number GSE40183) was screened to remove missing or zero expression values. We subsequently removed genes whose expression across replicates was less than the mean expression of the entire array dataset plus two standard deviations of the expression of the entire array data. This filter removed genes whose expression was not significantly different than the array background noise fluctuation at the 95% confidence limit. This processing resulted in 1648 genes available for EM algorithm analysis (see Supplementary Information for mathematical details of the method). The selection of genes was found to be quite robust, with very similar subsets of genes being selected for varied filtering models with varying degrees of imposed sparsity. After the filters were applied, the EM algorithm reduced the pool of candidate genes to 4–7 genes at the higher levels of sparsity control applied. These genes were able to classify the self-renewal division pattern with very high efficacy, with r2 values exceeding 0.99. Most of the selected genes made negative contributions to the model, implying that they would be down-regulated in cells self-renewing asymmetrically compared to those self-renewing symmetrically. Conversely, genes making positive contributions to the model would be up-regulated in cells self-renewing asymmetrically compared to those self-renewing symmetrically. The signs and contributions of selected genes to the models were also consistent across multiple calculations using varied filtering criteria, suggesting our novel gene selection algorithm was quite robust and reproducible.
Phosphorylated histone H3–H2A.Z asymmetry detection
For mitotic cell analysis of H2A.Z with phosphorylated histone H3 (pH3; phospho-S10), cells grown on glass chamber slides were washed with ice cold PBS and immediately fixed for 15 min at room temperature in PBS containing 3.7% formaldehyde. Thereafter, dual indirect in situ immunofluorescence (ISIF) analyses were performed for H2A.Z and pH3 as described for mitotic cell analysis of H2A.Z and α-tubulin (Huh and Sherley, 2011). In brief, after blocking, cells were incubated overnight at 4 °C in a humidified chamber with an a rabbit anti-H2A.Z polyclonal antibody (Cell Signaling Technology, Inc., Danvers, MA) diluted 1:200 in PBS containing 2% goat serum. After five rinses in PBS containing 0.5% BSA, the cells were incubated for 1 h at room temperature with Alexa Fluor® 488-conjugated goat anti-rabbit IgG antibodies (Invitrogen, Inc., Carlsbad, CA), diluted 1:300 in the blocking solution. Next, the cells were washed five times with PBS containing 0.5% BSA. Cells stained for H2A.Z detection were subsequently incubated in the same manner with a mouse anti-pH3 monoclonal antibody (Abcam, Inc., Cambridge, MA) at a 1:1000 dilution followed by Alexa Fluor® 568-conjugated goat anti-mouse IgG antibodies (Invitrogen, Inc., Carlsbad, CA) diluted 1:500. After washing, the cells were mounted with 4′-6-diamido-2-phenylindole (DAPI)-containing VectaShield® mounting media (Vector Laboratories, Inc., Burlingame, CA). Epifluorescence images were captured with a Leica DMR microscope and Leica DC300F digital camera system. Pair-wise control analyses that omitted anti-H2A.Z and anti-pH3 antibodies separately and together were evaluated to ensure that all detected fluorescence required the specific antibodies.
For tissue analyses, sections (10 μm) of paraffin-embedded skin from adult FVB mice (Taconic Farms, Inc., Hudson, NY; Institutional Animal Care and Use Committee-approved procedures) that included hair follicles were cut on a Leica RM2255 microtome (Leica Microsystems, Inc., Bannockburn, IL) and picked up on Superfrost/Plus glass slides (Fisher Scientific, Inc., Pittsburgh, PA). For ISIF, sections were deparaffinized in xylene and re-hydrated with a descending series of ethanol solutions (100%, 95%, 90%, 80%, 70%). Sections were then washed three times with PBS and incubated with phosphate buffered sodium citrate solution (0.01 M, pH 7.4) for 20 min in a food steamer (Oster) to expose epitopes blocked during fixation and embedding. After washing in PBS, the sections were permeabilized with 0.2% Triton X-100 (v/v in PBS) at room temperature for 10 min and afterwards washed once with PBS. Thereafter, sections were blocked with 10% goat serum for 1 h at room temperature and dual ISIF analyses were performed for H2A.Z and pH3 as described above for cultured cells.
An orthogonal-intersection strategy for identification of asymmetric self-renewal associated genes
Previously, we used non-tumorigenic, immortalized mouse mammary epithelial C127 cells and mouse embryo fibroblasts (MEFs) to derive lines with conditional self-renewal patterns. The self-renewal pattern of these cells can be reversibly switched between symmetric and asymmetric by varying either culture temperature or Zn concentration, respectively, as a consequence of controlling p53 expression from respectively responsive promoters (Noh et al., 2011, Sherley et al., 1995a, Liu et al., 1998a, Rambhatla et al., 2001, Rambhatla et al., 2005 and Liu et al., 1998b). These and related cell lines were used to design a 2 × 4 orthogonal-intersection microarray analysis for the purpose of identifying genes whose expression consistently showed the same pattern of change between asymmetric self-renewal versus symmetric self-renewal (See Fig. 1). Four different pair-wise comparisons were developed in which a state of asymmetric self-renewal was compared to a congruent state of symmetric self-renewal. The first 3 comparisons were based on a difference in p53 expression, but each had a different biological context. For asymmetric versussymmetric, respectively, these comparisons were:
Zn-responsive p53-inducible MEFs (Ind-8 cells) versus p53-null control MEFs (Con-3 cells), both cultured in Zn-supplemented medium;
Low temperature (32.5 °C)-dependent p53-inducible C127 cells (1h-3 cells) versus vector-control C127 cells (i.e., no p53 transgene; 1g-1 cells), both cultured at 32.5 °C ( Sherley et al., 1995a,Rambhatla et al., 2005 and Liu et al., 1998b).
The fourth comparison had a special purpose. It provided a comparison of asymmetric versus symmetric self-renewal that was not due to a difference in p53 expression ( Noh et al., 2011). Two previously described derivatives of the Zn-responsive p53-inducible MEFs were used to make this comparison. One line (tI-3 cells) is stably transfected with a constitutively expressed type II inosine monophosphate dehydrogenase (IMPDH II) mini-gene ( Noh et al., 2011, Liu et al., 1998a and Rambhatla et al., 2005). IMPDH II is the rate-limiting enzyme for guanine ribonucleotide biosynthesis. Its down-regulation by p53 is required for asymmetric self-renewal ( Liu et al., 1998a and Rambhatla et al., 2005). Therefore, even in Zn-supplemented medium, which induces normal p53 expression, cells derived with a stably expressed IMPDH II transgene continue to undergo symmetric self-renewal ( Liu et al., 1998a and Rambhatla et al., 2005). This abrogation of p53 effects on self-renewal pattern occurs even though other p53-dependent responses remain intact ( Liu et al., 1998a). Under the same conditions, control vector-only transfectants (tC-2 cells) continue to exhibit asymmetric self-renewal ( Liu et al., 1998a and Rambhatla et al., 2005). Thus, this fourth comparison could be used to exclude genes whose change in expression was primarily due to changes in p53 expression and not specifically transitions in self-renewal pattern as well ( Noh et al., 2011).
We identified genes whose expression varied in the same manner for all 4 orthogonal comparisons of cells in states of asymmetric self-renewal versus symmetric self-renewal. In order for a gene to be considered, its average expression ratio – based on the two dye-swapped technical replicates – had to lie consistently among the top 5% of all average gene expression ratios determined (i.e., up-regulated) or among the bottom 5% of all average gene expression ratios (i.e., down-regulated). Moreover, this requirement had to be met for all 4 compared biological contexts.
We developed a new quality control metric called the population division cycle ratio (PDC ratio) to ensure consistent degrees of asymmetric self-renewal and symmetric self-renewal across all 4 orthogonal comparisons (Merok et al., 2002) (See Supplementary Information; Supplementary Fig. S1; Supplementary Table S1). Prefabricated cDNA micro-arrays constructed with the National Institute for Aging mouse 15K mouse clone set were used for the analysis (Tanaka et al., 2000 and Kargul et al., 2001). As detailed in Materials and methods, for each of the 4 experimental comparisons, we isolated two independent samples of RNA; and each of the RNA samples was labeled independently with Cy5 and Cy3 fluorescent dyes. Each independent set of fluorescent RNA samples was used to develop two reciprocal co-hybridizations for each experimental comparison. Thus, data from 4 microarrays, representing 2 independent experiments, were available for each of the 4 orthogonal contexts for asymmetric versussymmetric self-renewal.
A sparse feature approach to identification of an asymmetric self-renewal biomarker
Conventional analysis of the microarray data identified 21 genes that were consistently up-regulated in all 4 contexts of deterministic asymmetric self-renewal and 31 genes that were consistently down-regulated based on comparisons to symmetric self-renewal (see e.g. Supplementary Table S2) ( Noh, 2006). Even a set of genes of this limited number poses a significant challenge to traditional methods for identifying DSC biomarkers. Therefore, we re-evaluated the entire microarray dataset (GEO Accession number GSE40183) using the expectation maximization (EM) algorithm method, a previously described sparse feature approach ( Burden and Winkler, 2009a and Burden and Winkler, 2009b). This very sparse feature selection method is ideally suited to select small sets of relevant genes from very large numbers of candidates, in a context-dependent manner.
Figueiredo reported an EM algorithm using a sparse (non-informative) prior that provided parameter-free adaptive sparseness methodology (Figueiredo, 2003). This algorithm iteratively and progressively sets irrelevant parameters exactly to zero. Figueiredo’s approach does not involve any (hyper)parameters to be adjusted or estimated. This is achieved by the adoption of a Jeffreys’ non-informative hyperprior. Figueiredo showed that this approach yields state-of-the-art performance that outperforms support vector machines, and performs competitively with the best alternative techniques. The EM method is set out very clearly in Figueiredo’s paper (Figueiredo, 2003) and ours (Burden and Winkler, 2009b). (See Material and methods and Supplementary Information [Fig. S2] for details of the method). In order to control manually the sparsity of the model, we added additional hyperparameters χ and ζ (Burden and Winkler, 2009b).
The analysis identified a very small set of genes whose expression patterns were significantly related to changes in cell state between asymmetric self-renewal and symmetric self-renewal (Table 1). One gene that was down-regulated during asymmetric self-renewal with respect to symmetric self-renewal had clearly superior significance as a discriminator of the two self-renewal states. This gene encoded H2A.Z (also known as H2afz), a variant of the nucleosomal core histone H2A.
cDNA probe ID Gene name Contribution factor ± SEa pb H3009F03 H2A.Z − 18.4 ± 1.1 7.0 × 10− 6 H3056B11 None to date − 15.8 ± 4.2 < 0.007 H3132G04 DNAjb11 11 ± 3.2 < 0.01 H3085E10 TMEM39B 12.1 ± 4.5 0.02 H3114D11 BTG1 8.7 ± 3.7 0.03
- The mean contribution, in descending order of statistical significance, that a gene’s expression change makes to the underlying model discriminating symmetric versus asymmetric self-renewal induced by any change in culture conditions. Positive values indicate up-regulation when cells shift to asymmetric self-renewal; and negative values indicate down-regulation when cells shift to asymmetric self-renewal. SE, the standard error.
- Student’s t-test confidence level for the gene’s expression contribution to the discrimination model.
H2A.Z DSC biomarker properties
A priori, H2A.Z seems an impractical candidate as a DSC biomarker, because in the engineered cell lines, both H2A.Z mRNA ( Noh, 2006) and protein (Supplementary Fig. S3) are down-regulated during asymmetric self-renewal. H2A.Z protein is also reduced when natural DSCs undergo asymmetric self-renewal (3C5 and 5B8 mouse hair follicle stem cells [HFSCs]; Supplementary Fig. S3). However, H2A.Z expression is not only reduced during asymmetric self-renewal, but the protein also takes on a unique molecular character. During asymmetric self-renewal, DSCs adopt a non-random pattern of chromosome segregation. With each asymmetric division, the DSC retains the set of sister chromatids with the older template DNA strands, which are called immortal DNA strands ( Sherley, 2008 and Cairns, 1975). Although H2A.Z is readily detected on immortal chromosomes by specific antibodies, on the corresponding mortal chromosomes it is masked from detection ( Huh and Sherley, 2011). This unique molecular character, “H2A.Z asymmetry,” represents a potentially highly specific biomarker for DSCs, because non-random chromosome segregation has not been described in any other cell type to date, including committed progenitors, whose detection compromises the specificity of previously reported DSC indicators.
We adapted detection of H2A.Z asymmetry to cultures of expanded DSCs, cultures of pre-crisis mouse embryo fibroblasts (pc-MEFs), and tissue sections containing adult mouse hair follicles. We paired the mitosis-specific biomarker phosphorylated histone H3 (pH3) with H2A.Z to develop a highly sensitive and specific assay for detecting rare mitotic tissue cells with H2A.Z asymmetry. Fig. 2 shows examples of the images obtained in evaluations of cultured cells. Ninety-three percent of prophase, symmetrically self-renewing, engineered cells that were pH3-positive had H2A.Z immunofluorescence that was symmetrically distributed with respect to pH3 immunofluorescence (e.g., see Fig. 2, hSAT SYM as an example; Table 2). In contrast, 32% and 20% of pH3-positive prophase cells in cultures of engineered cells and expanded mouse HFSCs ( Huh et al., 2011), respectively, under conditions that promote asymmetric self-renewal, had a smaller, asymmetric distribution of H2A.Z immunofluorescence ( Fig. 2; Table 2).
Cell type Promoted self-renewal patterna
pb (Basis) ASYM (Basis) SYM p53-MEFsc (p53 wt) 32 (p53 null) 7 < 0.024 pc-MEFsd (p53 wt) 7 (p53 KO) < 1 < 0.009 immMEFse N/af N/a (Immort) < 1 n/a HFSCs (− Xng) 20 Ndh nd n/a hSATs (− Xn) 12 (+ Xn) 4 < 0.004
- Listed are the observed percentages of pH3-positive prophase nuclei showing H2A.Z asymmetry for the indicated cell types under conditions that promote either asymmetric self-renewal and non-random chromosome segregation (ASYM) or symmetric self-renewal and random segregation (SYM).
- Statistical confidence, determined by the two-tailed Fisher’s exact test, for the differences in percentage between states promoting asymmetric self-renewalversus symmetric self-renewal.
- Wild-type p53-inducible MEFs and p53-null MEFs grown in ZnCl2-supplemented medium, corresponding to micro-array Comparison 1 (see also Fig. 1).
- Pre-crisis mouse embryonic fibroblasts from mice with a wild-type p53 genotype were compared to analogous cells derived from mice with a p53 gene knockout (KO) after 17 and 12 population doublings in culture, respectively. (The p53 wt cells had a transgenic KO of the cdkn1A gene.)
- The same strain of p53 wt/cdkn1A KO MEFs was evaluated after spontaneous immortalization and culture for a total of 50 population doublings.
- N/a = not applicable.
- Xn = xanthine.
- Nd = not done.
Early passage pre-crisis cultures are closer in cell make-up to primary dissociated tissue cell preparations than are immortalized cell lines. In analyses with cultures of early passage pc-MEFs (17 population doublings), 7% of prophase cells displayed H2A.Z asymmetry (Table 2; Fig. 3; also Fig. 2, pc-MEF). In contrast, in later immortalized cultures (50 population doublings), none of 110 examined prophase cells showed H2A.Z asymmetry (< 0.9%). Similarly, none of 123 examined prophases in early passage cultures from p53 gene knockout mice showed H2A.Z asymmetry (12 population doublings; < 0.8%). This significant difference in detection (p < 0.009) is consistent with our proposal that immortalization, which is highly associated with p53 gene mutation, manifests a permanent conversion of long-lived DSCs from asymmetric self-renewal to symmetric self-renewal, with loss of non-random segregation (Rambhatla et al., 2001, Rambhatla et al., 2005 and Merok et al., 2002).
We used the pH3–H2A.Z asymmetry assay to provide evidence for non-random segregation in normal human DSCs. Earlier human studies evaluated cancer stem cells (Pine et al., 2010, Hari et al., 2011 and Xin et al., 2012). Pre-senescent cultures containing human skeletal muscle satellite stem cell cultures were evaluated (Fig. 2, hSAT) (Homma et al., 2012). Non-random segregation by mouse skeletal muscle satellite stem cells was reported earlier (Shinin et al., 2006 and Conboy et al., 2007). When grown under routine culture conditions, 12% of mitotic cells in hSAT cultures exhibited H2A.Z asymmetry (Fig. 2, hSAT ASYM; Table 2). Asymmetry was detected throughout mitosis. Consistent with indicating non-random segregation, the frequency of H2A.Z asymmetric mitotic cells in hSAT cultures was reduced to 4% by supplementation with 1.5 mM xanthine (p < 0.004; Fig. 2, hSAT, SYM; Table 2), as occurs for mouse HFSCs (Huh et al., 2011).
Recently, we showed that Lgr5-expressing DSCs expanded from mouse hair follicles undergo non-random segregation with H2A.Z asymmetry (Huh et al., 2011 and Huh and Sherley, 2011) (also Fig. 2, HFSC). To evaluate whether cells with H2A.Z asymmetry were also found in tissues, we applied the pH3–H2A.Z asymmetry assay to evaluate 10-micron thick sections of adult mouse skin that contained hair follicles. Cells positive for pH3 were rare in these sections, as expected for mitotic cells. However, we found that 22% of the identified pH3-positive cells (i.e., 14 of 64 detected in 183 sectioned hair follicles) showed H2A.Z asymmetry as shown in the examples given in Fig. 4 (p < 0.0001; two-tailed Fisher’s exact test with respect to no detection). The mitotic cells with asymmetric H2A.Z were consistently found at the base of the hair shaft in the secondary germ region, the location in which Lgr5-expressing HFSCs have also been shown to reside ( Jaks et al., 2008), whereas mitotic cells with symmetric H2A.Z localization were found both in hair follicles and epidermal regions.
Our results show the potential of H2A.Z to provide a highly specific biomarker for identifying cells in tissues that are undergoing two tightly linked processes, deterministic asymmetric self-renewal and non-random segregation (Rambhatla et al., 2005 and Huh et al., 2011), which specify DSCs. The key evidence for this conclusion is based on the cell culture data presented with expanded HFSCs. We have shown previously that these expanded cell strains exhibit many of the unique properties of DSCs (Huh et al., 2011 and Huh and Sherley, 2011). These include long-term symmetric self-renewal; asymmetric self-renewal with production of multi-lineage differentiated cells based on loss of Lgr5 expression and expression of K5, K10, or filaggrin; expression of Lgr5, which is asymmetrically limited to the stem cell sister during asymmetric self-renewal divisions; and non-random chromosome segregation (Huh et al., 2011 and Huh and Sherley, 2011). In addition, we show several additional examples of H2A.Z asymmetry detection in murine and human cell types, both in cell culture and in tissue sections, which are consistent with the specific detection of asymmetrically self-renewing DSCs.
Supporting a general functional role for H2A.Z in DSC function in many human tissues, we found that in human cancer cell lines of diverse tissue origin higher expression of H2A.Z mRNA was significantly correlated with higher expression of IMPDH II mRNA (Supplementary Fig. S4). Up-regulation of IMPDH II is predicted to shift tissue DSCs from asymmetric self-renewal to symmetric self-renewal, with loss of non-random segregation, constituting an important carcinogenic mechanism (Rambhatla et al., 2005 and Huh et al., 2011).
Whereas H2A.Z is predicted to be a highly specific biomarker for DSCs, its sensitivity for detecting DSCs may vary from tissue to tissue because of differences in DSC self-renewal kinetics programs. DSCs undergoing symmetric self-renewal with random sister chromatid segregation would not be detected. Such self-renewal pattern excursions by deterministic asymmetrically self-renewing DSCs are predicted to be limited to periods of tissue mass expansion (e.g., during adult maturation) and repair of injuries. Quiescent DSCs might also go undetected, if they are arrested without H2A.Z asymmetry ( Li and Clevers, 2010). This possibility might be evaluated in cell culture studies.
A third possible cause of lower sensitivity awaits more extensive tissue investigations with the biomarker. An ongoing discussion in tissue stem cell biology is the balance of deterministic asymmetric self-renewal programs, reported here, compared to stochastic asymmetric self-renewal programs employed by DSCs in mammalian tissues (Loeffler and Potten, 1997, Enver et al., 1998 and Klein and Simons, 2011). In stochastic programs for asymmetric self-renewal by DSCs, tissue renewal is accomplished with nearly equivalent frequencies of symmetric self-renewal divisions and divisions that give rise to one or two lineage-committed cells. If tissues were renewing primarily by stochastic asymmetric self-renewal programs, H2A.Z asymmetry might have low sensitivity for detecting DSCs. However, this detection shortcoming would still serve to advance knowledge of the cell kinetics properties of DSCs in diverse mammalian tissues.
The identification of H2A.Z, a down-regulated gene, by the sparse feature approach underlines how this method can detect genes of unique biological significance that would often be overlooked. A down-regulated protein would hardly be considered as a practical expression biomarker candidate in the absence of any other distinguishing properties. Conceptually, the quantitative extraction of H2A.Z from the complex 16-dimensional data set by the sparse feature analysis reflects the detection of a consistent relationship of H2A.Z mRNA expression level and expression variance to two biologically linked cellular phenotypes (i.e., asymmetric self-renewal versus symmetric self-renewal) over an experimentally devised varying cellular landscape. Although conventional bioinformatics methods also identified H2A.Z, it was a relatively low priority member of a much longer list of candidate genes. Of the 930 genes identified as being up-regulated or down-regulated with respect to self-renewal pattern in the previous work ( Noh et al., 2011), H2A.Z appeared towards the bottom of the list sorted by expression fold ratio and would normally be overlooked. In contrast, the sparse feature selection method identified just a handful of candidate genes among which H2A.Z was the most prominent. This capability will undoubtedly be very powerful in many other types of investigations in which gene expression is evaluated with respect to specific physiological conditions of biomedical interest. As an additional proof of concept, we very recently used the same sparse feature selection approach to identify genes implicated in a novel mechanism for the strontium-induced differentiation of mesenchymal stem cells down the osteogenic pathway ( Autefage et al., 2015).
Currently, because of the lack of biomarkers that identify DSCs specifically, the only way to monitor relative changes in their number and quality is by functional assays. In animal model research, changes in DSCs are monitored by the ability of transplanted donor cells to reconstitute recipient tissues after ablation of host DSCs. Similarly, the measure of human hematopoietic stem cells is a successful reconstitution of patients whose hematopoietic stem cells were either deficient due to disease or iatrogenic ablation. These function-based approaches are not quantitative. They cannot discern differences in stem cell number versus stem cell quality; and they evaluate stem cells under conditions of physiological stress (e.g., transplantation). The research tools required to address these important deficiencies, specific and universal biomarkers for DSCs, have not been described previously. The availability of such biomarkers would greatly accelerate tissue stem cell research and advances in regenerative medicine. Stem cell research would join the ranks of other quantitative biological sciences; and transplant physicians would have a first quantitative metric for predicting transplant efficacy. This report of an H2A.Z asymmetry biomarker for DSCs is the beginning of this new era of DSC biology and biomedicine.
Our paper contains several sources of novelty. Primarily, our results show how an unconventional computational feature selection method can be applied effectively to large microarray gene expression datasets to identify sparse sets of a few genes that have a high degree of functional significance for a biological process under study. In this work the small set of genes were implicated in the pattern of cell division. These genes were only some of many that would have been selected using standard expression level/fold ratio bioinformatics criteria. The efficacy of this sparse feature approach has important advantages in functional genomics research.
Among our set of candidate markers was a down-regulated gene encoding the histone H2A variant H2A.Z. We specifically chose to validate experimentally H2A.Z because it was identified by the sparse feature selection properties of the EM algorithm to have the highest Bayesian relevance. We showed that it was a specific indicator of asymmetric self-renewal in cell lines genetically engineered to model DSC asymmetric self-renewal in several experimental contexts. Cellular analyses further revealed that down-regulated H2A.Z gene expression was associated with asymmetric detection of the H2A.Z protein on only one set of segregating mitotic chromosomes in DSCs undergoing non-random segregation, which is functionally linked to asymmetric self-renewal. Using expanded natural hair follicle tissue stem cell strains, we confirmed that this H2A.Z asymmetry was a specific biomarker for DSCs. Studies with pre-senescent mouse cell populations, human muscle satellite stem cell-enriched cultures, and mouse hair follicle tissue sections showed the potential of H2A.Z asymmetry to be a universal, specific biomarker for asymmetrically self-renewing mammalian DSCs with the capability of quantifying DSCs for the first time. Our work identifies the potential for sparse feature selection bioinformatics methods to uncover novel crucial insights into previously impenetrable biological systems and processes.
We also presented new stem cell methods and data:
A new H2A.Z/phospho-histone H3 assay for detecting H2A.Z asymmetry, prospectively, before a cell divides. We applied this new assay in the studies of in vitro cultured hair follicle stem cells.
We also applied this assay to show that H2A.Z asymmetry can also be detected in cultures containing human adult tissue stem cells, and that it is regulated by xanthine — evidence that the asymmetry is occurring in asymmetrically cycling human muscle tissue stem cells.
In primary MEF cultures, we showed that H2A.Z is regulated by p53 expression — additional evidence of its efficacy in identifying asymmetrically self-renewing primary tissue stem cells.
We extended our prior in vitro studies on H2A.Z asymmetry in asymmetrically self-renewing cultured hair follicle stem cells to the in vivo situation, showing the presence of cells with H2A.Z asymmetry in a region known to contain hair follicle tissue stem cells.
We showed the utility of the PDC ratio for evaluating the cell kinetics of cultures containing asymmetrically self-renewing cells.
These new data support our hypothesis that H2A.Z, identified by the EM algorithm, has potential to be a general highly-specific marker of asymmetrically self-renewing tissue stem cells both in mouse tissues and human tissues. No biomarker described previously has ability to identify tissue stem cells exclusively.
This work was supported by NIH-NHGRI grant # PSO HG 003170, NIH-NIEHS grant # 689471, and NIH-NIGMS Director’s Pioneer Award # 5DP1OD000805. JCC was supported by the Eunice Kennedy Shriver National Institute of Child Health & Human Development of the National Institutes of Health under Award Number U54HD060848. We thank Shirley A. Bohn and Hetal Parekh for the technical assistance in deriving immortalized p53 wt/cdkn1A KO MEFs. DAW also acknowledges support from the CSIRO Newton Turner Award for Exceptional Senior Scientists.