Characterization of subsets of cells in a heterogeneous population has long been the forte of flow cytometry. With the identification of new markers on the cell surface, facilitated by continuing development and use of novel monoclonal antibodies, more and more unique subsets of cells are being identified. As the collection of antibodies to the various cell surface molecules grew, it became evident that simultaneous use of all or most of these antibodies could yield an information-rich cellular expression profile. BDFACS™ CAP is one such semi-automated, high-throughput flow cytometry screening tool that enables the rapid characterization of cells through surface protein expression profiles. Similar to other high-throughput proteomics profiling methods, BD FACS™ CAP yields an abundance of information that is very useful for identifying complex protein expression profiles of cells. In this respect, BD FACS™ CAP is currently being used to characterize the surface expression of hundreds of cell surface proteins on many different cell types. This profiling technology will enable users to discover subsets of surface protein markers that can be used to identify uniquely specific cellular populations that were previously uncharacterized. Because BD FACS™ CAP is a flow cytometry-based screening technology, it has all of the advantages that flow cytometry offers, most importantly the ability to collect multiparametric protein expression data on single cells rapidly.
The current screening plate configuration consists of 229 directly conjugated antibodies, with the extension of this configuration inevitable as additional antibodies and fluorochrome conjugates become available. The antibodies in the present format are arrayed in a 96-well plate as three-color "cocktails", using the fluorochromes, FITC or Alexa488, PE, and APC or Alexa647. Several wells have been intentionally left empty, reserved for compensation and various controls. Isotype controls for selected immunoglobulin classes and subclasses have been included in the screening plates that are used for setting gating thresholds. Of the 229 surface markers, 208 are specific to a single protein, 11 bind small sets of related proteins, and 10 bind to uncharacterized proteins or carbohydrate antigens. The majority of cell surface targets are surface receptors. Each individual cell type is analyzed on triplicate plates to ensure reproducibility of the data acquired on a flow cytometer equipped with a high-throughput sampler (HTS). The expression level of each marker for the stained cells is then calculated using semiautomated custom flow cytometry software. The process of characterizing these surface markers in a highly efficient manner using BD FACS™ CAP is enabled by automated liquid handling for staining, automated flow cytometry for data acquisition, and standardized algorithms for flow data analysis.
High dimensional analysis of cell surface protein profiles using BD FACS™ CAP can be applied to a wide spectrum of experimental objectives. The main objective of many of these experiments is to identify a differential set of surface markers that are present on one set of cells compared to another, whether this is due to differentiation, different treatment regimens, or other conditions such as varying culture conditions, etc. More specifically, the primary goal may be to identify a set of biomarkers that may be used to isolate different cellular subtypes from a primary cell culture or to identify a unique set of biomarkers that would aid in the clinical diagnosis or predicting prognosis of a certain disease condition.
Additionally, this technology may be used to obtain surface fingerprints for various cells or tissue types, to evaluate starting and ending cellular products for cell therapy or research purposes, or to characterize contaminating cells in a heterogeneous cell line. For stem cell therapy or cell banking, in particular, BD FACS™ CAP is a valuable tool for (1) documenting phenotypic variants or changes due to different donors, or phenotypic differences in cells derived from different sources (e.g., bone marrow vs. fat vs. umbilical cord, etc.); (2) evaluating isolation protocols, passages, and culture or storage media/processes (e.g., examining the correlation between the function and phenotype of a given primary cell type in different culture stages; therefore, identifying a set of markers that can serve as quality control of a specific culture environment); and (3) studying stem cells for differentiation, carcinogenesis, and drug targeting. In addition, this technology may be applied in tandem with the BD Discovery Platform (BD Technologies, Research Triangle Park, NC), a high-throughput screening platform used to develop and optimize cell culture environment for any given cell type. By surveying the expression of integrins and growth factor receptors on the surface of the cell type of interest, appropriate selections of candidate factors can be made to facilitate the development and optimization of the cell culture environment for any specific cell type.
In the future, the BD FACS™ CAP platform may be expanded to include more than the 229 individual antibodies currently used. With the ever-expanding availability of new antibodies and fluorochromes, it is foreseeable that other fluorescence channels could be used to add possibly hundreds of more individual antibodies, or to use "anchor" antibodies to first identify a specific cellular subset of interest, e.g., T regulatory cells, and then to analyze the surface expression of all 229 markers on the specific cellular subtype. With further development of this technology, it may also be possible to include activation state analysis of the cells by using other flow-based technologies that interrogate intracellular proteins such as cytokines or phosphorylated proteins in combination with the BD FACS™ CAP platform. Additionally, this technology may be used to obtain surface fingerprints for various cells or tissue types, to evaluate starting and ending cellular products for cell therapy or research purposes, or to characterize contaminating cells in a heterogeneous cell line. For stem cell therapy or cell banking, in particular, BD FACS™ CAP is a valuable tool for (1) documenting phenotypic variants or changes due to different donors, or phenotypic differences in cells derived from different sources (e.g., bone marrow vs. fat vs. umbilical cord, etc.); (2) evaluating isolation protocols, passages, and culture or storage media/processes (e.g., examining the correlation between the function and phenotype of a given primary cell type in different culture stages; therefore, identifying a set of markers that can serve as quality control of a specific culture environment); and (3) studying stem cells for differentiation, carcinogenesis, and drug targeting. In addition, this technology may be applied in tandem with the BD Discovery Platform (BD Technologies, Research Triangle Park, NC), a high-throughput screening platform used to develop and optimize cell culture environment for any given cell type. By surveying the expression of integrins and growth factor receptors on the surface of the cell type of interest, appropriate selections of candidate factors can be made to facilitate the development and optimization of the cell culture environment for any specific cell type.
In the future, the BD FACS™ CAP platform may be expanded to include more than the 229 individual antibodies currently used. With the ever-expanding availability of new antibodies and fluorochromes, it is foreseeable that other fluorescence channels could be used to add possibly hundreds of more individual antibodies, or to use "anchor" antibodies to first identify a specific cellular subset of interest, e.g., T regulatory cells, and then to analyze the surface expression of all 229 markers on the specific cellular subtype. With further development of this technology, it may also be possible to include activation state analysis of the cells by using other flow-based technologies that interrogate intracellular proteins such as cytokines or phosphorylated proteins in combination with the BD FACS™ CAP platform.
Custom scripts written using the R software (1) were utilized for analysis. R is an Open Source environment for the implementation of statistical computing and graphics. It provides a large, well-integrated collection of tools for data management, data storage, data analysis, and graphical displays. There is a very large community of R users that provide custom packages to address specialized problems such as those discussed here. R includes a very flexible and effective programming language for automating many complex procedures, which provides the basis for our custom scripts. This combination of specialized analysis packages and custom scripts allows for a highly specialized, robust, efficient, semiautomated approach to flow cytometry data analysis.
Data can be acquired on a BD FACSCanto™ or BD LSRII flow cytometer equipped with a HTS. Data acquisition is performed with BD FACSDiva™ software.
Data Analysis
Successful data management is one of the most critical aspects of plate-based high-throughput flow cytometry. Depending on the design of the plate, there may be as many as 96 separate FCS files. For the example used in this chapter, the FCS files require about 130 Mb of storage space per plate. The authors' data management strategy requires manual copying of files from the cytometer to a central hard drive that supports our core flow laboratory. From there, the files are copied one more time to a server in our corporate data center that provides a secure, version controlled location. We use Subversion and Eclipse along with standard backup processes implemented by our corporate data center to ensure the persistence and integrity of the original data.
Software
BD FACS™ CAP output from the cytometer is analyzed using semiautomated routines in R to ensure that the data are analyzed in an efficient, consistent, and objective fashion. The automated methods were developed to reflect the analysis that would be performed by an expert cytometrist using FlowJo. The results of the R-based analysis were validated by direct comparison to FlowJo results.
Compensation
Compensation can be done using either single-color staining of the same cells that are used for analysis, or BD CompBeads.
Morphology Gating
It is of critical importance to examine the health status of the samples used for the FACS™ CAP assay. Compromised viability of cells contributes to increased nonspecific binding of the antibodies and can confound interpretation of flow data. For special applications, a near-infrared LIVE/DEAD? fixable dead cell stain kit can be incorporated to evaluate the viability of the cells after the surface marker staining process. For most applications, however, it is sufficient to restrict data analysis to a cell population where dead cells, debris, and aggregates are excluded based on cell morphology.
Figure 1. A gating strategy is used to exclude dead cells, debris, and aggregates prior to data analysis.
Autofluorescence Correction
Autofluorescence is one of the major concerns in flow cytometry. One characteristic of cultured adherent cells is that they have a significant level of autofluorescence. Thus, an additional signal is present in each channel that cannot be corrected for by compensation and confounds any attempt to identify positive cells. Additionally, the output from R has been corrected by autofluorescence, which facilitates automated gating for cells with high autofluorescence.
Determining Gates Based on Isotype Controls
On each BD FACS™ CAP plate, there are a variety of controls, including isotype controls for selected immunoglobulin classes and subclasses. These isotype controls are used to set thresholds (negative control gates). Any cell that has a measured fluorescence intensity (FI) above the threshold is classified as "positive".
Quality Control
The use of replicate plates of cells in the BD FACS™ CAP analysis primarily serves the purpose of quality control, as under normal circumstances, there is very little difference in the results among replicate plates. Although still relatively rare, the most common quality issue stems from fluidics problems that occur during the acquisition of cells by the flow cytometer.
Summary Statistics
The primary summary statistic is the Percent Positive; that is, the percentage of cells that have a reading greater than the value of the negative control gate, as determined from the isotype controls. Depending on the particular application, a variety of other statistics may be of interest. Due to the highly reliable nature of flow cytometry, the variability in Percent Positive from plate to plate is usually quite small. For a typical donor in the present example, the mean standard error across three plates over all markers for Percent Positive was less than 1%. From a theoretical viewpoint, we would expect the Percent Positive to follow a binomial distribution when generated from independent samples of the same cells. The observed variability from plate to plate is generally lower than what would be expected if the plates represented true biological variability rather than simply technical replicates.
Normalization
For the purposes of understanding and visually comparing marker expression profiles, use the smoothed density. For the purposes of comparing multiple marker expression across multiple donors or multiple samples, combine the data from the replicate plates to get a single density.
Categorization and Interpretation of Expression Levels
Because BD FACS™ CAP is a screening assay, the reported Percent Positive values should be interpreted with caution. In any case in which it is important to have an accurate estimate of the actual Percent Positive value for a given cell type, it is recommended that an optimized assay be used. Consequently, Percent Positive values are typically classified into one of four fairly broad categories; namely, negative, low, medium, or high.
(a) Negative corresponds to values of markers with Percent Positive £10%.
(b) The low category designates markers with 10% < Percent Positive £50%.
(c) The medium category designates markers with 50% < Percent Positive £85%.
(d) The high category designates markers with Percent Positive >85%.
Markers falling in the negative and high categories can be expected to stay in those categories when an optimized assay has been developed. The low and medium categories are expected to be less stable.
In many applications of flow cytometry in which a Percent Positive value is calculated, there are two distinct populations. In this case, one population is typically negative for the marker in question and one population is positive. The Percent Positive is the percent of cells falling in the positive population. In the case of adherent cells, it is more likely that there is only a single population, and the negative control gate falls in the interior of this population. When there is only a single population, a more accurate interpretation of the Percent Positive is "the percentage of cells with a measure intensity value greater than the negative control gate". Comparing Samples In the present example, BD FACS™ CAP was performed on human MSCs derived from three donors. Three replicate plates were assayed for each donor. As discussed earlier, the replicate plates represent technical (not biological) variability; the replication is primarily used for quality purposes. Data from replicate plates are combined before comparing results among the three donors. The test to compare the Percent Positive values among donors is based on the null hypothesis that the donors are not biologically different in regard to expression of a specific marker. In this case, the assumption is that the observed values for Percent Positive will follow a binomial distribution. The ratio of the observed variation in Percent Positive to the expected variation should follow a Chi-squared distribution. In this particular case, the distribution will only be approximate because the true Percent Positive is estimated from the data. Based on this chi-square distribution, a P-value is calculated for the test of the null hypothesis for each marker. Since there are more than 200 markers in the BD FACS™ CAP screen, the multiplicity problem must be addressed before computing final P-values. Multiplicity problem means that the risk of making false discoveries is extremely high when making many independent tests. To control for this false discovery rate, apply the method of Hochberg and Benjamini. After making this correction, an adjusted P-value of less than 0.05 will indicate that the samples (donors in this case) have different expression profiles.
Comparison of the Results to the Literature
Based on a review of the literature, the authors identified 12 markers that are commonly reported as being negative for bone marrow-derived MSCs and 15 markers that are commonly reported as being positive (see Note 4). Ten of the markers reported as negative in the literature are clearly negative for these donors. One marker (CD24) shows a low expression level and one marker (SSEA4) shows an intermediate level with differences among donors. Of the 15 markers reported as positive in the literature, 9 were approximately 100% positive, while 2 markers were negative. The other four markers were expressed at varying levels.
Heterogeneous Populations
A population is designated as heterogeneous in regard to expression of a particular marker if there are at least two distinct populations of cells. In this case, the density function will be bimodal, with each peak representing a different population of cells.
Reference