Cancer Biomarker Technical Guide
Table of Contents
Letter from the Editor
Index of Experts
Breast cancer: Harold Garner, Dong-Young Noh, and David Rimm
Ovarian Cancer: Andrei Drabovich
Leukemia: Charles Mullighan and Richard Wilson
Lymphoma: Izidore Lossos
Circulating Tumor Cells: Paul Hofman
List of Resources
Letter from the Editor
Forty years ago, US President Richard Nixon declared a “war on cancer,” signing into law the National Cancer Act in 1971, which strengthened the National Cancer Institute and made the search for treatments a federal priority as well as scientific one.
Since then, researchers have worked to develop new diagnostic and prognostic tools, new treatments, find out more about various cancer subtypes, and create individualized drug regimens to help cancer patients. New studies have shown that these efforts have resulted in longer lives, lower mortality rates, and more efficient treatment of patients.
But cancer is a pernicious disease — it finds ways to hide, grow, migrate, develop, and mutate. It affects adults and children alike, and doesn’t discriminate. So, many researchers have made it their life’s work to find the hallmarks of cancer wherever they may be and figure out what these hallmarks indicate about how, where, and when the cancer is going to develop, so they can head it off at the pass.
The researchers featured in this technical guide all have different specialties, from breast and ovarian cancer to cancers that affect children and to the relatively new field of cancer stem cells. But the thing they all have in common is their determination to find biomarkers that could lead to earlier detection of cancer, better treatments, longer survival times, and lower mortality rates for patients. They utilize various approaches — searching for low-abundance proteins, rare variants, new gene pathways that lead to oncogenesis — in order to find biomarkers that will be of use to clinicians and possibly to biotech and pharmaceutical companies looking to create new therapies.
And take a look at the list of resources, Web tools, and upcoming conferences at the end of the guide for more information, and suggested reading.
— Christie Rizk
Index of Experts
Many thanks to our experts for taking the time to contribute to this technical guide, which would not be possible without them.
Andrei Drabovich
Mount Sinai Hospital, Toronto
Harold Garner
Virginia Bioinformatics Institute, Virginia Tech
Paul Hofman
Centre Hospitalier Universitaire de Nice
Izidore Lossos
Sylvester Comprehensive Cancer Center, University of Miami
Charles Mullighan
St. Jude Children’s Research Hospital
Dong-Young Noh
Korea Institute of Science and Technology
David Rimm
Yale University School of Medicine
Richard Wilson
The Genome Institute, Washington University School of Medicine
Breast Cancer: Harold Garner, Dong-Young Noh, and David Rimm
Genome Technology: What method or technology do you use to discover breast cancer biomarkers? Why?
Harold Garner: This technology is not just for breast cancer — breast cancer is our largest and most effective data, set but it appears to have potential to be more universally applicable to a lot of cancers. The basic technology is a unique microarray approach that looks at global changes in the repetitive or microsatellite content in the genome of the cancer patients versus non-cancer patients. The other important thing is that we’re finding this global change not only in the tumors, but also in the germline of the patients, so they’re born with a predisposition in their genomes that shows up in their tumors as well. In the human genome, which is 3 billion bases long, there are about 2 million loci that have repeats — repetitive motifs like CAG CAG CAG CAG CAG — in which case they’ll have some variable number of copies of that motif. However, these microsatellites are dramatically under-studied or under-appreciated as a causative agent in disease, in phenotype, even in speciation, and this is largely because there hasn’t been a technology to be able to look at them efficiently en mass. We developed a unique microarray that allows us to look at all 2 million microsatellites at one time — not at individual places, but the sum of all the contributions of them — and so what this does is the microarray reads out if there are any particular motifs that are highly different in one population versus another. For example, in the paper in Genes, Chromosomes, and Cancer (Galindo et al., 2011. See resource list), there’s a number of motifs, especially AT-rich motifs like the motif TTA, where in the germline or the blood of the cancer patients as well as in the tumor of the cancer patient, there are elevated amounts of that in the genome. The entire genome has changed in all the positions that include TTA motif, and so our array records that.
Therefore, we really create two kinds of biomarkers. One biomarker is simply the pattern on the array, and that pattern can be used to distinguish the people who are at elevated risk for cancer or not. The other thing it does is it provides us with leads for looking at individual loci — for example, the TTA motif may exist in thousands of places spread across the genome, and we know where they are because we’ve sequenced the genome. The array simply reads out that these things have changed, but we don’t know exactly which ones have changed, and the array tells us that the TTA-containing motifs are expanded. Then we use bioinformatics and the Human Genome Project to go and look where these TTA motifs are. And then we can select to look at those that happen to be in genes that might be implicated in cancer, and find individual places in the genome where an expansion could cause for you to have a cancer phenotype. In other words, the array is both a biosignature or a biomarker [detector], and also a lead generator. There’s a companion paper that came out in Breast Cancer Research and Treatment (Galindo et al., 2010. See resource list) — what we did is we found this motif to be changed on the array as well and then we traced it down and looked at this individual place and found out that it correlated and is probably implicated in causing cancer itself.
Dong-Young Noh: We use both genomic and proteomic approaches to identify novel biomarkers. Each method has its own advantages and disadvantages. We are currently focusing on the next-generation sequencing for the genomic approach and multiple reaction monitoring for the proteomic approach.
David Rimm: We use the AQUA method of quantitative immunofluorescence for biomarker discovery in breast cancer. QIF maintains spatial information (nuclear versus cytoplasmic, stromal versus epithelial, et cetera) but unlike immunohistochemistry can be easily multiplexed to optimize information obtained from colocalization. The biggest disadvantage of this method is that it requires a candidate target approach, and does not have the discovery potential of genomic-based methods.
Genome Technology: Do you use a multiplexed approach? Why or why not?
DYN: We have some experience in multiplex approach such as Luminex (Kim, Lee, et al., 2009. See resource list), but as mentioned before, our major interest these days is the MRM method to identify a novel set of biomarkers for breast cancer diagnosis.
DR: AQUA is based on multiplexing and co-localization. Thus, multiplexing is a critical underpinning of the approach. Multiplexing using AQUA can be achieved in two ways. True multiplexing is limited by the number of distinguishable fluorophores (about five using conventional methods), but we also do virtual or serial section multiplexing. We have shown that the reproducibility of AQUA scores between serial sections has an R value between 0.97 and 0.99. Thus, for practical purposes serial sections can be used to extend multiplexed queries. For example, we have serially multiplexed more than 40 biomarkers on a tissue section by using serial sections (Dolled-Filhart et al., 2006; Giltnane et al., 2009; Rothberg et al. , 2009. See resource list). However, when true multiplexing is used, it is a critical step in the validation of the approach to show that the multiplexed result is equivalent to the single-plexed result (Harigopal et al , 2010. See resource list).
GT: How do you validate those putative biomarkers?
HG: The way we do that is we create an expanded cohort. In the 2011 paper, basically, we expand the study from looking at what we discover on the arrays to looking at 500 or so patients. So really we validate in two ways — we do an expanded study, to be assured of the statistics for a given risk or pharmacogenomic informative marker. And the second thing we do is we undertake a mechanistic study so the best biomarkers are not only ones that are informative but also so you know why. What we wanted to do in this particular case is we found out this repeat expansion is in the promoter of this gene, and indeed the repeat itself is the promoter and so therefore when the repeat expands, the promoter activity expands. We understand more about the mechanism by which changes in the repeat length can change the performance of this gene leading to cancer.
DYN: The method of validation differs on the purpose of the biomarker we are looking at. For example, we commonly use a cohort of breast cancer patients and normal healthy control to validate diagnostic biomarkers. Mostly, we prefer blood samples such as plasma for early detection biomarkers, and usually we use the plasma after we deplete the abundant proteins in the blood by using commercial filters. For the validation of predictive markers, the platform for validation differs as the source of the biomarker, and mostly we use tissue samples such as fresh frozen tissues or FFPE for predictive biomarkers.
DR: Validation of biomarkers is critical and should be based on Hayes’ Levels of Evidence (Hayes et al., 1996, Simon et al., 2009. See resource list). It is challenging to reach level 1 or 2 evidence, but that is the best validation of a biomarker. To reach that level, typically assessment of two or three cohorts using the putative biomarker will provide sufficient data to apply for cohorts that can achieve level 2 (or 1) evidence. Single marker, small cohort studies are level 4 evidence and can often be misleading. They should be considered preliminary data and not validation data.
GT: What kind of samples, such as fresh, fresh-frozen, FFPE, or other, ensure an optimal screen for breast cancer biomarkers? Why?
HG: Any of them – basically we’re looking at genomic material and not transcriptomic material, so we’re looking at DNA instead of RNA, and we also are principally looking for markers in the germline. Tumors are unstable and their genome’s continuously shifting, and they’re particularly shifting in microsatellites, so of the most value here is to find something that is a predisposition marker that will predispose you to increased risk of cancer, predispose you to particular therapeutics that might be effective or not effective, things like that. Therefore, we need very little material depending on which assay — for the array we need 10 micrograms and for the follow-on looking at individual loci, as little as you can start with any PCR reaction.
DYN: We prefer fresh frozen tissues for the validation of gene signatures discovered by the genomic methods, however, in cases where there are limited numbers of fresh frozen tissues, we prefer RT-PCR using FFPE. Our opinion is that despite what tissue we are using, the most important thing is to have a standardized method of collection and storage and this principle applies to even FFPE where the time interval between resection and fixation is important for the antigen preservation.
DR: Breast cancer biomarkers can, in theory, be derived from fresh, frozen, or FFPE tissue. Historically, we have seen that to be translated to common usage in the US, biomarkers must work on FFPE tissue. However, in the EU, frozen tissue is more readily available. Often fresh and frozen tissue can be used for discovery experiments, but once candidate biomarkers are identified, the assays must be converted to use FFPE.
GT: What method or technology is best to detect low-abundance biomarkers?
HG: Microarray or sequencing.
DR: Low-abundance biomarkers, for example rare circulating tumor cells, or low-abundance message or peptide levels in serum, generally depend on specialized methods. While mass spec is promising for low-abundance serum markers, it has yet to be broadly validated in breast cancer.
GT: How do you determine which biomarkers are both sensitive and specific for use in the clinic?
HG: Out of our studies on expanded cohorts, the main goal is to understand what is the statistical sensitivity or specificity for this marker. Again, in your genome the markers are not dependent on the local environment of the tumor or what you happen to have eaten earlier today — things like that, that tends to change your transcriptome. It really comes down to larger extended studies.
DYN: We commonly use ROC curves to determine the usefulness of diagnostic biomarkers. To do so, it is very important to have a high-quality healthy control because while it is easy to have correct cancer patients, it is really difficult to have purely healthy control. Therefore, we use healthy control from the women who visit our comprehensive health care system and the overall health screening results show no evidence of cancer.
DR: Sensitivity and specificity required for clinical usage is a function of the clinical question. For example, in a screening setting, a test must have very high sensitivity and specificity. This has been a barrier for new serum-based screening in breast cancer since low specificity would result in significant numbers of unnecessary and expensive workups for false-positive tests. For companion diagnostic tests, the sensitivity must be very high, but specificity can be sacrificed and is often shockingly low. For example, a positive FISH or IHC test for HER2 in determining who should get trastuzumab must have high sensitivity since the clinician does not want even a single patient to miss the opportunity to benefit from this drug. However, the number of positive patients that do not benefit from the drug is probably close to 50 percent or more. Therefore, the current tests are designed to maximize the sensitivity at the expense of specificity. Once a more specific test is validated, it will be interesting to see whether or not it is adopted. One can imagine clinicians who want to give their patients “every chance” who may not be willing to withhold trastuzumab, even if some new test (with both high sensitivity and specificity) suggests that a HER2+ patient will not respond.
Ovarian Cancer: Andrei Drabovich
GT: What method or technology do you use to discover ovarian cancer biomarkers? Why?
Andrei Drabovich: To discover cancer biomarkers, we use an integrated proteomic platform to analyze multiple types of biological samples. We also employ gene expression and literature data mining to confirm our findings. Mass spectrometry-based proteomic approaches allow us to identify in biological samples as many as several thousand proteins, some of which may be putative cancer biomarkers. Typically, we use bottom-up proteomics and tandem mass spectrometry to identify proteins, while ELISA or immuno-mass spectrometry assays are used for accurate quantification of candidate biomarkers in blood serum. To narrow the candidate list to a manageable size, we consider proteins discovered in all types of biological samples such as human cancer cell lines, proximal fluids, and cancer tissues.
GT: Do you use a multiplexed approach? Why or why not?
AD: Absolutely. We use multiplex selected reaction monitoring assays to verify long lists of candidate biomarkers in proximal fluids and select candidates for further verification and validation. ELISA and immuno-mass spectrometry assays allow multiplexing only a few proteins in a single analysis, but are superior when verification in blood serum is required. Taking into account very high heterogeneity of cancer, there is a lot of potential for multiplex assays and panels of biomarkers to provide accurate cancer diagnosis and prognosis.
GT: How do you validate those putative biomarkers?
AD: Ultimate validation of biomarkers requires accurate protein quantification in blood serum and is typically done by ELISA. Validation of cancer-specific biomarkers by mass spectrometry is limited due to its insufficient sensitivity and low nanogram-picogram per milliliter concentration range of candidate biomarkers, in blood serum. Additional purification or enrichment steps increase sensitivity but decrease sample throughput. However, most medium-to-high abundance proteins are amenable to validation by targeted mass spectrometry in biological fluids and even in blood serum without additional purification. Hopefully, future advances in mass spectrometry instrumentation will increase its sensitivity to measure even low-abundance proteins.
GT: What kind of samples, such as fresh, fresh-frozen, FFPE, or other, ensure an optimal screen for ovarian cancer biomarkers? Why?
AD: We typically use fresh frozen samples for biomarker discovery, verification and validation. Formalin-fixed paraffin-embedded samples may not be suitable for mass spectrometry-based proteomics, but are a great source to verify tissue biomarkers by immunohistochemistry. Standardized sample collection and handling is crucial for accurate evaluation of biomarker performance. Accurate clinical information for each sample should be available and should include as many parameters — stage, grade, demographics — as possible to allow not only for biomarker verification but also for prospective or retrospective clinical studies.
GT: What method or technology is best to detect low-abundance biomarkers?
AD: ELISA is still a superior method to quantify low-abundance biomarkers in a large number of clinical samples. However, development of two highly specific antibodies suitable for ELISA is a long and challenging process that may not be practical if quantification of dozens of proteins is required. There is a lot of hope that the following alternative technologies will mature to improve or complement ELISA: (i) high-throughput development of engineered affinity ligands (aptamers, antibody mimetics); (ii) targeted mass spectrometry with increased sensitivity and automated sample preparation; (iii) immuno-mass spectrometry approaches such as SISCAPA.
GT: How do you determine which biomarkers are both sensitive and specific for use in the clinic?
AD: To determine biomarker specificity and sensitivity for use in the clinic, the proper validation with a very large number of samples should be performed. Such validation would first require development of a specific, sensitive, and high-throughput analytical assay. When such an assay is available, proper validation will require asking a very specific clinical question (biomarker for diagnosis, prognosis, or treatment selection), collecting a large cohort of samples (blood, proximal fluids, tissues) with clear and accurate clinical diagnosis, and following standardized study designs (double-blind, PRoBE). Extensive national and international collaborations between hospitals are crucial to collect the sufficient number of clinical samples and thus facilitate proper biomarker validation.
Leukemia: Charles Mullighan and Richard Wilson
GT: What method or technology do you use to discover leukemia biomarkers? Why?
Charles Mullighan: My work uses genomic profiling — SNP arrays, gene expression arrays, methylation, and DNA sequencing. The goal is to find new genetic alterations that drive the development of leukemia, and influence outcome. We study large cohorts of well-annotated ALL samples, so we can correlate new genetic changes with outcome. Once we find new genetic changes of clinical relevance from genome-wide approaches, we investigate the most appropriate platform for diagnostic testing. Other investigators at St. Jude use complementary approaches, such as flow cytometry to detect low levels of leukemic cells in patient samples during treatment (minimal residual disease measurement).
Richard Wilson: Whole genome sequencing. At the time we began, sequencing of selected genes was not telling us anything we didn’t already know about leukemia. We knew we had to somehow sequence all genes in the genome. It was about this time that next-gen sequencing (Solexa) came along. We presently use the Illumina HiSeq technology, and also utilize exome hybrid capture as an ancillary approach to WGS.
GT: Do you use a multiplexed approach? Why or why not?
CM: The genome-wide approaches incorporate assays for millions of markers. Specific assays tend not to be multiplexed, but it highly depends on the nature of the alteration: e.g. deletions, gains, or sequence alterations may be detected by qPCR, FISH, DHPLC, sequencing etc.
RW: Not for WGS. As the run yield of the Illumina HiSeq has increased, we have begun to multiplex exome sequencing. For capture-based targeted sequencing, we make extensive use of multiplexing.
GT: How do you validate those putative biomarkers?
CM: By studying large cohorts of patients in clinical trials.
RW: First by additional DNA or RNA sequencing, then using any one of a number of laboratory assays, expression in tissue culture cells, and mouse models to understand function and mechanism.
GT: What kind of samples, such as fresh, fresh-frozen, FFPE, or other, ensure an optimal screen for leukemia biomarkers? Why?
CM: We typically use any sample that can yield non-degraded material suitable for DNA and RNA analyses. Most commonly, these are fresh or cryo-preserved leukemic cells. Flow cytometry assays require viable cells.
RW: Fresh-frozen samples typically yield DNA and RNA of the highest quality and quantity. FFPE blocks are fine, although there is quite a bit of variability depending on how the samples are prepared and/or stored.
GT: What method or technology is best to detect low-abundance biomarkers?
CM: It depends on the marker. QPCR may be suitable for some variants. Flow cytometry is suitable for biomarkers expressed on the leukemic cell surface, and this approach is widely used for monitoring of levels of minimal residual disease in leukemia. This is well established in management of patients with ALL.
RW: We are primarily focused on discovering somatic mutations in tumor genomes. We find that Illumina sequencing allows us to work with as little as 10 nanograms of DNA. If whole genome amplification or PCR (for selected genes) were added to the front end, fewer input tumor cells would be required. Once mutations have been discovered and validated, there are several methods and technologies that one could employ to detect their presence in patient samples.
GT: How do you determine which biomarkers are both sensitive and specific for use in the clinic?
CM: This primarily depends on which biomarkers are shown to have independent prognostic or diagnostic value that complements the wide range of existing molecular and related tests that are currently in use. We have detected many new genetic alterations, many of which are of great interest in leukemia biology, but not all are clinically useful. To determine which may be used in the clinic, careful trials assessing the clinical utility (association with outcome, and the ability to measure robustly in a CLIA environment) are required.
RW: Our goal is to discover and characterize the driver mutations in leukemia, and other cancers. Key characteristics include genes that are frequently mutated in patients (7 percent) and appear to give some clue as to outcome. In acute myeloid leukemia, for example, mutations in DNMT3A and IDH1 are correlated with poor outcomes. In fact, presence of DNMT3A mutations may be the best indicator of which AML patients will require bone marrow transplant. The assay for IDH1 mutation is relatively straightforward as nearly all mutations occur at the same site in the gene; furthermore, the gene product can be detected by biochemical methods. In contrast, detection of DNMT3A mutations in patient samples will require sequencing all exons and splice sites in the gene.
Lymphoma: Izidore Lossos>
GT: What method or technology do you use to discover lymphoma biomarkers? Why?
Izidore Lossos: We are personally using three technologies. We are using microarrays, we are using real-time PCR for genes and microRNAs, and we are using immunohistochemistry mostly for validation of the results from the various other techniques. There are different kinds of biomarkers — when you want to have a global picture of the tumor, it’s very easy to do gene expression arrays because you are analyzing the whole genome, and once you have all the data from that, you can decide which specific biomarkers you want to use or to validate or to try to eventually translate to the clinic. Now, real-time PCR allows you quicker, cheaper, more precise with more quantities of analysis than gene expression, so once you know your targets, it’s definitely significantly easier to analyze them by real-time PCR, it’s more quantitative if you want to transform it to clinical applications. It would be easier to use real-time PCR than gene expression array [in this case] so that’s why we take it there.
GT: How do you validate those putative biomarkers?
IL: To validate it we use immunohistochemistry. You need to validate it several times. One validation is usually not enough so you need to validate in several settings. … Many people ask why biomarkers are stuck at the investigator level and are not getting to the clinic. And one of the reasons for this, in my opinion, is that many of these biomarkers are developed by academic institutions and we unfortunately don’t have the capabilities — we can develop the assay, we can develop the biomarkers, but we cannot make it available for routine use for whoever wants it. At a certain point in time, companies need to pick up the ownership of these biomarkers and make the assay available and unfortunately, very frequently, the companies’ considerations are how frequent the disease is, how frequently the test will be used for a specific biomarker. If you are not dealing with breast cancer or with something that is very, very common, many companies will be reluctant to establish the methodology or make the methodology for routine use and frequently many of the biomarkers are stuck at this transitional step.
GT: What kind of samples, such as fresh, fresh-frozen, FFPE, or other, ensure an optimal screen for lymphoma biomarkers? Why?
IL: It depends on what you’re using it for. If you have a serum biomarker, you use serum. But definitely your ability to retrieve expression of genes at RNA levels are frozen or fresh specimens is the best approach, but the problems is that unfortunately, this is not very practical. The majority of the patients are treated not in academic centers but in private practice and nobody will deal with preservation of frozen samples or fresh samples. You eventually will need to use paraffin samples. We’ve developed techniques, and there are commercial techniques, that allow you to measure RNA expression in paraffin samples without any problem and quite reproducibly. You can go both ways but it’s more practical to go with paraffin samples.
GT: What method or technology is best to detect low-abundance biomarkers?
IL: Once again, real-time PCR. There are new methodologies — I don’t have personal experience with them, but Nanostring seems to be a possible technology that will allow you to detect low-abundance biomarkers.
GT: How do you determine which biomarkers are both sensitive and specific for use in the clinic?
IL: Once again, validation, and validation, and validation. Usually the way to do it, you discover the biomarker and you need to independently validate it. Discovery is not usually a big problem, but when you discover you always have a problem that you may over-feed your data. You can decide that you are using this cutoff or this cutoff, because that’s how you’re getting the best result, and that’s correct, and that’s the first step to do it. But eventually you need to make sure with 100 percent integrity that that’s the best cutoff, so you need to have a sample in which you have a different sample of patients or different cohorts of patients, in which you need to apply it independently and see if your cutoff or your methods reproduces your results. For me, for example, if your P values are better in your validation set, I will be very, very surprised. Usually you have a lower P value and less clear separation in the validation set. But even then, there is always a possible bias in sample collections, so we usually try to go into prospective trials and to test specimens routinely with specific criteria and uniform protocols at every step and then to validate the biomarker in such a prospective trial. That’s why it takes some time.
Circulating Tumor Cells: Paul Hofman
GT: What method or technology do you use to discover ovarian circulating tumor cell biomarkers? Why?
Paul Hofman: Several methods can be used for this proposal. We use in the Laboratory of Clinical and Experimental Pathology (at the University of Nice Sophia in Nice, France) different methods: A direct method — namely, isolation by size of epithelial tumor cell or ISET technology — and an indirect method — the cell search method using more specifically, the epithelial Cell Search kit. We believe that the combination of different methods can greatly optimize the potential for the detection of circulating tumor cell in blood samples. Some CTC can weakly express the family of cytokeratin antigens since these cells can demonstrate an epithelial to mesenchymal transition phenomenon. Therefore, the cell search method cannot detect these latter cells which do not express cytokeratin but only vimentin. By contrast, the ISET method is able to detect these cells morphologically. Moreover, ISET method can potentially allow better characterizing the CTC by using an associated immunocytochemistry approach and/or by doing molecular biology such as FISH, for example. However, this latter method needs also to be improved in order to better quantify the detected CTC, and thus to increase its sensitivity.
GT: Do you use a multiplexed approach? Why or why not?
PH: Currently, we do not use a multiplexed approach in our laboratory. We believe, of course, that it is a very interesting approach, very challenging, and probably it is a cutting-edge format in different areas for the characterization of different mutations in the same tumor sample, for example. It will be very important to develop this approach in the field of molecular biology, in particular in the health care domain. However, to my point of view, this technology is not totally well developed in the health care field and, other more “classical” methods — such as direct sequencing, pyrosequencing, Taqman, et cetera — currently available in most of the laboratory, are more developed in routine practice.
GT: How do you validate those putative biomarkers?
PH: The putative biomarkers are validated by doing large series of patients, with well characterized cohorts of patients — matched in age, gender, pTNM staging, etc. Correlation with the overall survival, specific survival and disease-free survival parameters is systematically performed in a “training set of patients” then in a “validation set” of patients, which is an independent set of patients. The specificity and sensitivity are systematically evaluated for each biomarker. Ideally the validation of some biomarkers can be made using a tissue microarray approach built with several hundred of tumor samples.
GT: What kind of samples, such as fresh, fresh-frozen, FFPE, or other, ensure an optimal screen for these biomarkers? Why?
PH: There is not ideal “type of sample” for an optimal screening of new biomarker. However, comparison of the same biomarker obtained from different samples — such as for example from the plasma or the sera, and from the tissue fixed in formaldehyde or frozen tumors — is usually of great interest. Ideally, some biomarkers can be expressed in fixed tissue, allowing the pathologist to easily look for its expression. However, it can be also of interest to make xenograft in mice by using fresh sample of tumor and to look for the behavior of cancer cells after tumor implantation in mice. So, the best way is to have different collections of samples in the same laboratory: fixed and frozen tumor samples, plasma, sera, whole blood samples, and primary cell cultures plus mice xenograft.
GT: What method or technology is best to detect low-abundance biomarkers?
PH: Certainly new high throughput technology such as microarrays (DNA, microRNA microarrays, etc.) and new generation of deep sequencing.
GT: How do you determine which biomarkers are both sensitive and specific for use in the clinic?
PH: The correlation with clinical data is essential for this determination. It depends also if this biomarker is a diagnostic, prognostic, or theranostic biomarker. This approach needs to include in the data management a very strong biostatistics approach. It is necessary also to do a meta-analysis consideration to look if the same biomarker is described by different teams and using different technology platforms all around the world. In fact a very few meta-analyses exist for the biomarkers. A strong biomarker can be used in the clinic if the same results are obtained in different countries, with different cohorts of patients, and sometimes by different techniques.
List of Resources
Sometimes you need to know more. Here are more sources that may help you out.
Publications
Dolled-Filhart M, Ryden L, Cregger M, Jirstrom K, Harigopal M, Camp RL, Rimm DL. (2006). Classification of breast cancer using genetic algorithms and tissue microarrays. Clinical Cancer Research. 12, 6459-6468.
Galindo CL, McCormick JF, Bubb VJ, Abid Alkadem DH, Li LS, McIver LJ, George AC, Boothman DA, Quinn JP, Skinner MA, Garner HR. (2010). A long AAAG repeat allele in the 5′ UTR of the ERR-γ gene is correlated with breast cancer predisposition and drives promoter activity in MCF-7 breast cancer cells. Breast Cancer Research and Treatment. E-pub.
Galindo CL, McIver LJ, Tae H, McCormick JF, Skinner MA, Hoeschele I, Lewis CM, Minna JD, Boothman DA, Garner HR. (2011). Sporadic breast cancer patients’ germline DNA exhibit an AT-rich microsatellite signature. Genes, Chromosomes, and Cancer. doi: 10.1002/gcc.20853.
Giltnane JM, Moeder CB, Camp RL, Rimm DL. (2009). Quantitative multiplexed analysis of ErbB family coexpression for primary breast cancer prognosis in a large retrospective cohort. Cancer. 115, 2400-2409.
Harigopal M, Barlow WE, Tedeschi G, Porter PL, Yeh IT, Haskell C, Livingston R, Hortobagyi GN, Sledge G, Shapiro C, et al. (2010). Multiplexed assessment of the Southwest Oncology Group-directed Intergroup Breast Cancer Trial S9313 by AQUA shows that both high and low levels of HER2 are associated with poor outcome. American Journal of Pathology. 176, 1639-1647.
Hayes DF, Bast RC, Desch CE, Fritsche H Jr, Kemeny NE, Jessup JM, Locker GY, Macdonald JS, Mennel RG, Norton L, et al. (1996). Tumor marker utility grading system: a framework to evaluate clinical utility of tumor markers. Journal of the National Cancer Institute. 88, 1456-1466.
Kim BK, Lee JW, Park PJ, Shin YS, Lee WY, Lee KA, Ye S, Hyun H, Kang KN, Yeo D, Kim Y, Ohn SY, Noh DY, Kim CW. (2009). The multiplex bead array approach to identifying serum biomarkers associated with breast cancer. Breast Cancer Research. E-pub.
Rothberg BE, Berger AJ, Molinaro AM, Subtil A, Krauthammer MO, Camp RL, Bradley WR, Ariyan S, Kluger HM, Rimm DL. (2009). Melanoma prognostic model using tissue microarrays and genetic algorithms. Journal of Clinical Oncology. 27, 5772-5780.
Simon RM, Paik S, Hayes DF. (2009). Use of archived specimens in evaluation of prognostic and predictive biomarkers. Journal of the National Cancer Institute. 101, 1446-1452.
Upcoming Conferences
Translational Approaches to Cancer
May 23-27, Suzhou, China
Cold Spring Harbor Laboratory Asia Conferences
2011 ASCO Annual Meeting
June 3-7, Chicago
American Society of Clinical Oncology
Cancer Proteomics
June 20-23, Dublin
Select Biosciences
13th Breast Cancer Milan Conference
June 22-24, Milan, Italy
European Institute of Oncology
Cancer Genomics
September 17-19, Heidelberg, Germany
EMBO-EMBL
Web Resources
Virginia Bioinformatics Institute
http://innovation.vbi.vt.edu/
ChipDX Tumor Analysis Platform
http://www.chipdx.com