But developing blood-based cell-free DNA tests is tricky. First, cell-free DNA doesn’t float in long, easy-to-sequence strings. It’s made up of small snippets that must be reassembled like a multimillion-piece jigsaw puzzle. Secondly, most of the DNA in our blood is from healthy cells, even in patients with large tumors. To top it off, most of the DNA in cancer cells is the same as the DNA from healthy cells.
To parse out the tumor DNA and discover its important information, Joe Hiatt, MD, PhD, a research associate in MacPherson’s lab teamed up with Anna-Lisa Doebley, PhD, while she was an MD/PhD student in Ha’s lab. (Doebley conducted the research portion of her graduate work with Ha and is currently finishing her MD at the University of Washington School of Medicine.) Ha and his group specialize in developing computational approaches to re-assemble the DNA jigsaw and find informative patterns in circulating tumor DNA.
Ha doesn’t focus on mutations in individual genes. Instead, he looks at larger patterns, particularly of DNA packaging, which can tell him about which genes are turned on (transcribed) and turned off (silenced).
The most basic unit of DNA packaging is a wheel-shaped protein complex called the nucleosome. DNA strands wrap around nucleosomes like yarn around a spool. The more nucleosomes in a section of DNA, the tighter the packaging and the more “silent” the gene region is. Looser areas where genes are turned on have fewer nucleosomes.
When DNA is released by cells (both cancerous and healthy) into the blood, nucleosomes protect it. The snippets of DNA floating in our blood reflect the regions with more protective nucleosomes. Ha has developed methods, called nucleosome profiling, to glean gene expression patterns from nucleosome-protected snippets of DNA in blood.
So Doebley and Hiatt decided to develop a targeted strategy to discover them. The team looked at regions of DNA called transcription start sites, or TSSs, where the “reading” of genes starts.
Transcription is orchestrated by transcription factors, a wide-ranging group of proteins that help turn genes on and off. The team suspected that the different genetic programs used by the different SCLC subtypes would be reflected in different patterns of DNA packaging at these genes’ start sites — and that these patterns could be found in circulating tumor DNA.
Cell-free tumor DNA reveals lung cancer types
To give themselves a leg up, Hiatt and Doebley started with preclinical models in which cell-free tumor DNA is easier to sift out from DNA released by healthy cells: patient-derived xenograft models, or PDXs.
In PDX models, tumor tissue taken from patients is implanted into mice. This means that all the tumor DNA circulating in the blood of a PDX mouse is of human origin.
The team drew from eight NSCLC PDX models and 20 SCLC PDX models (including one from a patient whose tumor had transformed from NSCLC to SCLC) to draw from. The team had detailed molecular information about each tumor.
“They had ground truth about activation of transcription factors, expression of all genes, and how we can correlate that with the TSS [transcription start site] signal,” MacPherson said.
They created a focused sequencing strategy to detect DNA from relevant regions, and Doebley built her probabilistic models by tailoring Ha’s team’s nucleosome profiling methods to this targeted panel. She formulated probabilistic models that account for the fact that the amount of tumor DNA in blood plasma can vary, and which accurately estimate the fraction of each SCLC subtype even when in samples with 5% tumor DNA.
Doebley and Ha built one model to predict likelihood that cell-free DNA came from an NSCLC or an SCLC tumor. The second model distinguished different SCLC subtypes.
All told, the predictive models examined more than 13,000 transcription start sites and more than 1,000 transcription factor binding sites. The assays captured the sequences of nearly 850 genes.
Doebley and Hiatt then tested the models against cell-free DNA in patient samples to see how they fared in a more clinically relevant context.
They found that their computational model was very good at predicting whether DNA had come from an NSCLC or SCLC tumor, suggesting that their approach has potential for detecting when a patient’s tumor transforms from NSCLC to SCLC, MacPherson said.
The model that predicted SCLC subtype performed well, but was hampered by the limited selection of subtypes in the patient samples. Certain subtypes were well represented, but one SCLC subtype was not included in the cohort.
Validating and refining
Though preliminary, the results show that liquid biopsies based on large-scale patterns in DNA packaging have potential as tools to monitor SCLC, MacPherson said.
The team is working toward further refining and validating their models to improve and expand their predictive capabilities. They will likely be able to winnow down the key TSSs, transcription factor binding sites and mutations to a smaller, but equally informative panel.
“It was remarkable to us that only a smaller set of informative genomic regions was needed for our computational models,” Ha said. “This has implications for more cost-effective and easier translation into the clinic.”
The refined panel will be tested against larger, more comprehensive sets of patient samples, including more from patients whose tumors change from NSCLC to SCLC.
“The other future direction is to broaden the types of phenotypes that we want to try to capture with this assay,” MacPherson said.
This would allow them to do more than merely assign patient SCLC tumors to particular subsets, he said. The team may be able to use the patterns in cell-free tumor DNA to qualities that oncologists could use to someday direct a patient’s treatment, like targets for therapies like antibody-drug conjugates or genetically engineered immune cells.
The investigators also want to link the patterns they’ve detected to clinical responses, which will also help tailor treatment regimens in the future.
The approach has implications for other tumor types as well, Ha and MacPherson noted.
“This was the first study to comprehensively assay thousands of [gene transcription] start sites. An important conclusion from our paper is that cell-free DNA contains information about the activation of a lot of these sites,” MacPherson said.
This information will likely be as important and informative for other tumors as it is for SCLC.
In the short term, MacPherson envisions similar assays being used to improve clinical trials, helping identify patients who are the best candidate for a new therapy, or giving researchers information about why certain patients respond and others do not. He’s also interested in discovering whether a similar assay could be used to detect at the molecular level tumors that are responding positively to treatment even if the response may not yet be clinically apparent.
“A clinical assay is our ultimate goal, and the next steps of our research directions are focused on that,” MacPherson said.
This work was supported by the National Institutes of Health, the Kuni Foundation and the Conquer Cancer Foundation.