Catching ovarian cancer with time to spare

July 2002
Cover Story

William Check, PhD

Discussing his research goal, Richard Roden, PhD, assistant professor of pathology at Johns Hopkins University, speaks of "our desperate need" for a good marker to detect ovarian cancer early. Echoing Dr. Roden, Lance Liotta, MD, PhD, chief of pathology at the National Cancer Institute and co-director of the FDA/NCI Clinical Proteomics Program, says, "Ovarian cancer is a disease where there is great need for an early detection method." Ovarian cancer is usually diagnosed in its late stages, when disease has spread beyond the ovary. "If we could switch detection from late to early stage we could make a tremendous impact on survival," Dr. Liotta says.

David Fishman, MD, expresses a similar view. "We are not doing our job," says Dr. Fishman, who is professor of obstetrics and gynecology and director of gynecologic oncology research, Northwestern Memorial Medical Center, and director of the National Ovarian Cancer Early Detection Research Network. "We need to be able to find out whether a woman’s ovaries are abnormal before cancer has spread." The reason is simple, according to Dr. Fishman—today, as in 1960, less than 20 percent of women diagnosed with ovarian cancer are alive five years later. "That to me is unacceptable," he says. "If we can apply our scientific intelligence clinically to detect ovarian cancer confined to the ovary, 90 percent of those women will be alive five years later." In addition, most women diagnosed early will need much less aggressive surgery and may not require chemotherapy.

In short, Dr. Fishman says, "We need early detection of early stage ovarian cancer."

If this were the ideal world envisioned by Karl Marx—"To each according to her needs"—we would have had a marker for early ovarian cancer many years ago. In our real world, however, this goal has proved to be elusive.

Steven Skates, PhD, a biostatistician at Massachusetts General Hospital and assistant professor of medicine at Harvard Medical School, lists three approaches that have been advocated for early detection of ovarian cancer in normal-risk women:

An annual blood test followed by an imaging test—typically transvaginal sonography-for those positive on the blood test.
Annual imaging, typically with transvaginal ultrasound.
A blood test and transvaginal ultrasound on all women, now being investigated as part of NCI’s PLCO (prostate, lung, colorectal, and ovarian cancer) screening trial.

In this article we discuss only blood tests, while noting Dr. Fishman’s caveat: "No one is going to have surgery based only on a blood test. They will also need an ultrasound imaging test to evaluate the ovaries. A decision on surgery will come from those combined results."

The best-known serum marker for ovarian cancer, CA 125, has been studied for many years, but it is not sensitive or specific enough. A more complex approach, looking at CA 125 levels over time, appears to be more promising. "Longitudinally following CA 125 significantly increases preclinical sensitivity," says Dr. Skates. At the present time, however, CA 125 alone does not suffice. At best, it detects only 40 to 50 percent of early ovarian cancers. So researchers are looking at several other markers.

Yan Xu, PhD, associate staff member in the department of cancer biology at Cleveland Clinic, is investigating lysophosphatidic acid. Based on preliminary data, Dr. Xu says, "potentially LPA is a big improvement" over CA 125. Larger trials of LPA are ongoing.

Dr. Roden’s research follows a third approach—attempting to identify ovarian tumor-associated antigens that are immunogenic. He has discovered several candidate antigens. "Immunogenic ovarian tumor-associated antigens might be useful both as biomarkers for detection and as therapeutic targets," he says.

A fourth distinct tack is being pursued by Dr. Liotta and his co-director in the FDA/NCI Clinical Proteomics Program, Emanuel Petricoin, PhD, senior investigator at FDA’s Center for Biologics Evaluation and Research. Called proteomics, this method seeks characteristic serum protein expression patterns. Looking at the serum proteome of ovarian cancer patients was a high priority, says Dr. Petricoin, "because of its obvious need and its potential ability to make a dramatic impact."

"The wealth of information in the serum proteome is astounding," says Dr. Liotta. "It contains details and subtleties that tell you what is going on in internal body sites." He reports that the proteome patterns that they have observed have been "exquisitely specific," differing between ovarian and prostate cancer and between malignant, benign, and inflammatory conditions. "It is extremely exciting to think of where we will be in the future with this," he says.

Still, data acquired to date indicate that no one marker—even the serum proteome—will be adequate. So Drs. Skates and Fishman and others are collaborating on an NCI-funded study to try to formulate a panel of markers that is simultaneously sensitive and specific.

Clive Taylor, MD, PhD, chairman of pathology at the University of Southern California School of Medicine, has written extensively on tumor markers and what evidence is needed to show they are applicable to patient care. He agrees that CA 125 has not been specific or sensitive enough. "It is a bit like the CEA story," he says. "CEA was initially thought to be a good diagnostic marker for carcinoma of the colon, but it was not specific or sensitive. Now we know that it is acceptable for monitoring and followup.

"If you are an optimist," Dr. Taylor continues, "there is some encouragement that these new looks at protein profiles in serum are of some real value. But," he cautions, "the same story has been told before. Fifteen years ago people were making the same claims about site-directed monoclonal antibodies against peptide products in serum." Only the appropriate clinical studies will tell whether the newer tests can stand up to scientific rigor.

Two paramount facts underlie the intense effort to find a biomarker for early detection of ovarian cancer and the intense difficulty of this search. As already noted, the motivation is to save lives. Each year in the United States about 24,000 women are diagnosed with ovarian cancer, and about 14,000 die of it. This high mortality rate is due to the fact that at least three-fourths of women are diagnosed with advanced disease, which has a very low survival rate. "The fact that most cases are picked up in late-stage disease and the contrast between early and late-stage survival rates makes early detection of ovarian cancer an appealing approach to decreasing mortality from that disease," says Dr. Skates.

However, the annual incidence of ovarian cancer in the target population—typically considered as all postmenopausal women—is quite low, about 40-50/100,000. As a consequence, Dr. Skates says, "Any early detection program has to find one case out of 2,000 or more women without falsely identifying too many women who don’t have the disease." Some have suggested that this demands a screening program with a positive predictive value-the number of women who have the disease out of those identified as positive—of at least 10 percent. Many believe it should be higher.

Overall specificity of a screening program must exceed 99.6 percent. "That is what makes screening for ovarian cancer so difficult," Dr. Skates says. Clinical trials have shown that it is possible to achieve such high overall specificity if the specificity of the blood test is around 98 percent, with followup imaging ruling out most false positives. At the same time, a test should pick out at least 75 percent of true cases.

While a single assay of CA 125 turned out not to meet these criteria, Dr. Skates and others have been looking at the value of measuring CA 125 over time. A statistical model converts the longitudinal CA 125 profile to a single number called "risk of ovarian cancer," which is the risk of having the disease at a given time. One study, conducted by Ian Jacobs at St. Bart’s Hospital, London, included 22,000 women who had an initial screen with CA 125. Half had three further annual screens. A retrospective analysis of the data from this clinical trial indicated that longitudinal CA 125 values increased sensitivity for ovarian cancer from about 70 percent to 85 percent while maintaining specificity.

To determine whether this approach increases preclinical detection of early stage disease—which is the crucial question—Jacobs is now conducting a randomized trial of longitudinal CA 125 assays followed by ultrasound compared to annual ultrasound and to a control group.

Discovery of LPA as a possible biomarker for ovarian cancer grew out of work in the early 1990s trying to isolate a tumor growth factor from ascites of patients with ovarian cancer. "Everyone thought it was a protein," Dr. Xu says, "because known regulators of cancer cell growth were usually proteins." However, this one turned out to be a lipid, LPA, which was present in high concentration in ascites. Further work showed that LPA was also elevated in blood: It was increased in samples from 47 of 48 ovarian cancer patients, including nine of 10 with stage 1 disease. "That shows that LPA is a potential marker for early detection of ovarian cancer," Dr. Xu says. In contrast, CA 125 was elevated in only two of the nine patients with stage 1 cancer who were tested.

Dr. Xu is continuing to work with LPA, measuring it prospectively in women coming to Cleveland Clinic for any reason. It continues to have a high sensitivity and specificity in her hands. However, she has collected only about 50 samples, since most samples are being entered into a seven-center clinical trial sponsored by Atairgin Technologies, the company that has licensed this technique. Atairgin’s trial is enrolling high-risk women—those who have a suspicious pelvic mass.

Dr. Roden’s approach to identifying immunogenic ovarian tumor-associated antigens uses the recombinant expression cloning method called SEREX, in which cDNA from a patient’s tumor are recombined into the coliphage lambda. The sum of the bacterial viruses carrying the human genetic material is called a cDNA expression library. Each clone’s cDNA was expressed in bacteria, and the resulting proteins were screened using antibodies from an ovarian cancer patient.

Five clones whose proteins reacted with the patient’s antibodies were further investigated. Most efforts so far have focused on two of these proteins, called HOX A7 and HOX B7. (HOX stands for "homeobox.") "Homeobox proteins are transcription factors that have very potent effects during development," Dr. Roden says, "so we sought to assess the association with ovarian cancer of these two human proteins."

In a preliminary study, Dr. Roden and his colleagues looked at the prevalence in patients and controls of antibodies reactive with HOX A7 or HOX B7. For HOX A7, sera from healthy women were minimally reactive, while sera from 16 of 24 women with moderately differentiated ovarian serous carcinoma had reactivity. Reactivity to HOX A7 was also present in serum from 13 of 19 patients with benign serous cystadenoma and one of 24 with poorly differentiated ovarian carcinoma.

For HOX B7, serum from 13 of 39 ovarian cancer patients and one of 29 healthy women were reactive. One complication was that HOX B7 RNA expression was elevated in all 15 ovarian carcinomas tested, even though not all patients made anti-B7 antibody. "It looks as though in some cases a patient can have elevated expression without antibody formation," Dr. Roden says.

He is also intrigued by finding strong antibody levels to HOX A7 even with localized disease, such as benign serous cystadenoma. On the negative side, that could reduce the test’s ability to distinguish benign from malignant ovarian disease. On the positive side, Dr. Roden notes, "That observation holds promise for detection of localized carcinoma." However, he says, "We have not yet tested blood samples from patients with early stage ovarian carcinoma—which is really what is wanted."

Detecting potential biomarkers by measuring the immune response to tumor-associated antigens rather than tumor-associated antigens themselves has several advantages, Dr. Roden says. One is that the test for antibodies in a patient’s serum is very simple. Another is that the tumor-associated antigen does not have to be released from the tumor to stimulate an immune response. "Most important," Dr. Roden says, "recognition of an immune response represents a powerful amplification of the signal—the tumor-associated antigen—that the tumor presents."

He is continuing to screen for new antigens with an improved methodology. Expression of antigens in bacteria through the lambda system does not allow for posttranscriptional modification of proteins, such as glycosylation. "Such changes may occur in tumors and be recognized by antibodies," Dr. Roden notes. He is now using patient antibodies to precipitate tumor antigens, which are then identified by mass spectrometry.

Amplistar Inc. is validating the utility of the five immunogenic ovarian cancer gene products for diagnostic purposes by conducting a larger case-control series.

Dr. Roden says that it is still "early days" in the assessment of the clinical value of this approach.

To identify proteomic patterns characteristic of ovarian cancer, a computer algorithm is employed. Dr. Liotta explains that the first step is to scan the thousands of proteins in a drop of serum with mass spectroscopy looking at varied subsets of proteins to detect any reproducible pattern of difference between normal and ovarian cancer. However, the computer must first be trained to recognize such patterns by showing it a training set of samples.

"In training the computer, selecting the training set is probably the most important step," says Dr. Petricoin. "These types of AI-based machine learning systems are not like having HAL on the other side of the computer screen saying, ’Hello, Dave,’" he notes. "These are dumb systems. If you don’t show it the right data set, it can’t learn."

For ovarian cancer, the training set comprised 50 blood samples from women with various stages of ovarian cancer as well as 50 samples from healthy women—those who had been followed for five years after the serum sample was taken and had no evidence of ovarian cancer, although some did have ovarian cysts.

Once the algorithm could distinguish benign from malignant samples with complete accuracy, it was shown a test set of 50 samples from women with ovarian cancer and 50 from women free of malignant disease, all distinct from the training set. In addition, the testing set included 16 samples from women with a variety of nonmalignant disease conditions, such as sinusitis and rheumatoid arthritis. "It is important to show that your system can discriminate malignant from benign disease," Dr. Petricoin says. In a general population, nonmalignant conditions such as ovarian cysts, fibroids, endometriosis, and general inflammatory conditions will be seen much more frequently than ovarian cancer. Attaining adequate specificity demands that the algorithm can distinguish between ovarian cancer and these other disorders.

In the test set, the algorithm correctly identified all cancers, including 18 stage 1 cases. It erroneously classified three of 66 (4.5 percent) benign samples as cancerous. These results yield a sensitivity (100 percent) exceeding the criteria cited by Dr. Skates, but a specificity (95 percent) that falls short of the 98 percent benchmark. A 4.5 percent false-positive rate is statistically impressive. However, given the low prevalence of ovarian cancer, among 20,000 screened women a 4.5 percent false-positive rate translates into 1,000 women wrongly sent for further workup to detect 10 who have ovarian cancer. "This is not exactly what we wanted," Dr. Petricoin concludes. "There is still some work that needs to be done." He notes that these numbers would be more acceptable in a high-risk population, so initial clinical testing will be done in women at increased risk of ovarian cancer.

In a refinement of the algorithmic approach, these investigators have subsequently evaluated combinations of models. Dr. Petricoin explains that the model that they published was only one of 12 potential models derived from the initial analysis of the spectroscopic datastream. They attempted to combine two or more models to increase specificity while maintaining sensitivity. Selecting the optimal combination of models is crucial. "If each model gets a different subset wrong, all you do ultimately is to get more wrong with the combination," Dr. Petricoin says. Eventually they did find a combination of models that was completely sensitive and specific for the test data set.

Validating a combination of models is the next step. "We are still trying to lock on to the best combination of models," Dr. Petricoin says. "We are excited to actually test this model in a true clinical setting." Later this year they anticipate opening the world’s first clinical reference laboratory for serum pattern proteomic diagnosis, which will handle serum samples for ovarian, breast, and prostate cancer in a clinical trials context. Analyses will be done in Dr. Liotta’s laboratory, which has CLIA and CAP licenses and provides clinical test results. "We will run all of these samples under a rigorous QC/QA process that we are developing to generate very reproducible results," Dr. Petricoin says.

Initially the combination of models will be evaluated in a clinical trial of high-risk women that will be supported by NCI and FDA. Subjects will be women with one of three risk factors for ovarian or breast cancer. One group will be women who are being followed for recurrence. A second group will be women who are at increased risk of ovarian and breast cancer because they have a first-degree relative with breast or ovarian cancer or because they have mutated BRCA genes. These women are being followed in high-risk clinics. "This is a very anxious and worried group of women," Dr. Petricoin says. "Some are choosing to have prophylactic oophorectomies without any medical justification to get rid of their anxiety because they saw their mother die of ovarian cancer."

In the third group will be women with a pelvic mass. In these women, serum will be collected before surgical analysis to see whether the proteomic pattern can distinguish benign from malignant cases. Pelvic masses seen on ultrasound turn out to be ovarian cysts or fibroids a very large percent of the time. Since prudent management dictates waiting two to three months to see whether the mass spontaneously resolves, there is a delay in the patient seeing a gynecologic oncologist, after which a laparoscopy may be done. "When we are talking about ovarian cancer, a delay of two to three months could translate into a real difference in survival probability," Dr. Petricoin says. "In that population, we might risk more false positives to get those with true cancer to be seen by a gynecologist oncologist as soon as possible."

In the clinical trials, proteomic testing will be done without charge to patients. Once the method is approved, any company can commercialize it using data gathered in the clinical trial for PMA filing and obtaining a commercial license from Correlogic Systems Inc. Dr. Petricoin says the proteomic method presents "a unique situation." License to the underlying intellectual property is held jointly by the U.S. government and Correlogic Systems, a software company that developed the basic algorithms and holds exclusive commercialization rights. Correlogic Systems and the U.S. government are coinventors and licensees of the patent.

Both Dr. Liotta and Dr. Petricoin express caution about how the proteomic technique is applied. "Right now we can’t envision this as being a general population screen for ovarian cancer," Dr. Liotta says. "Even if we had 99 percent sensitivity and specificity, we would have wrong results in one percent of persons screened. We couldn’t tolerate that level of misdiagnosis in a screen of millions of women. We don’t want to make any mistakes."

"It would be very dangerous to think about this as a general screening test," Dr. Petricoin concurs. "That is where many biomarkers have failed—zealous investigators pushed their favorite biomarkers as clinical screening tests. Just look at the conundrums that mammography is creating right now. We don’t want to increase confusion, but to try to help physicians make diagnoses."

Dr. Skates suggests that maximizing early stage preclinical sensitivity at a fixed specificity of 98 percent will require looking at multiple markers. First it will be necessary to work out a way to translate time series of measurements for one or more markers into a risk value. Then the level of risk can be set for a specificity of 98 percent. "We need to try both to identify markers that will maximize sensitivity and to find a way to combine that information into one number," Dr. Skates says.

Right now Dr. Skates and colleagues are comparing preoperative serum in cases to serum from control subjects. "We are not ready to implement this approach in a prospective trial," he says. "But through statistical analysis of data we are trying to identify the best panel of biomarkers to give optimal sensitivity for the target specificity." One question they are addressing is how many markers to use. "I don’t think there is an agreed-upon number of markers," Dr. Skates says. "It will be a trade-off between complexity of method and interpretation and diminishing returns from adding a new marker to the panel. Our hope is that these complex methods will be built into a system that measures multiple markers over time and summarizes the information in a single number that indicates the patient’s risk of having the disease."

Dr. Fishman, who is one of the principal investigators on this NCI-funded study, says that it will include both general and high-risk populations, as well as subjects with cancer. "We will evaluate biomarkers independent of each other and in combinations," he says. "We already have a fair amount of preliminary data showing that these markers are real. Now we are trying to validate them in larger populations—thousands of women—and to compare and combine them."

Dr. Skates emphasizes that, for now, "All of these results must be viewed in the context of a research study only, not for general clinical use, since none of these tests has been shown to have an impact on mortality." He adds, "We want to temper our enthusiasm to looking at these approaches through research studies—to determine their effect before advocating them for the general population."

As to when definitive data might be available, Dr. Skates says, "I am always very tentative when it comes to predictions, but I would hope we can identify a promising candidate panel by the end of the year and incorporate it into a prospective trial. Or at least begin the process of incorporating it into a prospective trial, which may be a pilot trial, sometime next year."

Dr. Fishman has a more ambitous goal for the next year. "My hope," he says, "is that a year from now we will have the answers. I anticipate at least two of the blood tests—which are in FDA evaluations right now—will be approved by this time next year. It would be absolutely fantastic if that would be the case."

William Check is a medical writer in Wilmette, Ill.