Read "Fulfilling the Potential of Cancer Prevention and Early Detection" at NAP.edu

Page 259 Cite

Suggested Citation:"7. Adopting New Technology in the Face of Uncertain Science: The Case of Screening for Lung Cancer." Institute of Medicine and National Research Council. 2003. Fulfilling the Potential of Cancer Prevention and Early Detection. Washington, DC: The National Academies Press. doi: 10.17226/10263.

×

7
Adopting New Technology in the Face of Uncertain Science: The Case of Screening for Lung Cancer¹

Lungcancer, anuncommontype ofcancer atthestartof the20th century, is the leading cause of cancer death in the United States at the start of the 21st. Surpassing deaths from breast, colon, and prostate cancer combined, there were an estimated 155,000 deaths from lung cancer in 2002 (ACS, 2002a). The prognosis after diagnosis is dismal. Five-year sur-vival rates remain less than 15 percent, changing little over the past 30 years (Travis et al., 1995). While lung cancer is mostly preventable through avoidance of tobacco products, smokers, health care providers, and scientists have unsuccessfully tried other preventive approaches, such as screening for early disease with chest radiographs and sputum cytology (secondary prevention).

Finding cancer earlier by screening seems intuitively appealing. Successful early detection of cervical cancer with Pap testing, breast cancer with mammography, and colon cancer through finding and removing polyps has lowered mortality from these cancers, thus providing impetus to search for early detection methods for other cancers. Unfortunately, the value of screening for other cancers, such as prostate-specific antigen (PSA) testing for prostate cancer, is less clear and has become more contentious (see also Chapter 5). Some cancers may be more amenable to early detection methods than others. For lung cancer, the prominent failures of chest radiographic and sputum cytology screening to lower disease mortality have led most organizations to recommend against screening for it.

¹	This chapter is based on a background paper prepared by Parthiv J. Mahadevia, Farin Kamangar, and Jonathan M. Samet (www.iom.edu/ncpb).

Page 260 Cite

Suggested Citation:"7. Adopting New Technology in the Face of Uncertain Science: The Case of Screening for Lung Cancer." Institute of Medicine and National Research Council. 2003. Fulfilling the Potential of Cancer Prevention and Early Detection. Washington, DC: The National Academies Press. doi: 10.17226/10263.

×

SOURCE: Corbis Corporation.

Recently a “high-tech” medical imaging device called spiral or helical computed tomography (CT) scan has renewed hope for finding an early detection method that can reduce mortality from lung cancer (Brice, 2000). Promising preliminary studies report that spiral CT scans can detect lung cancers at a smaller size than can chest radiographs (Henschke et al., 1999; Henschke et al., 2001; Sobue et al., 2002; Sone et al., 2001; Swensen et al., 2002). However, the clinical significance of these findings is unclear since long-term outcome data are unavailable. Randomized controlled trials evaluating spiral CT screening for lung cancer have only recently begun and conclusive efficacy data could be 5 to 10 years away.

Despite the lack of clear benefit, direct-to-consumer marketing of spiral CT screening is being offered by entrepreneurial radiology practices (Lee and Brennan, 2002). Early dissemination of an unproven screening test raises many concerns and questions. Concerns include false-positive and false-negative tests, harms from subsequent invasive procedures or treatments, and sizable costs to consumers, payers, and society. Decision makers have many questions. Should consumers get these scans? How should health care providers counsel high-risk individuals interested in this technology? Should managed care organizations and other third-party payers cover the costs of screening? What experimental or observational study designs provide the best data in the most efficient manner?

Page 261 Cite

Suggested Citation:"7. Adopting New Technology in the Face of Uncertain Science: The Case of Screening for Lung Cancer." Institute of Medicine and National Research Council. 2003. Fulfilling the Potential of Cancer Prevention and Early Detection. Washington, DC: The National Academies Press. doi: 10.17226/10263.

×

The case study presented in this chapter evaluates this high-technology screening test through a review of past and current scientific evidence. Clinical studies of lung cancer screening techniques have close to a 50-year history. Using a historical perspective, we review the lessons learned from past attempts to assist individuals, clinicians and policy makers in making decisions on the use of lung cancer screening technology despite the uncertainty of its effectiveness.

SCREENING FOR LUNG CANCER BY CHEST RADIOGRAPHY AND SPUTUM CYTOLOGY

In the early 1950s several researchers noted that “X-ray surveying” of the population detected lung cancers in asymptomatic individuals (Lilienfeld, 1966), raising the possibility that screening for lung cancer by chest radiography might detect cancers at earlier stages, when there might be hope of operative resection and cure. At that time there was already lengthy experience with mass screening for tuberculosis with a similar goal: identification of cases at a stage when intervention was most likely to be effective. Screening for tuberculosis was a major public health activity, and screening clinics with mobile radiographic facilities were successfully used for this purpose. It seemed reasonable to extend these same approaches to an emerging epidemic of another fatal pulmonary disease.

Four prospective cohort studies of lung cancer screening were started in the 1950s to determine if screening by chest radiography could improve lung cancer survival rates: the Veterans Administration-American Cancer Society (VA) Study (Lilienfeld, 1966), the Philadelphia Neoplasm Research Project Study (Weiss et al., 1982), the South London Lung Cancer Study (Nash et al., 1968), and the Tokyo Metropolitan Government Study (Hayata et al., 1982). Those studies used survival data to evaluate effectiveness and found 5-year survival rates that ranged from 8 to 20 percent (Table 7.1), not a meaningful improvement from the historical lung cancer survival rate. Survival rates among patients who had undergone surgical resection were 12 to 44 percent, higher than the overall survival rate. Unfortunately, the four studies did not incorporate control groups, and any improvement in survival from screening could not be assessed.

Two other nonrandomized studies, the North London Lung Cancer study (Brett, 1969) and the Erfurt County, Germany, study (Wilde, 1989), evaluated screening by chest radiography and did have control groups (Table 7.1). Both studies included an intervention group that received chest radiographs every 6 months and a control group that had either no screening or less frequent screening than the intervention group. In both studies, the 5-year survival rate was higher among the intervention group than the control group (15 versus 6 percent in the North London Lung Cancer study and 14 versus 8 percent in the Erfurt County study). However, lung cancer

Page 262 Cite

Suggested Citation:"7. Adopting New Technology in the Face of Uncertain Science: The Case of Screening for Lung Cancer." Institute of Medicine and National Research Council. 2003. Fulfilling the Potential of Cancer Prevention and Early Detection. Washington, DC: The National Academies Press. doi: 10.17226/10263.

×

TABLE 7.1 Summary of Nonrandomized Prospective Trials of Lung Cancer Screening (1950s to 1970s)

Study	Veterans Administration American Cancer Society Study, 1958–1961 (Lilienfeld, 1966)	Philadelphia Neoplasm Research Project, 1951–1965 (Weiss et al., 1982)
Design Population and Number Screened^a	Uncontrolled prospective study 14,607 males ages 45 and older	Uncontrolled prospective study 6,136 males ages 45 and older
Screening Interval and Method	6-month chest radiographs and sputum cytology	6-month chest radiographs
Incidence Rate (per 1,000 person-years)	0.52 percent^b	2.3 percent
Number of Cancers Found	73 cases	121 cases
Overall 5-Year Survival Rate	17 percent^c	8 percent
Percentage of Cancers Resected	36 percent	27 percent
5-Year Survival Among Those Who Had Resection	12 percent	18 percent
Lung Cancer Mortality Rate (per 1,000 person-years)	0.7 percent^b	47 percent
Number of Cancers Found Between Screenings	5 cases	NR^d
Comments	High attrition rate; VA domiciliary sample	Volunteer sampling; high attrition rate
^aNumbers are incidence screened. ^bReported as a proportion (percent) of all patients only rather than as a rate. ^cReported as the 32-month survival rate. ^dNR = not reported. ^eReported as 4-year survival rate.

Page 263 Cite

Suggested Citation:"7. Adopting New Technology in the Face of Uncertain Science: The Case of Screening for Lung Cancer." Institute of Medicine and National Research Council. 2003. Fulfilling the Potential of Cancer Prevention and Early Detection. Washington, DC: The National Academies Press. doi: 10.17226/10263.

×

Tokyo Metropolitan Government study (Hayata et al., 1982)	South London Lung Cancer Study, 1959–1963 (Nash et al., 1968)	North London Cancer Study, 1959, (Brett, 1969)	Erfurt County Study, 1953–1979 (Wilde, 1989)
Uncontrolled prospective study 1,871,374 radiographs	Uncontrolled prospective study 67,400 males ages 45 and older	Controlled prospective study Screened group, 29,733; control group, 25,311 males ages 40 and older	Controlled prospective study Screened group, 41,532; control group, 102,348 males
Annual chest radiographs	6-month chest radiographs	6-month chest radiographs	Screened group, 6-month chest radiographs; control group, 18-month chest radiographs
10.3 cases/100,000 radiographs	1.4 percent	Screened group, 1.1 cases; control group, 1.0 case	Screened group, 0.9 percent; control group, 0.65 percent
193 cases	147 cases	Screened group, 101 cases; control group, 77 cases	Screened group, 374 cases; control group, 667 cases
20.6 percent	27 percent^e	Screened group, 15 percent; control group, 6 percent	Screened group, 14 percent; control group, 8 percent
56 percent	56 percent	Screened group, 44 percent; control group, 29 percent	Screened group, 28 percent; control group, 19 percent
43.6 percent	47 percent^e	Screened group, 32 percent control group, 23 percent	Screened group vs. control group, 52 vs. 27 percent; 10-year: 39 percent vs. 19 percent
NR	NR	Screened group, 0.7 percent; control group, 0.8 percent	Screened group, 0.8 percent; control group, 0.6 percent
67 cases	87 cases (estimated)	36 cases	Screened group, 199 cases; control group, 485 cases
No mention of attrition or compliance	High attrition rate	High attrition rate	County-specific study

Page 264 Cite

Suggested Citation:"7. Adopting New Technology in the Face of Uncertain Science: The Case of Screening for Lung Cancer." Institute of Medicine and National Research Council. 2003. Fulfilling the Potential of Cancer Prevention and Early Detection. Washington, DC: The National Academies Press. doi: 10.17226/10263.

×

mortality rates were the same in both the intervention and the control groups. The discrepancy between the improved survival rate and the unchanged mortality rate was later explained by the previously mentioned biases that often affect screening data (see also Chapter 5).

These early studies were nonrandomized intervention studies that might be called “demonstration projects” today. Although the clinical trial was an established method for the evaluation of therapeutic interventions at the time, it had not yet been applied to the evaluation of screening. The landmark randomized controlled screening trial—the Health Insurance Plan of New York, which studied breast cancer—was not started until the mid-1960s (Shapiro, 1997).

Although most of the early lung cancer screening studies used chest radiography as the principal screening test, the VA study also evaluated sputum samples as another method for the detection of cancer. Oscar Auerbach (Auerbach, 1969), a pathologist, showed that a spectrum of histologic abnormalities could be found in the respiratory epithelia of smokers, ranging from normal cells to frank malignancy. Geno Saccomanno and colleagues (Saccomanno et al., 1974), who developed the techniques needed for the preparation of specimens of respiratory cells for cytological examination, showed that this spectrum of abnormalities was mirrored in exfoliated cells from the lung. These observational studies provided a rationale for screening for lung cancer by cytological examination of sputum (sputum cytology), which was considered a screening technique complementary to chest radiography. Radiography was presumed to be better at finding radiographically visible peripheral cancers, which originate in the small airways and alveoli (air sacs) of the lung, whereas sputum cytology would find centrally located and hence radiographically invisible cancers arising from the larger airways of the lung, the bronchi. The VA study estimated that the addition of sputum cytology increased the rate of detection of lung cancer by 50 percent compared with that by the use of chest radiography alone.

On retrospective assessment, these early lung cancer-screening studies had serious flaws, including a failure to have a control group and to randomize the participants to screened and nonscreened groups. Consequently, the results may have been affected by the time-related biases that arise in screening studies. Their results were also limited by attrition of the study populations, poor compliance with the screening regimen, difficulties with sputum collection, and high rates of mortality from surgery.

For screening to be effective, most enrollees should be compliant with the screening regimen. In the VA study, roughly 30 percent of the initial enrollees failed to return for a second chest radiograph. By the third year, only 685 of the initial 14,607 enrollees received their recommended chest radiographs. The Philadelphia Neoplasm Research Project study noted that noncompliant individuals had a 76 percent higher rate of lung cancer than participants who complied. If dropouts are more likely to have the disease,

Page 265 Cite

Suggested Citation:"7. Adopting New Technology in the Face of Uncertain Science: The Case of Screening for Lung Cancer." Institute of Medicine and National Research Council. 2003. Fulfilling the Potential of Cancer Prevention and Early Detection. Washington, DC: The National Academies Press. doi: 10.17226/10263.

×

the effectiveness of any screening program may seem to be lower, as the individuals at greatest risk are less likely to receive the intervention.

The quality of screening by sputum cytology in these studies was not optimal. International standards for cytological classification were not yet developed, and there was a high degree of variability in interpretation of abnormalities that fell between the normal and the malignant states (Fullmer, 1970). The significance of finding “atypical” cells was unclear. In 1970, Fullmer noted that priority areas for the enhancement of sputum cytology as a screening method included further refinements in the sample collection technique, education of the technicians who performed the cytological examination, reductions in costs, and establishment of international standards. A positive sputum cytology result requires follow-up by another test to localize the cancer. The VA study had difficulty finding the lung cancer when the sputum cytology result was considered positive but the chest radiography result was negative. The poor localization of cancer made surgical resection less effective, if not impossible. Finally, postoperative death rates were high, approaching 30 percent in the Philadelphia Neoplasm Research Project study.

Randomized Controlled Trials of Chest Radiographic and Sputum Cytology Screening

Building on the earlier studies, the NCI sponsored three randomized controlled trials of lung cancer screening in the 1970s: the Johns Hopkins Lung Project (the Hopkins study) (Tockman, 1986), the Memorial Sloan Kettering Lung Project (the Memorial study) (Melamed et al., 1984), and the Mayo Lung Project (the Mayo study) (Fontana et al., 1986). A fourth randomized controlled trial was performed in the Czech Republic (Kubik and Haerting, 1990). The studies addressed the key design deficiencies of the earlier studies: assignment to screening was by randomization, and careful conduct of the studies addressed issues of compliance and attrition (Berlin et al., 1984). Technological advancements such as CT and flexible fiberoptic bronchoscopy improved the ability to localize the cancer in persons positive by screening. In addition, surgical techniques had improved, and the postoperative mortality rate had declined since the earlier studies.

The individuals in the intervention arms of all three NCI studies underwent both chest radiography and sputum cytology every 4 months. The individuals in the control arms of the Hopkins and Memorial studies also underwent chest radiography annually. The Mayo study had a different control arm; enrollees were given advice only at the time of enrollment to have chest radiography and sputum cytology performed annually. In the Czech study, which lasted 3 years, the intervention group underwent chest radiography and sputum cytology every 6 months, whereas the control group had both tests at the beginning and at the end of the study.

Page 266 Cite

Suggested Citation:"7. Adopting New Technology in the Face of Uncertain Science: The Case of Screening for Lung Cancer." Institute of Medicine and National Research Council. 2003. Fulfilling the Potential of Cancer Prevention and Early Detection. Washington, DC: The National Academies Press. doi: 10.17226/10263.

×

TABLE 7.2 First Screening (Prevalence) Results from NCI-Sponsored Randomized Controlled Trials of Lung Cancer Screening Using Chest Radiographs and Sputum Cytology

Study	Johns Hopkins Lung Project, 1973–1978 (Tockman, 1986)	Memorial Sloan Kettering Lung Project, 1974–1978 (Melamed et al., 1984)
Population and Numbers Screened	10,387 male volunteers, ages 45+ with median 28.5 pack-year history of smoking	10,040 male volunteers, ages 45+ with median 31.2 pack-year history of smoking
Screening Intervention (number of subjects in each group)	I (5,226): CXR and SC C (5,161): CXR	I (4,968): CXR and SC C (5,072): CXR
Prevalence Rate (per 1,000 persons)	Overall: 7.6 I: 7.5 C: 7.8	Overall: 5.3 I: 6.0 C: 4.5
Number of Cancers Detected	I: 39 C: 40	I: 30 C: 23
5-Year Survival Rate in Study Group(s)	I: 59 percent C: 35 percent	I: 47 percent C: 31 percent
Number of Stage 1 Cancers Detected	I: 26 C: 16	I: 14 C: 8
5-Year Survival Rate for All Stage 1 Disease^b	90 percent	85 percent
Percentage of All Cancers Resected	I: 69 percent C: 42 percent	I: 60 percent C: 48 percent
Cancers Detected by Sputum Cytology Alone	11	9
Number of Second Primary Lung Cancers	8	6
Comments	2 postoperative deaths	2 postoperative deaths; 16 surgeries for non-malignant lesions
NOTE: Results are reported separately for groups receiving the intervention (I) and those that were controls (C). CXR = chest radiography; SC = sputum cytology. ^aThe NCI intervention group includes all of the Mayo study subjects and the intervention groups in the Hopkins and Memorial studies. These are then compared with the control groups in the Hopkins and Memorial studies.

Page 267 Cite

Suggested Citation:"7. Adopting New Technology in the Face of Uncertain Science: The Case of Screening for Lung Cancer." Institute of Medicine and National Research Council. 2003. Fulfilling the Potential of Cancer Prevention and Early Detection. Washington, DC: The National Academies Press. doi: 10.17226/10263.

×

Mayo Lung Project, 1971–1976 (Fontana et al., 1986)	NCI composite results of the above three trials^a (Berlin et al., 1984)	Czech study (Kubik and Haerting, 1990)
10,933 male volunteers, ages 45+ with median 20 pack-year history of smoking	31,360 males	6,364 males ages 40–64 with 32-year smoking history
All enrollees received CXR and Sputum Cytology	I (21,127): CXR and SC C (10,233): the control cases in the Hopkins and Memorial studies	All enrollees received CXR and Sputum cytology
Overall: 8.3	Overall: 7.1 I: 7.6 C: 6.2	Overall: 3.0
Overall: 91	I: 160 C: 63	Overall: 19
Overall: 40 percent	Hopkins/Memorial I: 55 percent; Mayo group: 40 percent; Hopkins/Memorial C: 35 percent Overall: 45 percent	Overall: 26 percent
Overall: 41	Overall: 105 I: 81 C: 24	Overall: 5
70 percent	80 percent	NR
54 percent	I: 76 percent C: NR^c	33 percent
17	37	NR
7	21	NR
3 postoperative deaths; 28 surgeries for non-malignant lesions
^bThese survival rates reflect those that were resected, not all stage I disease. ^cNR = not reported.

Page 268 Cite

Suggested Citation:"7. Adopting New Technology in the Face of Uncertain Science: The Case of Screening for Lung Cancer." Institute of Medicine and National Research Council. 2003. Fulfilling the Potential of Cancer Prevention and Early Detection. Washington, DC: The National Academies Press. doi: 10.17226/10263.

×

TABLE 7.3 Incidence Screening Results from Randomized Controlled Trials of Lung Cancer Screening Using Chest Radiographs and Sputum Cytology

Study	Johns Hopkins Lung Project, 1973–1978 (Tockman, 1986)
Population and Numbers Screened	10,387 male volunteers ages 45+ with median 28.5 pack year history of smoking
Screening Intervention (numbers in each group	I (5,226): CXR and SC C (5,161): CXR
Incidence rate (per 1,000 person-years)	I: 4.6^b C: 4.9^b
Number of Cancers Detected	I: 155 C: 162
5-Year Survival in Study Groups	I: 20 percent^c C: 20 percent^c
Number of Early vs. Advanced Cancers Found^a	I: early vs. advanced 83 and 111 C: early vs. advanced 93 and 109
Percentage of All Cancers Resected	I: 47 percent^c C: 44 percent^c
5-Year Survival for Cancers That Were Resected	NR^d
Mortality Rate (1,000 person-years)	I: 3.4 C: 3.8
Number of Cancers Found Between Screenings or Due to Symptoms	193 total
Additional Number of Cancers Found by SC	22
NOTE: Results are reported separately for groups receiving intervention (I) and those that were controls (C). CXR = chest radiography; SC = sputum cytology. ^aEarly cancers are those staged as 0, 1, or 2, and late cancers are those staged as 3 or 4. ^bThese results are based on interim results; the final results did not report these statistics. ^cEight-year survival. ^dNR = not reported. ^eThese results are for stage 1 cancers only.

Page 269 Cite

Suggested Citation:"7. Adopting New Technology in the Face of Uncertain Science: The Case of Screening for Lung Cancer." Institute of Medicine and National Research Council. 2003. Fulfilling the Potential of Cancer Prevention and Early Detection. Washington, DC: The National Academies Press. doi: 10.17226/10263.

×

Memorial Sloan Kettering Lung Project, 1974–1982 (Melamed et al., 1984)	Mayo Lung Project, 1971–1976 (Fontana et al., 1986)	Czech study (Kubik and Haerting, 1990)
10,040 male volunteers ages 45+ with median 31.2 pack-year history of smoking I (5,072): CYR and SC C (4,968): CXR	10,933 male volunteers ages 45+ with median 20 pack-year history of smoking I (4,618): CXR and SC C (4,593): annual advice to get CXR	6,364 males ages 40–64 with 32-year smoking history I (3,171): CXR and SC every 6 months for 3 years C (3,174): CXR and SC 3 years apart
I: 3.7	I: 5.5	I: 6.0
C: 3.8	C: 4.3	C: 4.5
I: 114	I: 206	I: 108
C: 121	C: 160	C: 82
I: 36 percent	I: 33 percent	I: 18 percent
C: 33 percent	C: 15 percent	C: 18 percent
I: early vs. advanced, 54 and 85	I: early vs. advanced, 99 and 107	I: early vs. advanced, 55 and 53
C: early vs. advanced, 68 and 86	C: early vs. advanced, 51 and 109	C: early vs. advanced, 36 and 46
I: 51 percent	I: 46 percent	I: 23 percent
C: 53 percent	C: 32 percent	C: 23 percent
80 percent^e	50 percent	26 percent
I: 2.7	I: 3.2	I: 3.6
C: 2.7	C: 3.0	C: 2.6
I: 44	I: 116	I: 47
C: 56	C: 160	C: 44
18	18	2

Page 270 Cite

Suggested Citation:"7. Adopting New Technology in the Face of Uncertain Science: The Case of Screening for Lung Cancer." Institute of Medicine and National Research Council. 2003. Fulfilling the Potential of Cancer Prevention and Early Detection. Washington, DC: The National Academies Press. doi: 10.17226/10263.

×

The Hopkins and Memorial studies thus evaluated the contribution of sputum cytology to early detection, whereas the Mayo and the Czech studies were designed to assess the combined effects of sputum cytology and chest radiography.

Results were reported for the first screening interval, also called the prevalence screening interval, separately from subsequent or incidence screening intervals (see Table 7.2 versus Table 7.3). This distinction was appropriate since prevalence data measure the burden of disease in the population as the study starts. Prevalence information indicates the potential benefit of one-time or short-term screening, but prevalence gives no insights into the effect of screening for new cases or for reductions in the rate of mortality from the disease. Length-time bias typically introduces apparent screening effectiveness into the prevalence screening results (Melamed et al., 1984). The effectiveness of screening tests is best measured by determining whether those individuals who were negative after the first screening benefit from ongoing screening. When this initially negative population is screened repeatedly, any subsequent new cancers are thought to be more representative cases of the disease in terms of the growth characteristics of the disease. This subsequent screening, or incidence screening, provides the needed estimate of the value of long-term screening.

Prevalence data from these randomized controlled studies showed 3 to 8 cancers per 1,000 persons screened (Table 7.2). The Hopkins and Memorial studies randomized their study populations from the start of the study. The combined 5-year survival rates for both studies were 55 percent for the screened groups and 35 percent for the control groups. The Czech and Mayo studies, which did not separate the groups into screened and control groups at the first screening, had overall survival rates of 26 and 40 percent, respectively. Composite results for all three NCI-sponsored trials show that 51 percent (81 of 160) of the lung cancer cases among the intervention group were stage I, whereas 38 percent (24 of 63) of the lung cancer cases in the control group were stage I. The 5-year survival rates among all individuals with stage I cancers ranged from 70 to 90 percent. Forty-two to 76 percent of the cancers were surgically resected. Chest radiography was more sensitive than sputum cytology for the detection of peripheral cancers. The addition of sputum cytology as a screening technique was considered complementary to chest radiography because 37 cancers were detected by sputum cytology alone. Nearly all cancers detected by cytology were centrally located squamous cell carcinomas. Persons with these cancers had the best survival rates compared with the survival rates for the persons in the other groups, suggesting that these cancers tend to have slower growth rates.

NCI concluded that the preliminary data were encouraging. The composite 5-year survival rate was 45 percent, much higher than the historical survival rate. The NCI investigators wrote optimistically, “It is probable

Page 271 Cite

Suggested Citation:"7. Adopting New Technology in the Face of Uncertain Science: The Case of Screening for Lung Cancer." Institute of Medicine and National Research Council. 2003. Fulfilling the Potential of Cancer Prevention and Early Detection. Washington, DC: The National Academies Press. doi: 10.17226/10263.

×

that some of the patients who had lung cancer detected by screening and successfully resected, and are alive and free of cancer today, would have died of their cancers had they not been screened.... Chest radiographs were the most sensitive method for detecting lung cancer” and sputum cytology for squamous cell cancers (Berlin et al., 1984). They cautioned that the high 5-year survival rate was artificially elevated because of the effects of lead-time bias, length bias, and overdiagnosis bias. Mortality rates from subsequent incidence data would determine whether screening was effective.

Incidence data from the randomized controlled trials showed annual new cancer rates of 4.3 to 6.0 per 1,000 persons. The Hopkins and Memorial studies found similar numbers of cancers in each group (intervention group versus control group in the Hopkins study, 155 versus 162; intervention group versus control group in the Memorial study, 114 versus 121) (Table 7.3). The Mayo and Czech studies found more cancers among the intervention group than the control group (206 versus 160 in the Mayo study and 108 versus 82 in the Czech study).

As mentioned earlier, an effective screening test should result in the detection of a greater proportion of early-stage cancers and a lower proportion of late-stage cancers (a stage shift). Among all three of the NCI studies, the total numbers of early cancers detected were 240 in the intervention groups and 212 in the control groups; that is, 28 more early-stage cancers were found in the intervention groups. Unfortunately, the gain in the numbers of cases of early-stage cancer was not offset by a decrease in the number of cases of advanced, late-stage cancer: 303 in the intervention groups and 304 in the control groups. Thus, a stage shift did not take place and the increased number of cases of early-stage cancer suggests possible overdiagnosis bias (Eddy, 1990b). The large number of interval cases—that is, cases not detected by screening but diagnosed between scheduled visits—is further evidence of the ineffectiveness of the screening interventions. Either the prior screening missed the tumors or the tumors represent new, aggressively growing tumors not amenable to screening.

The survival and mortality rate data were disappointing. Of the four randomized controlled studies, only the Mayo study had a favorable 5-year survival advantage for participants in the intervention group: 33 percent for the intervention group compared with 15 percent for the control group. These data need to be interpreted with consideration of potential biases, as described above. The other studies showed no survival advantage for screened participants, and all four studies showed no meaningful reduction in the rate of mortality from lung cancer as a result of screening. In the Mayo and Czech studies, the mortality rates were actually worse for those receiving the screening intervention.

Other studies that have evaluated screening for lung cancer by chest radiography include three case-control studies and a historical cohort study

Page 272 Cite

Suggested Citation:"7. Adopting New Technology in the Face of Uncertain Science: The Case of Screening for Lung Cancer." Institute of Medicine and National Research Council. 2003. Fulfilling the Potential of Cancer Prevention and Early Detection. Washington, DC: The National Academies Press. doi: 10.17226/10263.

×

(Ebeling and Nischan, 1987; Hillerdal, 1996; Okamoto et al., 1999; Sobue et al., 1992b). The case-control study of Okamoto and colleagues (Okamoto et al., 1999) was the only one to find a statistically significant beneficial effect of screening. The other three studies did not show significant benefits for screening. The case-control studies are subject to many biases inherent to retrospective data collection, in addition to the time-related biases that can affect interpretation of any data on screening. Selection of an appropriate control group is often difficult, and information obtained from records and interviews may be flawed.

Lessons Learned from Lung Cancer Screening Studies

Why was screening with chest radiographs and sputum cytology unsuccessful? Trial investigators and epidemiologists have offered several explanations. In her analysis of the lung cancer screening studies, Hulka (1986) offered two scenarios that could have explained the lack of mortality reduction in these trials. First, the duration of the preclinical phase of lung cancer may be short, implying that most of these malignancies are too aggressive to benefit from early detection. Secondly, sputum cytology and chest radiography may not have had the accuracy needed for early detection and subsequent mortality reduction.

With regard to the preclinical phase, the rate of mortality from a fatal disease can be reduced by screening only if the preclinical phase is sufficiently long and early detection leads to more effective interventions. For clinically diagnosed lung cancer, which is invariably fatal unless it is treated, the duration of the preclinical phase is not well characterized. Walter and colleagues (Walter et al., 1992) estimated the preclinical phase from the data from the Czech study, offering an estimate of 7 to 8 months (95 percent confidence interval, 6 months to a year), which is shorter than that for other cancers. They recommended that, at a minimum, biannual screening would be needed to detect the majority of cancers during the preclinical phase. Flehinger and colleagues (Flehinger et al., 1993), using data from the Mayo study, estimated that the mean duration of early-stage disease is 4 years. The rate of detection of disease at the early stage was low (less than 25 percent) (Flehinger et al., 1993). An analysis of the data from the Hopkins and Memorial studies by some of the same investigators found similar results (Flehinger and Kimmel, 1987). The actual duration of the average preclinical phase of lung cancer remains unclear.

Was the choice of diagnostic tests an issue? Inadequate sensitivity² of the tests was found in the studies. The VA study reported the sensitivities of

²	See Chapter 5 for the definitions of screening parameters such as sensitivity, specificity, and predictive value.

Page 273 Cite

Suggested Citation:"7. Adopting New Technology in the Face of Uncertain Science: The Case of Screening for Lung Cancer." Institute of Medicine and National Research Council. 2003. Fulfilling the Potential of Cancer Prevention and Early Detection. Washington, DC: The National Academies Press. doi: 10.17226/10263.

×

chest radiography and sputum cytology to be 42 and 33 percent, respectively. The Hopkins study and a study conducted in Osaka, Japan (Sobue et al., 1991), estimated the sensitivities of chest radiography to be 50 and 57 percent, respectively. The corresponding estimates for the sensitivity of sputum cytology were 25 and 31 percent. All three trials estimated the combined sensitivities for both screening modalities: 63 percent in the VA study, 67 percent in the Hopkins study, and 72 percent in the Osaka study. A systematic review of the sputum cytology literature, including studies done as early as 1935, found a wide range of test sensitivities, 22 to 98 percent, with the average being 64 percent (Bocking et al., 1992). Tockman (2000) reviewed the accuracy of chest radiography and sputum cytology for all cancers diagnosed in the NCI randomized trials. Both tests together detected 49 percent of all cancers, and sputum cytology alone detected 11 percent. The specificities of chest radiography or sputum cytology, or both, were generally high, about 95 percent.

The inaccuracy of chest radiography for lung cancer detection has also been shown by studies evaluating its rate of false-negative results. False-negative results represent the proportion of cancers present but missed by the screening test. Chest radiographs are known to detect cancers as small as 6 millimeters (mm) in diameter if the cancers happen to lie in the intercostal spaces (the clear area between the ribs). However, radiologists often miss cancers smaller than 10 mm on a chest film. In a study of missed lung cancers, the average size of the tumors was 16 mm and the largest tumor was 34 mm (Austin et al., 1992). In the Mayo study, only 1 of 50 peripheral cancers detected by screening was less than 10 mm. Seventeen measured between 10 and 20 mm, and 19 were between 20 and 30 mm (Sanderson and Fontana, 1982).

Other potential problems with these studies have been described. Using the Mayo study as an example, the study’s investigators gave insightful comments regarding the contamination of study groups, a lack of stage shift, and possible overdiagnosis (Fontana et al., 1991). They noted that the control group had high rates of screening by chest radiography during the last 2 years of the screening phase. At the time it was common practice to screen for lung cancer, and contamination of the control group could not be avoided. Nearly 75 percent of control subjects underwent radiography. In addition, the intervention group complied with the 4-month regimen of sputum cytology and chest radiography only 75 percent of the time, thereby reducing the possibility of observing the full impact of the screening. Even under conditions of no contamination, the statistical power of the Mayo study was limited (statistical power refers to the likelihood that a study will find a particular effect if the effect exists). Small reductions in the rate of mortality from lung cancer (reductions on the order of 10 to 20 percent) could easily have been dismissed as not statistically significant (Flehinger et al., 1993).

Page 274 Cite

Suggested Citation:"7. Adopting New Technology in the Face of Uncertain Science: The Case of Screening for Lung Cancer." Institute of Medicine and National Research Council. 2003. Fulfilling the Potential of Cancer Prevention and Early Detection. Washington, DC: The National Academies Press. doi: 10.17226/10263.

×

Excess cases were diagnosed among the screened groups, suggesting the possibility of overdiagnosis. If both groups were appropriately randomized, the risk for lung cancer would be equivalent, and similar numbers of cancer cases should be detected in each group. Yet, at the end of 10 years of follow-up, the intervention group had 46 more cases of disease than the control group. It is possible that some in the control group, who were less closely monitored than the screened group, died from other causes before a diagnosis of lung cancer could be made (Type II Pseudodisease); therefore, the 46 cases could have been overdiagnosed. Fontana et al. (1991) noted that the death rate from all causes was high among this population. “Predictably, the great majority of deaths were attributable to ischemic heart disease (which alone accounted for 50 percent of all deaths)” (p. 1160).

Strauss and colleagues (Strauss et al., 1995) argue that overdiagnosis bias is an unlikely problem in lung cancer, given the aggressive course of lung cancer and the rarity of undiagnosed lung cancers found in autopsy studies. The rate of survival among patients with stage I cancers who refused surgery is dismal, suggesting that practically all lung cancers are lethal if left untreated (Sobue et al., 1992a). Others contend that the natural history of early-stage cancers is unknown and that overdiagnosis bias is possible, even though it is difficult to document (Black, 2000).

Mortality from surgery and unnecessary therapy took place in the trials. Seven deaths were reported after the 122 surgeries. The Mayo and Memorial studies reported that 40 surgeries were performed for conditions that mimicked lung cancer. Common diagnoses among these cases included hamartomas, healed infarcts, granulomatous lung diseases such as tuberculosis, and rare tumors such as mesothelioma and thymoma. One man with a benign condition died from a myocardial infarction after surgery. Given the persistent risks of thoracic surgery, the harm that may result from false-positive diagnoses cannot be dismissed.

In general, survivors of a first lung cancer are at higher risk for the development of another lung cancer and may need more frequent screening. Among the studies evaluated here, second primary lung cancers occurred in 21 (17 percent) of the 122 lung cancer patients who had surgical resections (Berlin et al., 1984; Fontana et al., 1972; Frost et al., 1984; Melamed et al., 1984). Other lessons from these studies include the difficulty of motivating smokers to participate in regular screening, the need for close follow-up since the requirements for additional workups are high, and the costs attendant to such care.

Screening Recommendations from Medical Organizations

The American Cancer Society, which had recommended the annual screening of “heavy cigarette smokers” and workers exposed to asbestos by chest radiography, dropped this recommendation in 1980 (Eddy, 1980a).

Page 275 Cite

Suggested Citation:"7. Adopting New Technology in the Face of Uncertain Science: The Case of Screening for Lung Cancer." Institute of Medicine and National Research Council. 2003. Fulfilling the Potential of Cancer Prevention and Early Detection. Washington, DC: The National Academies Press. doi: 10.17226/10263.

×

Eddy wrote, “The Society has changed its policy and does not recommend any tests for the early detection of cancer of the lung, but urges a focus on primary prevention: helping smokers to stop (or switch to low tar and nicotine cigarettes), and keeping nonsmokers from starting. People with signs and symptoms of lung cancer should consult their physicians” (p. 205). The rationale for the policy change was that screening techniques must reduce morbidity and mortality from the disease, which these trials clearly did not establish. Furthermore, harm from screening due to false-positive workups and iatrogenic complications with corresponding costs would make such screening unattractive. The NCI trials were done at respected academic medical centers with well-trained health care professionals. If widespread mass screening were incorporated, the rates of workups because of false-positive results and subsequent harm would rise, given the inexperience and wide variability in the quality of care. A lack of experienced cytologists to read the sputum smears was cited as an example of the limitations of the infrastructure available for the implementation of widespread mass screening.

The American Cancer Society’s position paper did leave the door open for change: “Although at present there is insufficient evidence that screening is effective in reducing lung cancer mortality, there is no proof that it is not effective. As stated before, every case is different, and it may be that even knowing the lack of evidence of benefit and the potential risks, some individuals may choose to have early detection examinations. The Society’s recommendations are not meant to discourage this” (Eddy, 1980a, p. 206).

A U.S. Preventive Services Task Force (1990) position paper stated, “screening asymptomatic persons for lung cancer with routine chest radiographs or sputum cytology is not recommended” (p. 1763). They noted “accuracy of the chest radiograph is limited by the capabilities of the technology and by variation in interpretation among radiologists” (p. 1763). “Furthermore, the yield of screening chest radiography to detect cancer is low, largely because of the low prevalence of lung cancer in the general population and even among asymptomatic smokers” (p. 1763) and the low yield due to the uncommon nature of the disease. The NCI prevalence data indicate that only 0.39 percent of the screened population had lung cancer. Sputum cytology was a less effective technique since chest radiography detected the majority of cancers. The paper concluded that $1.5 billion would be spent if mass screening for high-risk groups was advocated and that there would be significant harm from follow-up testing. It concluded, “Primary prevention may be more effective ... cigarette smoking is responsible for more than 90 percent of lung cancers and should therefore be the principal focus of clinical efforts to help prevent this disease” (U.S. Preventive Task Force, 1990, p. 1765).

The NCI, the Food and Drug Administration, the American College of

Page 276 Cite

Suggested Citation:"7. Adopting New Technology in the Face of Uncertain Science: The Case of Screening for Lung Cancer." Institute of Medicine and National Research Council. 2003. Fulfilling the Potential of Cancer Prevention and Early Detection. Washington, DC: The National Academies Press. doi: 10.17226/10263.

×

Radiology, the Royal College of Radiologists, the World Health Organization, and the Canadian Task Force on Periodic Health Examinations reached the same conclusions. All strongly endorsed smoking cessation as the principal method of prevention.

A NEW MEDICAL IMAGING SCREENING TEST: THE SPIRAL CT SCAN

As a lung cancer screening test, chest radiography lacked the sensitivity to find early-stage cancers. When a cancer is seen on a radiograph it is now common clinical practice to obtain a CT scan to evaluate the lesion. The conventional CT scan is more precise in measuring the size of the lesion, the shape of the lesion, the number of microcalcifications contained in or around the lesion, and enlargement of lymph nodes (for spread of malignancy). CT scans are more sensitive than chest radiographs in finding small cancers hiding around blood vessels and old scars (Sone et al., 2000). If CT scans are superior imaging tests, why could they not be used to screen for lung cancer?

Conventional CT scanning emits higher dose of ionizing radiation than chest radiographs. Chest radiographic radiation exposure ranges from 13 to 20 millirads (mrads). Conventional CT scans can emit 1,400 mrads of exposure, or 70 times the dose of a single chest radiograph (Naidich et al., 1990). The average radiation dose from natural sources, which comes primarily from indoor radon, is estimated to be 300 mrads per person per year (Black, 1999b). Widespread mass screening with this level of ionizing radiation could cause harm in many individuals and could even increase the incidence of lung cancer (Eddy, 1980a). Furthermore, conventional CT scans are time-consuming for radiologists to read, are costly to produce, and produce more false-positive results than chest radiographs.

Spiral CT scanning has some advantages over conventional CT scanning for screening purposes. First, spiral CT scans emit less radiation, estimated around 260 mrads, than conventional CTs, prompting the description “low-dose” CT. Several studies have investigated spiral CT scanners that emit yet lower radiation (Anderson et al., 1991; Diederich et al., 1999; Kanazawa et al., 1998; Nitta et al., 1998; Nitta et al., 1999), so-called ultra-low-dose CT scanners. The radiation doses from these machines are lower and the diagnostic accuracy is reportedly not compromised. Secondly, by scanning in a spiral fashion, spiral CT scans are fast and scanning can be completed in a single breath hold of 15 to 30 seconds. Finally, out-of-pocket charges to the consumer for a spiral CT screen are approximately $300 (Brice, 2000), which is less than for a conventional CT scan but more than for a chest radiograph. Similar to conventional CT scanning, radiologist time for interpretation and false-positive test results remains a problem with spiral CT scanning as a screening test.

Page 277 Cite

Suggested Citation:"7. Adopting New Technology in the Face of Uncertain Science: The Case of Screening for Lung Cancer." Institute of Medicine and National Research Council. 2003. Fulfilling the Potential of Cancer Prevention and Early Detection. Washington, DC: The National Academies Press. doi: 10.17226/10263.

×

Four nonrandomized, uncontrolled studies have reported spiral CT screening data for both prevalence and incidence screening (Tables 7.4 and 7.5). Two studies, the Early Lung Cancer Action Project, or ELCAP, study (Henschke et al., 1999; Henschke et al., 2001), and a Mayo Clinic study (Swensen et al., 2002) were performed in the United States while the other two, the Anti-Lung Cancer Action, or ALCA study, (Kaneko et al., 1996; Sobue et al., 2002) and a Shinshu University study (Sone et al., 2001; Sone et al., 1998) were conducted in Japan.

All studies initially compared lung cancer detection rates using spiral CT scanning to chest radiographs. Three of the studies, ELCAP, the Mayo Clinic and Shinshu University, performed annual spiral CT screening, and the ALCA study recommended screening biannually. Participants who had abnormal scans, defined as having an indeterminate lung nodule, were asked to return for surveillance with conventional CT scans on a periodic basis. For example, the ELCAP trial recommended those with indeterminate nodules to have 3-, 6-, 12-, and 24-month conventional CT scans. If nodule growth was detected, then surveillance was stopped and a definitive invasive diagnostic procedure was performed. This nodule triaging process aimed to limit the number of unnecessary invasive tests and any harm performed for those without lung cancer.

The demographic composition of the participants varied widely across the studies. The United States studies had combinations of current and former smokers, with the ELCAP trial enrolling the highest-risk population, older individuals (mean age 66 years) with a heavy smoking history (median 44 pack-year). The Japanese studies included never smokers, which are at low risk for lung cancer. Also, many participants in one study, ALCA, had undergone screening with chest radiographs in prior years, making this a more prescreened population than in the other three trials.

While all of the studies used spiral CT scan technology, the radiological parameters of the spiral CT scans were slightly different for each trial. Notably, the Mayo Clinic study used newer spiral CT scan technology, a multi-detector scanner with thinner slice thickness (5 mm in the Mayo Clinic study vs. 10 mm in the other studies), which significantly improves the resolution and detection capability of the spiral CT scan.

During the prevalence screening, spiral CT scans detected 30 lung cancers among 1,000 participants in the ELCAP study for a cancer detection rate of 30 per 1,000 screenings. The cancer detection rate at the Mayo Clinic study was 13.8 per 1,000 screenings. For the Japanese studies the spiral CT cancer detection rate was lower, 8.1 per 1,000 screenings for ALCA and 4.0 per 1,000 screenings for Shinshu University. ELCAP, which enrolled a high-risk population, had the highest lung cancer detection rate. The Shinshu University study, which enrolled the largest numbers of never smokers, had the lowest lung cancer detection rate. Nonetheless, every

Page 278 Cite

Suggested Citation:"7. Adopting New Technology in the Face of Uncertain Science: The Case of Screening for Lung Cancer." Institute of Medicine and National Research Council. 2003. Fulfilling the Potential of Cancer Prevention and Early Detection. Washington, DC: The National Academies Press. doi: 10.17226/10263.

×

TABLE 7.4 Spiral CT Lung Cancer Screening Trials: Prevalence Data

	Baseline Screening Data
Trial	Henschke et al. (1999)	Swensen et al. (2002)	Sone et al. (1998)	Kaneko Sobue et al. (1996)	Weighted Average
Study location	United States		Japan
Study design	Prospective uncontrolled cohort study	Prospective uncontrolled cohort study	Prospective uncontrolled cohort study	Prospective uncontrolled cohort study
Study years	1993–98	1999	1996–98	1993–98
Recommended screening interval Participant demographics	1 year 54% male, 44 pack-year, Mean age 66	1 year 52% male, 61% CS, 39% FS, Mean age 59	1 year 54% male, 46% ES, 54% NS, Med. age 64	6 months 88% male, 62% CS, 25% FS, 14% NS, Mean age 59
Spiral CT scan parameters (all protocols used 120–140 kVp and between 40–50 mA)	Single detector 10 mm ST	Multi-detector 5 mm ST	Single detector 10 mm ST	Single detector 10 mm ST

Page 279 Cite

Suggested Citation:"7. Adopting New Technology in the Face of Uncertain Science: The Case of Screening for Lung Cancer." Institute of Medicine and National Research Council. 2003. Fulfilling the Potential of Cancer Prevention and Early Detection. Washington, DC: The National Academies Press. doi: 10.17226/10263.

×

Participants screened	1,000	1,520	5,483	1,611	9,614
Number of screening tests performed	1,000	1,520	5,483	1,611	9,614
New indeterminate lung nodules detected	233	782	676	192	1,883
Rate of new indeterminate nodules per screening, %	23	51	12	12	20
Benign biopsies or surgeries (harm)	3	4	7	8	22
Clinical Stage Distribution
IA NSCLC	22	13	21	10	66
IB NSCLC	1	1	2	1	5
IIA NSCLC	1	4			5
IIB NSCLC					0
IIIA NSCLC	2	2		2	6
IIIB NSCLC	1			1	2
IV NSCLC	1				1
Unclassified NSCLC	2				2
SCLC		2			2
Total lung cancers	30	22	23	14	89
NS = Never smoker, ES = Ever smoker, FS = Former smoker, CS = Current smoker. CT = Computerized Tomography, NSCLC = Non-small-cell lung cancer, SCLC = Small-cell lung cancer. ST = slice thickness, kVp = kilovolt peak, mA = milliangstroms.

Page 280 Cite

Suggested Citation:"7. Adopting New Technology in the Face of Uncertain Science: The Case of Screening for Lung Cancer." Institute of Medicine and National Research Council. 2003. Fulfilling the Potential of Cancer Prevention and Early Detection. Washington, DC: The National Academies Press. doi: 10.17226/10263.

×

TABLE 7.5 Spiral CT Lung Cancer Screening Trials: Incidence Data

	Annual Repeat Screening
Trial	Henschke et al. (2001)	Swensen et al. (2002)	Sone et al. (1998)	Sobue et al., (2002)	Weighted Average
Study location	United States		Japan
Study design	Prospective uncontrolled cohort study	Prospective uncontrolled cohort study	Prospective uncontrolled cohort study	Prospective uncontrolled cohort study
Study years	1993–98	1999	1996–98	1993–98
Recommended screening interval	1-year	1-year	1-year	6-months
Participant demographics	54% male, 44 pack-year, Mean age 67	52% male, 61% CS, 39% FS, Mean age 60	54% male, 46% ES, 54% NS, Med. Age 65+	88% male, 62% CS, 25% FS, 14% NS, Mean age 60
Spiral CT scan parameters (all protocols used 120–140 kVp and between 40–50 mA)	Single detector 10 mm ST	Multi-detector 5 mm ST	Single detector 10 mm ST	Single detector 10 mm ST
Participants screened	841	1,464	8,303	1,180	11,788
Number of screening tests performed	1,184	1,464	8,303	7,891	18,842

Page 281 Cite

Suggested Citation:"7. Adopting New Technology in the Face of Uncertain Science: The Case of Screening for Lung Cancer." Institute of Medicine and National Research Council. 2003. Fulfilling the Potential of Cancer Prevention and Early Detection. Washington, DC: The National Academies Press. doi: 10.17226/10263.

×

New indeterminate lung nodules detected	35	191	518	770	1,514
Rate of new indeterminate nodules per screening, %	31	3	6	10	8
Benign biopsies or surgeries (harm)	1	3	9	27	40
Clinical Stage Distribution
IA NSCLC	5		32	18	55
IB NSCLC					0
IIA NSCLC	1	1	1	1	4
IIB NSCLC		1	1		2
IIIA NSCLC	1		1	1	3
IIIB NSCLC			1	1	2
IV NSCLC			1	1	2
Unclassified NSCLC					0
SCLC	2			1	3
Total number of lung cancers	9	3	37	22	71
Spiral CT Test Performance
Sensitivity, %	78	67	92	86	87
Specificity, %	98	87	94	90	92
Rate of lung cancers detected per 1,000 screenings	5.9	1.4	4.1	2.4	3.3
Rate of benign biopsies/surgeries per 1,000 screening	0.8	2.0	1.1	3.4	2.1
NS = Never smoker, ES = Ever smoker, FS = Former smoker, CS = Current smoker. CT = Computerized Tomography, NSCLC = Non-small-cell lung cancer, SCLC = Small-cell lung cancer. ST = slice thickness, kVp = kilovolt peak, mA = milliangstroms.

Page 282 Cite

Suggested Citation:"7. Adopting New Technology in the Face of Uncertain Science: The Case of Screening for Lung Cancer." Institute of Medicine and National Research Council. 2003. Fulfilling the Potential of Cancer Prevention and Early Detection. Washington, DC: The National Academies Press. doi: 10.17226/10263.

×

study reported that chest radiographs detected fewer lung cancers than spiral CT scans. All together, 89 lung cancers were found during prevalence screening, of which 3 were missed by spiral CT scans.

The lung cancers detected by spiral CT scans were mostly localized clinical stages. Of the 89 lung cancers found, 66 were clinical stage IA non-small-cell lung cancers (NSCLC), 5 stage IB NSCLC, 5 stage IIA NSCLC, 6 stage IIIA NSCLC, 2 stage IIIB NSCLC, 1 stage IVA NSCLC, 2 NSCLC and were classified as being in the right and left main stem bronchus and 2 were small-cell lung cancers. Viewed another way, 82 percent of all NSCLC found in these studies were at clinically localized stages (stage IA and IB).

For the incidence screening years, the ELCAP study once again found the highest lung cancer detection rate, 5.9 cancers per 1,000 screenings compared to 1.4 per 1,000 screenings at the Mayo clinic, 4.1 per 1,000 screenings at Shinshu University, and 2.4 per 1,000 screenings at ALCA. Out of 68 NSCLC, 55 or 81 percent were localized stage IA or IB. Furthermore, the average size of these localized staged cancers was small, usually less than 14 mm in diameter (Sone et al., 2001).

Negative findings such as the frequency of indeterminate lung nodules and harms were also reported. During the Mayo Clinic’s prevalence screening, 782 out of 1,520 participants (51 percent) had indeterminate lung nodules that required further periodic surveillance with one or more conventional CT scans. Out of these 782 individuals only 22 had lung cancer; that is, more than 97 percent of those with indeterminate lung nodules were believed to be not cancerous (false positives). The average rate of new indeterminate lung nodules across all studies was 196 per 1,000 screenings (19.6 percent) for the prevalence screenings. This rate decreased but remained sizable, 80 per 1,000 screenings, during incidence screenings.

Harms from screening include invasive diagnostic testing or treatment for individuals without disease (false positives). During the prevalence screening years, 22 participants underwent an invasive test or surgery for a benign lesion. For incident screenings, 40 participants had invasive procedures resulting in a benign diagnosis. The ratio of individuals potentially benefiting from screening to those potentially harmed can be obtained by comparing the rate of lung cancers detected by spiral CT screening to the rate of benign diagnoses from unnecessary invasive procedures. Screening with spiral CT scans found 3.3 lung cancers per 1,000 screenings and 2.1 benign diagnoses per 1,000 screenings (unnecessary testing). Therefore, for every 3 lung cancers successfully found (potential benefit), almost 2 individuals received unnecessary testing and potential morbidity and mortality. In the spiral CT trials, there were no reported deaths among the disease-free individuals who underwent unnecessary testing.

As with past chest radigraphic screening trials, non-compliance with helical CT screening is a major problem. Sizable numbers of participants did not followup for incident screenings in some of these spiral CT screen-

Page 283 Cite

Suggested Citation:"7. Adopting New Technology in the Face of Uncertain Science: The Case of Screening for Lung Cancer." Institute of Medicine and National Research Council. 2003. Fulfilling the Potential of Cancer Prevention and Early Detection. Washington, DC: The National Academies Press. doi: 10.17226/10263.

×

ing trials. For example, in the Shinshu University study, 1,035 of 5,460 participants without identifiable cancer, 19 percent of all eligible participants, did not return for their second year screening. In contrast, the Mayo Clinic Study reported a low non-compliance rate of only 3 percent per year, suggesting that low rates of non-compliance are achievable.

One study, ALCA, has reported 5-year survival estimates. Among lung cancers found with prevalence screening, the 5-year survival was 76.2 percent and among those found on incidence screening, the 5-year survival was lower, 64.9 percent. The authors acknowledged that these estimates could be influenced by length, lead-time, and overdiagnosis biases. The lower survival among lung cancer participants detected during incidence screenings suggests the presence of length bias among the prevalent screening cancers.

Implications of Lung Cancer Screening with Spiral CT Scans

The results from these uncontrolled trials are promising and leave little doubt that spiral CT is more sensitive in detecting lung cancers than chest radiographs. However, while encouraging, they do not provide conclusive evidence for long-term efficacy. Even had they provided long-term outcome data, a full understanding of the clinical significance of the results would not have been gained as these trials, like the demonstration projects of the 1950s, all lack a control group. Simply finding smaller-sized cancers does not mean mortality is lowered. In one study, among localized stage cancers, smaller tumor size was not associated with better outcomes, demonstrating that some biologically aggressive forms of lung cancer may metastasize early, even when they are 1–5 mm in size (Patz et al., 2000b).

Elevated survival rates also do not prove efficacy. Survival rates are affected by selection, lead-time, length and over-diagnosis bias (see Chapter 5 for definitions), hence inferences about screening efficacy made from these data alone become speculative. Cancers with longer latency periods or with the potential for length bias are likely to be over-sampled in the early screening years, and inflated survival estimates may result, as shown in the ALCA results. Overdiagnosis bias is also a concern, since very small cancers have been found and the natural course of disease in individuals with tumors of this size is not known. Could some of these cancers grow very slowly, fail to progress, or perhaps regress? Excluding the possibility of overdiagnosis bias will be difficult, as the identification of a lung cancer mandates curative therapy and observation without therapy would not be considered ethical. Estimates of overdiagnosis bias may best be gained after the fact through autopsy screening (Black, 2000). Despite the many weaknesses of the CT studies, they have spurred interest and investments into randomized controlled trials of spiral CT screening.

Page 284 Cite

Suggested Citation:"7. Adopting New Technology in the Face of Uncertain Science: The Case of Screening for Lung Cancer." Institute of Medicine and National Research Council. 2003. Fulfilling the Potential of Cancer Prevention and Early Detection. Washington, DC: The National Academies Press. doi: 10.17226/10263.

×

Concerns such as physical and psychological harms from screening have been raised in the spiral CT screening studies. A case of a 73-year-old woman, Mrs. S., who decided to undergo lung cancer screening with spiral CT, illustrates the downside of screening. As described in the New York Times, Mrs. S.’s spiral CT scan showed a collapsed lung, presumably due to an obstructing lung cancer. However, after open lung surgery, surgical pathology showed no evidence of lung cancer. Her doctor writes, “While this news is welcome, Mrs. S.’s surgery and a rocky postoperative course had drained her both physically and emotionally. When she returned home, it took her several months to recover. She is still paying her hospital bills” (Lerner, 2002, p. D6). This individual did not benefit from screening and her case describes the possibility of unintended consequences of spiral CT screening.

Psychological harms can affect many individuals in these trials. Uncertainty as to the diagnosis of an indeterminate lung nodule can cause much anxiety, as the affected individuals may have had to wait months to years before learning that their nodules were not growing and hence not of concern. More than half of all participants in the Mayo Clinic Study had indeterminate nodules. This study, which used the most updated screening technology, found a higher rate of lung nodules than in other studies. Trials of spiral CT face the difficult challenge of appropriately triaging these very common nodules and counseling participants. Wardle and Pope (1992) pointed out that psychological costs from early detection are worrisome, and Reich argued that so far lung cancer “screening does no good and may do much harm” (1995, p. 557).

Edward Golub (1999) wrote, “It is not overdramatic to say that the entire nature of the future life of a patient can depend on the results of...[these] tests, what the person can and cannot do, how much time the person has to do it in, what the person’s self-perception is, and how others think of and behave toward the person” (p. 13). The implications for otherwise healthy participants of screening tests are even more striking since they are at risk of being “transformed” from wellness into sickness. Another potential harm of screening would occur if smokers receiving a negative screening test decided that they did not need to quit smoking.

These spiral CT studies have been performed at referral centers with highly motivated staff, investigators, and participants. If widespread mass screening by spiral CT were adopted today, could their results be replicated elsewhere? It is possible that they could, but the costs of developing the infrastructure required to deliver top-quality care would be substantial. Standards for the detection and measurement of lesions by spiral CT scanning need to be established. Costs for the training of personnel and investments to purchase scanning instruments are required. Furthermore, capacity would also be needed to carry out diagnostic workups and follow-up.

Given the potential harms, logistical hurdles to minimize harm and

Page 285 Cite

Suggested Citation:"7. Adopting New Technology in the Face of Uncertain Science: The Case of Screening for Lung Cancer." Institute of Medicine and National Research Council. 2003. Fulfilling the Potential of Cancer Prevention and Early Detection. Washington, DC: The National Academies Press. doi: 10.17226/10263.

×

costs, and, most importantly, the lack of evidence of efficacy, judgment on using spiral CT as a lung cancer screening test should be reserved until evidence from well-designed clinical trials can be evaluated.

Sputum Cytology as an Adjunctive Screening Test

While spiral CT scans are superior to chest radiographs for detecting lung cancer, CT could still miss cancers hiding in endobronchial locations. Sputum cytology is considered a good adjunctive screening test since it frequently detects endobronchial cancers, which are usually squamous cell carcinomas. The randomized controlled trials that were started in the 1970s showed that of the four major histologic types of lung cancer, the best prognosis was for squamous cell carcinoma. Squamous cell cancers represented 25 percent of all cancers found, and the 5-year survival rate for those with squamous cell cancers detected by cytology only was 85 to 90 percent (Berlin et al., 1984). Another cohort study that monitored lung cancer patients with radiographically occult malignancies identified by sputum cytology reported 5-year survival rates of 74 percent (Bechtel et al., 2000). Kennedy and colleagues (2000) pointed out that the number of deaths caused by squamous cell carcinoma of the lung is similar to the number caused by breast or colon cancer, for both of which screening is recommended. Thus, if squamous cell carcinoma is the slowest growing of the lung cancers and therefore the most likely to be detected early, screening for this particular type of lung cancer by sputum cytology may be warranted (Kennedy et al., 2000).

Advances in the screening of sputum samples hold promise, as do new bronchoscopic methods for examination of the lung. Researchers have identified precancerous and early cancerous states in sputum by identification of certain abnormal genes in sputum cells and improved localization of early cancers through fluorescent bronchoscopy, which consists of the identification of malignant cells by bronchoscopic examination under fluorescent light (Lam et al., 1998; Palmisano et al., 2000; Tockman, 2000). For example, Palmisano and colleagues (2000) found that certain cancer-fight-ing genes in the sputum cells of smokers had an abnormality called hypermethylation. On examination of sputum specimens several years before cancer developed, they found that hypermethylation antedated lung cancer in each of 21 persons who eventually developed lung cancer. They reported that “aberrant methylation ... can be detected in DNA from sputum in 100 percent of patients with squamous cell lung carcinoma up to 3 years before clinical diagnosis” (Palmisano et al., 2000, p. 5954). This specific molecular abnormality is only one of many that appear to be promising as targets for early detection, and new bronchoscopic methods should eventually improve the ability to find very small tumors.

Fluorescent bronchoscopy is more sensitive than the traditional means

Page 286 Cite

Suggested Citation:"7. Adopting New Technology in the Face of Uncertain Science: The Case of Screening for Lung Cancer." Institute of Medicine and National Research Council. 2003. Fulfilling the Potential of Cancer Prevention and Early Detection. Washington, DC: The National Academies Press. doi: 10.17226/10263.

×

of examination under white light for the detection of cancers. Kennedy and colleagues (2000) pointed out “the increased sensitivity [of fluorescent bronchoscopy] is associated with decreased specificity, resulting in many false positive biopsies” (p. 76S). A low specificity adds to the costs of using the fluorescent bronchoscope and increases the time of the procedure. Bias from overdiagnosis might also be introduced by the detection of very small cancers. Whether molecular or genetic markers in sputum, accompanied by bronchoscopic examination, can find curable lung cancers has yet to be shown.

FUTURE DIRECTIONS IN LUNG CANCER SCREENING

The promise of new technologies has led to the initiation of randomized controlled trials. Randomized studies of chest radiographs and spiral CT scanning for lung cancer screening are under way (Patz et al., 2000a). The Prostate, Lung, Colorectal, and Ovarian (PLCO) study has randomized 152,000 participants to receive screening chest radiographs or no screening (Simpson et al., 2000). The National Lung Screening Study, using a subset of the PLCO, will randomize 50,000 participants to chest radiographs or spiral CT screening. This NCI-sponsored trial is collaborating with the American College of Radiology Imaging Network (ACRIN) to enroll heavy smokers between the ages of 55 and 74. The trial will screen for 3 years with a 4-year follow-up, also collecting sputum samples in 10,000 participants. The trial will conclude data collection in 2009 (Sullivan, 2002).

How long before conclusive effectiveness data will be published? The NCI chest radiographic screening trials started in the early 1970s, and reports were published in the mid-1980s. Conclusive data for spiral CT may not be available for at least 5 to 10 years. Japan, in contrast to the United States, has already adopted spiral CT for lung cancer screening, despite a lack of evidence supporting its effectiveness. Ecological population data on the effect of widespread mass screening on lung cancer incidence and mortality rates could provide clues as to whether such screening is effective, although this type of data, due its limitations, usually does not provide sufficient evidence to recommend screening.

Already, techniques that are being used in practice are more advanced than those being considered in the most recent and ongoing studies. The initial spiral CT trials used single-detector CT technology, which is now outdated (National Cancer Institute and ACS, 2001). General Electric Medical Systems is selling multi-detector spiral CT scanners. Multi-detector scanners offer better resolution than prior machines and offer the option of magnifying parts of the lung without rescanning the participant. Within the next 5 to 10 years, even this technology will be updated (Fox, 2001). The NCI/ACRIN study, which is using multi-detector technology, is at risk of

Page 287 Cite

Suggested Citation:"7. Adopting New Technology in the Face of Uncertain Science: The Case of Screening for Lung Cancer." Institute of Medicine and National Research Council. 2003. Fulfilling the Potential of Cancer Prevention and Early Detection. Washington, DC: The National Academies Press. doi: 10.17226/10263.

×

reporting data based on technology that will be considered obsolescent before the trial is completed.

These new scanners can detect 1- to 4-mm lesions called “ground glass opacities.” These lesions are so small that they cannot be characterized as nodules. By improving the sensitivity of scanning, specificity is likely to decline so that false-positive results are likely to increase. The Mayo Clinic used multi-detector technology and found a higher rate of lung nodules than trials using single detector technology. Clinicians will face the challenge of distinguishing between false-positive and true-positive results in order to prevent unnecessary morbidity and mortality. On a short-term basis, antibiotic therapy can be administered followed by repeat scanning to see if the opacity was of an infectious etiology. Software that provides computer-aided diagnosis (CAD) can be used to enhance the accuracy of reading. Software algorithms can estimate the likelihood of a malignancy on the basis of certain characteristics of the lesion and the participant. CAD software can also double-check the readings of radiologists and pathologists. PAPNET, a CAD device for the reading of Pap smears, has been shown to reduce the number of smears with false-negative results (Halford et al., 1999). Similar software is being developed for mammograms and CT scans (National Cancer Institute and ACS, 2001).

CT imaging technology is also being used more widely. Virtual or three-dimensional means of bronchoscopy and colonoscopy imaging are being evaluated as noninvasive alternatives to conventional endoscopy (Black, 1999a). Spiral CT angiography evaluates arteriosclerosis without placing the individual at risk from the use of the contrast dye that is required by conventional angiography (Siegel and Evens, 1999). Spiral CT angiography has incidentally found lung cancers. Some radiology sites are offering screening by multiphasic imaging for cancer and arteriosclerosis (National Cancer Institute and ACS, 2001). If screening is not targeted, there is a high likelihood of finding many false-positive lesions.

Public and Policy Reactions to New Technologies

When the initial findings from the ELCAP study were reported in medical journals, major newspapers and weekly newsmagazines published articles about the findings. The articles mentioned that lung cancer death rates could be greatly reduced if smokers and former smokers were routinely given a CT test that can detect tumors when they are small enough to be cured (Brice, 2000). The public response to the news was dramatic. The voice mail systems at the hospital centers participating in the ELCAP study were overwhelmed, and at least 3,000 calls were placed to the Mayo Clinic by the next day. One ex-smoker whose husband died from lung cancer stated “I’m so grateful that the technology is present ... lung cancer is just an awful, awful thing.”

Page 288 Cite

Suggested Citation:"7. Adopting New Technology in the Face of Uncertain Science: The Case of Screening for Lung Cancer." Institute of Medicine and National Research Council. 2003. Fulfilling the Potential of Cancer Prevention and Early Detection. Washington, DC: The National Academies Press. doi: 10.17226/10263.

×

The public has long been worried about lung cancer. In an American Cancer Society survey that asked participants to “list the body sites susceptible to cancer that first come to mind,” respondents most commonly mentioned the lung, breast, and skin (ACS, 1980b, p. 93). “About seven out of 10 smokers (71 percent) believe that if lung cancer is detected early, there is a good chance that it can be cured” (ACS, 1980b, p. 98). As lung cancer is a dreaded disease and screening is viewed as bringing hope for its detection and cure, it should come as no surprise that the ELCAP study results were enthusiastically received.

Some community physicians routinely perform screening chest radiographs, despite their lack of effectiveness as determined in clinical trials, and despite the lack of endorsement of their use by policy organizations (Black, 1999a). Adoption of CT technology by some providers is likely given their prior beliefs and practice patterns.

Radiology groups have adopted this technology and are advertising their use of the technology in local newspapers and on television and radio (Brice, 2000). One group in Skokie, Illinois,, found one cancer in 120 screening procedures. The average price per scan was $325, all of which was paid out of pocket by the screening participant. The premature promotion of spiral CT as a lung cancer screening tool raises many questions. Should spiral CT be promoted even though its effectiveness is not established? Does the public understand the consequences of having this screening test? How well informed is the public concerning the unwanted consequences of false-positive test results? What are the conflicts of interest when providers who could make financial gains by screening advertise on the promise of an unproven technology?

These reports suggest fast and too early adoption of this unproven technology. Cautions have been raised about spiral CT screening for lung cancer. Christine D. Berg of NCI’s Division of Cancer Prevention and Suburban Cancer Hospital Center reported that “We had leeches in the 1800s, radium elixirs in the 1900s and radiation treatment for enlarged thyroids in the 1950s. A long list of medical fads have come and gone. Spiral CT has great promise. That’s why it deserves further study” (Brice, 2000, p. 49). The Society of Thoracic Radiology, which included researchers of the Mayo trial and the ELCAP study, issued the following consensus statement: “It is the consensus of this committee that mass screening for lung cancer with CT is not currently advocated. Suitable subjects who wish to participate should be encouraged to do so in controlled trials, so that the value of CT screening can be ascertained as soon as possible” (Aberle et al., 2001, p. 65). The American College of Radiology has appointed a task force to evaluate spiral CT and withholds recommendation at the time of this writing (Zinninger MD, American College of Radiology, personal communication, 2001).

Although the initial results of evaluations of spiral CT scanning are

Page 289 Cite

Suggested Citation:"7. Adopting New Technology in the Face of Uncertain Science: The Case of Screening for Lung Cancer." Institute of Medicine and National Research Council. 2003. Fulfilling the Potential of Cancer Prevention and Early Detection. Washington, DC: The National Academies Press. doi: 10.17226/10263.

×

encouraging, it is now known that radiographic detection of asymptomatic lung cancer provides no assurance that benefit in terms of the prevention of mortality from lung cancer will ensue. The same level of evidence that was available for chest X-ray and sputum cytology decades ago is now available for spiral CT, and any judgment on spiral CT as a screening modality should await the findings of trials and the availability of mortality rate data. Therefore, at this time, the evidence does not support a recommendation for either widespread screening or the screening of selective high-risk groups for lung cancer. This conclusion is based on the historical record and its lessons, the potential for harm from widespread screening, and the lack of proven effectiveness from current evidence on spiral CT.

Given that spiral CT is already in use, what specific policy recommendations can be implemented now? Public education, mutual decision-making, and a focus on primary prevention are needed to aid consumers, providers, researchers, payers, and policy makers.

Consumers

Smokers, current or former, are at far greater risk of lung cancer than those who have never smoked. However, the absolute risk of developing lung cancer among even the at-risk smoker groups is relatively low. SEER Program data show that the incidence rate of lung cancer among those age 65 and older is about 3 to 6 per 1,000 persons per year. Given the relatively low incidence rates, any decision to be screened for lung cancer by spiral CT should be made with full consideration of the rates of true-positive versus false-positive results and the attendant benefits and risks. Knowledgeable providers must adequately communicate these complexities so that patients can weigh the risks and benefits of screening. Educational materials that clearly explain risk can assist with informed decision-making.

Consumers who choose to undergo screening by spiral CT, despite the current lack of evidence, should be informed of the chance of finding true disease and the attendant risks of screening. Topics that should be covered include the costs of initial and follow-up tests, the potential physical harms from radiation and invasive secondary testing and surgery, and the potential psychological harms from misdiagnosis or mislabeling. An informed-consent form detailing these facts should be administered before the scan is performed. Consumers should be informed of the possibility that screening may result in their knowing of cancer early without lowering risk of death. In other words, they could be living longer with a diagnosis but not necessarily really living longer. Examples from past screening studies can be used to illustrate this point.

Page 290 Cite

Suggested Citation:"7. Adopting New Technology in the Face of Uncertain Science: The Case of Screening for Lung Cancer." Institute of Medicine and National Research Council. 2003. Fulfilling the Potential of Cancer Prevention and Early Detection. Washington, DC: The National Academies Press. doi: 10.17226/10263.

×

Providers

Current clinical practice guidelines advise against routine lung cancer screening. Neither chest radiographic nor spiral CT screening should be offered on a routine basis. Preferably, a primary care provider or specialty physician familiar with lung cancer screening and its likely limitations should appropriately counsel patients who desire testing. Consumers who lack primary care or who bypass their primary care provider and go directly to radiology groups should receive appropriate counseling before undergoing the procedure. Radiologists will need to arrange for appropriate followup care. If comorbid conditions such as heart disease or emphysema are present and could prevent eligibility for surgical resection, screening should be delayed until an appropriate evaluation is completed.

Radiology groups endorsing or advertising their use of spiral CT scanning should acknowledge its experimental nature and should clearly state the current lack of evidence. Follow-up mechanisms for individuals with indeterminate results should be established so that noncompliance is minimized. To ensure the highest standards of care, professional organizations and provider groups should develop guidelines on the management of pulmonary nodules.

Researchers

Since there is substantial uncertainty about spiral CT’s effectiveness, resources should be made available for research. A randomized controlled trial will provide the clearest, most convincing evidence of possible effectiveness, and the level of evidence from trials with this design is considered the strongest for policy development. Others have called such trials unethical (Henschke and Yankelevitz, 2000), as this would be the case if spiral CT were clearly efficacious and such trials were simply testing a new or altered protocol. At present, however, randomized controlled trials of screening are ethically sound; the evidence about screening by spiral CT (or other modalities) has not yet been tipped to favor screening. Since observational studies are subject to selection bias and uncontrolled confounding, uncertainty often clouds observational evidence of effectiveness, leaving the question of whether the benefit was due to the intervention or to some unmeasured difference between groups. Observational studies are useful for evaluations of the natural history of disease, tumor doubling times, and biomarker analysis and should be done for these purposes. Neither study design—randomized or observational—would provide a result faster since mortality from the disease is the common and appropriate endpoint. While waiting for trial results, researchers could evaluate decision support models that simulate the natural history of disease and spiral CT. These models can identify important disease and test thresholds at which screening can be a viable option.

Page 291 Cite

Suggested Citation:"7. Adopting New Technology in the Face of Uncertain Science: The Case of Screening for Lung Cancer." Institute of Medicine and National Research Council. 2003. Fulfilling the Potential of Cancer Prevention and Early Detection. Washington, DC: The National Academies Press. doi: 10.17226/10263.

×

Additional research is needed to find high-risk participants most likely to benefit from lung cancer screening, find methods to improve compliance with cancer screening, risk stratify the numerous lung nodules that are found on screening, and prevent harms induced by screening.

Policy Makers and Payers

For coverage decisions, policy makers will want to know the cost-effectiveness of spiral CT. David Eddy (1981) estimated that primary prevention by smoking cessation is over 400 times more effective than screening by chest radiography. The direct costs to provide chest radiographic screening to 100,000 40-year-old smokers were estimated to be $500 million in 1980 dollars. Clearly, if resources are limited, primary prevention is more cost-effective. Given that the cost of spiral CT screening is higher than chest radiographs, the budgetary impact to implement systematic screening is likely to be much higher today.

During periods of uncertainty, policies should be implemented that educate consumers, providers, and policy makers of the risks of early adoption. Widespread mass screening for lung cancer has not been endorsed by professional organizations, given the history of past attempts and the difficulties with present attempts. Rapid dissemination of screening technology before establishment of its effectiveness and prior to establishment of quality control standards could result in more harm than benefits. Currently, for every three cancers detected by spiral CT screening, two false-positive participants will undergo invasive procedures and potential subsequent harm.

CONCLUSIONS

Lung cancer is a dreaded disease with a difficult and frequently fatal course. It is now the leading cause of cancer death in the United States. Smoking prevention and cessation can prevent most cases. Yet, one-quarter of adults in the United States are still regular smokers, and youths continue to experiment with tobacco and often become addicted. Although rates of smoking have declined, continuation of the current lung cancer epidemic can be anticipated for decades to come. In addition, lung cancer death rates will soon surge in the developing world (Hoel et al., 1992; Pandey et al., 1999; Parkin et al., 1999).

Logically, investigators have looked for approaches other than tobacco control to reduce the numbers of deaths from lung cancer. Screening was first attempted in the 1950s, soon after the scope of the lung cancer epidemic was recognized. The use of chest X-rays for screening appeared to be appropriate at the time and fit with the approach already in use for tuberculosis. Sputum cytology added another screening tool that also appeared

Page 292 Cite

Suggested Citation:"7. Adopting New Technology in the Face of Uncertain Science: The Case of Screening for Lung Cancer." Institute of Medicine and National Research Council. 2003. Fulfilling the Potential of Cancer Prevention and Early Detection. Washington, DC: The National Academies Press. doi: 10.17226/10263.

×

promising, as the abnormalities in expectorated cells mirrored the abnormalities in the respiratory epithelium where they originated.

The first studies were observational rather than experimental in design. Although the randomized clinical trial is now the standard means for the assessment of screening tests, it had not yet been used for that purpose in the 1950s. The first studies of screening tests used a design equivalent to a demonstration project, inviting participants to have the screening test and then monitoring them over time. Viewed by today’s standards, those studies had poor quality control measures, inaccurate standardization and poor means of interpretation of test results, and high rates of noncompliance with the screening regimen by participants. The designs of the studies were also flawed because they did not include randomization to a control group and one or more screening modalities; in fact, comparison or control groups were lacking in several of the studies.

In the 1970s four randomized controlled trials on lung cancer screening using chest radiography and sputum cytology were performed. Unfortunately, these randomized controlled trials of lung cancer screening showed no evidence of early detection of lung cancers by these tests (Fontana et al., 1991; Melamed et al., 1984; Tockman, 1986). The numbers of early-stage and late-stage cancers did not meaningfully differ between the study groups. Most disappointing was that on follow-up, the rates of mortality were the same among the screened and the unscreened participants.

An effective screening test for lung cancer has not yet been identified, therefore organizations that develop screening guidelines do not recommend screening for it (Aberle et al., 2001; Biesalski et al., 1998; Eddy, 1980a; Eddy, 1980b). Although the studies of screening to date have failed to show mortality reduction, the lessons learned from these trials can pave the way for future research with newer technologies such as spiral CT scans. The conceptual framework for cancer screening illustrates its complexity and provides criteria for the evaluation of new technologies such as spiral CT. Future studies need to account for biases when measuring survival data. These biases are lead-time bias, length bias, and overdiagnosis bias (described in Chapter 5). Future studies not only must show evidence that screening can detect smaller cancers but also must show evidence that screening can produce stage shifts, including both more early cancers and fewer late cancers, followed by lower lung cancer mortality rates (Patz et al., 2000a).

Potential concerns related to the psychological and physical harms from the finding of positive results and the management of patients with positive results need to be considered, as there is strong demand for screening by spiral CT and spiral CT has been adopted into practice, despite the uncertain and incomplete scientific evidence.

Clinicians and the public are eager to have new approaches to the prevention of lung cancer; consequently, any new tool for prevention is

Page 293 Cite

Suggested Citation:"7. Adopting New Technology in the Face of Uncertain Science: The Case of Screening for Lung Cancer." Institute of Medicine and National Research Council. 2003. Fulfilling the Potential of Cancer Prevention and Early Detection. Washington, DC: The National Academies Press. doi: 10.17226/10263.

×

likely to be hailed and perhaps not receive a sufficiently critical evaluation. The lessons learned from past experience with lung cancer screening call for restraint in the use of spiral CT screening. Findings from uncontrolled trials conducted to date do not provide the level of evidence required to endorse systematic screening with spiral CT scans.