Read "Improving the Effectiveness of U.S. Climate Modeling" at NAP.edu

Page 29 Cite

Suggested Citation:"State of U.S. Climate Modeling." National Research Council. 2001. Improving the Effectiveness of U.S. Climate Modeling. Washington, DC: The National Academies Press. doi: 10.17226/10087.

×

3 State of U.S. Climate Modeling

An important task of this study was to quantitatively assess the computational and human resources presently directed towards climate modeling in the United States. To accomplish this goal two surveys were developed ( Appendix C and Appendix D ). One of these surveys was sent to large and intermediate-size modeling centers and one was sent to small centers. After these surveys were drafted, a specialist in social surveying edited them to ensure that the information collected was as free of bias as possible. These surveys were sent to 50 modeling institutions and groups and 42 responses were received. The panel does not claim to have surveyed all groups or institutions operating small-scale modeling efforts. Because of the varied and extensive use of modeling in many areas of earth science, it would be extremely difficult to identify all of these small centers, thus, those responses that were received were taken to be indicative of smaller efforts. A good estimate of resources could be obtained for the largest centers because they were easier to identify, and all responded. Survey responses are discussed below and tabulated in Appendix E.

3.1 MODELS

The information collected on current modeling activities shows the robust and varied nature of climate and weather modeling in the United States. Smaller modeling centers enjoy a level of resources equivalent to what would have been considered supercomputer resources only a de-

Page 30 Cite

Suggested Citation:"State of U.S. Climate Modeling." National Research Council. 2001. Improving the Effectiveness of U.S. Climate Modeling. Washington, DC: The National Academies Press. doi: 10.17226/10087.

×

cade ago allowing them to run either in-house (regional and global) component models, component models from the larger centers, or a combination of the two. Smaller centers can even run coupled climate models but at coarse resolution (e.g., 800 km) or a higher resolution (300 km) for shorter time periods. Responses to the question about improvements that are being planned for models at all centers were varied, but most involved a mixture of increased model physics, dynamics, numerics, efficiency, and applicability. Many respondents also noted the desire to better incorporate new types of satellite and radar data.

In general, most respondents stated that their code was portable on platforms other than those on which they normally operated although some models required a moderate amount of optimization to ensure that they ran with minimal performance loss. Most centers release their modeling results to the wider scientific community without restrictions. A few centers freely release their data but stipulate that the results be used only for research purposes; others limit the release of modeling results to collaborators.

Large and intermediate-size modeling centers were asked whether there were plans to convert model code to run on massively parallel (MPP) architectures. Most institutions responded that this conversion had already taken place although those that have converted or are in the process of converting noted the difficulty in transferring certain models to an MPP architecture. Many respondents also noted that this conversion required significant programmer time and drained resources that could have been devoted to other activities. When asked for comment on the relative merits and hindrances of MPP versus VPP architectures, the majority of respondents preferred VPP architecture for the following reasons:

MPP systems are generally more difficult to program and require increased computer expertise. There are therefore significant training issues involved in the use of these systems. These difficulties are particularly significant for university centers as they often rely on graduate student labor that is characterized by high turnover.
Data assimilation and processing are more difficult on MPP systems.
VPP systems are more stable and reliable.
There are significant scalability problems on MPP systems.
There is a lack of compilers on current MPP systems that make these systems difficult to use.

Despite the difficulties with MPP systems some respondents felt that these systems had significant benefits over VPP systems (e.g., lower memory cost and increased aggregate CPU power).

Page 31 Cite

Suggested Citation:"State of U.S. Climate Modeling." National Research Council. 2001. Improving the Effectiveness of U.S. Climate Modeling. Washington, DC: The National Academies Press. doi: 10.17226/10087.

×

3.2 COMPUTING ¹

Most small and many intermediate-size modeling centers either rely on the use of workstations or clusters of workstations for their modeling efforts or they collaborate with the larger centers and use their computational facilities. Larger-size modeling centers primarily rely on supercomputers for their climate and weather simulations. Of the large modeling centers surveyed, half share their computational time with the wider community. The computing capacity of large and intermediate-size modeling centers are described in Table 3-1. This table also includes planned upgrades to existing systems.

When asked what upgrades would be incorporated if funds were available, the responses were varied (Table 3-1; Appendix C and Appendix D), although the majority of centers noted the need for increased capabilities such as additional processors, nodes, and disk space or some combination. Some centers also noted the need for additional network bandwidth to more rapidly acquire data sets from remote sources. Some of the smaller centers, when asked what additional upgrades would be incorporated if funds were available, said they would prefer to devote any new funds to the purchase or enlargement of an existing PC cluster rather than pooling these funds to upgrade shared supercomputing resources.

Most centers (large, intermediate, and small) responded that computing capabilities were limiting the resolution and number of model runs and the production of real-time forecasting products. Although it is arguable that the desire will always be to produce a greater number of higher resolution, higher complexity model runs regardless of the available computational capacity, it is apparent that the ability to accurately model weather and climate at finer spatial and temporal scales is dependent on the ability to obtain a robust estimate of climate model uncertainty. This requires the analysis of a large number of cases and ensemble members per case. Increased model quality will lead to increased predictive skill and higher quality operational products for climate and weather prediction. Thus, the computational limitations noted in the survey are not only affecting current research activities and model development but also the production of outputs required for operational use.

It is important to note that, in addition to the need for additional computing capabilities, many respondents discussed the critical need for qualified scientists, modelers, and hardware and software engineers. This need is discussed more fully in the next section.

¹

The information in Table 3-1 was accurate at the time that the survey results were assembled. Since then, information detailing the upgraded computing capabilities at NCEP was provided. The recently upgraded machine uses IBM's Power 3 Winterhawk-II technology, operating at 375mhz. The system has 2208 processors in 40 frames and has 512 compute nodes, with 2 GB memory per node.

Page 32 Cite

Suggested Citation:"State of U.S. Climate Modeling." National Research Council. 2001. Improving the Effectiveness of U.S. Climate Modeling. Washington, DC: The National Academies Press. doi: 10.17226/10087.

×

This page in the original is blank.

Page 33 Cite

Suggested Citation:"State of U.S. Climate Modeling." National Research Council. 2001. Improving the Effectiveness of U.S. Climate Modeling. Washington, DC: The National Academies Press. doi: 10.17226/10087.

×

TABLE 3-1 Computing Resources Located At Large Modeling Centers ^a

Institution ^b	Computer System	Processors	Last Upgrade	Sustained System Performance	Central Memory / Secondary Disc Storage	Future Upgrades Planned
CIT-JPL	¹Cray T3D/T3E ²SGI Origin 2000	¹512 ²128	1999	¹10~50 Gflops	No information provided.	No information provided.
COLA	¹SGI Origin 2000 ²Compaq ES40 ³Compaq DS20	¹16 CPUs ²4 CPUs	1999	¹2.5 Gflops ²1.25 Gflops	¹4 GB ²4 GB/node Disk capacity: 2.3 TB (shared via gigabit-switch LAN)	None.
CSU	¹SGI Origin 2000 ²Octane	¹10 ²12	20% of inventory upgraded/year	No information provided.	No information provided.	8-processor Origin in 2000 (Chance).
FSU	¹IBM SP2 with 9 nodes running on a fast interconnect bus 6 RS6000 model 260/270 series.	2 of the 260 series are dual processors, the remaining 4 units are 4 processor machines.	No major upgrades.	Unknown.	Each machine has approximately 2Gb of memory; 270's have ~ 50 GB of disk space Other machines have ∼ 9Gb of disk space per machine.
UCLA	¹Compaq XP1000-cluster	¹5	¹1999	¹2 Gflops	¹2 GB/0.1 TB	None planned.
UH	¹Cray SV-1 ²SGI Origin 2000 ³SGI Origin 2000 ⁴SGI Origin 2000	¹24/300 Mhz ²32/250 Mhz ³16/195 Mhz + 8/30 Mhz ⁴4/180 Mhz	¹March1999 ²March 1999 ³March 2000 ⁴December 1999	¹28.8 Gflops ²16 Gflops ³6.2 Gflops ⁴1.4 Gflops	¹16.0 GB RAM/156GB ²14 GB RAM/180 GB ³4.5 GB RAM/36 GB ⁴1.0 GB RAM/1 TB RAID5 (capacity is extended by Veritas HSM using a tape library with 13.6 TB capacity)	No information provided.
UI	¹NekoTech Jaguar 333Mhz ²DCG Computers Viper 500 MHz ³DCG Computers LX 533 MHz ⁴DCG Computers LX 533 MHz ⁵MicroWay Alpha 600 MHz	¹1 ²1 ³1 ⁴1 ⁵1	¹1995 ²1997 ³1997 ⁴1998 ⁵1999	No information provided.	¹64 M/9 G ²128 M/18G ³128 M/18 G ⁴128 M/18 G ⁵256 M/18 G	Three AlphaStation-type workstations in the next five years.
IRI	¹Cray J-9 ²SGI O2000 ³NEC SX-4B	18 and 16 ²64 ³2	4 years for Crays; nearly 1 year for Origin upgrade, just over 1 year for SX4.	¹1.5 Gflops ²5 Gflops ³2.5 Gflops	¹32 Gbytes, 1.4Tb ²16 Gbytes, 0.1Tb ³8 Gbytes; 0.2Tb Additional mass store available (10Tb at LDEO, larger system at SDSC)	Crays will be replaced within the next year. New system not known yet.

Page 34 Cite

Suggested Citation:"State of U.S. Climate Modeling." National Research Council. 2001. Improving the Effectiveness of U.S. Climate Modeling. Washington, DC: The National Academies Press. doi: 10.17226/10087.

×

This page in the original is blank.

Page 35 Cite

Suggested Citation:"State of U.S. Climate Modeling." National Research Council. 2001. Improving the Effectiveness of U.S. Climate Modeling. Washington, DC: The National Academies Press. doi: 10.17226/10087.

×

LANL	¹SGI Origin 2000	¹1024	¹1999	¹100 Gflops (theoretical sustained)c	¹256 MB/processor or 256 GB/system	Unknown.
NASA-DAO	¹SGI Origin-2000 clusters	¹Six 64-CPU machines, one 32-CPU machine	¹2000	¹~ 3-4 GFLOPS on each of the 64-CPU clusters	¹16 GB central memory; disk space varies	Only minor upgrades planned.
NASA-GISS	¹SGI Origin 2000	¹96	¹1998	¹For mostly single-processor and ensembles of runs it is ∼ 75 Gflops	¹Central memory 20 GB/1000 GB	Upgrade to 128 processors and an upgrade of chip speed to the current state of the art as well as increased disk storage.
NASA-GSFC	¹CRAY T3E/600 ²DEC alpha 4100	¹1024 ²12	¹2000 ²1999	¹40 Gflops ²1 Gflop	¹128 GB (mem) 750 GB (disk) ²3.5 GB (mem) 1800 GB (disk), 20 TB mass strorage system	1and 2. doubling of capability for the current system in 2001 and another in 2003.
NCAR-M. Blackmon	¹Cray C-90 ²Cray J-90 ³SGI Origin ⁴IBM SP	¹16 ²16-20 ³32, 64 or 128 ⁴Variety of configurations	¹Decomissioned in late 1999 ⁴Spring 2000	¹~5 Gflops ³~5 Gflops Both using 64 processors	Unknown.	New system procurement to be installed in early 2001
NCAR-W. Washington	¹CRAY T3E900 ²SGI Origin ³Origin 2000/128 ⁴Hp SPP2000 ⁵IBM SP2 ⁶Sun Starfire ⁷DEC/Compaq ⁸Alpha Cluster ⁹Linux Cluster	Unknown.	Unknown.	Unknown.	No information provided.	NCAR will soon be involved in procurement for a new system to be installed in early 2001.
NOAA-CDC	¹Compaq AlphaServer DS10 ²Sun Enterprise 4500 ³Sun Ultra 60 ⁴Sun Enterprise 450	¹12 machines each with a single 466mhz Alpha 21264 processor ²2 machines one with 8 UltraSparc II 400mhz processors, the other with 4 ³6 machines each with 2 360mhz UltraSparc II processors ⁴4machines each with 4 300 MHz UltraSparc II processors	¹May 2000.	¹6.3 Gflops ²3.6 Gflops ³3.25 Gflops ⁴3.6 Gflops LINPACK Gflops for aggregate of each system type.	¹Each node has 512 MB/50 GB ²4 GB on the 8-processor machine, 2 GB on the 4-processor machine ³1 GB RAM on 3 machines, 2GB on the others ⁴2 GB 2928 GB of disk storage shared by the Sun systems.	AlphaServer cluster will be upgraded, as faster processors become available (resources permitting).

Page 36 Cite

Suggested Citation:"State of U.S. Climate Modeling." National Research Council. 2001. Improving the Effectiveness of U.S. Climate Modeling. Washington, DC: The National Academies Press. doi: 10.17226/10087.

×

This page in the original is blank.

Page 37 Cite

Suggested Citation:"State of U.S. Climate Modeling." National Research Council. 2001. Improving the Effectiveness of U.S. Climate Modeling. Washington, DC: The National Academies Press. doi: 10.17226/10087.

×

NOAA-GFDL

¹SGI/Cray T932

²SGI/Cray T94

³SGI/Cray T3E (water-cooled chassis)

¹122

²24

³128 450-MHz

¹Upgraded to 26 processors in 1996; was de-rated to 22 processors in 1999 because of irreparable damage to the inter-processor network.

³The air-cooled T3E system with 40 450-MHz processors, each with 128 MB of memory was replaced with a water-cooled T3E with 128 450-MHz processors, each with 256 MB of memory.

Sustained system performance of approximately 14–15 Gflops for the laboratory's actual workload.

Central Memory:

¹0.004 TB (Shared Memory)

²0.001 TB (Shared Memory)

³0.033 TB (Distributed Memory) Secondary Storage:

¹32 GB

²2 GB

³0 GB Rotating Disc Secondary Storage:

¹450 GB

²770 GB

³430 GB

Acquire a balanced high performance system to replace the current SGI/Cray systems. The first phase of this new system is expected to provide at least a three-to-four-fold increase in performance. The second phase, should deliver a substantial increase in performance over the phase-one system.

NOAA-NCEP

¹IBM-SP

²SGI/Origin 2000

¹768

²256

¹Nov. 1998; Major upgrade due in Sept. 2000

²Fall 1999

Unknown.

¹256MB/node on 384 nodes, ∼96GB total

²128GB total

-The IBM-SP will be upgraded to 128 nodes (2048 PE) system in Sept. 2000

-Further upgrades to increase capacity in 2001.

-NAVO MSRC will continue to increase its total capacity by installing new systems such as Sun server and IBM SP.

NPGS

¹T3E

²SGI Origin 2000

³IBM SP2 All off-site

¹256

²128

³64

0–3 years old

¹10 Gflops

²10 Gflops

³5 Gflops

0.5~1.0 GB

The remote systems have plans in the works for upgrades of 2x to 5x in computing power.

NRL

¹Cray C90 (2 systems at FNMOC)

²Dec Alpha (NRL system)

³SGI O2K (FNMOC)

⁴T3E (DoD HPC/NAVO)

¹16/8

²8

³128

⁴1088

¹1999

²1999

³2000

⁴1998

¹6.4/3.2 Gflops

²2.0 Gflops

³40 Gflops

⁴50 Gflops

¹8GB/3 TB

²8GB/1 TB

³256GB/3.7TB

⁴387GB/1.5 TB

SGI O2K will be upgraded to SGI SN1 during fall 2000. DoD HPC undergoes constant upgrades.

Page 38 Cite

Suggested Citation:"State of U.S. Climate Modeling." National Research Council. 2001. Improving the Effectiveness of U.S. Climate Modeling. Washington, DC: The National Academies Press. doi: 10.17226/10087.

×

This page in the original is blank.

Page 39 Cite

Suggested Citation:"State of U.S. Climate Modeling." National Research Council. 2001. Improving the Effectiveness of U.S. Climate Modeling. Washington, DC: The National Academies Press. doi: 10.17226/10087.

×

PNNL-S. Ghan

¹~3 SUN ultra 5 workstations

²Beowulf cluster

¹1

²16

¹1999

²2000

¹0.2 Gflop

²2 Gflops

¹512 Mb /30 GB

²4 GB/320 GB

Upgrade Beowulf network to gigabit.

PNNL-R. Leung

¹IBM-SP2

¹512

¹1999

¹247 Gflops

¹262 GB/5 TB

Upgrade IBM-SP by replacing all existing processors with faster ones.

PSU

¹Cray SV-1

²IBM RS6000 SP (8 Winterhawk nodes)

¹16 (each 1.2 GF)

²8 nodes of 4 cpus each (32)

¹2000 – Cray SV-1 replaced a J-class machine

²Brand new

¹6 Gflops

²6 Gflops

¹4 GB/220 GB

²16 GB/292 GB

The IBM is an effort to match the architectures of recent U.S. lab purchases. If successful in transitioning codes to this machine the plan is to increase the number of cpus, hopefully by a factor of 3.

^a This table does not include the potentially large amount of classified computer capabilities located at Department of Energy Laboratories that are occasionally used for climate modeling by in-house research groups.

^b Institutions defined in Appendix J.

^c No more than 1/16 of the total number of processors is applied to a single operation.

Page 40 Cite

Suggested Citation:"State of U.S. Climate Modeling." National Research Council. 2001. Improving the Effectiveness of U.S. Climate Modeling. Washington, DC: The National Academies Press. doi: 10.17226/10087.

×

3.3 HUMAN RESOURCES

The survey responses revealed an overwhelming need at many of the modeling centers for highly qualified technical staff, such as modelers, hardware engineers, computer technologists, and programmers, who are difficult to find because private industry lures them away with higher salaries and other financial incentives.

An interesting point to note from the survey responses is that staffing levels at all three sizes of centers are similar despite differences in the scale of effort. This is likely because at the smaller centers many of those listed as staff are students and post-docs, whose number vary depending on funding levels. There are approximately 550 full-time employees dedicated to climate and weather modeling in the United States. This number is likely to be low because all small modeling centers were not surveyed, and there were a few intermediate-size centers that did not respond.

Most centers, regardless of size, indicated the likelihood of increasing the number of staff in the near future. Although many of the staffing increases listed were in the area of software development and computational support, a number of institutions were also increasing the scientific staff devoted to model interpretation and parameterization. Larger centers tended to be more satisfied with their staffing numbers. In part, this difference appears to be due to difficulties in finding stable, long-term funding for permanent staff at the small centers.

The respondents from universities differed in the belief that there is a decrease in the availability of high quality graduate students entering the atmospheric sciences. Those centers that felt there were sufficient students noted that the greater difficulty was finding continued funding to support the highest quality students available.

3.4 THE HIGHER-END CENTERS

Table 3-1 gives a synoptic view of the computer resources available to the higher-end centers in the United States. In general, most of the centers have computer capabilities on the order of 20 Gflops with one or two having twice that. With these resources most coupled climate models are run at about 300 km resolution in the atmosphere and about 100 km in the ocean.

In contrast, the European Center for Medium-range Weather Forecasting (ECMWF) has a 100-processor Fujitsu VPP5000 rated at a sustained 300 Gflops, a 116-processor Fujitsu VPP700 rated at a sustained 75 Gflops, and a 48-processor VPP700E rated at a sustained 34 Gflops. Its forecast model is run at 60 km resolution globally while its seasonal-to-interannual predictions are run at about 130 km resolution globally in a

Page 41 Cite

Suggested Citation:"State of U.S. Climate Modeling." National Research Council. 2001. Improving the Effectiveness of U.S. Climate Modeling. Washington, DC: The National Academies Press. doi: 10.17226/10087.

×

one-tiered sense and with ensembles of 15 per month. For more detailed information refer to http://www.ecmwf.int/research/fc_by_computer.html .

The Japanese Frontier Program is developing a 10 km global atmospheric model and has contracted for a supercomputer (“The Earth Simulator ”) having a sustained speed of 5 Tflops ( http://www.gaia.jaeri.go.jp/OutlineOfGS40v3_1.pdf).

3.5 ORGANIZATIONAL BACKGROUND

The earlier modeling report (NRC, 1998a) pointed out the basic health of small-scale climate modeling and the lagging progress of high-end climate modeling: these findings were confirmed above. That report summarized the difficulties faced by high-end climate modeling as follows: “The lack of national coordination and funding, and thus sustained interest, are substantial reasons why the United States is no longer in the lead in high-end climate modeling.” It also identified the United States Global Change Research Program (USGCRP) as the only available mechanism to coordinate and balance the priorities established by individual agencies, but pointed out that the USGCRP did not have the means to do this.

More background is appropriate and again the organizational comparison of weather and climate proves valuable. The government organization for weather and weather forecasting was solidified about 1970 when NOAA and its Weather Service was placed in the Department of Commerce. The Weather Service embodied a specific agency structure with a well-defined mission that could be evaluated by progress in the production, accuracy, and delivery of weather forecast products.

The development of climate research in the United States was hastened by concerns over the perceived problem of global warming, but was constrained by the existence of an agency structure that had solidified by 1970. No additional government re-organizations occurred after 1970 and previous ones did not have climate as a tangible concern. Because no single agency could address all the aspects of climate (or more precisely, because many agencies claimed different aspects of climate but none were founded with climate as a mission), the Global Change Research Act of 1990 established the U.S. Global Change Research Program (USGCRP) “aimed at understanding and responding to global change, including the cumulative effects of human activities and natural processes on the environment, and to promote discussions toward protocols in global change research and for other purposes ” (Appendix A of NRC, 1999a). It set into motion the USGCRP interagency process that addressed the following research elements:

global observations of “physical, chemical and biological processes in the earth system”;

Page 42 Cite

Suggested Citation:"State of U.S. Climate Modeling." National Research Council. 2001. Improving the Effectiveness of U.S. Climate Modeling. Washington, DC: The National Academies Press. doi: 10.17226/10087.

×

documentation of global change;
studies of earlier global change using paleo proxies;
predictions of global change including regional implications;
“focused research initiatives to understand the nature of and interactions among physical, chemical, biological, and social processes related to global change.”

It also called upon the National Research Council to evaluate the science plan and provide priorities of future global change research. This was the motivation behind the NRC “Pathways” report (NRC, 1999a).

The Pathways report pointed out the flaws in the conception and implementation of the USGCRP—in particular that “in practice, the monitoring of climate variability is not currently an operational requirement of the USGCRP nor is there an agency of the U.S. government that accepts climate monitoring as an operational requirement or is committed to it as a goal.” It also expanded the domain of climate research to include variability on seasonal-to-interannual and decadal-to-centennial time scales.

A group of agencies, each devoted only to research and combined in the USGCRP, is currently the only institutional arrangement for performing climate research; for establishing and sustaining a climate observing system; for identifying, developing and producing climate information products; for delivery of these products; and for building the general infrastructure needed to accomplish these tasks. The USGCRP is currently the only entity organized to develop climate models and to secure the computational and human infrastructure needed to respond to the demands placed on the climate modeling community. About 6% of the $1.8 billion annually allocated to the USGCRP is devoted to modeling and this includes the major data assimilation efforts of the NASA Data Assimilation Office.

3.6 SUMMARY OF HIGH-END CAPABILITIES IN THE UNITED STATES

With a sustained computer capability of 20 Gflops, the current capability of some of the U.S. high-end centers, a climate model consisting of a 300 km resolution atmosphere with 20 levels in the vertical, a land model, and 100 km ocean model, all coupled together and well coded for parallel machines is able to simulate 5–10 years per wall-clock day (see http://www.cgd.ucar.edu/pcm/sc99/img002.jpg . A 1000-year run would therefore take between 3 and 6 months to complete as a dedicated job. As we will see in the next section, these run times are too long to address some of the recent demands placed on the U.S. climate modeling community.