Below are the first 10 and last 10 pages of uncorrected machine-read text (when available) of this chapter, followed by the top 30 algorithmically extracted key phrases from the chapter as a whole.
Intended to provide our own search engines and external engines with highly rich, chapter-representative searchable text on the opening pages of each chapter.
Because it is UNCORRECTED material, please consider the following text as a useful but insufficient proxy for the authoritative book pages.
Do not use for reproduction, copying, pasting, or reading; exclusively for search engines.
OCR for page 100
D
Questionnaire Responses
In formulating the lessons learned described in Chapter 3, the committee interviewed experts, heard
presentations, and created an online questionnaire that was distributed to the broader community.
The online questionnaire received responses from national and foreign individuals who had
an interest in spatial data infrastructure and proved to be a rich source of opinions from users,
planners, and policy-makers. The responses are grouped below by issue.
Lessons Learned Questionnaire Responses
Issue Lesson
Standardization Full acceptance of the organization of the need for rigid standardization
of its data and information products to agreed international standards
Organizational commitment to internationally agreed metadata standards
[Successful organizations] work within the community to make/improve
standards...
[Challenges] Let a thousand flowers bloom In the past year we are
witnessing a global convergence in thinking on how spatial data should
be integrated. This is occurring in all technical fields as well as in the
library sciences. There are scores (hundreds??) of projects moving
forward and we risk duplicating effort or worse, creating divergences
in standards, protocols, processes and methods that will make later data
integration much more difficult or effectively impossible.
[Challenges] The amount of data. Data from countless agencies and
data formats, gathered with varying standards, documented with varying
accuracy and amount. It can be difficult to get many people and agencies
to agree on standards and work together unless some plan and mutual
benefit is in place.
[Challenges] Technical - appropriate standards (metadata, vocabulary)
100
OCR for page 101
APPENDIX D 101
Issue Lesson
[Does not work] Implementation of OGC standards suffers from
performance issues. There is lack of leadership.
[Worked well] The development and adoption, though limited, of open
standards for geospatial Web Services is a key capability that promotes
interoperability. This, in turn, allows for neighboring or overlapping SDIs
to work well together without special agreements or translators.
Data Scientists must make available data that underpin knowledge products
Federal data are created to some minimum achievable standard
Census using local roads data
Landsat 7...we've done the best we could. We need the continuity
mission now.
Everything online [what has worked well] - We have seen a paradigm
shift in thinking about data and especially spatial data, in the last four
years or so. Prior to this, data owners were generally unwilling to share
their data for fear of them being misused, losing control over the data,
of them being used to scoop the originators of the data, or others getting
credit for the data. In the past few years however, there is widespread
recognition of the value of making ones data more widely available for
others to use. This coincides generally with the release of Google Earth
and the rapidly growing expectation that everything, including scientific
data, should be readily available online at no cost.
[what has not worked well] Mandated uniformity Everyone has
invested vast resources in their databases and spatial data infrastructure.
So, when discussion of data integration came up, the fear was that we
would all be forced to convert what works for us, into some format (and
operating system, and server configuration) that would be imposed.
Many data providers have custom systems and applications that will
be prohibitively expensive to re-do. Plus, with cyberinfrastructure in a
constant state of change, how could we adopt a system that would not be
obsolete before it was implemented.
[Do differently] Put some good people into cataloging existing
reports and data sets. Build better metadata tools. Make management
accountable to publish geospatial data from all projects. Make all projects
identify spatial data results, plan for, and publish them before project is
considered complete.
[Challenges] Geography has spent the last decade trying to justify their
existence, rather than meeting customer needs. Many of Geography
programs have become largely irrelevant, with some very notable
exceptions, all of which are long-term commitments of resources focused
on data content, such as NED, NHD, NLCD.
OCR for page 102
102 APPENDIX D
Issue Lesson
[Challenges] The major challenge is to establish a NSDI organization
that is viewed by authors as a robust clearinghouse of their spatial
datasets. Authors should be glad to submit their data and should be
delighted that others will have easy access, instead of having to handle
`data requests' every time someone wants it. The current cultural views
the NSDI as an unfunded mandate, with a lot of hassles to submit data,
with very little benefits in return. Trying to establish a new organization
- or revamping the existing one - is always extremely difficult, and the
establishment of this one is even more difficult because of the lack of the
basic understanding of the true value.
[What has not worked?] US Topo is a solution looking for a problem.
The focus should be more on content, rather than packaging. GeoPDF
may satisfy a certain niche, but without excellent content, it serves little
purpose.
[has worked] Several national seamless datasets have been very
successful, including NED, NHD, NLCD, and NHDPlus. These are
providing very useful data that is nationally consistent, well organized,
and easy to access.
[not worked] The WRD NSDI node is just a tabular list of 646 datasets
- some datasets are listed by theme, such as, ag, aquifers, etc; a lot are
listed with obscure names, such as , darea, diffus, etc.; some are listed
by OFR #, by SIR # and by WRIR#. How in the world can anyone find
what their [sic] after? We need a better way of assigning searchable `key
words' to the datasets and tools that can search and retrieve datasets that
meet a specified query.
[not worked] Main sticking points are access to updated, high-quality
satellite-derived imagery, and access to sufficient field-based observations
of vegetation (i.e., we need 500,000 current georeferenced samples with
sufficient vegetation composition and structure documented - maintained
and accessible) to support map development and accuracy assessment.
[Challenges] technical challenges come mainly from a lack of certain
critical data sets required to develop robust spatial models. We work
across local/regional/national/ continental scales, so access to data that
are standardized across these scales presents the greatest challenge.
[Domain] Our work is centered on biological data, including the
characterization and assessment of ecological systems and habitats for
species of concern. But in order to address this domain successfully, we
rely on a wide range of non-biological data inputs, such as imagery (of
varying types and resolutions), digital elevation, synthesized climate data
(past, current, future), surficial geology, soils, surface drainages, wetland
location, hydrography, land use, land ownership, and land use policy.
[worked well] The understanding that data needs to be consistent and of
known quality so that decisions can be more easily made on how the data
can/should be used.
OCR for page 103
APPENDIX D 103
Issue Lesson
[didn't work] We are severely lacking in field-based observation data for
about 10,000 plant and animal species in the U.S. that are of conservation
concern. With these observations and a foundation, habitat models can
be developed to apply to many forms of environmental decision making
processes.
[not worked well] Data content standards and schemas have not been
widely adopted for key base data sets. This makes the exchange and co-
development of data, at least in the US, more difficult.
Metadata [what works] State and local grants for data creation and metadata
training.
[what did not work well] Difficult metadata standards It is such a
challenge to generate FGDC-compliant data that many individuals,
programs, and agencies, do not even try, and instead have their own
internal systems. The head of a geoscience division in a large federal
agency told us that, yes, the USGS standards are nice, but they could not
invest the time and resources to meet them, so they developed their own
in-house way of doing things, because they had to get things done. The
chief scientist for one of the world's largest multinational oil company
described to me how they were scraping their fourth internal attempt to
create a company-wide data base, after spending millions on it. Before
they could make significant progress, various offices and branches had
gone off in other directions because they could not wait, and they had
their own needs to address.
[What has not worked?] It has been too hard to develop metadata [for
water data], and the process is usually left to the end of a project and then
not done. Management bears much of the blame for this, as they have not
enforced the requirement to publish metadata, even though it has been
required by executive order since 1994. Although much of our work is
supported by geospatial data, much of the data supporting that work is
never published because of this.
[What has worked?] Publishing metadata through the NSDI node, when
it is done, does work fairly well. I can find datasets I published in the
past easier with Google than I can find them on my hard drive or backup
media.
[not worked well] Outside the Federal government: metadata collection
and dissemination are often non-existent.
Distribution, [what works] The National Map accepting state data and using it on ...
Serving servers
Having the National Map portal at EDC is very useful
OCR for page 104
104 APPENDIX D
Issue Lesson
Distribution, Work with Google [private industry in general] to make it even more
Serving Continued useful and friendly. Market expansion for geospatial data could be
exponential if the tool people sue to access the data is easy enough to use
and does enough analysis. (all we need is overlay analysis and we are in
the GIS business online)
[What has worked well] Market driven solutions the pervasive
adoption of free online visualization tools for spatial data (e.g., Google
Earth, Google Maps, Bing, ArcGIS Explorer, etc) by even the smallest
retailer and organization has made it the norm to share (and promote)
your data online. Also, data providers find these free tools of tremendous
value, so that when more data are made public, more tools and
applications become available to use them.
[Do differently] Make the USGS SDI more powerful by giving it better
search and data discovery mechanisms. Requires support from the top,
a budget, and a dedicated team - not just USGS but from all agencies.
Standards need to be defined. Robust software tools need to be developed
to create standard metadata, and to provide the ability to search all NSDI
nodes. This really requires a lot of coordination from all agencies to
enforce the standards so `searches' have the potential to retrieve all data
that meets a query.
[What has not worked?] The National Map is [ineffective]. Viewer 2 is
better, but it still lacks compelling content. Much of the current content
is not much better than we had in the early 1990's from TIGER/Line and
100K DLGs.
[worked well] New map services are providing data access in new ways.
These include NWIS web services, NWIS Mapper, real-time earthquake
maps, and StreamStats. StreamStats provides analytical services rather
than just raw data, and is a good example of how far the web service
model can be taken.
We were able to work successfully with USGS to access global
climate, digital elevation, lithology, and other data sets for critical new
advancements in classification and mapping of terrestrial ecosystems.
Much new work has been advanced in the U.S., Latin America, Canada,
and Africa, due to the accessibility of key data sets from USGS.
[Worked well] The idea of centralizing data so that it can be easily
accessed.
[Did not work well] Up until the National Map, we had access to
localized lists of available spatial data from both the USGS and other
agencies. You had to read through title upon title to find things and
getting a grip on what was available for a given geographic region was
difficult. You would have to visit each agency where you think data
might be available and then it may not be documented very well.
One-stop portals have not materialized and efforts such as the National
Map have failed to reach their potential.
OCR for page 105
APPENDIX D 105
Issue Lesson
It has worked OK for me. I use several sources for data and those sources
are reliable and well documented. However, certain USGS efforts have
not reached a level of usefulness (e.g., the National Map).
[Worked well] The increased use of and migration to database
technology for storing spatial data.
Tools [Challenges] Technical - appropriate standards (metadata, vocabulary)
and tools (gazeteers, spatial and keyword search) are still lacking.
Existing systems need to lessen reliance on proprietary software. Tools to
integrate diverse data types need development.
[worked well] Cutting edge development of tiled map services by
Google and others
Public Relations Under promise and over deliver
The more successful organizations in building SDIs are the ones that
have a long history of collaborating with other organizations and ha[ve] a
culture which is focused on making data and information available to the
broader community.
State liaisons. Relationships in the field cannot be beat...should be some
of the most intelligent and motivated folks...part of their performance
evaluation (not sure if this is possible) should come from the people they
serve in the states.
Partnership maintenance, state and local venues...local professional
organizations...representation on state and local geospatial decision
making bodies
[Challenges] High expectations - Increasingly, scientists as well as
decision makers, business and the public not only want, but expect all
data will be instantly available online at no cost, and fully interoperable.
Such systems are standard on a number of popular network television
crime shows where all data of any kind sought are brought to the desk
top instantly and fully integrated with no need to convert, process, or
interpret them.
[Do differently] Demonstrate the results and benefits earlier Until
recently, we have been talking among ourselves primarily and not to
the users of the infrastructure. Audiences glaze over instantly with the
mention of data exchange standards and semantic ontologies. So, we are
starting to showcase what the system will look like and deliver to the
average user. A demonstration of the Geoscience Information Network
(GIN http://usgin.org) to the Arizona Legislature in November 2009
was hugely successful, not only in showing decision-makers the potential
but to many of our stakeholders and participants who are still somewhat
fuzzy about how this will all work and what it will do.
OCR for page 106
106 APPENDIX D
Issue Lesson
Public Relations [do Differently] Make the USGS SDI more powerful by giving it better
Continuted search and data discovery mechanisms. - Requires support from the top,
a budget, and a dedicated team - not just USGS but from all agencies.
Standards need to be defined. Robust software tools need to be developed
to create standard metadata, and to provide the ability to search all NSDI
nodes. This really requires a lot of coordination from all agencies to
enforce the standards so `searches' have the potential to retrieve all data
that meets a query.
Planning Develop a roadmap that encompasses the business , information,
technology, computation and engineering viewpoints, and consistently
review and update as required
Well developed business case that articulates to the organization what the
value of the proposition of SDIs are
Successful projects are done incrementally...low hanging fruit
Successful projects initially focus only on those projects that are staffed
by fully committed people.
[do differently] Appropriate data management starts at the planning
phase and proceeds through data collection, processing and use. Tools
must be provided that reduce the burden to individual projects/users
throughout this process - and that ultimately provide them access to more
data than would be otherwise available (or easily discovered/accessed).
[Do differently] 1) Promote data lifecycle management objectives
and outcomes as performance indicators for federal agencies, 2) create
government centers of excellence for highest priority data sets and
require cross agency funding mechanisms for collection and maintenance,
3) promote standards-based, optimized, geospatial data service hosting
for federal agencies to increase capacity and uptake.
Organization FGDC or some other entity has not been given adequate authority to
carry out the mission they were put in place to do. If they are meant to
be successful, they need to be put in some place other than USGS...like
OMP.
OCR for page 107
APPENDIX D 107
Issue Lesson
[What has not worked] Central or concentrated control (e.g. Data Czars)
in the early days of the Web, researchers starting creating centralized
databases for each domain or sub-domain. These required scouring the
archives and literature for analog ("legacy") data, digitizing them, and
building an ongoing capability to update and maintain the repository.
Very soon, data providers could be barraged by multiple data base owners
for copies of their data and constant demands for the latest updates.
No one had the time or resources to be repeatedly feed the demands
of external bodies for their data. As the number, size, and diversity of
data bases grew rapidly, the communities wrestled with how to share
and integrate data from disparate sources. Proposals to `coordinate' data
integration or oversee standards were viewed skeptically or hostilely by
many as creating the potential for `data czars' to impose their will on the
rest of the community. This concern was one of the biggest stumbling
blocks to getting community consensus in building cyberinfrastructure
for the earth sciences in the past decade.
[Challenges] This new organization is not just [about the] USGS, but
all stakeholders from all agencies. Since the current organization is
disjointed, it almost appears the past approach was to allow agencies to
do whatever they wanted, and the `best practice' would float to the top
becoming the de facto standard. But the reality is, nothing floated to the
top and it is still disjointed.
Security needs and concerns also challenge most government programs
[worked well] Recognition of benefits of web services
[Challenges] The major challenges are primarily organizational,
confounded by financial challenges. USGS has not had consistent
leadership with the goals of leveraging our geospatial data and the
enterprise licenses The majority of geospatial issues in the Department
of the Interior (DOI) and USGS is the result of too little attention to
the fundamentals of data standards and data applications across the
spectrum of spatial data services in USGS. There is a partitioning of
data collection among themes and funding of these themes, as well as
partitioning of support services for Geospatial Data collections and the
research scientists requiring GIS support to use our enterprise license.
A very small part of the GIO is able to see the big picture and the result
is that GIS application support has fallen through the cracks of constant
reorganization.
[Challenges] The most significant challenges to success of SDI are:
1) Clarity of responsibility and government-wide recognition of
the stewardship responsibilities, 2) clear governance with regard to
collaborative development and stewardship within and beyond the federal
government, 3) greater leverage of public and private data resources
and value-add capabilities, 4) lack of wide adoption of Web Services
infrastructure.
OCR for page 108
108 APPENDIX D
Issue Lesson
Organizational Executive level support as well as commitment from senior, middle, and
Commitment junior levels of staff
Champion who is knowledgeable and respected by the community
Full understanding of the impacts of the introduction of what is
essentially a disruptive technology
Collaboration in sharing of data, agreement of standards etc is critical to
the development of an SDI
Scientists must make available data that underpin knowledge products
... properly articulated polices can be an enabler for SDIs....needed at
the organizational ...whole government level.
[Challenges] Sustainability Hundreds of millions of dollars have
been spent on myriads of projects that, while individually successful,
have not led to the creation of an integrated or sustainable spatial data
infrastructure. Hundreds of stove-piped projects have been funded, but
too many disappeared when they could not get funding renewals. Or the
technology has changed and the results are in obsolete formats or buried
on a hard drive somewhere. NSF is requiring new informatics projects to
address the question of sustainability but having recently reviewed a large
stack of proposals on an external panel, the community practitioners are
not even close to dealing with this problem realistically or satisfactorily.
[Do differently] Once the new infrastructure is in place, all projects
should be required to budget time and money to prepare and submit all
spatial data - as intended.
[What has not worked?] The NSDI was initial established in 1994
and was intended to be a repository of all spatial data referenced in
reports/publications. I'm not sure how many spatial datasets have been
referenced in Water Resources Division (WRD) publications since 1994
to the present, but I would estimate well over 10,000. Keep in mind, GIS
started to become main-stream in WRD in the mid 80s.WRD currently
only has 646 datasets in the WRD NSDI (http://water.usgs.gov/cgi-bin/
lookup/getgislist ), so as you see, there is a huge problem getting authors
to participate and I'm glad to see this finally getting addressed. To the
authors defense, the reasons listed below are why they did not participate.
[worked well] Not much has worked well; no support; standards not well
defined; very little guidance; very little incentive; software tools to create
consistent metadata lacking; datasets are almost considered a burden,
especially large ones; search mechanisms of data in NSDIs lacking.
[Challenges] Cultural - Incentives, if not mandates, need to be provided
and a culture needs to be developed that recognizes data management
and provision as part of the public trust responsibility of federal and state
agencies. This culture will not arise because of theoretical benefits, it will
develop when real benefits accrue to users through a) facilitation of data
access and use and b) when systems provide relief from burdens of data
and metadata development and management.
OCR for page 109
APPENDIX D 109
Issue Lesson
[do differently] Make a real commitment to Enterprise GIS and
geospatial data management, development, and integration. Current
support is nominal and based on the minimum support required to fulfill
requirements of enterprise GIS licensing agreements.
[do differently]The USGS Geospatial programs are primarily outward
looking, and driven what they feel is public demand. This does very
little to support USGS science. USGS Management needs to define a
geospatial science commitment and plan
Personnel Tertiary trained professionals who understand the technology and are
respected
Important to accept the high level of technical skills required to develop
an SDI. ...people become overnight experts...can annihilate a project
very quickly.
[do differently] Requirements for, and funding for, comprehensive data
management within a shared infrastructure should be explicitly required
in funding requests and performance evaluation.
USGS lacks staff that are as skilled as in the private sector. The USGS
is very salary burdened and as such has limited funds to go to outside
vendors who could develop infrastructure.
I think it is important to make sure that USGS researchers have a clear
stake in the development and maintenance of world class data bases. In
line with one of the recommendations of the NRC report (Finding the
Forest in the Trees: The Challenge of Combining Diverse Environmental
Data Committee for a Pilot Study on Database Interfaces, National
Research Council 1995) I think that USGS has to find a way to enable
researchers to RGE "credit" for ongoing involvement in the development
and maintenance of databases. Leaving database development/
management to IT people or masters-level scientists will inhibit the
researcher-driven experimentation, brainstorming, and interdisciplinary
mindset needed for the creation and ongoing development a database that
serves [an] ambitious science agenda.
Resources Adequate funding but not over funding
Avoid big projects with big funding that promise to deliver everything to
everyone
SDIs work that have provided economic revenue...easy to get additional
funding. Is economic revenue the only benefit that will work?
It works when funding is applied from the fed state level supplement
long term partnerships between fed and state. And it doesn't take many
$$...
Work with Google [private industry in general] to make it even more
useful and friendly
OCR for page 110
110 APPENDIX D
Issue Lesson
Resources Uncoordinated federal/state/local geospatial budgets and expenditures do
Continued not work
Funding geospatial data programmatically rather than strategically does
not work
The USGS has not been adequately funded to carry out their mission of
civil domestic mapping over the U.S.
What I would do first and immediately is figure out what the SDI is
worth in the U.S. and to whom it is worth what? Once you know what
everyone does with it, where the gaps are and put a $$ value on closing
those gaps you could begin creating the necessary partnerships both
programmatically and fiscally to complete a sturdy and useful SDI.
We worry so much about the sexy technologies that we forget people
just need this stuff to get their jobs done. Those who have worked
beside me for years have heard this before. We need to understand the
econometrics of our SDI to be able to spread the cost and responsibility
in a useful and meaningful way. Maybe we need to get economists and
intergovernmental programmatic folks together to monetize the SDI.
[what has not worked well] Non-sustainable business models - Early on,
NSF and other agencies funded the creation and population of databases
but after a few years it became clear that NSF did not have the mission
or the resources to maintain this infrastructure permanently. Data bases
shut down for lack of funding. Resources disappeared and people moved
on to other projects. Even today, many funding proposals describe their
sustainability plans as simply returning to the original funding agency
and asking for more money.
[Do differently] Integrate with other domains To say we would do
things differently may be misleading. The problem has been finding
resources to do all the things we know need to be done, including
integrating our work with that being done in other domains.
[Do differently] Once the new infrastructure is in place, all projects
should be required to budget time and money to prepare and submit all
spatial data - as intended.
[Worked well] Spatial data infrastructures (SDI) have worked well at
the federal level, and have mostly worked well at the state level. With
funding problems, SDI has faltered somewhat at the state level, and for
the same reason, many counties and other local jurisdictions have had
mixed results varying from robust SDIs to non-existent SDIs.
[Not worked well] Outside of the federal government: un-funded
mandates for SDI tend to be ignored;
[Challenges] The major challenge is financial: support for SDI requires
additional personnel, with changes technology and cultural behaviors.
Many academic and non-governmental organizations (as well as a
number of governmental entities below the federal level) will not
undertake participating in SDI unless the financial support is available
since it would take time away from existing activities.
OCR for page 111
APPENDIX D 111
Issue Lesson
[did not work] Broad and generic mandates or reliance on "good will" to
drive participation in development of community information resources.
[Challenges] Financial - Data management, provision, and integration
are the infrastructure for both science and management applications. The
resources to build this infrastructure are lacking.
Coordination It doesn't work when there is competition within the state to be the single
point of contact. i. e. a state GIS coordinating council. Helping the states
get coordinated is a very useful activity for USGS. Through their liaisons
and field offices. (suggest NSGIC for these activities - they live and die
by coordination and cooperation).
[does not work] States who are not coordinated and have a state level
geospatial coordinating body. There must be an entity who can speak
with authority on funding issues for geospatial data at the state level,
otherwise fed state partnerships are very difficult to put together. The
state entity must be recognized by state agencies, and the executive and
legislative branches of govt. along with the local governments.
[Challenges] Agency cultural, data, fiscal you name it....silos. I was a
fed and a state person for a long time. I know first hand how difficult
it is to do intra and inter-agency coordination of anything, let alone
intergovernmental cross coordination. But it is critical to the success
of an SDI. If geospatial funds and programs were (pipe dream here)
coordinated (not consolidated) across the fed level - by OMB the only
people with a big stick in the fed govt. - just the slosh factor of $$ being
expended on geospatial activities at the fed level could fund coordination
activities at the state level.
I always did think that if we took the lines of business (or whatever the
current lingo is at the fed level) not just across the bureaus and down
through the fed agencies but on down to the state and local level there
would be a logical pathway of responsibilities. In those pathways there is
a common need for the same kind of data, geospatial data and practices.
How hard would it then be to monetize the value of the necessary data
and applications to get the job done at every level it needs doing??
Something like the old a-16 process.[OMB Circular A-16 Coordination
of Geographic Information and Related Spatial Data Activities Revised
2002]
OCR for page 112
112 APPENDIX D
Issue Lesson
Coordination [Challenges] Community adoption and buy-in The geoscience
Continued community has been wary of cyberinfrastructure (including spatial
data infrastructure) due to concerns over control of and access to data,
recognition of data ownership, costs of converting data and systems,
mandates, and how decisions are made. Every domain is dealing with
similar issues, and coming up with generally similar approaches. Yet,
we are all still mostly working within our community stovepipes. We
have much to learn from each other and much we can share so we don't
have to duplicate or relearn what others have done. The library sciences
in particular are making dramatic strides in aggregation, archiving, and
disseminating digital data in a multitude of formats. We have not made
the connections yet with them.
Even within the geosciences community, we are only part way there. Our
network is based on geological surveys with only a few example external
partners. The NSF-funded National Geoinformatics System (NGS)
project to evaluate community needs and wishes has been dormant for
more than a year. Could they be watching to see how GIN (and NGDS)
develop and serve as core elements of an NGS? We also need to nurture
preliminary linkages with the biological, oceanographic, atmospheric,
and geographic communities as well as computer sciences
[don't do well] We also need much greater coordination and dialogue
across this community to minimize wasted effort and maximize
accomplishment of shared goals.
[Challenges] ...the most critical challenge stems for the inadequate
dialogue and coordination among developers and users of these critical
data. This is a combination of policy (e.g., stovepiped federal agencies),
cultural (basically, a `stovepiped' mindset), and financial issues (we're all
scrambling for resources).
[Do differently] The FGDC, USGS, and other bodies need to be better
supported, more open in membership (i.e., to science NGOs), and
empowered to support more robust dialogue, clarify shared goals, and
facilitate sharing of financial resources.
[what worked] Development of systems/processes that engage the "user
community" in defining requirements and reflect the technical capabilities
available to the users. And, in response, focusing on provision of tools
that facilitate use of existing systems (FGDC/GOS) by reflecting the
particular search, discovery, and access needs of the users. Working with
a specific but broad user community (coastal and marine researchers and
managers) to develop tools that facilitate integration of data and model
output using open source standards in response to identified needs.
OCR for page 113
APPENDIX D 113
Issue Lesson
Vision [Do differently?] Could we have gotten here earlier? The debates at
workshops, forums, professional meeting sessions, and in the corridors
over the past decade were part of a process of exploring and testing
ideas in a fast-changing technical and social environment. It is only in
hindsight that we see where we were heading. But I doubt that if we
presented our current model to ourselves 10 or even 5 years ago, that we
would be ready to embrace it. There has been an evolution in thinking
that was crucial to developing current models. Based on conversations
with colleagues in other fields, and in tracking the literature superficially,
it appears that the solid earth geosciences are just a bit ahead of other
communities in coming to our present realization and acting effectively
on it.
a. Interoperable data should be seamlessly delivered to desktops
regardless of the originating database software, version, operating
system, or server.
b. Open-source data and services need to be compliant with open-
source standards such as OGC and ISO. This will help avoid the problem
of data that cannot be accessed in obsolete or priority software
c. Distributed data providers should provide their latest available data
directly into the network. They decide what is made public and when.
They do not have to continually pass along their revisions to a growing
number of data aggregators or central databases.
The data network then looks more like the Web each provider is
responsible for what they want to share. There will be a continuing
need for archive and orphan data repositories for data that do not have
permanent homes, and for data scavenged from historical and analog
sources. But even these central databases will be another layer of
distributed nodes in the network.
d. Web-based (SOA) services and applications are increasingly being
served on the Web rather than being on the desktop. This allows for large
resources beyond the standard desktop to handle and greatly diminishes
bandwidth requirements. Referencing an online resource also means you
are using the latest version as are others.
e. Flexible, dynamic, organic, modular the system has to open to users
to choose what tools and applications they want to use and to allow
them to develop and implement their own applications. Just as there is
not only one Web browser, there should not be components beyond the
most fundamental standards and protocols that are mandated to users.
Technology is moving too fast to be locked into restrictions that will
limit and ultimately make the system obsolete. A modular approach
allows anyone with a better idea to link into the network and make their
service available. It also means the network developers don't have to
build everything. They can choose among the best work done by others
in order to quickly assemble a functioning system, while leaving open the
potential for alternatives to be networked
OCR for page 114
114 APPENDIX D
Issue Lesson
Vision f. User-friendly The first Web sites had to be tediously programmed
Continued in html, but now user-friendly commercial software and ubiquitous
free applications, allow everyone to easily and quickly build Web sites,
including specialized sites like blogs.
The early stages of the spatial data infrastructure will require fairly
sophisticated developers but emphasis should be on off-the-shelf
cookbooks and guides, and eventually smart applications that almost
anyone can use to provide data or services to the network.
g. Community of practice The changes being brought about by the
widespread use of digital data delivered via the Web requires that
we develop new communities of practice in how we qualify the vast
amounts of data that we might otherwise use indiscriminately and how
we recognize and reward those who provide data and services in data
networks.
Improved search engines should make it easier to find everything. A web
service should index everything we have, allowing users to subscribe
to any content desired. The system should be distributed, and should
aggregate datasets from Science Centers. The Science Centers would go
through a streamlined process to document and publish their data sets,
and to set access, e.g. local use only, USGS only, or public dissemination.
From that point on, it should be automatically harvested and pushed out
to the appropriate user groups. The content could be live services, or
could be extracted to a local geodatabase, and this could be maintained
and updated automatically. Most of the pieces of such a system exist and
could be implemented today.
[Do differently] Put some good people into cataloging existing
reports and data sets. Build better metadata tools. Make management
accountable to publish geospatial data from all projects Make all projects
identify spatial data results, plan for, and publish them before project is
considered complete.
[Vision] Very simply, my vision for SDI is that it should enable the
scientific community to freely access and exchange spatial data with
sufficient metadata to allow an interchange.
To look at an image of the U.S./globe, zoom in on an area, and get a
listing of ALL the available data for that patch of land. Then be able
to view detailed documentation on what the data are and how it should
be used and then be able to download a single geodatabase of that
information for the patch of land I am interested in.
USGS has a critical role to play in facilitating dialogue among the federal
agency, academic, non-government science, and state agency sectors
to clarify shared goals, data standards, and data sharing technology.
Success in this area will allow us to collectively maximize utility in our
investments in spatial data.
OCR for page 115
APPENDIX D 115
Issue Lesson
SDI should benefit data collectors and users from planning (evaluation
of existing data), collection (standards and requirements), metadata
development, archiving, search & discovery, and integration. The system
will not be seen as an "overhead" on research activities - but rather as
a way to facilitate research, ensure data preservation, and will enhance
and expand the application and integration of information resources.
Performance will be evaluated not simply on "availability" of data - but
on success in enhancing data application to meet diverse research and
application needs.
A system that is integrated that provides readily available information
from local to national scales. A on-stop integrated portal would be a nice
start. Also, the SDI should have a set of tools and interfaces that permit
the integration of data ... e.g., downscaled climate data and models.
Promotion and development of fast, reliable, web services that provide
discovery and access to geospatial data. The users will figure the rest out.
Better use of and support to the users of Enterprise GIS tools.
An NSDI that supports the USGS Science Strategy would include
relevant base and thematic data that are refreshed at an appropriate rate
and yet are maintained as time-accessible snapshots to allow change and
context evaluation. The SDI would provide a geographic framework for
the publication of most scientific data of the USGS, allowing for easy
visual analysis of geographic and temporal phenomena.
OCR for page 116