A Collaborative Agenda for Improving International Comparative Studies in Education
A Collaborative Agenda for Improving International Comparative Studies in Education
A COLLABORATIVE AGENDA FOR IMPROVING
INTERNATIONAL COMPARATIVE STUDIES IN EDUCATION
Interest in international comparisons in education has increased rapidly
over the past two decades, perhaps now exceeding that of the earlier era of
public discussion in the late 1950s that was stimulated by the successful
launching of Sputnik. This renewed interest offers both potential benefits and
considerable risks. Earlier research has been cited in the press and in
reports prepared for policy makers, but much of this public discussion has been
selective, overly simplified, and a poor guide to public policy making. The
increasing scrutiny of earlier studies has revealed their limitations and the
consequent need for improvement in the planning, execution, and dissemination
of international comparative research.
If the situation is to be improved, the tasks of establishing a more
coherent and effective worldwide system for cross-national research and data
collection in education must be shared among many organizations in many
countries. This document does not presume to set the agenda for international
research, or even to indicate the priorities for such research-- activities
that would be appropriately carried out only within the framework of a
worldwide system for cross-national education studies. This document was
prepared to stimulate interest and a willingness to engage in establishing such
a system by suggesting a vision of what international comparative research and
data collection might produce in the next 20 years.
By the year 2010, this vision calls for those with an interest in learning
about education in other countries to have access to the following sources of
data, information, and analysis:
- A set of widely accepted and widely understood indicators of the status
and conditions of education in various countries;
[1]
- Syntheses of empirical research throughout the world, bringing such research
to bear on broad comparative questions of wide interest;
- Specialized comparative monographs on questions of importance and interest
to all educators;
- Documentation of teaching practices and school organization in a form that
will be stimulating and useful for discussion and study by specialists and
nonspecialists, including, for example, videotape archives of a wide range of
classroom practice;
- Widely distributed publications of comparative information on issues of
public policy, written in nontechnical language and responsive to the interests
of journalists and the public;
- Routine translation, distribution, and use of primary documents, such as
textbooks and syllabi, when these documents are sufficiently distinctive and
important to be of interest to educators outside the country of origin; and
- Archives necessary to strengthen the methods used in cross-national studies,
including both technical reports on what works methodologically and banks of
instruments, items, and other applicable research elements and models for a
variety of cross-national studies.
This document is primarily intended to focus attention on the nature of
collaborative efforts to attain the vision. It includes discussion of the
value of international comparative studies in education; how they could be
improved; and a suggestion for consultation and forming alliances among
potential collaborators.
WHAT IS THE VALUE OF INTERNATIONAL COMPARATIVE STUDIES?
Why should the United States or other countries be interested in
supporting or participating in international education studies? From the Board
on International Comparative Studies in Education's viewpoint, the most
important reason for U.S. participation in international studies of education
is to improve understanding of our own education system (Bradburn and Gilford,
1990). Since the philosophy, organization, and practice of education varies
across cultures and societies, a comparative perspective will increase our
understanding of its many forms. Every aspect of education might benefit from
comparison, from issues of teaching and learning, to matters of curriculum and
organization, to broader issues of education policy. Since there are no
absolute standards in education, comparison informs the standard we set for our
children. Comparative studies also help policy makers monitor the success of
education systems. And given that many people are reluctant to conduct
controlled experiments with our children's education, comparison of natural
variation is usually the most feasible way to study the effects of differing
policies and practices.[2]
Nevertheless, much of education research is parochial, uninformed by the
comparative knowledge that cross-national education research can provide.
Researchers should be encouraged to develop broad and in-depth knowledge about
cross-cultural similarities and differences and similarities in the way people
learn. They should also undertake studies that examine the context of and
participation in formal and informal education at all levels. Research is also
needed to generate more narrowly focused knowledge that speaks to pressing
problems of practice, such as identifying pedagogies and schools that best help
children at risk to avoid becoming dropouts or knowing the role of algebra in
curricula designed for today's students and how they learn it best.
It would be important, therefore, to seek ways to engage education
researchers, practitioners, and consumers in the search for international
comparative understanding of what education is and how it should be judged.
Researchers should capitalize on what can be learned from a natural worldwide
laboratory of education systems, which over generations have come to embody
many important differences in opportunities to learn, students' desire to
learn, curricular structures, and teaching practices. Addressing these
differences involves making inferences about what different cultures and
organizations around the world consider to be excellent outcomes and excellent
practice. Education policy makers should focus on success wherever they find
it and could also learn from what does not seem to work. It is especially
important to seek data over time in order to examine trends and the impact of
changing contextual conditions. In addition, the collection and sharing of the
artifacts of education (e.g., textbooks, children's writings, teachers'
journals, and videotapes of classrooms) should be encouraged so that
discussions of education in other places and at other times can give the users
of such artifacts a better sense for the social texture and cultural
intricacies that are so central to knowledge of teaching, learning, and human
development.
The balance of this section focuses on the two broad types of international
comparative knowledge about education; each type satisfies different needs for
information. First is the knowledge needed to address issues of policy and
practice currently on the public agenda. Approaches in this category focus on
problems for which researchers have arrived at a working definition and for
which they seek alternative solutions to compare and evaluate. Second is
knowledge without reference to a particular current problem or issue--such
knowledge is gathered to enhance general understanding in the long term.
An important purpose of cross-national research is to respond to
particular policy needs. For example, an issue of particular importance to
U.S. policy makers at present is whether all our students have been given
adequate opportunities, appropriate standards, and justifiable expectations for
learning, relative to what is done in other countries. In this respect, it is
important to examine out-of-school opportunities as well as those provided by
schools. Questions range from "What can be expected of our best students?" to
"What can be done about students who are poorly prepared and poorly motivated?"
In short, what can be done to make sure that all students truly have adequate
opportunities to learn?
Such policy issues can be addressed in three ways:
- Tracking progress within and among nations;
- Comparing the quality of education offered while explicating and
attempting to understand its context; and
- Policy-driven comparative studies of education institutions and
practices.
Tracking Progress
The lack of an adequate system of education indicators to inform education
policy making has become increasingly apparent. Data are not collected
regularly, systematically, or with enough coordination either to satisfy
natural curiosity about education systems around the world or to answer the
questions of researchers and policy makers about changes over time in education
in a variety of countries. Trend data are needed on many aspects of education.
We illustrate with three types of trend data that are needed:
What children in other countries are learning. International
comparisons provide comparisons beyond the limits of U.S. national experience.
For example, recent studies have shown that students in other countries study
some mathematics topics at an earlier age than students in the United States
and that consequently it may be feasible to modify U.S. curricula accordingly.
In the absence of absolute standards for education, comparisons help policy
makers set realistic standards. Periodic collection of data from a diverse
group of countries can serve both descriptive and monitoring purposes (Bradburn
and Gilford, 1990).
Factors that may be related to school achievement. Information
from different countries on policy-manipulable variables that are associated
with student performance can identify potential policy improvements. Such
variables include the time that children in other countries spend studying
important subject matters such as mathematics, science, mother tongue, history,
and foreign languages; the knowledge and expertise teachers bring to their
work; and classroom size.
School finance data related to the delivery of education.
Education expenditures such as teacher salaries, expenditures for in-service
and other continuing education for teachers, expenditures on textbooks and
other instructional materials, expenditures for special services, and class
size (a determinant of the total expenditure for teacher salaries) can reflect
various education policies. Education policy makers find it useful to compare
trends in these indicators for the United States (a reflection of U.S.
education policy) with trends in these expenditures for various countries that
have education policies different from those in the United States, although the
data must be used with caution because of the difficulties in obtaining
comparable financial data.
Not only should national averages and other measures of central tendency
be investigated, but also indices and graphic representations of variation
within and between countries and trends over time as they become
available.
International agencies have begun to provide information to answer
questions about some aspects of education. For the past three decades a
nongovernmental consortium known as the International Association for the
Evaluation of Educational Achievement (IEA) has endeavored to fill a major gap
by providing cross-national data on educational achievement, augmented by
extensive information concerning the curriculum, classroom process, and other
contextual information necessary to its interpretation. IEA studies have
covered a broad range of topics: mathematics, science, reading literacy,
written composition, literature, English as a foreign language, French as a
foreign language, civic education, classroom environment, computers in
education, and early childhood care and education. IEA has not identified or
attempted to deal with all the important issues that should be addressed by
indicators, nor has it been sufficiently well financed or well organized to set
up regularly scheduled international assessments in all important school
subject matters. Recently, however, IEA has released plans for a schedule of
4-year cycles of studies that provide for reports: a cycle for mathematics and
science starting in 1995, a cycle of special studies such as preprimary
education or computers in education starting in 1996, a language cycle starting
in 1996 and alternating between second or foreign language and reading
literacy, and a cycle starting in 1997 alternating between civics and the
arts.
The Organization for Economic Cooperation and Development (OECD) has
organized an extensive effort, growing out of preparatory conferences held in
1987 and 1988, to plan for the development of comparable education indicators
across its member countries. Six networks were formed to deal with student
achievement, education and labor market participation, features of school
systems, attitudes and expectations, student flows, and costs and resources.
Participation in these networks ranges between 6 and 20 countries. In 1992,
OECD published a first set of international education indicators (OECD, 1992)
developed by the Centre for Educational Research and Innovation (CERI). It
provides comparative information for the OECD countries for three clusters of
indicators: the demographic, economic, and social context of education
systems; costs, resources, and school processes; and outcomes of education.
Future publications of OECD indicator data, which are expected to be published
at about 16-month intervals, will establish important trend information on OECD
education systems for use by policy makers.
The United Nations Educational, Scientific and Cultural Organization
(UNESCO) also collects country data on key aspects of education in over 160
countries. During the 1950s and 1960s, UNESCO issued a periodic publication,
A World Survey of Education, which provided statistics on the state of
education throughout the world. In 1992, UNESCO published World Education
Report 1991 (UNESCO, 1992), the first in a new biennial series that will
present major trends and current policy issues in education. Focusing on basic
education, the 1991 report describes world education growth since 1970 and
includes information on demographic trends, adult literacy, participation in
formal education, teachers, and finance. The report focuses on basic education
and on continuing challenges in that area. Looking toward the future, the
report identifies two major issues: teachers and teaching (e.g., the need for
additional teachers, recruitment, and training) and the assessment of students'
learning achievement (e.g., school-based assessment, changes in assessment
practices). The report includes education indicators for the years 1980 and
1988, thereby providing measures of change in education characteristics, for
over 160 countries on a number of topics: population and gross national
product; literacy, communications, and media; school entry and participation;
enrollment ratios and internal efficiency in first-level education; enrollment
ratios in second level education; teaching staff in preprimary, first- and
second-level education; enrollment ratios and teachers and students by broad
field of study in 1988 in third-level education; public expenditure on
education and private enrollment; and current public expenditure on education.
These OECD and UNESCO reports are welcome additions to education
information. In planning for future indicator sets, a 1991 report by the
Special Study Panel on Education Indicators for the U.S. National Center for
Education Statistics (NCES, 1992) is particularly relevant. The five
principles advocated in this report and listed below were developed for
national indicator sets, but they could be applied to cross-national sets as
well:
- Indicators should address enduring issues. Although a model for
indicator development oriented to education goals (i.e., define goals, specify
objectives related to the goals, and develop indicators to measure progress
toward the objectives and goals) could be used, it would lose its value over
time as goals change. Researchers should assess what they think is important,
not settle only for what they can measure.
- The public's understanding of education can be improved by high-quality,
reliable indicators.
- An effective indicator system must monitor education outcomes and
processes wherever they occur (not just in schools).
- An indicator system built solely around achievement tests will mislead
the people in the United States.
- An indicator system must respect the complexity of the education process
and the internal operations of schools and colleges (National Center for
Education Statistics, 1991:19-21).
Moreover, for use in informing policy and practice, sets of indicators are
more useful than individual indicators, and these sets must be developed for a
specific context, form, and level of education.
Comparing Quality While Understanding Context
The interested layperson naturally wants to know how the quality of
education in the United States compares with that of other countries. Often
this concern is addressed in ways that pay scant attention to the complexities
of education and the particular contexts in which children are educated.
Cross-national studies in education have attempted to deal with this dilemma by
developing methods and measures that avoid comparing the incomparable. Thus,
for example, the concepts and measures developed for IEA studies include: the
intended curriculum (what the curriculum specifies), the implemented curriculum
(what the teacher actually taught), and the attained curriculum (the student
performance on a test of content). This refinement represents one way to
achieve more comparability in the explanation of achievement outcomes (although
much more improvement is called for in the implementation of these curriculum
measures). In general, a great deal of additional work on explanatory
variables is needed to increase the comparability of education processes and
outcomes.
Still another approach to dealing with the difficulties of comparability
is illustrated by earlier IEA research that focused on how to adjust results to
take account of more and less selective secondary school systems to avoid
comparing the elites of one system with a larger mass of students in another.
The goal of these studies was to help policy makers consider the advantages and
disadvantages of a more or less comprehensive secondary school system. The
findings were used to support policy changes making a number of secondary
school systems more comprehensive (e.g., Marklund, 1989; Bathory, 1989).
Curriculum and selectivity differences are not the only issues in
understanding the context for cross-national comparisons. Similar issues
include differences in language spoken, religion, laws, implements (e.g.,
tools, utensils, instruments) used, and values held, as well as attitudes
toward testing that might influence motivation. Differences in cultural
context can affect what is taught and when it is taught. Research is needed on
the effects of these cultural differences on student achievement and the
development of a theory explaining the contextual difference among nations.
Policy-Driven Comparative Studies of Educational Institutions and
Practices
U.S. national education goals for the 1990s have provided strong
motivation for continued interest and discussion of cross-national comparisons
in mathematics and science (U.S. Department of Education, 1991). Additional
motivation has been provided by the Goals 2000: Educate America Act
(introduced in the U.S. House of Representatives and the U.S. Senate), which
formally establishes the education goals and calls for a National Education
Goals Panel and a National Education Standards and Improvement Council, which
would develop model standards for student performance, curriculum, and school
quality (H.R. 1804 and S.R. 846, 1993). Several implicit causal assumptions
underlie public and political interest in cross-national studies of mathematics
and science: (1) increased and improved curricular emphasis on mathematics or
science in precollegiate education for all students will result in greater
achievement for a greater number of students; (2) this will result in a greater
number of better prepared students entering the natural sciences and
engineering; and (3) this will in turn lead to a greater number of more
productive scientists and engineers in the labor force. A final assumption is
that more and better scientists and engineers will increase economic growth or
productivity. Each of these premises has its critics, and each could be more
systematically evaluated. At a minimum, conferences should be organized to
ensure that all reasonably well-informed parties with opinions on such issues
are heard, reports should be issued, and, if possible, strategies should be
developed to keep the contending parties from speaking past or ignoring one
another's point of view.
However, a focus on mathematics and science is not sufficient; other
subjects are also important. It is critical to ensure that the complexities of
education, including its variation across contexts and multiple content areas,
are taken into account. Virtually every policy issue can be illuminated
through study of how policy and practice vary among countries. For example,
recent cross-national comparisons highlight gender differences and similarities
at both the individual and societal levels of analysis. These studies can
facilitate our understanding of the implications of the gender gap in academic
achievement (e.g., they allow us empirically to evaluate arguments that suggest
that more centralized education systems are more equitable and/or more
efficient). These studies also help us understand the effects on economic
growth of the expanded access of women to varying levels of education. Data
from the studies can be used to evaluate arguments that suggest that the
undereducation of women negatively affects economic growth (King, 1990).
The range of issues ripe for comparative inquiry includes questions of
finance, decentralization versus centralization, the professionalization of
teachers, the need for formal programs of teacher education, the effects of
education on productivity, the effectiveness of measures whose goal is to
enhance education opportunities for minority populations, the value of
bilingual education, differing mixes of public and private schools (including
religious schools), and even the feasibility of emphasizing moral education in
public schools. Although much has been said about the world's constituting a
natural laboratory, in fact the availability to U.S. audiences of current,
relevant, and in-depth information on policy issues in other countries remains
far too limited. Although it is true that studying education in other
countries will not tell us what we ought to do, such study will help us
to become clearer about what we value and the policy options for putting these
values into practice.[3]
What does it mean to pinpoint areas in which cross-national
differences are important to education policy and practice and to attempt to
provide reasonably persuasive explanations for such differences? Heretofore,
this goal has meant an attempt in cross-national surveys to estimate how much
education outcomes are affected by factors controlled through education policy.
This effort has been motivated by the idea that comparisons of achievement
levels are not meaningful unless one can, first, identify the educational
inputs and processes that contribute to observed outcome differences between
countries; second, make some estimate of the contribution of each educational
input to realized outcome levels, and third, consider how these effects vary by
context.
But surveys by themselves are not adequate. They have focused too much on
narrow measures of achievement, neglecting other important outcomes. They are
not very sensitive to contextual differences, and as a result it is often very
difficult to give plausible explanations for observed differences. The
complexities and difficulties of acquiring a more adequate understanding of
education in other countries and its possible implications for the United
States can be illustrated by looking at Japan. Since progress has been made in
U.S. understanding of Japanese education, let us look carefully at how this
understanding has evolved, at what has been learned, and at the gaps and
ambiguities that remain in understanding. Turning back to the increasing
exposure to Japanese education over the past 20 years, it becomes clear that
there are different phases, which have gradually led to more, though still
inadequate, comparative understanding.
During the 1980s, policy makers became aware that achievement levels on
conventional multiple-choice tests in mathematics and science were higher for
Japanese than for U.S. students at certain grade levels. For example, the IEA
Second International Mathematics Study, which gathered data in the 1981-82
school year, reported that Japanese eighth-grade students consistently scored
higher than students in other countries in arithmetic, algebra, geometry, and
measurement, whereas U.S. students scored in the middle of the group of
countries on arithmetic and algebra and below the international average on
geometry and measurement (McKnight et al., 1987:20-22) In science, in grades 5
and 9, Japanese students also scored at the top of the group, although at grade
12 their scores were lower (International Association for the Evaluation of
Educational Achievement, 1988:7, 26-27, 32-36, 48-54). This awareness was
accompanied by debates over the significance of and reasons for such results.
There was much disagreement over whether these measures were meaningful when
taken in isolation from other evidence about education outcomes. These debates
stimulated public interest, yet they were initially largely uninformed by
further empirical research on these issues.
Gradually, however, there have been more focused and data-based studies
that were intended to provide more in-depth understanding of Japanese practices
(e.g., studies by Cummings, Peak, Rohlen, Shimahara, Stevenson, Tobin, and
others). As a result, relatively uninformed endorsement or rejection of
Japanese practices has partially yielded to attempts to explain these practices
more thoroughly within the Japanese cultural context. More attention has been
directed to the many interactive factors that might influence a particular
practice or outcome, such as the amount of time spent on a task or on homework,
teacher quality, and parental beliefs about the relative importance of factors
influencing success in school.
As such studies become more generally known, noncomparativists can be
expected to cite these findings more frequently and to incorporate them in a
more general understanding of mathematics education. Perhaps in the near
future there will even be a sizable market for publications about how
mathematics and science are taught in Japan as well as other topics.
To the extent that such writings bear fruit in explicating practices that
are effective in Japan, there will inevitably be more debate about whether
similar effects might be anticipated in the United States. Although the
borrowing of educational policies and practices is much criticized for
infringing on cultural values and indigenous practices and for its undeniable
risks of unintended consequences, it can also positively influence classroom
practices. It has been and will no doubt remain widely attempted in
education.
The same principles, in many respects, apply to other countries and
subject areas.
However, it is very difficult to carry out multivariate analysis of natural
variation that is reliable and valid. For example, the high scores of Japanese
eighth-grade students relative to those of the U.S. students in the IEA Second
International Mathematics Study raised questions about the quality of
precollege mathematics teachers in the United States. But, although poor
student outcomes may be correlated with teacher quality in terms of training
characteristics or classroom method, the cause may be due to factors unrelated
to teacher quality, such as the curriculum, time spent on mathematics topics in
school, and the influence of the home environment. The effects of all these
factors on student achievement need to be understood before we understand the
relative performance of Japanese and U.S. students. Although international
studies have provided detailed information on student achievement in various
subjects in many countries, the causal analyses of the comparative results have
been inadequate. Advances in survey analytic techniques, such as hierarchical
linear modeling, offer promise of addressing those weaknesses.
A second important purpose of cross-national research is the long-term
development of knowledge. Scholars often gather knowledge for its own sake,
sometimes to expand the knowledge base in their field and at other times to
develop their capacity for grasping trends and advancing ideas. In the long
run, this exploration enriches and expands their understanding of the world and
its complexities. They make an intellectual investment, to be drawn on later,
as they interpret their research findings and the results of other related
investigations.
To underwrite such investments, it is important to collect cross-national
data at societal levels over reasonably long periods of time. Such data
facilitate the identification of worldwide, regional, and national trends and
permit analysis of the sources and effects of cross-national variation in
education organization, policy, and practice. Systematic cross-national
inquiry, motivated by theory and carried out over time, is needed to discourage
parochial folklore and inappropriate inferences rooted in a spatially and
temporally limited vision of what education is as well as what it could or
should be.
More specifically, long-term development of knowledge can affect
comparative inquiry in a number of important ways:
- Becoming more cumulative and theoretical by providing synthetic works
and bringing about consensus on a cross-national system of indicators, which in
turn will illuminate changes across time;
- Highlighting areas in which cross-national differences are important to
education policy and practice as well as provide reasonably persuasive
explanations for such differences;
- Stimulating the development of a comparative dimension within all major
educational specialties (e.g., empirically based comparative education has had
far too little to say about the realities of classroom practice);
- Communicating the state of comparative education knowledge more
effectively than at present to nonspecialists so that the thinking and
discourse of educators in other fields will, as a matter of course, be informed
by international comparative studies;
- Drawing attention to potential weaknesses or strengths of education
systems;
- Identifying models or practices of education in other countries that
have rarely, if ever, been used in U.S. education; and
- Expanding the categories and solutions that researchers consider when
they think about education problems and thereby contribute to the contextual
understanding of education.
In designing and carrying out such long-term efforts, a better
understanding will be gained not only of contextual variation in education
policies, institutions, and practices and their effects on learners, but also
of more fundamental aspects of human and social development, including:
- The developmental processes of children, youth, and adults and the
implications of these processes for learning;
- Education issues related to gender, race, and social class;
- The importance of cultural values in shaping the identities for which
children are educated; and
- The role of education in relation to subsequent social and economic
stratification in adult life.
Knowledge of a nation's education system will reveal what a nation values
in its people. Is education for some or for all? What cultural norms does the
education system embody? Are assumptions made that some people are unworthy of
education investment? What are the personal and national payoffs of formal
schooling? Who are the intended beneficiaries of the system, in addition to
students? Such issues are of paramount importance in considering how nations
determine who is to be educated, what is to be learned, and how all this can be
accomplished. The central role of teachers in the success of an education
program should be studied, as should issues concerning their recruitment,
retention, training, rewarding, development, and deployment. Equally important
are issues related to formal and informal sources of control and
autonomy.
1 Some members of the comparative
education community question the usefulness of indicators. They question
whether the benefits such indicators produce are worth the cost, and they are
concerned that the financial and political cost of producing truly comparable
indicators exceeds the political willingness of many countries to subject their
educational system to the potentially harsh comparison that would ensue.
2 Some critics have noted, however, that controlled
experiments have the potential to improve the methodological sensitivity of
education studies. Recommendations for greater use of experimental methods are
discussed in a recent National Research Council report on bilingual education
studies (Meyer and Fienberg, 1992). An exclusive reliance on correlational
research designs and data analysis techniques is a methodologically limited
outlook. Thus, it is important to continue to weigh the pros and cons of the
arguments against controlled education experiments.
3 It should be noted that countries with decentralized
education systems such as the United States can be considered to be a
self-contained natural laboratory. Cross-state studies also have the potential
to contribute to knowledge about the impact of education practices and policies
on education outcomes.
Previous Section |
HTML Home Page |
Next Section
NAS Home Page |
NAP Home Page |
Reading Room |
Report Home Page
|