At the Intersection of Official Statistics and Public Policy:

Confronting the Challenges

Katherine K. Wallman, Chief Statistician of the United States


On January 19, 1999, the President of the United States opened his State of the Union Address as follows:

 

- America has ... nearly 18 million new jobs,

- wages rising at more than twice the rate of inflation,

- the highest home ownership rate in history,

- the smallest welfare rolls in 30 years,

- and the lowest peacetime unemployment since 1957.
 

    This exceptionally high profile attention to the products of our Nation's statistical system bears witness to the extent to which our policy makers and our public depend on data to measure and understand conditions and trends in our economy and society.
 

Our democracy and economy demand that public and private leaders have unbiased, relevant, accurate, and timely information on which to base their decisions.Data on real Gross Domestic Product, the Consumer Price Index, and the trade deficit, for example, are critical inputs to monetary, fiscal, trade, and regulatory policy.They also have a major impact on government spending, budget projections, and the allocation of public funds.Economic data, such as measures of price change, have as well a significant influence on interest rates and cost-of-living adjustments that affect every American who runs a business, saves for retirement, or mortgages a home.Taken together, official statistics on demographic, economic, and social conditions and trends are essential to inform decisions that are made by virtually every organization and household.


Among the most influential statistics produced by our Federal government is the Consumer Price Index, or CPI, which measures the average change in prices over time for a fixed market basket of goods and services.As an economic indicator, the CPI is used by the Executive Branch, the Congress, and the Federal Reserve Board to determine and evaluate government economic policy.As a consequence of statutory requirements, the CPI directly affects the incomes of millions of our citizens through Federal programs that deliver benefits to individuals.These include more than 48 million Social Security beneficiaries, close to 20 million food stamp recipients, and almost 4 million military and Federal Civil Service retirees and survivors.Changes in the CPI also affect more than 26 million children through adjustments to the School Lunch program, and more than 2 million workers whose wages are tied to the CPI under collective bargaining agreements.According to a recent estimate, a one percent reduction in the CPI's annual growth rate would reduce Federal outlays by approximately 80 billion dollars over a five year period.

    Similarly, information from our Decennial Census of Population and Housing affects Americans every day.Data on the number and characteristics of the population are used by State and local governments to plan schools and highways, by the Federal government to distribute close to $200 billion annually for health care and other programs, and by businesses in making their economic plans.Census data are used to reapportion Congressional seats among the States and to draw legislative districts within the States.

    More specific examples of the uses of official statistics in informing policy discussions and fostering policy changes affecting the Nation's population abound.

    - Statistics from the National Health Interview Survey in the late 1950's, and a more sharply focused Social Security Administration Survey of the Aged in 1963, showed that only 50 percent of the population over 65 years of age had any form of comprehensive health insurance coverage, and that much of that insurance coverage was inadequate, as measured by the proportion of the hospital bill that was paid.Testimony and legislative reports leading to amendments to the Social Security Act in 1965 reflected these facts, with the result that nearly everyone in this age group now has Medicare or Medicaid coverage for at least a major part of the costs of hospital care.Similarly, more recent information is informing proposals such as those that would allow Americans in the 62 to 65 year age cohort to purchase Medicare coverage and would offer a Medicare "buy-in" to workers between the ages of 55 and 62 who have lost company-sponsored health care coverage.As we look to the future, the Medicare population is expected to double over the next 30 to 35 years.But will the 85 year old of tomorrow look like the 65 year old of today in terms of health care needs?And what will be the financial status of this population?To keep the program relevant, relevant data will be crucial. 


    - In the early 1980's, statistics from the National Health and Nutrition Examination Survey (NHANES) provided, for the first time, national data on levels of lead in the blood.These data indicated that 78 percent of Americans had blood lead levels that were considered too high.Over the course of the survey, a clear relationship was shown between the lower amounts of lead in gasoline sold and the lower blood lead levels being observed in study results.The results from an analysis of these data by an expert committee of statisticians were presented as primary evidence at Congressional and Environmental Protection Agency (EPA) hearings, which in turn led to a decision by the Office of Management and Budget to reverse its proposal to relax requirements for reducing lead in gasoline and to instead support the EPA proposal to set a firm timetable for the complete elimination of sales of leaded gasoline.Additional programs were implemented to remove all lead from paint and other commercial products.By 1991, the next NHANES survey showed that the percentage of Americans with high blood lead levels had dropped sharply to just over 4 percent.The same survey also identified groups still at risk, and formed the basis for restructuring prevention efforts in the 1990s.

    - During the 1980s and early 1990s, statistics garnered from the Nation's vital records system together with data from hospitals highlighted high and rising rates of cesarean deliveries in the United States.When compared with data from other industrialized countries, the analyses indicated that U. S. cesarean delivery rates generally were higher, and that the reliance on this procedure had not translated into better health for the mother or the infant.The data provided by the National Center for Health Statistics prompted a review of obstetrical practice and revision of standards for performance of cesarean deliveries.Using the new guidelines and new procedures in hospitals, cesarean rates leveled off in the late 1980s and have declined during the 1990s. 

    - In 1983, the publication "A Nation at Risk" called into question the quality and effectiveness of the entire educational enterprise in the United States.A major reason that this publication gained attention from both policy makers and the public stemmed from the data presented -- the fact that our students scored worse on international assessments of mathematics and science than those in most other countries participating in the study.The statistical information initiated what is now more than a decade and a half of reform movements to improve educational performance.


    In recent years, international assessments have indicated that while our 4th graders outperform their peers in other countries in mathematics and science, our 8th graders are near the average, and our 12th graders rank near the bottom.These findings have triggered changes in our laws to support State and local standards-based reform efforts and speed the use of technology in education.Early results from these policy shifts in the mid-1990s indicate that all but one of the States have now developed content standards at least in the areas of reading and mathematics.
 
    Similarly, statistical analyses have been at the crux of education policy changes affecting our youngest Americans.Data have indicated that preschool programs for disadvantaged children improve the participants' subsequent performance in regular school, reduce their rates of criminal activity, reduce their chances of premature parenting, and improve their likelihood of attending college.Proposals to achieve full funding that would permit every eligible child to participate in Headstart are firmly grounded in statistics concerning the effectiveness of the program. 
 
As these examples -- and many others we could cite -- indicate, the terms of reference for interaction between statisticians, policy makers, and the public have changed markedly since the days when members of the statistics profession were largely concerned with "coding, tabulation, and recognition and correction of obvious errors in data collection."Whether we as statisticians like it or not, we are faced with what economist James Bonnen has described as "a growing, intimate embrace between statistics and public policy decision making" that "has greatly increased the significance and decision value of the statistics we produce."As sociologists William Alonso and Paul Starr observed several years ago, 
 
"So well institutionalized are statistics such as the unemployment rate, the money supply, and various price indices that the date and even the hour of their release are regular events in the political and economic calendar, setting off debates on the performance of administration policy and influencing both stock markets and elections."

    The public policy uses of statistical data have increased exponentially over the last half century.Today, we use statistical data to redistribute income and to fine tune the economy.We mandate the use of official statistics in many of our country's laws, and we use government statistics to evaluate policy alternatives.Competing interpretations of statistics on income, inflation, and unemployment figure prominently in our election campaigns.We use statistics to evaluate risk in legislative policy on toxicity and the environment, and we use statistics in our governmental efforts to expand the coverage and contain the costs of health care.


    With each session of Congress, the list of uses gets longer; our legislators have for some time been in the process of essentially shifting the burden of detailed decision making to formulas based on statistical series.In many cases, the data are not designed for such tasks -- and are woefully inadequate for the legislated purposes -- but still are called upon to bear the burden.It is little wonder, then, that the decibel level continues to rise when questions are raised about the accuracy of the Consumer Price Index or the Decennial Census population counts.
 
    At this intersection of statistics and politics, we find some ironies.Some concern desires (or a seeming lack thereof) for accuracy of data.Others have to do with interest (or an apparent lack thereof) in adequately funding statistical programs.One irony that has always struck me is the debate that took place a decade ago concerning the content and sample size for the 1990 decennial census.While that census ultimately captured virtually all of the information that data users have come to rely on, the concerns that were laid on the table in 1987 (about the burden and cost of data collection) continue to engage the attention of decision makers in the Administration and the Congress.Indeed, as we have prepared for Census 2000, a cardinal rule for justifying any question we will include has been that the data must be required by some Federal statute.
 
    Such disputes about the content of the Federal government's statistical endeavors are not as new to the scene as some embroiled in the debates of recent decades may have thought.At the time of the first decennial census, according to an account provided by historian Margo Anderson, Congress discussed whether the census should collect any information beyond a mere head count necessary for apportionment.James Madison suggested a rather elaborate schedule to categorize the population.The Senate deleted the proposed census of occupations, because, Madison wrote to Thomas Jefferson, they considered it "a waste of trouble and supplying materials for idle people to make a book.
 
    The debate a decade ago over the need for the decennial census long form data brings clearly into focus the disconnect between the substantial uses of official statistics that are called for in legislation and the support that is provided by the Congress for producing the data on which these legislatively based programs rely.

    Somewhat hazier at present, but appearing on our horizon, is the statistical gap we must bridge in an era of devolution.National mandates are bringing new responsibilities to State and local governments to plan and evaluate programs.One is generally hard pressed, however, to find within these recent laws any provision of resources to support the statistical infrastructure required to efficiently and effectively administer programs - let alone any allocation of resources to ultimately evaluate these initiatives with nationally comparable statistical data.As programs devolve to the States, we have from the statistical point of view a "double whammy" - not only do we lose the data sources (administrative records) that previously were available, we also stand to lose comparability of data across States and local areas.
 
    Yet another key issue that demands our attention concerns what I believe is the thorniest - and potentially most consequential - issue facing our national statistical system.This is the growing tension between the need to protect the confidentiality of individual responses, and the desire to provide the broadest possible access to information.Government statistical offices derive their mandate for data collection and dissemination from a citizenry that demands at once both quality information to drive public policy and protection of the individual respondent from privacy invasion and administrative harm.In response to these cross-pressures, statistical agencies are actively pursuing approaches such as encryption and licensing agreements that extend responsibility for confidentiality protection to the data user.Striking the proper balance between permitting access to accomplish compelling and legitimate research, and incurring the risk, however remote, of inadvertent revelation of individual information is, I believe, a fundamental concern and challenge for official statisticians. 
 
    Many researchers find access to government data increasingly desirable.The newer data bases are more comprehensive, of better quality, and - with improved data base management techniques  -  better structured.At the same time, the individuals and institutions that provide the data residing on government data bases  -  as well as the agencies that sponsor the collection of such information  - are well aware that the same technologies that extend analytical capabilities also furnish the tools that threaten the confidentiality of data records.This awareness has the potential to erode -- or at least to undermine-- our respondents' confidence in the privacy of the information they provide. 

    Electronic dissemination is truly a boon to national statistical offices anxious to make their data more accessible and useful -- and to user communities equipped to handle the wealth of available information.But this technology remains to a degree a bane, for while we have taken monumental strides in making our nation's statistics electronically available, attention to documentation in electronic media has lagged.And, I continue to argue, as I have for almost a decade, that the gap between our citizens' computer literacy and their "statistical literacy" remains significant. 
 
    As U. S. government statisticians seek to confront the challenges at the intersection of official statistics and public policy, we do so in an exceptionally charged partisan political environment.A confidant of the Speaker of the U. S. House of Representatives recently stated, for example,
 
    The Speaker and virtually every GOP leader believe no single vote will have greater ramifications on the future of the Republican majority than the vote to block President Clinton from changing the way we conduct the Census.
 
    I believe we have two critical assignments:

    - We must ensure that our science is relevant to our society, and
    - We must ensure that our society understands our science.

    Personally, as I consider our challenge to ensure our relevance to society, I find myself an ardent subscriber to the thesis that our Census Bureau Director Kenneth Prewitt espoused more than a decade ago (at the conclusion of his article on APublic Statistics and Democratic Politics@ in Alonso and Starr's The Politics of Numbers):

    The task involves analysis of the political role of numbers as well as a professional commitment to make the numbers perform according to the burden that a democracy places upon them.

    If there is one pre-eminent credo for my role, and for other leaders in our national statistical system, it is that we must at all times be politically sensitive, i.e., aware of the political milieu in which we carry out our work, yet we must always be vigilant to ensure that our official statistics are not in fact - and are not perceived to be - overwhelmed, or even influenced by, political concerns.As Senator Grassley observed at a recent forum on statistics related to the aging population, 

    We need sound data for sound decisions.Sound data won't ensure sound decisions -  but let us be sure we can blame the political process for unsound decisions. 


    I find myself struck by a paradox.On the one hand, our data are so relevant to policy that policy makers who use our statistics to allocate power and money - sometimes in applications the data were not designed to support - increasingly question our science and our product.Yet, these same policy makers criticize us for not having available from our statistical offices exactly the data they need, right then, for consideration in policy discussions and decisions.As we look to the future, we hardly need a crystal ball to give us the message - we will continue to be challenged to ensure that our science is relevant to our society.
 
    Even more challenging as we look ahead - and perhaps here a crystal ball would indeed help - is the task of ensuring that our society understands our science.Almost 50 years ago, the American Statistical Association's president Aryness Joy Wickens challenged her colleagues as follows:
 
    We owe the public ... a clear description of the general nature and limitations of statistics, with a simple measure of their accuracy, if it can be measured, and a warning if it cannot.  We cannot, and should not try, to explain every technical detail, or to make statisticians of everyone ...
 
    We still owe the public this debt.
 
    Our citizens encounter statistics at every turn in their daily lives.Yet often they are unequipped with the statistical literacy required to evaluate the information presented to them.The "rationale for teaching statistics" set forth at the beginning of this decade in the American Statistical Association's Guidelines for the Teaching of Statistics K-12 Mathematics Curriculum, elaborated this encounter between citizen and quantitative information:
 

    Raw data, graphs, charts, percentages, probabilities, averages, forecasts, and trend lines are an inescapable part of our everyday lives.They affect decisions on health, citizenship, parenthood, employment, financial concerns, sports, and many other matters. ... Informed citizens should understand the latest news polls, the consumer price index, and unemployment rates.

 

    Back in 1992, when I was privileged to serve as the American Statistical Association's president, I defined "statistical literacy" as the ability to understand and critically evaluate statistical results that permeate our daily lives - coupled with the ability to appreciate the contributions that statistical thinking can make in public and private, professional and personal decisions.


    Addressing the ASA membership on this topic at that time, I noted, among other things, that "policy makers and the public have raised issues that focus not so much on the scientific merit of our method but rather on matters (such as respondent burden and data confidentiality) that attend the collection and use of statistical information."Regrettably, as the policy implications of statistical data have become more widely appreciated - to wit, the applications of the Consumer Price Index and the Decennial Census population counts - even our methods have become grist for public debate, often leaving citizens in a quandary concerning the scientific versus the political aspects of arguments presented to them.As my colleague Edvard Outrata, President of the Czech Statistical Office, has said, "we face a newly skeptical public."
 
    And little wonder the public is confused.  In the current debate surrounding our Census 2000, we hear Citizens Against Government Waste explaining why the Census Bureau's plan is all a plot:
 
The Clinton-Gore Administration has announced that they think the last Census failed to count 10 percent of the population. ...To correct this problem, they developed a plan known as "sampling", whereby a poll, like a public opinion poll, would be conducted.This poll would use information from a group of people to make projections about the population as a whole.  Based on the outcome of this "sampling", they would devise an estimate of how many Americans fall into various groupings, such as how many poor, how many of each minority, and so on. ... The net effect is that the Census outcome would be controlled by sampling, which can be manipulated to meet the political goals of the Administration.
 
    Or, the public may have heard the recent assessment by Republican National Committee Chairman Jim Nicholson, "The Clinton Administration is implementing a radical new way of taking the next census that effectively will add nearly four and one-half million Democrats to the Nation's population."
 

    What must the public think when they hear Representative Dan Miller, who now chairs the House Census Subcommittee, say,  "I am a former statistics professor. ... I know all the statisticians say sampling is great.Statisticians would not have a job if we did not have sampling.That is what statistics is based on.Statisticians are biased to start with. "  "Every governor, mayor, and county executive should be gravely concerned," Mr. Miller warned recently. "The Census Bureau is peddling snake oil and they're headed for your neighborhood."
 

    We simply must extend our efforts to ensure that society understands our science.As former Office of Management and Budget Director Franklin Raines wrote in the July 12, 1998, issue of Science magazine, 
 

    "Not every American will become a scientist, and most will not be interested in the arcane details that so excite the scientific community.Yet scientists should make a difference where they can, most importantly by improving the science taught at the K-12 levels."


    I take hope from signs that our professions' quantitative literacy initiatives are increasingly pervading the curriculum, and thus the statistical literacy, of the next generation.We must build on this foundation.Much remains to be done.It continues to be my goal to find ways for the U. S. Federal statistical system to contribute more directly to enhancing our Nation's statistical literacy and our citizens' ability to use official statistics wisely and well.