MCDC Public Data Archive
Help: What is Uexplore/Dexter? | Tutorials | 10 things about the archive
American Community Survey Data: ACS (general) | 2017 |2016 | 2015 | 2014 | 2013 | 2012 | 2011 | 2010 | 2009 (& earlier)
Decennial Census Data: 2010 | 2000 | 1990 | 1980
Other Data: Population estimates | Economic indicators | Geography/GIS | Compendia | Other
American Community Survey
- acs — American Community Survey general info
- The American Community Survey has replaced long-form (sample-based) decennial census data. This generic directory is for holding materials about the ACS in general. Date for specific years ("vintages") are stored in the acs20[YY] directories (below) or in the acspums directory.
- acs2017 — American Community Survey data, 2017 vintage
- We now have the single-year 2017 data. Watch for the five-year data this December.
- acs2016 — American Community Survey data, 2016 vintage
- Once again we have the single-year 2016 and five-year 2012-2016 data. Looks almost identical to what we did with the vintage 2015 and other recent vintages.
- acs2015 — American Community Survey data, 2015 vintage
- Once again we have the single-year 2015 and five-year 2011-2015 data. Looks almost identical to what we did with the vintage 2014 and other recent vintages.
- acs2014 — American Community Survey data, 2014 vintage
- Single-year 2014 and five-year 2010-2014 data. No three-year 2011-2013 data this year (product was canceled by the Bureau). For the first time we now have non-overlapping five-year data (this and the 2005-2009 data in acs2005).
- acs2013 — American Community Survey data, 2013 vintage
- The usual single-year 2013 data, three-year 2011-2013 data and five-year 2009-2013 data.
- acs2012 — American Community Survey data, 2012 vintage
- The usual single-year 2012 data, three-year 2010-2012 data and five-year 2008-2012 data. These data were released over the period from September to December, 2013.
- acs2011 — American Community Survey data, 2011 vintage
- Once again we have the single-year 2011 data, three-year 2009-2011 data, and five-year 2007-2011 data. New this year is data at the block group level for the entire U.S. and a complete set of five-year base tables in the btabs5yr subdirectory.
- acs2010 — American Community Survey data, 2010 vintage
- This was the first year the Census Bureau took advantage of the results of the 2010 census counts in order to refine their weighting algorithm, hopefully improving the estimates. The format in which we got the data from the Census Bureau was different from previous years and has required us to do considerable recoding on our (us)mcdcprofiles data sets. Once again we have single-year 2010 data, three-year 2008-2010 data, and five-year 2006-2010 data.
- acs2009 — American Community Survey data, 2009 vintage
- This was the first ACS collection to include five-year period estimates with data for all geographic units regardless of population size.
- acs2008 — American Community Survey data, 2008 vintage
- These data were released in September and October of 2009. There are both single year summary data for 2008 and three-year period estimates data based on 2006-2008 stored here. There is no five-year data as part of this collection — that does not begin until next year, with filetype acs2009.
- acs2007 — American Community Survey data, 2007 vintage
- These data were released in September thru December of 2008. There are both single year summary data for 2007 and three-year period estimates data based on 2005-2007 stored here. The single-year data are limited to geographic entities of 65,000 or more population, while the three-year period estimates lower the threshold to around 19,000. Our collection includes data for the entire U.S. and includes complete detailed (base) tables as well as profile datasets that are similar to the data found in the Census Bureau's profile reports accessible via American FactFinder.
- acs2006 — American Community Survey data, 2006 vintage
- These data are the tabulated results of the surveys for the 2006 calendar year, released by the Census Bureau in 2007. Unlike the 2005 tabulations, these include persons in group quarters as part of the universe. These data are still limited to single-year estimates for geographic entities of 65,000 or more population.
- acs2005 — American Community Survey data, 2005 vintage
- Data tables from the ACS for the calendar year 2005 are summarized here. These are the first substantial set of data tables ever to appear based on the ACS. Summaries are for geographic areas of at least 65,000 population. The group quarters segment of the population was not covered in the 2005 survey, so all figures here summarize just the household population. There are no moving averages here, just tables based on a single year of surveys. These data were released in "waves" during the summer and fall of 2006. The general structure of these data is not consistent with the way we processed data for later years.
- acspums — Public use microsample data from the ACS (all available years)
- This is American Community Survey's 1% public use microsample (PUMS) data. These data are only of direct interest to researchers with access to and skill using a statistical software package. We have complete national collections broken down by state for 2005 through 2015 or later.
[Return to top of page]
2010 Decennial Census
See the Census Bureau's 2010 Census Data Products At a Glance for an overview of data products based on the 2010 census, with release dates and technical documentation. Unless otherwise noted, all of these data collections are nationwide.
- sf12010x — Summary file 1 standard extract
- These data sets access the detailed table data stored in the sf12010 collection and create much smaller extract data sets. Standard practice and naming conventions for all our decennial summary (tape) filetypes. The Census 2010 Profile Report web application displays the data from these data sets. See also the SF1 Extract Assistant web application to make extractions easier.
- sf12010 — Summary file 1, complete tables
- This is the primary data product based on the 2010 decennial census. There was only a short form questionnaire in 2010 with just seven questions, so the tables here deal with just basic demographics (age, sex, race, hispanic origin, household types, etc.). This data directory contains the full tables. See sf12010x, above, for the standard extracts based on these complete tables.
- pl942010 — Redistricting data from the 2010 census
- Also referred to as "PL-94" or "Public Law 94-171" files. The purpose of these data is primarily for states to redraw their congressional and state legislative district boundaries using the result of the 2010 census. We have a complete collection of 51 state summary files and some national files based on selecting, concatenating, and sometimes aggregating the data from the state datasets.
- pums2010 — Public use microsample data
- Released late (in December of 2014) these data allow researchers to access individual census questionnaire data (rather than the summary data we see on all of the other data files based on the census). A 10% sample. With limited data, however, because of the very limited subject matter covered on the 2010 short form.
- dpro2010 — Demographic profile data from the 2010 census
- These data files were released in May of 2011 to provide users with early access to some of the more frequently-used data items extracted from the much more detailed Summary File 1 collection which was going to be released later in the summer. Geographic detail on these data sets is limited to governmental units.
[Return to top of page]
2000 Decennial Census
- sf32000x — Summary file 3, standard extract
- This has been our most popular (frequently accessed) filetype since it was created in 2002. In these data files we have compressed the 16,000+ cells of tabular data on a full summary file 3 (filetype sf32000) summary record down to just over 200 key items (and another 200 or so corresponding percentages). These files are the basis for our hugely popular Census 2000 Profiles. Be sure to look at the Variables.pdf file, which provides an excellent overview of the data items contained in these datasets.
- sf32000 — Summary file 3, complete data tables (long form/sample data)
- Summary file 3 (2000 decennial census) is probably the most widely used of all the summary data files ever produced by the Census Bureau. "SF3" contains detailed tables based on responses to the long form questionnaire. Here is where you can find data on topics such as income, poverty, housing value, occupation, education, etc. These data are available for a wide array of geographic units. The MCDC has data down to the tract and block group levels for at least Missouri, Illinois, Kansas, Minnesota, Michigan, and Delaware (state files). We also have the final national file with data for every state, county, ZCTA, place, UA, MSA, etc. in the U.S.
- sf12000x — Summary file 1, standard extract
- Basic demographic counts based on the short-form census questionnaire. One of the few places where you can find census block level data. For an overview, see the Readme.html file.
- sf12000 — Summary file 1, complete data tables (short form/complete count data)
- The first detailed set of tables from the 2000 U.S. Census, derived from responses to the short form questionnaire. Does not have data based on long-form questions regarding items such as income, housing value, occupation, etc. Those items are on sf32000, which was released in the summer of 2002. For an overview of SF1, review Readme.html. Missouri, Illinois, Kansas only.
- sf1prof — Census Bureau demographic profile (short-form-only data)
- The data used in these files are based on data collected on the short form. So it is basic demographics: age, sex, race and ethnicity, as well as household composition and some basic housing data (occupancy rates and tenure, for example.) This is the 2000 census equivalent of the dpro2010 collection based on the 2010 census.
- pums2000 — Public use microsample files
- These files are the favorites of researchers who have access to a good statistical software package and know how to use it. With PUMS, you can use a tool such as SAS's proc tabulate to build tables almost any way you like it from these datasets, which contain actual microdata (census returns from individual persons and households). Geographic detail is limited (to special geographic areas called PUMAs. The Census Bureau releases these files in two product types, a 1% sample file and a 5% sample. The MCDC collection includes a complete collection of 5% sample files for all states and a smaller collection of the 1% sample files (Missouri, Illinois, and Kansas).
- pl942000 — Public law 94-171 ("redistricting") file
- This was the first data published based on the 2000 decennial census. Contains basic pop counts by race/Hispanic and voting age for a wide variety of geographic levels (including VTDs — voting tabulation districts), down to census block. MCDC has a complete national collection of these data. For an overview see the Readme file.
- cqr2000 — Count question resolution data
- "CQR" (count question resolution) was the Census Bureau's program to identify errors in the total population and housing unit counts in the 2000 census. This directory has datasets that capture those adjustments at both the census block level (one dataset and CSV file per state) and at the governmental unit (state, county, place, MCD) levels.
- daytmpop — Daytime population
- This special tabulation can be used to estimate the number of people who may be in an area on a typical work day, i.e. it tabulates based on place of work instead of place of residence as with almost all other census tabulations.
- mig2000 — Migration patterns, 1995-2000
- Datasets in this directory are related to migration in the U.S. between 1995 and 2000 as derived from the 2000 decennial census long form (sample) data. The Census Bureau released a number of different summary files in this category, but our collection contains only the basic counts of movers (data disclosure restrictions rendered many of the other tabulations almost useless). This is a national collection.
- eeo2000 — Equal employment opportunity special tabulation
- This loosely affiliated collection of tables (datasets) is named for the Equal Employment Opportunity Commission, one of four federal agencies that commissioned this special tabulation product. Here you will find detailed counts of persons by occupation categories by age, sex, race/ethnicity, education level, income level, and even sometimes by industry. Hard to describe, since it is a very complicated collection of 24 tables, each with its own geographic, demographic, and occupational dimensions. We tried very hard to make this collection simple to access, but the task proved to be impossible.
- workflow — County-to-county work flows
- Tells you where people who lived in one county worked in that or other counties. This special tabulation file is based on sample (long form) data from the 2000 census. It gives you the count of persons who live in County R and work in County W for all counties in the country. The MCDC has created some custom reports based on these files.
- sf42000 — Summary file 4
- See the Census Bureau's abstract for a description of this data collection. The key feature of SF4 is the ability to get detailed tables for a long list of race/ancestry groups. However, new threshold limitations (explained in the abstract) make using it for analytical purposes very problematic. The large number of tables combined with the large number of characteristic iterations makes these files huge. Because of the enormous size and complexity of this collection, we strongly recommend that users new to the collection begin by accessing the SF4 data using the American FactFinder application, the data sets option. It tends to be a lot easier than accessing via Dexter.
- hudcdbg — Community development block grant data
- These data are derived from 2000 Census data by the U.S. Dept of Housing and Urban Development (HUD). The datasets contain information regarding low and moderate income housing down to very small (block group) geographic areas. These numbers are used by local developers and planners wanting to qualify for special grant funding in neighborhoods. See the HUD web site where we went to download these data, and where you can download it in the form of Excel spreadsheets for any state in the country. Our collection is limited to Missouri, Illinois, and Kansas.
[Return to top of page]
1990 Decennial Census
- stf901 — Summary tape file 1
- This STF is the 100-percent file on population and housing. No social or economic indicators are reported on this file.
- stf901x2 — Summary tape file 1, standard extract (revised version)
- Specifically designed to be used as the 1990 equivalent of the data in filetype sf12000x. Many sets in this directory have been re-tabulated to 2000 census geography to allow for direct trend reports using comparable geography.
- stf902 — Summary tape file 2
- 1990 Summary tape file 2. We don't have a whole of data here. STF2 is like STF1, only more detailed.
- stf903 — Summary tape file 3
- Each dataset here contains over 3,300 cells of pre-tabulated data based on the 1990 census long-form questionnaires. Each observation contains data for a single geographic area. We have complete A files for Missouri, Illinois, and Kansas, plus a few other states; we also have the complete C file (national) with summaries for the country, states, counties, and larger cities. And, we have the B file — ZIP level summaries. This filetype has been made accessible at the table level from Dexter. As with any of the census summary file filetypes, you really need to have access to the technical documentation — available in the stf903/Docs subdirectory of this archive — before attempting to use these data. The stf903x and stf903x2 filetypes are derived from these files and are appropriate for quick overviews or access to frequently-used variables. This data collection was substantially restructured in early 2005.
- stf903x — Summary tape file 3, standard extract (original version)
- These datasets are faster and friendlier to access than the much larger stf903 sets from which they are derived. See also the stf903x2 alternative filetype, below.
- stf903x2 — 1990 Summary tape file 3, standard extract (revised version)
- Same basic idea as stf903x (above), but this filetype was created to be comparable to the SF3-based extract data for 2000 (filetype sf32000x). Data from these sets are used in the MCDC's very popular 1990-2000 demographic trend reports.
- stf904 — Summary tape file 4
- These are very large 1990 census summary files, featuring large multidimensional tables and separate files that summarize subpopulations based on race and/or hispanic origin. We have File A and File B for Missouri only and all of File C (the national file). We have B Table files for total pop, the five basic race groups (White, Black, American Indian+, Asian & PI, Other), the five basic groups/nonhispanic and Hispanic (12 chariters — characteristic iterations). We have added labels to the variables in these datasets (in December, 2005) and made them accessible at the table level via Dexter. The complete technical documentation is accessible from the Docs subdirectory.
- stf420 — Place of work destinations file
- Got its name because of Table 20 on summary tape file 4 contained these data. The comparable filetype in 2000 is Workflow.
- stf9s5 — Special tabulation file 5
- Commuting Patterns by county.
- stp154 — Special tabulation product 154
- Commuting patterns by place-tract ("daytime population" file).
- pl9490 — Public law 94 (redistricting) file
- A few variables about lots of geographic areas, down to census block.
- pl9490tx — Special extracts from pl9490
- Public law 94-171 (redistricting data) from the 1990 census.
[Return to top of page]
1980 Decennial Census
- stf803 — Summary tape file 3
- We have a complete national collection with A files for every state as well as a national ZIP code file and the C file with higher-level geographies for the entire U.S. These contain very detailed tables based on the 1980 long form.
- stf803x — Summary tape file 3, standard extract
- Needs work. We strongly suggest using stf803x2 instead.
- stf803x2 — Summary tape file 3, standard extract (revised)
- Same idea as the stf803x standard extract but this is by far the better collection. Content was chosen to be as compatible as possible with similar extracts from the 1990 and 2000 extracts. Complete national collection.
- stf801 — Summary tape file 1
- Detailed tables based on the 1980 short form.
- stf801x — Extracts derived from STF1 tables
- 1980 summary tape file 1 standard extracts.
[Return to top of page]
- popests — Population estimates
- More recent population estimates and projections from many different sources, for many different geographic areas and units. More for Missouri than for elsewhere, but some good state, metro area and county level data for the entire U.S. Some datasets with historical trends, some with components of change, some with demographic detail. These are all post-2000 estimates (with one key exception) along with a small number of projections (i.e. estimates of future populations). (Note: for the latest Missouri county level projections, see the moprojs filetype, just below.) In late March, 2014 we added the estimates with components of change data for 2013 (uscom and mocom data sets).
- nchsbri — NCHS special tab "bridged race" estimates
- This popests/nchsbri subdirectory (not technically a filetype, but rather an important sub-filetype) contains special estimates commissioned by the NCHS (National Center for Health Statistics) using bridged race categories, i.e. using race standards established by OMB in 1977 rather than the current ones established in 1997. The complete national collection has four datasets per state, two based on 1990 intercensal estimates and two based on post-2000 estimates. State- and county-level numbers with detail by single years of age, race (only four "bridged" values), sex, and hispanic origin. Important raw data resource for demographer types. These dataare used in the MCDC's Population estimates by age dynamic web application. Updated each summer with estimates through July 1 of the previous year. These data get refreshed (and overwritten) annually, between June and September.
- moprojs — Missouri population projections
- Missouri projected pops at the state and county level out to the year 2030. Done in 2008 by the state demographer in the Office of Administration in Jefferson City. These projections were done using the latest census results and estimates then available. These numbers represent the "preferred" series.
- popests/old — Older estimates of historic interest
- More estimates, but these are older and of interest mostly for for historical or trend analyses. Most were released by the Census Bureau during the 1990s and contain data estimated as of some year or years within that decade.
- saipe — Small area income and poverty estimates
- See the Economic Indicators section, below. Does include some population estimates as well, which is why we put this here. We also store the SAHIE data here — the estimates of health insurance coverage.
[Return to top of page]
- beareis — Bureau of Economic Analysis regional economic information system (REIS) data
- Time-series data on employment, income, farm income, transfer payments, and an overall economic profile for all states and counties in the U.S. Updated (with the files written over) each year, typically in the spring. There is usually a two-year lag in getting these data out. In September, 2012 we completely replaced the data collection with new data sets rebenchmarked and with new data for 1969 through 2010.
- cbp — County business patterns
- Counts of businesses, total and by industry, with establishments by size. Also has employment and payroll data. Data varies by geographic level, which includes nation, state, county, metro area (CBSA), and ZIP code. Annual data for theses levels (except ZIP code) available for the entire U.S. going back to 1999, with some earlier data for selected years back to 1990. New data are added to this collection each year.
- irstaxes — IRS tax return data
- These data sets provide summaries of personal income tax returns by year. Data include items such as number of total returns, total gross income reported, total taxes paid, total deductions of various types, etc. The data are tabulated at the county and ZIP code levels, along with state aggregates. Added to archive Feb., 2014. In July, 2014 the IRS released the data for tax year 2012. There were signficant changes to the data, especially at the ZIP level.
- bankdeps — Banking deposits data for Missouri
- Data are for individual branches with names, addresses, and estimated deposit info from the FDIC. The data have also been aggregated to Missouri counties and the state. Data are available for 1999 through (at least) 2015.
- bls_la — Employment statistics from the Bureau of Labor Statistics
- We have significantly restructured the data and have added badly needed FIPS state and county codes to make the data mergeable with other statistics. Monthly and annual average employment, unemployment, and unemployment rate data back to 1990 for all US states and counties (+PR). In 2015 we added a new collection of data sets containing annual average data for selected years going back to 1990. These are the usstateavgs and uscountyavgs data sets.
- empwage — Employment and wage data for Missouri
- Based on ES-202 files for the state. County-level summaries for various years. Missouri only. These data are for the decade of the 1990s and have not been updated in over 10 years. Of historic interest only.
- taxsales — Taxable sales for Missouri counties
- By SIC, by quarter, starting with year 2000. Data is from the Missouri Dept of Revenue. Lots of suppression here when you look for detailed SIC info, but the data for total sales without SIC detail is there. One dataset per year at the county/SIC level, and then a single summary set with just total sales by county by year (with state totals as well). Updated in June 2013 with complete 2012 data.
- govfin — State and local government finance
- Data based on the annual survey conducted by the Census Bureau. For each state and the nation, you get information regarding tax revenue sources and expenditures. The data are also reported by government level: state, local, and state + local. This filetype was added to the archive on an experimental basis in March, 2014. We currently just have data for the years 2006 and 2011.
[Return to top of page]
- georef — Geographic reference files
- Extensive collection of geographic reference data. A mixture of national and Missouri-specific files.
- corrlst — Geographic correlation lists and equivalency files
- These are our geographic correlation list (aka equivalency file) datasets. They deal with how various geographic layers correspond to each other — for example, how ZIP codes correlate with congressional districts. Included as a subdirectory is the MABLE database used in the Geocorr web application. Many of these datasets (and many more like them) can be generated using the Geocorr dynamic web application. Whereas many of these datasets may only be for Missouri and neighboring states, Geocorr works for the entire country.
- blkrel10 — Census 2000 / Census 2010 tabulation block relationship files
- These files show the relationship between the census blocks for these two decades. We enhance the files by combining them with data from our 2000 block-based mable09 data (see below), and by doing various value-added aggregations to generation other equivalency data sets. It includes a report directory and data sets that depict tract-to-tract equivalencies across the decade.
- hudusps — HUD/USPS ZIP to tract and tract to ZIP crosswalk files
- These files (for various points in time, currently just 1st quarter 2016 but many more available if needed) allow linking current ZIP codes (including non-residential) and 2010 census tracts.
- mable14 — Master area block level equivalency file, 2012
- This is the database constructed for use in the Geocorr 2014 web application. It is a distillation of the information contained in the geographic headers files from Summary File 1, 2010 census. augmented with data from more recent TIGER line files and some other miscellaneous sources. As usual, the Missouri datasets have some extra geocodes not available for the rest of the country. Most users will want to access these data using the Geocorr 2014 web application.
- mable09 — Master area block level equivalency file, 2009/2010
- This is the database constructed in 2010 for use in the Geocorr 2000 web application. It is a distillation of the information contained in the geographic headers files from Summary File 1, 2000 census, with some augmentations based on more recent TIGER line files and various other geographic sources such as CBSA codes for counties. The Missouri datasets have some extra codes not available for the rest of the country. Most users will want to access these data using the Geocorr 2000 web application.
- mable2k — Master area block level equivalency file, 2000
- This is the original 2000-geography MABLE database used in the Geocorr 2000 web application. We have since updated the datasets in mable09, but this directory contains the Tools directory where we have kept all the code used to generate the original and the updated datasets throughout the decade 2000-2009.
- mable98 — Master area block level equivalency file, 1998
- Similar to the mable2k/mable09 data collections, but this has older geographic codes. You can access these using the original 1990 version of Geocorr.
- gazetteer — Census Bureau gazetteers
- These data sets provide basic codes, names (if applicable) and geographic measures (internal point coordinates and various land/water area measurements) for a wide array of geographic types and time periods.
[Return to top of page]
- mosenior — Missouri Senior Report
- The data used to support the publication and web site of the same name. Contains county-level ranking of the state's elderly population (defined usually as aged 65+). OSEDA (the Office of Social and Economic Data Analaysis at the University of Missouri) was reponsible for the data and website development on this project. (See the Missouri Senior Report web site.)
- cntypage — The Missouri county summary of social and economic indicators
- This collection was originally developed by OSEDA in collaboration with University of Missouri Extension personnel originally in the fall of 2005. It includes key indicators used by Extension personnel, including key data from the 2000 and 2010 decennial censuses, recent American Community Survey data, the latest population estimates, current housing unit estimate, key employment and personal income categories from BEA, and a host of other items. The key datasets get refreshed at least once a year as new time series data become available. Geographic summary units are the state, its counties, and the UM Extension regions. See the Missouri County Social and Economic Indicators and Fact Sheets page for access to these data as formatted reports.
- indctrs — Missouri county key indicators database
- Of historic interest only. An assortment of datasets that have been created mostly by extraction of key data items from other sets in this archive. Emphasis is on data for Missouri (the state, its counties, and various regions), and most have data for at least two points in time. This collection of data was the basis of all reports and analyses published by OSEDA on their web site from 2002 to 2007. There have been no data added to this collection since April, 2007.
- kidscnt — Kids Count
- Kids Count is a national program sponsored by the Annie E. Casey foundation. The data for Missouri (all we have in the archive) comes from a myriad of sources, mostly within state government. They are all collected here and are used as the source of the tables, charts, reports, maps, etc. that can be accessed at the Missouri Kids Count Data web site. The mocomaster[yyyy] data sets (especially the most recent one) are the data sets to focus on initially.
- desex — DESE demographic data extract
- Demographic indicators extracted from the 1990 and 2000 censuses and other public sources created specifically for the Missouri Department of Elementary and Secondary Education (DESE). Summaries at various geographic levels, most of them within school district.
- modotx — MoDOT demographic data extract
- Of historic interest only. The MoDOT SEIR extract data were created for the Missouri Department of Transportation for use in their Socio-Economic Indicator Resource web-based system (developed for MoDOT by OSEDA circa 2002). Geography is geared toward MoDOT applications but includes Regional Planning Commisions, counties, and places. Data are heavily 2000 census (including CTPP) based.
[Return to top of page]
- irsmig — County-level migration data based on IRS tax returns
- We have data for the entire U.S. based on tax years as early as 1999/2000 and as recent as 2010/2011.
- ag2012 — Census of agriculture, 2012
- Summary Table 1 data only. Very similar to what we did with the 2007 data. We might do more later, if user demand indicates a need. Most users will want to go to the Ag Census web site.
- ag2007 — Census of agriculture, 2007
- Summary table 1 data only.
Note: If you know what filetype you want, you can explore the data by accessing the main data directory, sorted alphabetically by filetype.