A critically important aspect of using SF4 data is the dramatic degree to which the data are suppressed. The Abstract from the Tech Doc describes the situation as follows:
In Summary File 4, the sample data are presented in 213 population tables (matrices) and 110
housing tables, identified with ‘‘PCT’’ and ‘‘HCT,’’ respectively. Each table is iterated for 336
population groups: the total population, 132 race groups, 78 American Indian and Alaska Native
tribe categories (reflecting 39 individual tribes), 39 Hispanic or Latino groups, and 86 ancestry
groups. The presentation of SF 4 tables for any of the 336 population groups is subject to a
population threshold. That is, if there are fewer than 100 people (100 percent count) in a specific
population group in a specific geographic area, and there are fewer than 50 unweighted cases,
their population and housing characteristics data are not available for that geographic area in SF
4. For the ancestry iterations, only the 50 unweighted cases test can be performed. [This last note translates into an effective threshold for ancestry groups that varies according to the sampling factor in each geographic area, but since the overall sampling factor was about 1 in 6 the average population threshold for an ancestry group is about 300 persons. The smallest total population we found for any ancestry summary in the state of Missouri was over 200.]
.
The MCDC has created a set of index reports that can help you quickly determine if there is any data available for the subpopulation / geographic area(s) in which you are interested. For example, if you wanted SF4 data regarding the Hispanic/Latino population by county for Missouri you would want to know which counties met the 100-persons threshold. If you go to
http://mcdc2.missouri.edu/pub/data/sf42000/IndexReports/moChariterIndices/ and click on mohispanic you will get a report showing what geographic summary levels are available for the various Hispanic subgroups. The report would tell you, for example, that there are data for 57 counties in Missouri with summaries for the total Hispanic population. It would also tell you that if you wanted to get data for the detailed Hispanic subcategory Mexican that there are only two counties in Missouri with such data available.
Another index report is http://mcdc2.missouri.edu/pub/data/sf42000/IndexReports/mogeoindex.pdf. This one is sorted by geographic area and tells you which "chariters" (population subcategories) are available for each area. You could use this report, for example, if you were interested in doing an analysis of a specific city or county and wanted to know what kind of detailed you would be able to find.