The Census Bureau released a series of demographic profile products in May/early June of 2002. These products were in
the form of Adobe Acrobat format print reports and corresponding data files in comma delimited format.
We have downloaded those comma delimited ("csv") files and read them in to create the SAS data files found
in this directory. The data parallel the information in the four DP reports. Where the reports have a
percentage value we have also calculated and stored those percentage values as separate variables (which
were not included in the csv files as distributed.)
What you can expect to find here are good general overviews of the geographic areas summarized. Those areas are limited primarily to governmental units: states, counties, cities (places), MCDs, Congressional Districts and Metro Areas.
For more information and access to the nationwide collection of profile reports in pdf format see
the Census Bureau's Demographic Profiles web page. It includes a link to the technical documentation, also in pdf format.
Alternative Data Sources
For users needing more detail than what is contained in these extracts, the alternative source is the data in the
full sf3 directory, sf32000.
Not yet available, but coming soon (circa August, 2000 perhaps) will be the MCDC's sf32000x filetype,
which will be a comparable collection of key indicators extracted from the full sf32000 datasets. All we have at
the moment is a set of working specs for what we want to put on these files.
Data Files: What Goes Where
There are eight files per state. The first two characters of the file name are the state postal abbreviation. The next 3 characters of the file name indicate the profile: values are dp1 thru dp4. The file extension is either sas7bdat or sas7bvew, indicating either a regular SAS data set or a SAS data view. The former has variables that have been assigned mnemonic, descriptive names such as TotPop or Median_Age. The latter (the view files) use the Census Bureau's database naming conventions as documented in the technical documentation (see link, above). That technical documentation is a good place to start if are trying to get a handle on the contents of these datasets.
Mnemonic vs. Data Dictionary Variable Names
The MCDC allows you to use alternate versions of these data. You can access datasets with longer, more descriptive mnemonic names or, as mentioned in the previous paragraph, there are alternate views available that use the "data dictionary" names per the Census Bureau. To see the relationship of the mnemonic to the data dictionary names see the Variables.pdf report in the sf3prof directory.
This report also documents what variables were used as the denominators in calculating the pct_ variables. For example, it specifies that for the variable NoVehicles the corresponding percentage variable pct_NoVehicles was derived as a percentage of the variable HousingUnits. This is a good way to get a handle on what the universe is for each variable. In the NoVehicles example, it reminds us that that this variable is a count of housing units (not people) that have no vehicles available.
The MCDC has also created a series of Excel spreadsheets and corresponding pdf files containing some of the more frequently used data items for Missouri counties (and -- coming soon if not already here by the time you read this -- towns). These files are what the Bureau likes to call "Geographic Comparison" reports, meaning they contain a relatively small number of demographic items but for a large number of geographic entitities to facilitate comparison of those entities. The data in these reports were derived from the information in the comma-delimited files provided by the Census Bureau.
Go to explore the sf3prof data directory.