Data @ VirtualRDC
| Table of contents |
1 Overview of available data
The following datasets are available on the VirtualRDC, in either dummy (zero-obs) form, or with some content based on public-use data. Check the Data section for updated documentation on these datasets. Some documentation on the full dataset may be available on the CES website.
2 Types of datasets
The VirtualRDC does not host confidential data of any type. Here’s what types of data we have:
- public -use data: Public use data not (easily) available elsewhere, or because they have a use in the analysis of synthetic data.
- Synthetic data: replicates the exact structure and the statistical properties of the original (confidential) dataset, without using any of the original confidential data content. They are, of course, also public-use datasets…
- Zero-obs or dummy datasets: replicate the exact structure of the original dataset, but contain no data at all – no original data, no synthetic data, none.
- Public-use based replica: replicates the exact structure of the original (confidential) dataset, with the content of a similar publi-use dataset reformatted to resemble the original dataset
3 Available public-use datasets
- Download-only:
- Access on compute servers only:
- Public-use SIPP files
/demographic/sip/sipp_public_use_data
- American Housing Survey (AHS) public use files
/demographic/ahs/microdata/
- County Business Patterns (CBP)
/economic/cbp/
- Public-use SIPP files
4 Available synthetic datasets
- OnTheMap synthetic data (1 replicate from Version 3–the current version on the Census Bureau OnTheMap site). Please consult the VirtualRDC OnTheMap website
- Longitudinal Business Database (1 replicate, documentation). Available on compute nodes under
/economic/lbd/synthetic
- SIPP Synthetic Beta file
5 Available dummy datasets
On VirtualRDC nodes:
- PREVIOUSLY USED LEHDLITE FILES:
/mixed/lehdlite/brb /mixed/lehdlite/ecf /mixed/lehdlite/qwi
- LEHD Infrastructure files FROM 2004 SNAPSHOT (documentation)
/mixed/lehd/s2004/brb/us/brb_us_brlist.sas7bdat /mixed/lehd/s2004/brb/us/brb_us_ecflist.sas7bdat /mixed/lehd/s2004/brb/us/brb_us_xwalk.sas7bdat /mixed/lehd/s2004/ecft26/ca/ecf_ca_sein_t26.sas7bdat /mixed/lehd/s2004/ecft26/ca/ecf_ca_seinunit_t26.sas7bdat /mixed/lehd/s2004/ecft26/ca/ecf_zz_sein_t26.sas7bdat /mixed/lehd/s2004/ecft26/ca/ecf_zz_seinunit_t26.sas7bdat /mixed/lehd/s2004/ecft26/ca/es202_ca_t26.sas7bdat /mixed/lehd/s2004/ecf/zz/ecf_zz_sein.sas7bdat /mixed/lehd/s2004/ecf/zz/ecf_zz_seinunit.sas7bdat /mixed/lehd/s2004/ehf/zz/ehf_zz_controltotals.sas7bdat /mixed/lehd/s2004/ehf/zz/ehf_zz_phf.sas7bdat /mixed/lehd/s2004/ehf/zz/ehf_zz.sas7bdat /mixed/lehd/s2004/ehf/zz/ehf_zz_sein_employment.sas7bdat /mixed/lehd/s2004/ehf/zz/ehf_zz_shf.sas7bdat /mixed/lehd/s2004/ehf/zz/ehf_zz_uhf.sas7bdat /mixed/lehd/s2004/ehf/zz/ehf_zz_uniqpik.sas7bdat /mixed/lehd/s2004/galt26/zz/gal_zz_2003_br_2001.sas7bdat /mixed/lehd/s2004/galt26/zz/gal_zz_2003_t26flags.sas7bdat /mixed/lehd/s2004/galt26/zz/gal_zz_2003_t26.sas7bdat /mixed/lehd/s2004/gal/zz/gal_zz_2003_acspow_2001.sas7bdat /mixed/lehd/s2004/gal/zz/gal_zz_2003_ahs_2002.sas7bdat /mixed/lehd/s2004/gal/zz/gal_zz_2003_es202only.sas7bdat /mixed/lehd/s2004/gal/zz/gal_zz_2003_maf_2004.sas7bdat /mixed/lehd/s2004/gal/zz/gal_zz_2003.sas7bdat /mixed/lehd/s2004/gal/zz/gal_zz_2003_xwlk_2001.sas7bdat /mixed/lehd/s2004/icft26/zz/icf_zz_t26.sas7bdat /mixed/lehd/s2004/icf/zz/icf_zz_implicates_age_sex.sas7bdat /mixed/lehd/s2004/icf/zz/icf_zz_implicates_county.sas7bdat /mixed/lehd/s2004/icf/zz/icf_zz_implicates_education.sas7bdat /mixed/lehd/s2004/icf/zz/icf_zz.sas7bdat /mixed/lehd/s2004/qwi/zz/qwi_zz_seinunit.sas7bdat /mixed/lehd/s2004/u2w/zz/u2w_zz.sas7bdat
- Longitudinal Business Database (LBD) dummy datasets
/economic/lbd/
- Internal decennial census microdata dummy datasets
(list soon)