How to use NCI's SAS residential history generation programs

How to use NCI's SAS residential history generation programs

As part of the National Cancer Institute’s residential history pilot project, Westat created “ResHistGen,” a set of open-source SAS programs that will help researchers and others reconcile data from commercial vendors and generate residential histories of study participants. Here’s how to use the ResHistGen programs to create residential histories for a set of research subjects. These steps can be performed by staff at the cancer registry, members of the research team, or staff at a third-party contractor.

  1. Individual patient identifiers are needed for this process. It is essential that the researcher follow established procedures to protect the privacy of human subjects.
  2. Submit subject names and identifiers for relevant cases to the vendor.
  3. Geocode the addresses you have received from the commercial vendor. All U.S. cancer registries have access to the North American Association of Central Cancer Registries (NAACCR) geocoder, but any batch geocoder can be used.
  4. Run the first SAS program (01_MatchAddresses1.sas) to match common addresses. For a study with a small number of study subjects, possible matches can be reviewed manually in a 2-step process. For a study with a large number of subjects, this can be done automatically in a single step.
  5. If a manual review is desired, edit the “LN_matchcombos_review.xlsx” created by the first program by deleting rows that are not matches.
  6. Run the second SAS program (02_MatchAddresses2.sas) to add manual review results (if any) and combine matched addresses.
  7. Run the third SAS program (03_BuildResHistory.sas) to reconcile addresses and generate a derived residential history.

In the ResHistGen programs, local file locations are specified in the first few lines of each program to facilitate portability. The programs have been written to avoid any data conversion or divide by zero warning messages; if these occur, there’s an error. There are tests for unexpected conditions, and messages are generated with 3 asterisks (“***”) if any unexpected conditions are encountered.

The ResHistGen programs are released under the GNU General Public License. If you have questions, limited support is available by email at NCI.ResidentialHistory@westat.com. You can also use this email address to share any enhancements you make to these programs. If we think the enhancements are worthwhile, we will include them in a future release. By the terms of the license, you may distribute your changes on your own provided you include a prominent notice that you have modified the original.

If you publish results based on these programs, please include the following citation: ResHistGen Residential History Generation Programs, Version 1.0 - June 2016; Surveillance Research Program, National Cancer Institute.

Want to work with us?
You’ll be in great company.

About Us Careers

Two people looking at phone
Two people talking
AAA Foundation for Traffic Safety
Baltimore Metropolitan Council
Centers for Disease Control and Prevention
Centers for Medicare & Medicaid Services
Chicago Metropolitan Agency for Planning
ClearWay Minnesota
DC Public Schools
Georgia Department of Transportation
Internal Revenue Service
Maryland Cancer Registry
Michigan Department of Health and Human Services
National Science Foundation
NYC Mayor’s Office for Economic Opportunity
Organization for Economic Cooperation and Development
Robert Wood Johnson Foundation
SiriusXM
Social Security Administration
Substance Abuse and Mental Health Services Administration
Teach for America
Texas Education Agency
The Johns Hopkins University
The National Institutes of Health
The Verizon Foundation
Toyota
U.S. Department of Agriculture
U.S. Department of Education
U.S. Department of Health and Human Services
U.S. Department of Justice
U.S. Department of Transportation
U.S. Department of Veterans Affairs
University of Denver
University of Maryland Baltimore Campus
University of Michigan

Westat is an Equal Opportunity Employer and does not discriminate on the basis of race, color, religion, sex, national origin, age, veteran status, disability, marital status, sexual orientation, citizenship status, genetic information, gender identity, or any other protected status under applicable law. Notices to Employees & Applicants.