Tools for using commercial sources of residential histories for cancer research

Tools for using commercial sources of residential histories for cancer research

How to use NCI's SAS residential history generation programs

As part of the National Cancer Institute’s residential history pilot project, Westat created “ResHistGen,” a set of open-source SAS programs that will help researchers and others reconcile data from commercial vendors and generate residential histories of study participants.

For more information on the residential history study and the development of the SAS programs, see NCI/SEER Residential History Project [1544kb PDF], the study’s technical report.

The steps to use the ResHistGen programs for the creation of residential histories of research subjects can be performed by staff at the cancer registry, members of the research team, or staff at a third-party contractor. To access the programs along with information on how to use them, please go to the GitHub repository.

  1. Individual patient identifiers are needed for this process. It is essential that the researcher follow established procedures to protect the privacy of human subjects.
  2. Submit subject names and identifiers for relevant cases to the vendor.
  3. Geocode the addresses received from the commercial vendor. All U.S. cancer registries have access to the North American Association of Central Cancer Registries (NAACCR) geocoder, but any batch geocoder can be used.
  4. Run the first SAS program ( [12kb text file]) to match common addresses. For a study with a small number of study subjects, possible matches can be reviewed manually in a two-step process. For a study with a large number of subjects, this can be done automatically in a single step.
  5. If a manual review is desired, edit the “LN_matchcombos_review.xlsx” created by the first program by deleting rows that are not matches. This review can be guided by the NCI SEER Manual Address Comparison Guidelines [31kb Word file]
  6. Run the second SAS program ( [16kb text file]) to add any results from the manual review and combine matched addresses.
  7. Run the third SAS program ( [13kb text file]) to reconcile addresses and generate a derived residential history.

The current release of these programs is Version 2.1. For a summary of changes since the previous release, see Version 2.1 Changes.txt [3kb text file].

In the ResHistGen programs, local file locations are specified in the first few lines of each program to facilitate portability. The programs have been written to avoid any data conversion or divide-by-zero warning messages; if these occur, there is an error. There are tests for unexpected conditions, and messages are generated with three asterisks (“***”) if any unexpected conditions are encountered.

The ResHistGen programs are released under the GNU General Public License [34kb text file]. For questions, limited support is available by email at; enhancements may also be shared via this email address and if found to be beneficial, they will be included in a future release. By the terms of the license, you may distribute your changes on your own provided you include a prominent notice that you have modified the original.

If you publish results based on these programs, please include the following citation: ResHistGen Residential History Generation Programs, Version 2.1 - October 2020; Surveillance Research Program, National Cancer Institute.

Want to work with us?
You’ll be in great company.

About Us Careers

Westat Employees.
Westat Employee.
AAA Foundation for Traffic Safety
Baltimore Metropolitan Council
Centers for Disease Control and Prevention
Centers for Medicare & Medicaid Services
Chicago Metropolitan Agency for Planning
DC Public Schools
Georgia Department of Transportation
Internal Revenue Service
Leadership Montgomery
Maryland Cancer Registry
Michigan Department of Health and Human Services
National Science Foundation
NYC Mayor’s Office for Economic Opportunity
Organization for Economic Cooperation and Development
Robert Wood Johnson Foundation
Social Security Administration
Substance Abuse and Mental Health Services Administration
Teach for America
Texas Education Agency
The Johns Hopkins University
The National Institutes of Health
The Verizon Foundation
U.S. Department of Agriculture
U.S. Department of Education
U.S. Department of Health and Human Services
U.S. Department of Justice
U.S. Department of Transportation
U.S. Department of Veterans Affairs
University of Maryland Baltimore Campus
University of Michigan
University System of Maryland
Explore Our Clients

Please wait...

Forbes 2022 The Best Employers for Women Powered by Statista

Westat is an Equal Opportunity Employer and does not discriminate on the basis of race, creed, color, religion, sex, national origin, age, veteran status, disability, marital status, sexual orientation, citizenship status, genetic information, gender identity or expression, or any other protected status under applicable law. Notices to Employees & Applicants.