Machine Learning: Bridging Statistics and IT to Solve Data Management Challenges

What do you do with millions of data points from multiple data sources that need to be categorized, coded, and analyzed? And in real time? And within a tight budget? Coding and categorizing it all could take years to complete. To say nothing of the cost. Machine learning provides the remedy.

With machine learning, we manually review and categorize subsets of available data. We then train the system using those subsets thru the latest machine learning techniques to automatically code and categorize raw data. We can recalibrate the process over time to deal with difficult data patterns and changing requirements.

Machine learning lets us set up an infrastructure that we can receive and review massive amounts of data, and quickly spot, analyze, and report on trends.

Westat uses advanced methods for solutions

Westat has harnessed the power of statistics and IT to solve data management challenges. We’ve developed a multipronged approach using natural language processing, machine learning methods, and statistical algorithms. Our toolkit draws on neural network and support vector machine methods, latent semantic indexing, and other advanced statistical methods.

Good prognosis for processing hospital survey data

Machine learning is a great tool to use when processing large-scale, longitudinal data. Take, for example, a survey that provides national data on inpatient hospital care. Westat collects millions of medical claims records each year for the survey. The data is sent to us via a secure site.

Using machine learning, we developed a system to automatically categorize payer type based on the payer name listed in the records:

  1. We built dictionaries to preprocess the raw data into usable inputs. 
  2. We trained the system with that preprocessed data and used the resulting “models” to code new data. 
  3. We set up an infrastructure for data management to review, check quality, annotate, and update results.

Our system has processed tens of millions of records, something that previously required intensive manual labor. We also developed a system to streamline data quality control so that manual review is reduced by 80%. This allows data management staff to focus on resolving more difficult data issues.

80%
Reduction in manual review needed
System streamlines data quality control to reduce manual review needed

Want to work with us?
You’ll be in great company.

About Us Careers

Westat Employees.
Westat Employee.
Client List
Centers for Disease Control and Prevention
Centers for Medicare & Medicaid Services
Substance Abuse and Mental Health Services Administration
The National Institutes of Health
AAA Foundation for Traffic Safety
The Johns Hopkins University
University of Maryland Baltimore Campus
University of Denver
U.S. Department of Veterans Affairs
U.S. Department of Transportation
U.S. Department of Justice
U.S. Department of Health and Human Services
U.S. Department of Education
U.S. Department of Agriculture
Toyota
The Verizon Foundation
Texas Education Agency
Baltimore Metropolitan Council
Teach for America
Social Security Administration
SiriusXM
Robert Wood Johnson Foundation
Organization for Economic Cooperation and Development
NYC Mayor’s Office for Economic Opportunity
National Science Foundation
Michigan Department of Health and Human Services
Maryland Cancer Registry
Internal Revenue Service
Georgia Department of Transportation
DC Public Schools
ClearWay Minnesota
Chicago Metropolitan Agency for Planning
University of Michigan

Westat is an Equal Opportunity Employer and does not discriminate on the basis of race, color, religion, sex, national origin, age, veteran status, disability, marital status, sexual orientation, citizenship status, genetic information, gender identity, or any other protected status under applicable law. Notices to Employees & Applicants.