Skip to Main Content
It looks like you're using Internet Explorer 11 or older. This website works best with modern browsers such as the latest versions of Chrome, Firefox, Safari, and Edge. If you continue with this browser, you may see unexpected results.

Research Data

A guide for locating, managing, and sharing research data.


Below is a list of data and dataset sources at UTHSC available for use in your research project. Contact the individual listed with each source to obtain permission and access to the data.

1. Research Enterprise Data Warehouse (rEDW): The rEDW is an informatics tool available to researchers at UTHSC and Methodist Health System to assist with generating strong, data-driven hypotheses. Via controlled searches of the rEDW, UTHSC researchers will have the ability to run cohort queries, perform aggregated analyses and develop evidenced preparatory to research study plans.

  • Contact: Center for Biomedical Informatics, 901-287-5834,

2. Diabetes Wellness and Prevention Coalition (DWPC) Registry: Data on more than 480,000 patients is gathered from 80 clinics to track processes and outcomes of care and improve care for people living with diabetes or at risk for diabetes in the Mid-South.

  • Contact: Cardella Leak, PhD, MPH, 901-448-2476,

3. Center for Medicare and Medicaid Service (CMS) Datasets: Medicare and Medicaid claims data are released by Center for Medicare and Medicaid Service (CMS) annually. Medicare data include claims for the use of inpatient, outpatient, and physician/supplier services for the elderly population, and Medicaid data include such claims for low income or disabled population. Claims contain Information on patient demographic characteristics, diagnoses, procedures, and cost for each use of health service. Part D drug data are also available since 2007 and can be linked to the claims data. Researchers can use these data to conduct studies on health outcomes, health care access, and health disparities. The Research Data Assistance Center (ResDAC) provides free technical support on how to use these data.

  • Contact: Xinhua Yu, MD, PhD, 901-678-3433,

4. Medical Expenditure Panel Survey (MEPS): A set of large-scale surveys of families and individuals, their medical providers (doctors, hospitals, pharmacies, etc.), and employers across the United States. MEPS collects data on the specific health services that Americans use, how frequently they use them, the cost of these services, and how they are paid for, as well as data on the cost, scope, and breadth of health insurance held by and available to U.S. workers.

  • Contact: Satya Surbhi, PhD, 901-448-4166,

5. CERNER Health Facts® (HF) database: Since 2000, CER NER Health Facts® (HF) database has captured and stored de-identified, longitudinal electronic health record (EHR) patient data, aggregated and organized to facilitate analyses and reporting. It currently contains data on almost 50 million patients and almost 300 million encounters. Researchers may request access through the UTHSC Center for Biomedical Informatics.

  • Contact: Charisse Madlock-Brown, PhD, 901-448-6486,

6.  Physician Compare datasets: The Centers for Medicacre & Medicaid Services (CMS) provide official datasets for the Physical Compare website to give you useful information about groups and clinicians listed on Physician Compare.

  • Contact: Satya Surbhi, PhD, 901-448-4166,

7.  Conditions Affecting Neurocognitive Development and Learning in Early Childhood (CANDLE) Study: CANDLE is an observational, longitudinal cohort research study of the development and ability to learn in children from birth to age four. The Candle study has enrolled 1503 healthy pregnant women in Shelby County.


Have a data set you would like to share or think would be a good addition to our list? Please contact Jess Newman at