Add like
Add dislike
Add to saved papers

Validation of electronic medical data: Identifying diabetes prevalence in general practice.

BACKGROUND: Electronic medical records are increasingly used for research with limited external validation of their data.

OBJECTIVE: This study investigates the validity of electronic medical data (EMD) for estimating diabetes prevalence in general practitioner (GP) patients by comparing EMD with national Bettering the Evaluation and Care of Health (BEACH) data.

METHOD: A "decision tree" was created using inclusion/exclusion of pre-agreed variables to determine the probability of diabetes in absence of diagnostic label, including diagnoses (coded/free-text diabetes, polycystic ovarian syndrome, impaired glucose tolerance, impaired fasting glucose), diabetic annual cycle of care (DACC), hemoglobin (HbA1c) > 6.5%, and prescription (metformin, other diabetes medications). Via SQL query, cases were identified in EMD of five Illawarra and Southern Practice Network practices (30,007 active patients; from 2 years to January 2015). Patient-based Supplementary Analysis of Nominated Data (SAND) sub-studies from BEACH investigating diabetes prevalence (1172 GPs; 35,162 patients; November 2012 to February 2015) were comparison data. SAND results were adjusted for number of GP encounters per year, per patient, and then age-sex standardised to match age-sex distribution of EMD patients. Cluster-adjusted 95% confidence intervals (CIs) were calculated for both datasets.

RESULTS: EMD diabetes prevalence (T1 and/or T2) was 6.5% (95% CI: 4.1-8.9). Following age-sex standardisation, SAND prevalence, not significantly different, was 6.7% (95% CI: 6.3-7.1). Extracting only coded diagnosis missed 13.0% of probable cases, subsequently identified through the presence of metformin/other diabetes medications medications (*without other indicator variables; 6.1%), free-text diabetes label (3.8%), HbA1c result* (1.6%), DACC* (1.3%), and diabetes medications* (0.2%).

DISCUSSION: While complex, proxy variables can improve usefulness of EMD for research. Without their consideration, EMD results should be interpreted with caution.

CONCLUSION: Enforceable, transparent data linkages in EMRs would resolve many problems with identification of diagnoses. Ongoing data quality improvement remains essential.

Full text links

We have located links that may give you full text access.
Can't access the paper?
Try logging in through your university/institutional subscription. For a smoother one-click institutional access experience, please use our mobile app.

For the best experience, use the Read mobile app

Mobile app image

Get seemless 1-tap access through your institution/university

For the best experience, use the Read mobile app

All material on this website is protected by copyright, Copyright © 1994-2024 by WebMD LLC.
This website also contains material copyrighted by 3rd parties.

By using this service, you agree to our terms of use and privacy policy.

Your Privacy Choices Toggle icon

You can now claim free CME credits for this literature searchClaim now

Get seemless 1-tap access through your institution/university

For the best experience, use the Read mobile app