Treffer: Searching the PDF Haystack: Automated Knowledge Discovery in Scanned EHR Documents.

Title:
Searching the PDF Haystack: Automated Knowledge Discovery in Scanned EHR Documents.
Authors:
Kostrinsky-Thomas AL; College of Osteopathic Medicine, Pacific Northwest University of Health Sciences, 200 University Pkwy Yakima, Washington, United States., Hisama FM; Division of Medical Genetics, Department of Medicine, University of Washington School of Medicine, Seattle, Washington, United States., Payne TH; Department of Medicine, University of Washington School of Medicine, Seattle, Washington, United States.
Source:
Applied clinical informatics [Appl Clin Inform] 2021 Mar; Vol. 12 (2), pp. 245-250. Date of Electronic Publication: 2021 Mar 24.
Publication Type:
Journal Article; Research Support, Non-U.S. Gov't
Language:
English
Journal Info:
Publisher: Thieme Country of Publication: Germany NLM ID: 101537732 Publication Model: Print-Electronic Cited Medium: Internet ISSN: 1869-0327 (Electronic) Linking ISSN: 18690327 NLM ISO Abbreviation: Appl Clin Inform Subsets: MEDLINE
Imprint Name(s):
Publication: 2018- : Stuttgart, Germany : Thieme
Original Publication: Hölderlinstr, Germany : Schattauer
References:
J Healthc Inform Res. 2019 Jan 28;3(2):200-219. (PMID: 35415427)
J Biomed Inform. 2001 Oct;34(5):301-10. (PMID: 12123149)
JCO Clin Cancer Inform. 2019 Mar;3:1-8. (PMID: 30869999)
AMIA Annu Symp Proc. 2012;2012:1211-20. (PMID: 23304398)
AMIA Jt Summits Transl Sci Proc. 2019 May 06;2019:173-181. (PMID: 31258969)
J Am Med Inform Assoc. 2020 Jul 1;27(9):1443-1449. (PMID: 32940694)
J Am Med Inform Assoc. 2012 Jun;19(e1):e90-5. (PMID: 21890871)
J Am Med Inform Assoc. 2018 Mar 1;25(3):331-336. (PMID: 29186491)
Appl Clin Inform. 2011 Jan 1;2(3):250-262. (PMID: 22180762)
EGEMS (Wash DC). 2016 Jun 01;4(1):1217. (PMID: 27376095)
Int J Med Inform. 2020 Dec;144:104302. (PMID: 33091829)
Entry Date(s):
Date Created: 20210325 Date Completed: 20211112 Latest Revision: 20240331
Update Code:
20250114
PubMed Central ID:
PMC7990572
DOI:
10.1055/s-0041-1726103
PMID:
33763846
Database:
MEDLINE

Weitere Informationen

Background: Clinicians express concern that they may be unaware of important information contained in voluminous scanned and other outside documents contained in electronic health records (EHRs). An example is "unrecognized EHR risk factor information," defined as risk factors for heritable cancer that exist within a patient's EHR but are not known by current treating providers. In a related study using manual EHR chart review, we found that half of the women whose EHR contained risk factor information meet criteria for further genetic risk evaluation for heritable forms of breast and ovarian cancer. They were not referred for genetic counseling.
Objectives: The purpose of this study was to compare the use of automated methods (optical character recognition with natural language processing) versus human review in their ability to identify risk factors for heritable breast and ovarian cancer within EHR scanned documents.
Methods: We evaluated the accuracy of the chart review by comparing our criterion standard (physician chart review) versus an automated method involving Amazon's Textract service (Amazon.com, Seattle, Washington, United States), a clinical language annotation modeling and processing toolkit (CLAMP) (Center for Computational Biomedicine at The University of Texas Health Science, Houston, Texas, United States), and a custom-written Java application.
Results: We found that automated methods identified most cancer risk factor information that would otherwise require clinician manual review and therefore is at risk of being missed.
Conclusion: The use of automated methods for identification of heritable risk factors within EHRs may provide an accurate yet rapid review of patients' past medical histories. These methods could be further strengthened via improved analysis of handwritten notes, tables, and colloquial phrases.
(Thieme. All rights reserved.)

None declared.