A Privacy-Preserving OCR-LLM System for Coronary Syndrome Subtyping From Admission HPI: Multicenter Validation in China and the US
NCT07449429 · Status: NOT_YET_RECRUITING · Type: OBSERVATIONAL · Enrollment: 10
Last updated 2026-03-04
Summary
This study develops and validates a privacy-preserving OCR-LLM pipeline that converts admission history of present illness (HPI) records into structured coronary syndrome subtypes (STEMI, NSTEMI, unstable angina, and chronic coronary syndrome). The system first extracts text from de-identified HPI images using locally deployed OCR, then applies large language models with a fixed diagnostic prompt to generate subtype classification and evidence. Performance is evaluated in an internal validation cohort and multiple external datasets covering heterogeneous EHR templates, emergency department cases, and an English dataset from MIMIC-IV. A clinician usability study assesses changes in diagnostic accuracy and time with and without tool assistance.
Conditions
- Coronary Artery Disease (CAD) (E.G., Angina, Myocardial Infarction, and Atherosclerotic Heart Disease (ASHD))
- Acute Coronary Syndromes
- ST-segment Elevation Myocardial Infarction (STEMI)
- Non-ST-Segment Elevation Myocardial Infarction (NSTEMI)
Interventions
- DEVICE
-
OCR-Prompt-LLM Information Extraction and Classification Workflow (OCR-Prompt-LLM)
An automated clinical data management workflow integrating Optical Character Recognition (OCR), optimized prompt engineering, and large language models (LLMs). The system processes unstructured inpatient/ED records (primarily admission history of present illness and related narrative text) to extract prespecified key clinical indicators (e.g., left ventricular ejection fraction, coronary syndrome subtype, medications) and to classify cases into prespecified coronary artery disease categories (e.g., unstable angina, STEMI, NSTEMI, chronic coronary syndrome). The workflow outputs structured fields and a classification result with supporting evidence excerpts.
- DEVICE
-
Manual Clinical Data Review
Standard manual process in which experienced clinicians review patient medical records and extract the same prespecified clinical indicators and coronary artery disease categories using routine clinical judgment and documentation review. This manual abstraction serves as the human benchmark for comparing diagnostic accuracy, completeness, and operational efficiency against the automated OCR-Prompt-LLM workflow.
Sponsors & Collaborators
-
China National Center for Cardiovascular Diseases
lead OTHER_GOV
Eligibility
- Min Age
- 18 Years
- Sex
- ALL
- Healthy Volunteers
- No
Timeline & Regulatory
- Start
- 2026-02-28
- Primary Completion
- 2026-03-08
- Completion
- 2026-03-08
More Related Trials
-
Comparison of Different Feature Engineering Methods for Automated ICD Coding
NCT04849195 ·Status: UNKNOWN
-
The Influence of Explainability and Integrability of AI-CDSS on Usage Behavior Among Primary Care Physicians
NCT07401979 ·Status: NOT_YET_RECRUITING ·Phase: NA
-
Large Language Model-Generated Messages to Improve Guideline-Directed Medical Therapy in Heart Failure
NCT07337577 ·Status: NOT_YET_RECRUITING ·Phase: NA
-
Build-up Computed Assisted History Taking, Physical Examination and Diagnosis System of Emergency Patient Through Machine Learning (II)
NCT05596929 ·Status: UNKNOWN ·Phase: NA
-
Development of an AI-based Emergency Imaging Multi-Disease Rapid Joint Screening System
NCT05974163 ·Status: UNKNOWN
-
Scalable Clinical Oversight of Large Language Models Via Uncertainty Triangulation
NCT07414966 ·Status: NOT_YET_RECRUITING ·Phase: NA
-
Evaluation of AI Large Models for Diagnosis and Treatment in Real-World Cases: Multicenter Retrospective Study
NCT07378358 ·Status: RECRUITING
-
AI-Assisted Interpretation of Cardiac CT in the Emergency Department
NCT07235657 ·Status: NOT_YET_RECRUITING ·Phase: NA
-
Clinical Intelligent Management System - Multilingual Exploration
NCT06923410 ·Status: RECRUITING ·Phase: NA
-
Evaluating the Real World Performance of an AI Based Lung Nodule Detection Tool
NCT06597968 ·Status: RECRUITING
-
Physician Response Evaluation With Contextual Insights vs. Standard Engines - Artificial Intelligence RAG vs LLM Clinical Decision Support
NCT07037940 ·Status: COMPLETED ·Phase: NA
-
AI-Agent for Automated Diagnosis and Predicting Using EHR and Multimodal Data
NCT06791499 ·Status: RECRUITING
-
"Smart Family Doctor" Assisted Comprehensive Management of Secondary Prevention Among Post Revascularization Patients
NCT07273513 ·Status: NOT_YET_RECRUITING ·Phase: NA
-
Natural Language Processing (NLP) Analysis of Free Text Notes to Investigate Coronavirus (COVID-19)
NCT04432961 ·Status: COMPLETED
-
Improving the Quality of Patient Care by Using a Clinical Expert System.
NCT00430755 ·Status: COMPLETED ·Phase: NA
-
Chest X-Ray Image Diagnosis and Report Generation Dedicated Model Based on Deepseek
NCT06874647 ·Status: COMPLETED
-
Large Language Model for Understanding and Monitoring Elderly Neurocognition
NCT07347431 ·Status: RECRUITING
-
Retrospective Study of Carebot AI CXR Performance in Preclinical Practice
NCT05594485 ·Status: COMPLETED
-
Exploring the Application Efficacy of Artificial Intelligence (AI) Diagnostic Tools in Medical Imaging (MI) of Respiratory(R) Infectious (I) Disease (D)
NCT06553911 ·Status: RECRUITING ·Phase: NA
-
Speech-to-text Tool for Heart Failure Patient-Reported Data
NCT07217366 ·Status: ENROLLING_BY_INVITATION ·Phase: NA
-
Dragon Ambient eXperience (DAX) Copilot Evaluation
NCT06329427 ·Status: COMPLETED ·Phase: NA
-
Multi-Reader Retrospective Study Examining Carebot AI CXR 2.0.21-v2.01 Implementation in Everyday Radiology Clinical Practice
NCT05963945 ·Status: COMPLETED
-
ChatBot and Activity Monitoring in Patients Undergoing Chemoradiotherapy
NCT05318027 ·Status: ACTIVE_NOT_RECRUITING ·Phase: PHASE2
-
A Platform for Multidisciplinary Medical Artificial Intelligence Development
NCT04890847 ·Status: UNKNOWN
-
An AI Educational Agent for Medical Machine Learning Courses
NCT07449182 ·Status: RECRUITING ·Phase: NA