Summary of Post Based at the Cardiovascular Research Institute (CVRI) in Dublin, Mater Private Hospital in Dublin, and RCSI in Dublin.
This role is central to the CroíValve (DUO MAX) project, which aims to develop and validate a precision risk stratification model for patients undergoing tricuspid valve treatment.
The Royal College of Surgeons in Ireland (RCSI) focuses on leveraging advanced AI to overcome the limitations of traditional cardiovascular imaging, such as variability and radiation exposure.
By integrating NLP and data analytics, the project seeks to automate the extraction of critical data from Electronic Health Records (EHRs) and clinical reports to power a Clinical Decision Support System (CDSS) that standardizes workflows and enhances patient selection.
The successful candidate will contribute to high-impact scientific objectives, including the development of deep learning algorithms for automated leak quantification and the non-invasive assessment of Right Ventricular (RV) function.
Specifically, the duties of the post are: The applicant will work as part of Prof. Soliman's lab at both RCSI and Mater Private Hospital.
Key Responsibilities • Extract and structure critical data from non-imaging sources, specifically Electronic Health Records (EHRs) and clinical reports.
• Create multi-modal datasets that integrate extracted text data with other clinical sources to power advanced patient selection features.
• Develop NLP pipelines to automate the processing of unstructured medical narratives and clinical documentation.
• Design and implement data analytics frameworks to support outcome prediction features within a Clinical Decision Support System (CDSS).
• Collaborate with clinical and imaging teams to ensure data harmonisation and the accuracy of extracted clinical features.
• Prepare scientific reports, manuscripts, and presentations detailing the methodology and results of NLP and data analytics workflows.
• Support the PI in fulfilling technical required reports.
Requirements Essential • PhD or MSc in Computer Science, Data Science, Computational Linguistics, or a related field with a focus on NLP.
• Research or industrial experience in Natural Language Processing (NLP).
• Demonstrated expertise in extracting and structuring information from complex, non-imaging medical sources (e.g., EHRs, clinical reports).
• Proficiency in Python and/or R, with deep knowledge of NLP libraries and frameworks.
• Proven track record in applying machine learning or AI to large-scale clinical datasets.
Desirable • Experience with Clinical Decision Support Systems (CDSS) and outcome prediction modeling.
• Knowledge of time-series analysis and how longitudinal data interacts with text-based clinical insights.
• Record of publications in high-impact journals or top-tier AI/NLP conferences.
• Familiarity with data privacy and ethical considerations regarding the use of sensitive patient EHR data.
We are all too aware that imposter syndrome and the confidence gap can sometimes stop fantastic candidates putting themselves forward, so please do submit an application — we'd love to hear from you.