CL4Health

CL4Health @ NAACL 2025

Patient-oriented language processing

Important Dates
Invited Talk
Program
Submissions
Scope
Shared Task
Meeting
Program Committee
Organizers

Important Dates

Workshop Paper Due Date️:	February 4, 2025
Notification of acceptance:	March 1, 2025
Camera-ready papers due:	March 13, 2025
Pre-recorded video due (hard deadline):	April 8, 2025
Workshop:	May 4, 2025

Invited Talk

Bridging the Gap: Inclusive Artificial Intelligence for Patient-Oriented Language Processing in Conversational Agents in Healthcare

Kerstin Denecke

Kerstin Denecke is Professor of Medical Informatics at the Department of Technology & Informatics, Bern University of Applied Sciences. She researches and teaches at the Institute of Medical Informatics on text mining in the clinical context and mobile health applications including dialogue-oriented user interfaces. One of her research directions is inclusive design of digital health solutions for older adults. The project on digital health solutions utilizes evidence-based approaches for prevention, treatment, and health promotion.

CL4Health 2025: Program

Time Zone in Albuquerque, New Mexico, USA in May is MDT — Mountain Daylight Time

Sunday, May 4, 2025

08:15 - 08:30

Opening remarks

08:30 - 10:30

Session 1: Oral Presentations

08:30 - 08:50

Am I eligible? Natural Language Inference for Clinical Trial Patient Recruitment: the Patient's Point of View
Mathilde Aguiar¹, Pierre Zweigenbaum², Nona Naderi³

08:50 - 09:10

When Multilingual Models Compete with Monolingual Domain-Specific Models in Clinical Question Answering
Vojtech Lanz and Pavel Pecina

09:10 - 09:30

Comparing representations of long clinical texts for the task of patient-note identification
Safa Alsaidi¹, Marc Vincent², Olivia Boyer³, Nicolas Garcelon², Miguel Couceiro⁴, Adrien Coulet¹

09:30 - 09:50

Towards Knowledge-Guided Biomedical Lay Summarization using Large Language Models
Shufan Ming, Yue Guo, Halil Kilicoglu

09:50 - 10:10

Benchmarking IsiXhosa Automatic Speech Recognition and Machine Translation for Digital Health Provision
Abby Blocker¹, Francois Meyer¹, Ahmed Biyabani², Joyce Mwangama¹, Mohammed Ishaaq Datay¹, Bessie Malila¹

10:10 - 10:30

Mining Social Media for Barriers to Opioid Recovery with LLMs
Vinu Ekanayake, Md Sultan Al Nahian, Ramakanth Kavuluru

10:30 - 11:00

Coffee Break

11:00 - 12:40

Session 2: Oral Presentations

11:00 - 11:15

Towards Understanding LLM-Generated Biomedical Lay Summaries
Rohan Charudatt Salvi¹, Swapnil Panigrahi², Dhruv Jain², Shweta Yadav³, Md. Shad Akhtar²

11:15 - 11:30

Prefix-Enhanced Large Language Models with Reused Training Data in Multi-Turn Medical Dialogue
Suxue Ma¹, Zhicheng Yang², Ruei-Sung Lin², Youbao Tang², Ning Zhang³, Zhenjie Cao⁴, yuan ni⁵, Jing Xiao⁶, Jieke Hou⁷, Peng Chang²

11:30 - 11:50

MeDiSumQA: Patient-Oriented Question-Answer Generation from Discharge Letters
Amin Dada¹, Osman Alperen Koras², Marie Bauer², Amanda Butler³, Kaleb E Smith³, Jens Kleesiek², Julian Friedrich²

11:50 - 12:05

Bias in Danish Medical Notes: Infection Classification of Long Texts Using Transformer and LSTM Architectures Coupled with BERT
Mehdi Parviz¹, Rudi Agius², Carsten Utoft Niemann², Rob van der Goot³

12:05 - 12:25

Bridging the Gap in Health Literacy: Harnessing the Power of Large Language Models to Generate Plain Language Summaries from Biomedical Texts
Andrés Felipe Arias-Russi, Carolina Salazar-Lara, Rubén Francisco Manrique

12:25 - 12:40

Preliminary Evaluation of an Open-Source LLM for Lay Translation of German Clinical Documents
Tabea Margareta Grace Pakull¹, Amin Dada², Hendrik Damm³, Anke Fleischhauer⁴, Sven Benson⁵, Noëlle Bender⁶, Nicola Prasuhn⁷, Katharina Kaminski⁴, Christoph M. Friedrich⁸, Peter A. Horn¹, Jens Kleesiek², Dirk Schadendorf⁴, Ina Pretzell⁴

12:40 - 14:00

Lunch

14:00 - 14:40

Invited Talk
Bridging the Gap: Inclusive Artificial Intelligence for Patient-Oriented Language Processing in Conversational Agents in Healthcare
Kerstin Denecke

14:40 - 15:30

Sesion3: Shared Task

14:40 - 15:10

Overview of the PerAnsSumm 2025 Shared Task on Perspective-aware Healthcare Answer Summarization
Siddhant Agarwal¹, Md. Shad Akhtar², Shweta Yadav¹

15:10 - 15:20

WisPerMed @ PerAnsSumm 2025: Strong Reasoning Through Structured Prompting and Careful Answer Selection Enhances Perspective Extraction and Summarization of Healthcare Forum Threads
Tabea Margareta Grace Pakull¹, Hendrik Damm², Henning Schäfer¹, Peter A. Horn¹, Christoph Matthias Friedrich²

15:20 - 15:30

YaleNLP @ PerAnsSumm 2025: Multi-Perspective Integration via Mixture-of-Agents for Enhanced Healthcare QA Summarization
Dongsuk Jang¹, Haoxin Li², Arman Cohan¹

15:30 - 16:00

Coffee Break

16:00 - 17:30

Poster Session

On Large Foundation Models and Alzheimer's Disease Detection
Chuyuan Li¹, Giuseppe Carenini², Thalia S Field²

Synthetic Documents for Medical Tasks: Bridging Privacy with Knowledge Injection and Reward Mechanism
Simon Meoni¹, Éric de la Clergerie², Théo Ryffel³

A Preliminary Study on NLP-Based Personalized Support for Type 1 Diabetes Management
Sandra Mitrović¹, Federico Fontana², Andrea Zignoli², Felipe Mattioni Maturana², Christian Berchtold¹, Daniele Malpetti¹, Sam Scott², Laura Azzimonti¹

Multimodal Transformers for Clinical Time Series Forecasting and Early Sepsis Prediction
Jinghua Xu¹ and Michael Staniek²

LT3: Generating Medication Prescriptions with Conditional Transformer
Samuel Belkadi¹, Nicolo Micheletti², Lifeng Han³, Warren Del-Pinto², Goran Nenadic²

Explainable ICD Coding via Entity Linking
Leonor Maria Machado Barreiros¹, Isabel Coutinho², Gonçalo M Correia¹, Bruno Martins²

Predicting Chronic Kidney Disease Progression from Stage III to Stage V using Language Models
Zainab K Awan¹, Rafael Henkin¹, Nick Reynolds², Michael R. Barnes¹

Capturing Patients' Lived Experiences with Chronic Pain through Motivational Interviewing and Information Extraction
Hadeel R A Elyazori¹, Rusul Abdulrazzaq¹, Hana Al Shawi¹, Isaac Paki Elom Amouzou¹, Patrick King¹, Syleah Manns¹, Mahdia Popal¹, Zarna Narsihbhai Patel¹, Secili DeStefano¹, Jay Shah², Naomi Lynn Gerber¹, Siddhartha Sikdar¹, Seiyon Lee¹, Samuel Acuna¹, Kevin Lybarger¹

Explainability for NLP in Pharmacovigilance: A Study on Adverse Event Report Triage in Swedish
Luise DÃ¼rlich¹, Erik Bergman², Maria Larsson², Hercules Dalianis³, Seamus Doyle², Gabriel Westman², Joakim Nivre⁴

PatientDx: Merging Large Language Models for Protecting Data-Privacy in Healthcare
Jose G. Moreno¹, Jesus Lovon-Melgarejo², M'Rick Robin-Charlet³, Christine Damase-Michel³, Lynda Tamine²

Beyond Reconstruction: Generating Privacy-Preserving Clinical Letters
Libo Ren¹, Samuel Belkadi², Lifeng Han², Warren Del-Pinto², Goran Nenadic²

Medication Extraction and Entity Linking using Stacked and Voted Ensembles on LLMs
Pablo Romero¹, Lifeng Han², Goran Nenadic³

SpecialtyScribe: Enhancing SOAP note Scribing for Medical Specialties using LLM's
Sagar Goyal¹, Eti Rastogi², Fen Zhao¹, Dong Yuan³, Andrew Beinstein¹

Using LLMs to improve RL policies in personalized health adaptive interventions
Karine Karine and Benjamin M Marlin

Leveraging External Knowledge Bases: Analyzing Presentation Methods and Their Impact on Model Performance
Hui-Syuan Yeh¹, Thomas Lavergne¹, Pierre Zweigenbaum²

Will Gen Z users look for evidence to verify QA System-generated answers?
SOUMYA GAYEN¹, Deepak Gupta², Dina Demner-Fushman³

LLM Based Efficient CSR Summarization using Structured Fact Extraction and Feedback
Kunwar Zaid, Amit Sangroya, Lovekesh Vig

Shared Task Posters

MNLP at PerAnsSumm: A Classifier-Refiner Architecture for Improving the Classification of Consumer Health User Responses
Jooyeon Lee, Luan Huy Pham, Özlem Uzuner

Team_UMB at PerAnsSumm 2025: Enhancing Perspective-Aware Summarization with Prompt Optimization and Supervised Fine-Tuning
Kristin Qi, Youxiang Zhu, Xiaohui Liang

Medifact at PerAnsSumm 2025: Leveraging Lightweight Models for Perspective-Specific Summarization of Clinical Q&A Forums
Nadia Saeed

Roux-lette @ PerAnsSumm Shared Task
Anson Antony¹, Peter Vickers², Suzanne Wendelken³

Abdelmalak at PerAnsSumm 2025: Leveraging a Domain-Specific BERT and LLaMA for Perspective-Aware Healthcare Answer Summarization
Abanoub Medhat Saeed Abdelmalak

AICOE at PerAnsSumm 2025: An Ensemble of Large Language Models for Perspective-Aware Healthcare Answer Summarization
Rakshith R¹, Mohammed Sameer Khan¹, Ankush Chopra²

LTRC-IIITH at PerAnsSumm 2025: SpanSense - Perspective-specific span identification and Summarization
Sushvin Marimuthu¹ and Parameswari Krishnamurthy²

DataHacks at PerAnsSumm 2025: LoRA-Driven Prompt Engineering for Perspective Aware Span Identification and Summarization
Vansh Nawander and Chaithra Reddy Nerella

LMU at PerAnsSumm 2025: LlaMA-in-the-loop at Perspective-Aware Healthcare Answer Summarization Task 2.2 Factuality
Tanalp Ağustoslu

The Manchester Bees at PerAnsSumm 2025: Iterative Self-Prompting with Claude and o1 for Perspective-aware Healthcare Answer Summarisation
Pablo Romero¹, Libo Ren², Lifeng Han³, Goran Nenadic⁴

17:30 - 17:45

Closing remarks

Submissions

Two types of submissions are invited: full papers and short papers.

Full papers should not exceed eight (8) pages of text, plus unlimited references. These are intended to be reports of original research.
Short papers may consist of up to four (4) pages of content, plus unlimited references. Appropriate short paper topics include preliminary results, application notes, descriptions of work in progress, etc.

Electronic Submission: Submissions must be electronic and in PDF format, using the Softconf START conference management system. Submissions must be anonymous.

The papers should follow ACL formatting.
ACL provides style files for LaTeX and Microsoft Word at https://github.com/acl-org/acl-style-files.
ACL pubcheck tool (https://github.com/acl-org/aclpubcheck) is available to check for common formatting problems.
ACL official Overleaf template (https://www.overleaf.com/latex/templates/association-for-computational-linguistics-acl-conference/jvxskxpnznfj) has a [review] setting that must be on for the submission. Please do not forget to turn it off for the final submission. The optional limitations and ethical considerations sections, references, and appendices should be included in the pdf for the paper (not counting towards the page limit), and not be submitted as a separate PDF.

The submission site is: https://softconf.com/naacl2025/cl4health2025

Dual submission policy: papers may NOT be submitted to the workshop if they are or will be concurrently submitted to another meeting or publication.

Scope

CL4Health fills the gap among the different biomedical language processing workshops by providing a general venue for a broad spectrum of patient-oriented language processing research. The second workshop on patient-oriented language processing follows the successful inaugural CL4Health workshop (collocated with LREC-COLING 2024), which clearly demonstrated the need for a computational linguistics venue that focuses on language related to health of the public.

CL4Health is concerned with the resources, computational approaches, and behavioral and socio-economic aspects of the public interactions with digital resources in search of health-related information that satisfies their information needs and guides their actions. The workshop invites papers concerning all areas of language processing focused on patients' health and health-related issues concerning the public. The issues include, but are not limited to accessibility and trustworthiness of health information provided to the public; explainable and evidence-supported answers to consumer-health questions; accurate summarization of patients' health records at their health-literacy level; understanding patients' non-informational needs through their language, and accurate and accessible interpretations of biomedical research. The topics of interest for the workshop include but are not limited to the following:

Health-related information needs and online behaviors of the public;
Quality assurance and ethics considerations in language technologies and approaches applied to text and other modalities for public consumption;
Summarization of data from electronic health records for patients;
Detection of misinformation in consumer health-related resources and mitigation of potential harms;
Consumer health question answering (Community Question Answering)(CQA);
Biomedical text simplification/adaptation;
Dialogue systems to support patients' interactions with clinicians, healthcare systems, and online resources;
Linguistic resources, data and tools for language technologies focusing on consumer health;
Infrastructures and pre-trained language models for consumer health;

Shared Task

Perspective-aware Healthcare Answer Summarization (PerAnsSumm) will be co-located with the workshop. See details at https://peranssumm.github.io/.
In community/consumer health question answering, several aspects, such as question understanding and answer generation, have been studied for over a decade. A new and important question posed by this task is the different perspectives provided in the answers to questions posted to online forums. The responses to the questions offer different answer perspectives, e.g., personal experiences, factual information, and suggestions. Traditionally, the CQA answer summarization task has focused on a single best-voted answer as a reference summary. A single answer does not capture all the perspectives. Providing the answers in structured, perspective-specific summaries could better serve the information needs of end users. To address these gaps, this challenge introduces a novel perspective-specific answer summarization task within a CQA setup. The task will use the Perspective-aware healthcare Answer SuMmarizAtion (PUMA) dataset, a corpus of medical question-answer pairs created by the task organizers. The PUMA dataset consists of 3,167 CQA threads with approximately 10K answers filtered from the Yahoo! L6 corpus. Each answer in PUMA is annotated with five perspective spans: ‘cause’, ‘suggestion’, ‘experience’, ‘question’, and ‘information’.

Meeting

The workshop will be hybrid. Virtual attendees must be registered for the workshop to access the online environment.

Accepted papers will be presented as posters or oral presentations based on the reviewers’ recommendations.

Program Committee

Mathilde Aguiar, Université Paris-Saclay, CNRS, France
Sophia Ananiadou, National Centre for Text Mining and University of Manchester, UK
Zainab Awan, UTS, Australia
Leonor Barreiros, University of Lisbon, Portugal
Abby Blocker, University of Cape Town, South Africa
Leonardo Campillos-Llanos, Spanish National Research Council, Spain
Amin Dada, Institute for AI in Medicine (IKIM), University Hospital Essen, Germany
Luise Dürlich, Uppsala University, Sweden
Manas Gaur, University of Maryland, Baltimore County, USA
Sagar Goyal, DeepScribe Inc., USA
Natalia Grabar, Université de Lille, France
Tudor Groza, SingHealth Duke-NUS Institute of Precision Medicine, Singapore
Deepak Gupta, National Library of Medicine, USA
Ramakanth Kavuluru, University of Kentucky, USA
Oren Kobo, Intel, Israel
Vojtech Lanz, Charles University, Czech Republic
Chuyuan Li, The University of British Columbia, Canada
Zhiwei Liu, University of Manchester, UK
Simon Meoni, Arkhn, France
Shufan Ming, University of Illinois Urbana-Champaign, USA
Jose G. Moreno, Paul Sabatier University - IRIT, France
Aurélie Névéol, Université de Paris-Saclay, CNRS, LISN, Orsay, France
Brian Ondov, Yale University, USA
Tabea Pakull, Institute for Transfusion Medicine, University Hospital Essen, Germany
Mehdi Parviz, University of Copenhagen, Denmark
Anthony Rios, University of Texas at San Antonio, USA
Miguel Rocha, University of Minho, Portugal
Roland Roller, German Research Center for Artificial Intelligence, DFKI, Germany
Abeed Sarker, Emory School of Medicine, USA
Sarvesh Soni, National Library of Medicine, USA
Paul Thompson, National Centre for Text Mining and University of Manchester, UK
Grigorios Tsoumakas, Aristotle University of Thessaloniki, Greece
Aswathy Velutharambath, University of Stuttgart, Germany
Jinghua Xu, Heidelberg University, Germany
Zhicheng Yang, PAII Inc., USA
Dong Yuan, Google, USA
Tianlin Zhang, The University of Manchester, UK
Pierre Zweigenbaum, Université de Paris-Saclay, CNRS, LISN, Orsay, France

Organizers

Dina Demner-Fushman, US National Library of Medicine
Sophia Ananiadou, National Centre for Text Mining and University of Manchester, UK
Paul Thompson, National Centre for Text Mining and University of Manchester, UK
Deepak Gupta, US National Library of Medicine