PUMA
Istituto di Scienza e Tecnologie dell'Informazione     
Esuli A., Marcheggiani D., Sebastiani F. An enhanced CRFs-based system for information extraction from radiology reports. In: Journal of Biomedical Informatics, vol. 46 (3) pp. 425 - 435. Elsevier, 2013.
 
 
Abstract
(English)
We discuss the problem of performing information extraction from free-text radiology reports via supervised learning. In this task, segments of text (not necessarily coinciding with entire sentences, and possibly crossing sentence boundaries) need to be annotated with tags representing concepts of interest in the radiological domain. In this paper we present two novel approaches to IE for radiology reports: (i) a cascaded, two-stage method based on pipelining two taggers generated via the well known linear-chain conditional random fields (LC-CRFs) learner and (ii) a confidence-weighted ensemble method that combines standard LC-CRFs and the proposed two-stage method. We also report on the use of "positional features", a novel type of feature intended to aid in the automatic annotation of texts in which the instances of a given concept may be hypothesized to systematically occur in specific areas of the text. We present experiments on a dataset of mammography reports in which the proposed ensemble is shown to outperform a traditional, single-stage CRFs system in two different, applicatively interesting scenarios.
URL: http://www.sciencedirect.com/science/article/pii/S1532046413000191
DOI: http://dx.doi.org/10.1016/j.jbi.2013.01.006
Subject Information extraction
Clinical text
Conditional random fields
I.2.6 Learning


Icona documento 1) Download Document PDF


Icona documento Open access Icona documento Restricted Icona documento Private

 


Per ulteriori informazioni, contattare: Librarian http://puma.isti.cnr.it

Valid HTML 4.0 Transitional