Journal Home
Search for

Volume 78, Issue 12, Pages e7-e12 (December 2009)


View previous. 10 of 22 View next.

Towards automated processing of clinical Finnish: Sublanguage analysis and a rule-based parser

Veronika LaippalaacCorresponding Author Informationemail address, Filip Gintera, Sampo Pyysalob, Tapio Salakoskiab

Received 31 October 2008; received in revised form 28 January 2009; accepted 10 February 2009. published online 20 March 2009.

Abstract 

Introduction

In this paper, we present steps taken towards more efficient automated processing of clinical Finnish, focusing on daily nursing notes in a Finnish Intensive Care Unit (ICU). First, we analyze ICU Finnish as a sublanguage, identifying its specific features facilitating, for example, the development of a specialized syntactic analyser. The identified features include frequent omission of finite verbs, limitations in allowed syntactic structures, and domain-specific vocabulary. Second, we develop a formal grammar and a parser for ICU Finnish, thus providing better tools for the development of further applications in the clinical domain.

Methods

The grammar is implemented in the LKB system in a typed feature structure formalism. The lexicon is automatically generated based on the output of the FinTWOL morphological analyzer adapted to the clinical domain. As an additional experiment, we study the effect of using Finnish constraint grammar to reduce the size of the lexicon. The parser construction thus makes efficient use of existing resources for Finnish.

Results

The grammar currently covers 76.6% of ICU Finnish sentences, producing highly accurate best-parse analyzes with F-score of 91.1%. We find that building a parser for the highly specialized domain sublanguage is not only feasible, but also surprisingly efficient, given an existing morphological analyzer with broad vocabulary coverage. The resulting parser enables a deeper analysis of the text than was previously possible.

a Department of Information Technology,University of Turku, Joukahaisenkatu 3-5, 20520 Turku, Finland

b Turku Centre for Computer Science (TUCS), University of Turku, Joukahaisenkatu 3-5, 20520 Turku, Finland

c Department of French Studies, University of Turku, Henrikink. 2, 20014 Turku, Finland

Corresponding Author InformationCorresponding author at: Department of French Studies, University of Turku, Henrikink. 2, 20014 Turku, Finland. Tel.: +358 407782814.

PII: S1386-5056(09)00020-3

doi:10.1016/j.ijmedinf.2009.02.005


View previous. 10 of 22 View next.