Automated Biomedical Text Fragmentation In Support Of Biomedical Sentence Fragment Classification
Abstract
The past decade has seen a tremendous growth in the amount of biomedical literature, specifically in the area of bioinformatics. As a result, biomedical text categorization has become a central task for providing researchers with literature appropriate for their specific information needs.
Pan et al. have explored a method that automatically identifies information-bearing sentence fragments within scientific text. Their proposed method aims to automatically classify sentence fragments into certain sets of categories defined to satisfy specific types of information needs. The categories are grouped into five different dimensions known as Focus, Polarity, Certainty, Evidence, and Trend. The reason that fragments are used as the unit of classification is that the class value along each of these dimensions can change mid-sentence.
In order to automatically annotate sentence fragments along the five dimensions, automatically breaking sentences into fragments is a necessary step. The performance of the classifier depends on the sentence fragments. In this study, we investigate the problem of automatic fragmentation of biomedical sentences, which is a fundamental layer in the multi-dimensional fragment classification. In addition, we believe that our proposed fragmentation algorithm can be used in other domains such as sentiment analysis. The goal of sentiment analysis is often to classify the polarity (positive or negative) of a given text. Sentiment classification can be conducted at different levels such as document, sentence, or phrase (fragment) level. Our proposed fragmentation algorithm can be used as a prerequisite for phrase-level sentiment categorization which aims to automatically capture multiple sentiments within a sentence.
URI for this record
http://hdl.handle.net/1974/5251Request an alternative format
If you require this document in an alternate, accessible format, please contact the Queen's Adaptive Technology CentreRelated items
Showing items related by title, author, creator and subject.
-
Elucidating the interaction between the Fragment 2 domain of Prothrombin and Factor Va
Berridge, Joanne (2012-08-03)The prothrombinase (IIase) complex is an essential component of the coagulation cascade and is composed of a serine protease, Factor Xa (FXa), its non-enzymatic cofactor, Factor Va (FVa), calcium and a phospholipid membrane ... -
Overcoming Fragmentation? Labour-Community Alliances and The Complexity of Movement Building in Cape Town
Murray, Adrian (2013-08-21)This thesis explores processes of social movement organizing in response to the neoliberal restructuring of public services in South Africa. Through a case study of an alliance of municipal workers and community activists ... -
Comparative landscape genetics of two sympatric snake species in a fragmented southwestern Ontario habitat
DiLeo, Michelle (2009-04-27)In this study I investigate the effects of a fragmented southwestern Ontario landscape on the genetic population structure of two sympatric snake species that differ in habitat preference. I was most interested in comparing ...