2.2 Evaluation measures

Juan Miguel Cejuela; Aleksandar Bojchevski; Carsten Uhlig; Rustem Bekmukhametov; Sanjeev Kumar Karn; Shpend Mahmuti; Ashish Baghudana; Ankit Dubey; Venkata P Satagopam; Burkhard Rost

Improve Research Reproducibility A Bio-protocol resource

Home
Protocols

Concise Method

2.2 Evaluation measures

JC Juan Miguel Cejuela

AB Aleksandar Bojchevski

CU Carsten Uhlig

RB Rustem Bekmukhametov

SK Sanjeev Kumar Karn

SM Shpend Mahmuti

AB Ashish Baghudana

AD Ankit Dubey

VS Venkata P Satagopam

BR Burkhard Rost

This method is extracted from research article: Bioinformatics, Feb 2017

nala: text mining natural language mutation mentions

DOI: 10.1093/bioinformatics/btx083

Request a Protocol

Ask a question

Favorite

We considered a named entity as successfully extracted if its text offsets (character positions in a text-string) were correctly identified (tp: true positive). We considered two modes for tp: exact matching (two entities match if their text offsets are identical) and partial matching (text offsets overlap). Any other prediction was considered as a false positive (fp) and any missed entity as a false negative (fn). Partial matching is more suitable to evaluate NL mentions lacking well-defined boundaries. For instance, in finding ‘[changed conserved] glutamine at 115 to proline’, we did not distinguish solutions with and without the words in brackets, because we focused on the extraction of the mention not on that of additional annotations (here ‘conserved’). We computed performance for all cases and for the subclasses (ST, SST and NL). A test entity of subclass X was considered as correctly identified if any predicted entity matched. We then used the standard evaluation measures for named-entity recognition, namely, precision (P: tp/tp + fp), recall (R: tp/tp + fn) and F-Measure (F: 2*(P*R)/(P + R)). Within a corpus, we computed the StdErr by randomly selecting 15% of the test data without replacement in 1000 (n) bootstrap samples. With <x> as the overall performance for the entire test set and x_i for subset i, we computed:

Across corpora, we did not merge documents. Rather, we computed the mean of P, R and F between the considered corpora, and computed the StdErr of the mean without subsampling.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.

Do you have any questions about this protocol?

Post your question to gather feedback from the community. We will also invite the authors of this article to respond.

0/150

tip Tips for asking effective questions

+ Description

Write a detailed description. Include all information that will help others answer your question including experimental processes, conditions, and relevant images.

Post a Question

0 Q&A

Share your protocol with your peers.

Submit a Preprint Protocol