International Journal of Medical Informatics
Volume 67, Issue 1 , Pages 49-61 , 4 December 2002

Protein names and how to find them

  • Kristofer Franzén

      Affiliations

    • Swedish Institute of Computer Science, Box 1263, SE-164 29 Kista, Sweden
    • Corresponding Author InformationCorresponding author. Tel.: +46-8-633-1537; fax: +46-8-751-7230
  • ,
  • Gunnar Eriksson

      Affiliations

    • Swedish Institute of Computer Science, Box 1263, SE-164 29 Kista, Sweden
  • ,
  • Fredrik Olsson

      Affiliations

    • Swedish Institute of Computer Science, Box 1263, SE-164 29 Kista, Sweden
  • ,
  • Lars Asker

      Affiliations

    • Virtual Genetics Laboratory AB, SE-171 77 Stockholm, Sweden
  • ,
  • Per Lidén

      Affiliations

    • Virtual Genetics Laboratory AB, SE-171 77 Stockholm, Sweden
  • ,
  • Joakim Cöster

      Affiliations

    • Virtual Genetics Laboratory AB, SE-171 77 Stockholm, Sweden

References 

  1. F. Olsson, P. Hansen, K. Franzén, J. Karlgren. Information access and refinement — a research theme, Ercim News 46 (2001).
  2. Grishman R. Information extraction: techniques and challenges. In:  Pazienza MT editors. Information Extraction — a Multidisciplinary Approach to an Emerging Information Technology. Springer; 1997;p. 10–27
  3. Proceedings of the Seventh Message Understanding Conference (MUC-7), Morgan Kaufmann, Virginia USA, April–May 1998.
  4. Proceedings of the Sixth Message Understanding Conference (MUC-6), Morgan Kaufmann, Columbia, MD USA, November 1995.
  5. Proceedings of the Fifth Message Understanding Conference (MUC-5), Morgan Kaufmann, Baltimore, MD, USA, August 1993.
  6. Proceedings of the Fourth Message Understanding Conference (MUC-4), Morgan Kaufmann, June 1992.
  7. Proceedings of the Third Message Understanding Conference (MUC-3), Morgan Kaufmann, May 1991.
  8. A. Borthwick, J. Sterling, E. Agichtein, R. Grishman, Exploiting diverse knowledge sources via maximum entropy in named entity recognition, in: Proceedings of the Sixth Workshop on Very Large Corpora, Montreal, Canada, August 1998.
  9. C. Nobata, N. Collier, J. Tsujii, Automatic term identification and classification in biology texts, in: Proceedings of the Natural Language Pacific Rim Symposium (NLPRS'2000), November 1999, pp. 369–374.
  10. N. Collier, C. Nobata, J. Tsujii, Extracting the name of genes and gene products with a Hidden Markov Model, in: Proceedings of the 18th International Conference on Computational Linguistics (COLING-2000), August 2000, pp. 201–207.
  11. K. Fukuda, T. Tsunoda, A. Tamura, T. Takagi, Toward Information extraction: identifying protein names from biological papers, in: Proceedings of the Pacific Symposium on Biocamputing (PSB'98), Maui, Hawaii, 4–9 January 1998, pp. 705–716.
  12. R. Gaizauskas, K. Humphreys, G. Demetriou, Information extraction from biological science journal articles: enzyme interactions and protein structures, in: M.G. Hicks (Ed.), Proceedings of the Workshop Chemical Data Analysis in the Large: the Challenge of the Automation Age, 2001.
  13. Bairoch A, Apweiler R. The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000. Nucl. Acids Res. 2000;28:45–48
  14. P. Tapanainen, T. Järvinen, A non-projective dependency parser, In: Proceedings of the fifth Conference on Applied Natural Language Processing, Association for Computational Linguistics, Washington DC, April 1997, pp. 64–71.
  15. B. de Bruijn, J. Martin, Protein name tagging, Presented as a poster at the eighth International Conference on Intelligent Systems for Molecular Biology (ISMB'00), 2000.
  16. N. Collier, H.S. Park, N. Ogata, Y. Tateishi, C. Nobata, T. Ohta, T. Sekimizu, H. Imai, K. Ibushi, J. Tsujii, The GENIA project: corpus-based knowledge acquisition and information extraction from genome research papers, In: Proceedings of the ninth Conference of the European Chapter of the Association for Computational Linguistics (EACL), June 1999, pp. 271–272.

PII: S1386-5056(02)00052-7

International Journal of Medical Informatics
Volume 67, Issue 1 , Pages 49-61 , 4 December 2002