TextAnnotator

About
In different disciplines, scholars are supported in their research by the use of digital methods to process increasingly large amounts of data. For the necessary annotation, tools are required which should meet at least the following general requirements: they can handle diverse data and annotation levels within one tool, and they support the annotation process with automatic (pre-)processing outcomes as much as possible. We developed a framework that meets these general requirements and that enables versatile and browser-based annotations of texts, the TextAnnotator. It combines NLP methods of pre-processing with methods of flexible post-processing. Infact, machine learning (ML) requires a lot of training and test data, but is usually far from achieving perfect results. Producing high-level annotations for ML and post-correcting its results are therefore necessary. This is the purpose of TextAnnotator, which is entirely implemented in ExtJS and provides a range of interactive visualizations of annotations. In addition, it allows for flexibly integrating knowledge resources, e.g. in the course of post-processing named entity recognition. The paper describes TextAnnotator‘s architecture together with different use cases: annotating temporal structures, argument structures, propositional structures, rhetorical structures and named entity linking.

  • [PDF] G. Abrami, A. Mehler, P. Helfrich, and E. Rieb, “TextAnnotator: A Browser-based Framework for Annotating Textual Data in Digital Humanities,” in Proceedings of the Digital Humanities Austria 2018, 2018.
    [Bibtex]
    @InProceedings{Abrami:et:al:2018,
    Author = {Giuseppe Abrami and Alexander Mehler and Philipp Helfrich and Elias Rieb},
    Title = {{TextAnnotator}: A Browser-based Framework for Annotating Textual Data in Digital Humanities},
    BookTitle = {Proceedings of the Digital Humanities Austria 2018},
    pdf = {https://www.texttechnologylab.org/wp-content/uploads/2019/04/TA__A_Browser_based_Framework_for_Annotating_Textual_Data_in_Digital_Humanities.pdf},
    location = {Salzburg, Austria},
    year = 2018
    }
  • [PDF] G. Abrami, A. Mehler, A. Lücking, E. Rieb, and P. Helfrich, “TextAnnotator: A flexible framework for semantic annotations,” in Proceedings of the Fifteenth Joint ACL – ISO Workshop on Interoperable Semantic Annotation, (ISA-15), 2019.
    [Bibtex]
    @InProceedings{Abrami:et:al:2019,
      Author         = {Abrami, Giuseppe and Mehler, Alexander and Lücking, Andy and Rieb, Elias and Helfrich, Philipp},
      Title          = {{TextAnnotator}: A flexible framework for semantic annotations},
      BookTitle      = {Proceedings of the Fifteenth Joint ACL - ISO Workshop on Interoperable Semantic Annotation, (ISA-15)},
      Series         = {ISA-15},
      location       = {Gothenburg, Sweden},
      month     = {May},
      pdf      = {https://www.texttechnologylab.org/wp-content/uploads/2019/04/TextAnnotator_IWCS_Göteborg.pdf},
      year           = 2019,
      abstract   ="Modern annotation tools should meet at least the following general requirements: they can handle diverse data and annotation levels within one tool, and they support the annotation process with automatic (pre-)processing outcomes as much as possible. We developed a framework that meets these general requirements and that enables versatile and browser-based annotations of texts, the TextAnnotator. It combines NLP methods of pre-processing with methods of flexible post-processing. Infact, machine learning (ML) requires a lot of training and test data, but is usually far from achieving perfect results. Producing high-level annotations for ML and post-correcting its results are therefore necessary. This is the purpose of TextAnnotator, which is entirely implemented in ExtJS and provides a range of interactive visualizations of annotations. In addition, it allows for flexibly integrating knowledge resources, e.g. in the course of post-processing named entity recognition. The paper describes TextAnnotator’s architecture together with three use cases: annotating temporal structures, argument structures and named entity linking."
    }
  • [PDF] P. Helfrich, E. Rieb, G. Abrami, A. Lücking, and A. Mehler, “TreeAnnotator: Versatile Visual Annotation of Hierarchical Text Relations,” in Proceedings of the 11th edition of the Language Resources and Evaluation Conference, May 7 – 12, Miyazaki, Japan, 2018.
    [Bibtex]
    @InProceedings{Helfrich:et:al:2018,
      Author         = {Philipp Helfrich and Elias Rieb and Giuseppe Abrami
                       and Andy L{\"u}cking and Alexander Mehler},
      Title          = {TreeAnnotator: Versatile Visual Annotation of
                       Hierarchical Text Relations},
      BookTitle      = {Proceedings of the 11th edition of the Language
                       Resources and Evaluation Conference, May 7 - 12},
      Series         = {LREC 2018},
      Address        = {Miyazaki, Japan},
      pdf            = {https://www.texttechnologylab.org/wp-content/uploads/2018/03/TreeAnnotator.pdf},
      year           = 2018
    }