Augmenting and Automating Corpus Enrichment

Felix Kuhr, Tanya Braun, Magnus Bender, Ralf Möller

Abstract

An agent in pursuit of a task may work with a reference library containing documents associated with additional data that provide location-specific explanations about the content. Faced with a new document, an agent has to decide whether to include the new document in its reference library. Basing the decision on words, topics, or entities has shown not to lead to a balanced performance for varying documents. In this paper, we present an approach for automatically enriching new documents with data associated to documents in a reference library. Additionally, we analyze these data to classify new documents into categories to help an agent in deciding whether to include the new document in its reference library.

Original languageEnglish
JournalInternational Journal of Semantic Computing
Volume14
Issue number2
Pages (from-to)173-197
Number of pages25
ISSN1793-351X
DOIs
Publication statusPublished - 01.06.2020

Research Areas and Centers

  • Centers: Center for Artificial Intelligence Luebeck (ZKIL)
  • Research Area: Intelligent Systems

DFG Research Classification Scheme

  • 409-06 Information Systems, Process and Knowledge Management

Cite this