Augmenting and Automating Corpus Enrichment

Felix Kuhr, Tanya Braun, Magnus Bender, Ralf Möller

Abstract

An agent in pursuit of a task may work with a reference library containing documents associated with additional data that provide location-specific explanations about the content. Faced with a new document, an agent has to decide whether to include the new document in its reference library. Basing the decision on words, topics, or entities has shown not to lead to a balanced performance for varying documents. In this paper, we present an approach for automatically enriching new documents with data associated to documents in a reference library. Additionally, we analyze these data to classify new documents into categories to help an agent in deciding whether to include the new document in its reference library.

Original languageEnglish
JournalInternational Journal of Semantic Computing
Volume14
Issue number2
Pages (from-to)173-197
Number of pages25
ISSN1793-351X
DOIs
Publication statusPublished - 01.06.2020

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being
  2. SDG 4 - Quality Education
    SDG 4 Quality Education
  3. SDG 9 - Industry, Innovation, and Infrastructure
    SDG 9 Industry, Innovation, and Infrastructure
  4. SDG 11 - Sustainable Cities and Communities
    SDG 11 Sustainable Cities and Communities
  5. SDG 12 - Responsible Consumption and Production
    SDG 12 Responsible Consumption and Production
  6. SDG 14 - Life Below Water
    SDG 14 Life Below Water
  7. SDG 15 - Life on Land
    SDG 15 Life on Land

Research Areas and Centers

  • Centers: Center for Artificial Intelligence Luebeck (ZKIL)
  • Research Area: Intelligent Systems

DFG Research Classification Scheme

  • 409-06 Information Systems, Process and Knowledge Management

Fingerprint

Dive into the research topics of 'Augmenting and Automating Corpus Enrichment'. Together they form a unique fingerprint.

Cite this