Ontology-based Information Extraction (OBIE) Task

From GM-RKB
Jump to navigation Jump to search

An Ontology-based Information Extraction (OBIE) Task is an information extraction from text task that requires an ontology.



References

2024

  • Perplexity
    • Ontology-based Information Extraction (OBIE) is a process of automatically extracting information from unstructured or semi-structured text sources, guided by an ontology that formally represents the domain knowledge.[1][2][3]
      1. **Ontology-Driven**: The information extraction process is guided by an ontology, which defines the concepts, relationships, and constraints within a specific domain. The ontology serves as a knowledge base, providing a structured representation of the domain.[1][2]
      2. **Semantic Annotation**: OBIE systems aim to semantically annotate text by identifying instances of concepts, relations, and properties defined in the ontology. This involves mapping textual mentions to their corresponding ontological entities.[1][3]
      3. **Knowledge Base Population**: The extracted information is typically used to populate a knowledge base or ontology with instances and relationships derived from the text.[1][2]
      4. **Domain-Specific**: OBIE systems are often tailored to specific domains, as the ontologies used are designed to capture the relevant concepts and relationships within that domain.[2][4]
      5. **Hybrid Approaches**: OBIE systems may employ a combination of techniques, such as rule-based methods, machine learning, and linguistic analysis, to leverage the ontological knowledge and extract relevant information from text.[1][3]
    • Here are three examples of Ontology-based Information Extraction (OBIE) tasks:
      1. Biomedical Domain
        • **Task**: Extract information about genes, proteins, diseases, and their relationships from scientific literature and research papers.
        • **Ontology**: Gene Ontology (GO), Unified Medical Language System (UMLS), or domain-specific ontologies developed for biomedical research.
        • **Example**: Given a research paper on cancer, an OBIE system could identify mentions of genes (e.g., BRCA1, TP53), proteins (e.g., p53), diseases (e.g., breast cancer, lung cancer), and extract relationships such as "BRCA1 gene is associated with breast cancer" or "p53 protein regulates cell cycle."
      2. 2. Business Intelligence
        • **Task**: Extract information about companies, products, financial data, and market trends from news articles, reports, and websites.
        • **Ontology**: Domain ontologies for business entities, products, financial concepts, and market sectors.
        • **Example**: Given a news article about a company's quarterly earnings report, an OBIE system could identify the company name, financial figures (revenue, profit, earnings per share), and extract relationships such as "Company X reported a revenue of $Y billion" or "Product Z contributed to X% of total sales."
      3. Legal Domain
        • **Task**: Extract information about legal entities, regulations, court cases, and their relationships from legal documents and case law.
        • **Ontology**: Legal ontologies covering concepts like legal entities, jurisdictions, laws, court cases, and legal relationships.
        • **Example**: Given a court case document, an OBIE system could identify the parties involved (e.g., plaintiff, defendant), legal concepts (e.g., contract breach, intellectual property), and extract relationships such as "Company A sued Company B for patent infringement" or "The court ruled in favor of the defendant."
    • Citations:
[1] https://citeseerx.ist.psu.edu/document?doi=f53373c8eba5a75f7f5eb5ba61f986accef6bee9&repid=rep1&type=pdf
[2] https://ceur-ws.org/Vol-2721/paper563.pdf
[3] https://www.researchgate.net/publication/220195792_Ontology-based_information_extraction_An_Introduction_and_a_survey_of_current_approaches
[4] https://www.gabormelli.com/RKB/Ontology-based_Information_Extraction_Task
[5] https://www.semanticscholar.org/paper/Ontology-Based-Information-Extraction-Wimalasuriya/3c5ce0b7e541c8dd53bb5c8ce10fdbab6d85211d

2013

2010a

2010b

2009

2008

2006