Description |
1 online resource. |
Physical Medium |
polychrome |
Description |
text file |
Series |
Studies on the semantic web ; volume 043
|
|
Studies on the Semantic Web ; v. 043.
|
Contents |
Intro -- Title Page -- Contents -- Acronyms -- 1 Introduction -- 1.1 Background: Identity in the digital era -- 1.2 Challenge: Entity Linking in the long tail -- 1.3 Research questions -- 1.4 Approach and structure of the thesis -- 1.4.1 Describing and observing the head and the tail -- 1.4.2 Analyzing the evaluation bias on the long tail -- 1.4.3 Improving the evaluation bias on the long tail -- 1.4.4 Enabling access to knowledge about long-tail entities beyond DBpedia -- 1.4.5 The role of knowledge in establishing identity of long-tail entities -- 1.5 Summary of findings |
|
1.6 Software and data -- 2 Describing and Observing the Head and the Tail of Entity Linking -- 2.1 Introduction -- 2.2 Related work -- 2.3 Approach -- 2.3.1 The head-tail phenomena of the entity linking task -- 2.3.2 Hypotheses on the head-tail phenomena of the entity linking task -- 2.3.3 Datasets and systems -- 2.3.4 Evaluation -- 2.4 Analysis of data properties -- 2.4.1 Frequency distribution of forms and instances in datasets -- 2.4.2 PageRank distribution of instances in datasets -- 2.4.3 Ambiguity distribution of forms -- 2.4.4 Variance distribution of instances |
|
2.4.5 Interaction between frequency, PageRank, and ambiguity/variance -- 2.4.6 Frequency distribution for a single form or an instance -- 2.5 Analysis of system performance and data properties -- 2.5.1 Correlating system performance with form ambiguity -- 2.5.2 Correlating system performance with form frequency, instance frequency, and PageRank -- 2.5.3 Correlating system performance with ambiguity and frequency of forms jointly -- 2.5.4 Correlating system performance with frequency of instances for ambiguous forms -- 2.6 Summary of findings -- 2.7 Recommended actions -- 2.8 Conclusions |
|
3 Analyzing the Evaluation bias on the Long Tail of Disambiguation & Reference -- 3.1 Introduction -- 3.2 Temporal aspect of the disambiguation task -- 3.3 Related work -- 3.4 Preliminary study of EL evaluation datasets -- 3.4.1 Datasets -- 3.4.2 Dataset characteristics -- 3.4.3 Distributions of instances and surface forms -- 3.4.4 Discussion and roadmap -- 3.5 Semiotic generation and context model -- 3.6 Methodology -- 3.6.1 Metrics -- 3.6.2 Tasks -- 3.6.3 Datasets -- 3.7 Analysis -- 3.8 Proposal for improving evaluation -- 3.9 Conclusions |
|
4 Improving the Evaluation bias on the Long Tail of Disambiguation & Reference -- 4.1 Introduction -- 4.2 Motivation & target communities -- 4.2.1 Disambiguation & reference -- 4.2.2 Reading Comprehension & Question Answering -- 4.2.3 Moving away from semantic overfitting -- 4.3 Task requirements -- 4.4 Methods for creating an event-based task -- 4.4.1 State of text-to-data datasets -- 4.4.2 From data to text -- 4.5 Data & resources -- 4.5.1 Structured data -- 4.5.2 Example document -- 4.5.3 Licensing & availability -- 4.6 Task design -- 4.6.1 Subtasks -- 4.6.2 Question template |
Bibliography |
Includes bibliographical references. |
Local Note |
eBooks on EBSCOhost EBSCO eBook Subscription Academic Collection - North America |
Subject |
Natural language processing (Computer science)
|
|
Natural language processing (Computer science) |
Genre/Form |
Electronic books.
|
|
Electronic books.
|
Other Form: |
Original 1643680420 9781643680422 (OCoLC)1129988118 |
ISBN |
9781643680439 (electronic book) |
|
1643680439 (electronic book) |
|
9781643680422 |
|
1643680420 |
|