Analysing entity context in multilingual Wikipedia to support entity-centric retrieval applications

[thumbnail of WRAP_ikc_entity_search.pdf]
Preview
PDF
WRAP_ikc_entity_search.pdf - Accepted Version - Requires a PDF viewer.

Download (613kB) | Preview

Request Changes to record.

Abstract

Representation of influential entities, such as famous people and multinational corporations, on the Web can vary across languages, reflecting language-specific entity aspects as well as divergent views on these entities in different communities. A systematic analysis of language specific entity contexts can provide a better overview of the existing aspects and support entity-centric retrieval applications over multilingual Web data. An important source of cross-lingual information about influential entities is Wikipedia — an online community-created encyclopaedia — containing more than 280 language editions. In this paper we focus on the extraction and analysis of the language-specific entity contexts from different Wikipedia language editions over multilingual data. We discuss alternative ways such contexts can be built, including graph-based and article-based contexts. Furthermore, we analyse the similarities and the differences in these contexts in a case study including 80 entities and five Wikipedia language editions.

Item Type: Book Item
Subjects: A General Works > AC Collections. Series. Collected works
P Language and Literature > P Philology. Linguistics
Q Science > QA Mathematics > QA76 Electronic computers. Computer science. Computer software
Z Bibliography. Library Science. Information Resources > ZA Information resources
Divisions: Faculty of Science, Engineering and Medicine > Science > Computer Science
Library of Congress Subject Headings (LCSH): Wikis (Computer science), Encyclopedias and dictionaries, Multilingual computing, Internet, Celebrities -- Press coverage , Corporations -- Press coverage , Politicians -- Press coverage
Series Name: Lecture Notes in Computer Science
Journal or Publication Title: Semantic Keyword-based Search on Structured Data Sources
Publisher: Springer International Publishing
ISBN: 9783319279312
ISSN: 0302-9743
Book Title: Semantic Keyword-based Search on Structured Data Sources : First COST Action IC1302 International KEYSTONE Conference, IKC 2015, Coimbra, Portugal, September 8-9, 2015. Revised Selected Papers
Official Date: 7 January 2016
Dates:
Date
Event
7 January 2016
Published
6 July 2015
Accepted
Volume: 9398
Page Range: pp. 197-208
Status: Peer Reviewed
Publication Status: Published
Access rights to Published version: Restricted or Subscription Access
Date of first compliant deposit: 21 April 2016
Date of first compliant Open Access: 3 March 2017
Funder: European Cooperation in the Field of Scientific and Technical Research (Organization) (COST), European Research Council (ERC)
Grant number: Action IC1302 KEYSTONE (COST), ALEXANDRIA ERC 339233 (ERC)
Conference Paper Type: Paper
Title of Event: IKC 2015
Type of Event: Conference
Location of Event: Coimbra, Portugal
Date(s) of Event: 8-9 Sep 2015
Related URLs:
URI: https://wrap.warwick.ac.uk/78613/

Export / Share Citation


Request changes or add full text files to a record

Repository staff actions (login required)

View Item View Item