Login | Register

Ideas Challenge 2020/2021 : WikiData Integration with Repository Contents

Title:

Ideas Challenge 2020/2021 : WikiData Integration with Repository Contents

Neugebauer, Tomasz ORCID: https://orcid.org/0000-0002-9743-5910, Kuchma, Iryna, Selematsela, Daisy, Biziwe, Tembe, Zimmer, Niklas, Zodwa, Thomas, Havemann, Jo, Staines, Heather, Berrizbeitia, Francisco ORCID: https://orcid.org/0000-0002-1542-8435, Stacey, Phil, Shearer, Kathleen, Venema, Victor and Tykhonov, Slava (2021) Ideas Challenge 2020/2021 : WikiData Integration with Repository Contents. In: International Conference on Open Repositories, June 7th-10th, 2021, Virtual Conference.

[thumbnail of IdeasChallenge-OR2021-Opening-Presentation.pptx]
Slideshow (application/vnd.openxmlformats-officedocument.presentationml.presentation)
IdeasChallenge-OR2021-Opening-Presentation.pptx - Published Version
Available under License Spectrum Terms of Access.
9MB

Official URL: https://zenodo.org/record/4914112

Abstract

Presentation of the work of an Ideas Challenge Team from Open Repositories Conference 2020. The challenge presented is that of WikiData integration with repositories as a way of improving multilingual access to repository contents. Multilingual indexing by search engines and aggregators, and the overall importance of linguistic diversity in scholarly publishing and access is discussed. The results presented include a detailed overview of various metadata standards relevant for representing multilingual WikiData concepts in repositories: HTML5, Dublin Core, DataCite, JATS XML, Schema.org. Two scripts that were written in Python for enriching Repository Metadata with WikiData Concepts and their use on EPrints JSON-LD metadata and a test dataset of publications in information visualization is presented. These scripts use DBPedia Spotlight API to annotate scholarly metadata with DBPedia concepts, and these in turn are used to extract translated labels from WikiData. A resource list of relevant projects is included, as well as some additional examples and notes.

Divisions:Concordia University > Library
Item Type:Conference or Workshop Item (Other)
Refereed:No
Authors:Neugebauer, Tomasz and Kuchma, Iryna and Selematsela, Daisy and Biziwe, Tembe and Zimmer, Niklas and Zodwa, Thomas and Havemann, Jo and Staines, Heather and Berrizbeitia, Francisco and Stacey, Phil and Shearer, Kathleen and Venema, Victor and Tykhonov, Slava
Date:8 June 2021
Digital Object Identifier (DOI):10.5281/ZENODO.4914112
Keywords:linguistic diversity, wikidata, metadata, linked open data, OR2021, HTML5, Dublin Core, DataCite, JATS XML, Schema.org, OpenAIRE, DBPedia, DBPedia Spotlight
ID Code:990887
Deposited By: Tomasz Neugebauer
Deposited On:24 Aug 2022 17:48
Last Modified:21 Oct 2022 20:41
All items in Spectrum are protected by copyright, with all rights reserved. The use of items is governed by Spectrum's terms of access.

Repository Staff Only: item control page

Downloads per month over past year

Research related to the current document (at the CORE website)
- Research related to the current document (at the CORE website)
Back to top Back to top