Neugebauer, Tomasz ORCID: https://orcid.org/0000-0002-9743-5910, Kuchma, Iryna, Selematsela, Daisy, Biziwe, Tembe, Zimmer, Niklas, Zodwa, Thomas, Havemann, Jo, Staines, Heather, Berrizbeitia, Francisco ORCID: https://orcid.org/0000-0002-1542-8435, Stacey, Phil, Shearer, Kathleen, Venema, Victor and Tykhonov, Slava (2021) Ideas Challenge 2020/2021 : WikiData Integration with Repository Contents. In: International Conference on Open Repositories, June 7th-10th, 2021, Virtual Conference.
Slideshow (application/vnd.openxmlformats-officedocument.presentationml.presentation)
9MBIdeasChallenge-OR2021-Opening-Presentation.pptx - Published Version Available under License Spectrum Terms of Access. |
Official URL: https://zenodo.org/record/4914112
Abstract
Presentation of the work of an Ideas Challenge Team from Open Repositories Conference 2020. The challenge presented is that of WikiData integration with repositories as a way of improving multilingual access to repository contents. Multilingual indexing by search engines and aggregators, and the overall importance of linguistic diversity in scholarly publishing and access is discussed. The results presented include a detailed overview of various metadata standards relevant for representing multilingual WikiData concepts in repositories: HTML5, Dublin Core, DataCite, JATS XML, Schema.org. Two scripts that were written in Python for enriching Repository Metadata with WikiData Concepts and their use on EPrints JSON-LD metadata and a test dataset of publications in information visualization is presented. These scripts use DBPedia Spotlight API to annotate scholarly metadata with DBPedia concepts, and these in turn are used to extract translated labels from WikiData. A resource list of relevant projects is included, as well as some additional examples and notes.
Divisions: | Concordia University > Library |
---|---|
Item Type: | Conference or Workshop Item (Other) |
Refereed: | No |
Authors: | Neugebauer, Tomasz and Kuchma, Iryna and Selematsela, Daisy and Biziwe, Tembe and Zimmer, Niklas and Zodwa, Thomas and Havemann, Jo and Staines, Heather and Berrizbeitia, Francisco and Stacey, Phil and Shearer, Kathleen and Venema, Victor and Tykhonov, Slava |
Date: | 8 June 2021 |
Digital Object Identifier (DOI): | 10.5281/ZENODO.4914112 |
Keywords: | linguistic diversity, wikidata, metadata, linked open data, OR2021, HTML5, Dublin Core, DataCite, JATS XML, Schema.org, OpenAIRE, DBPedia, DBPedia Spotlight |
ID Code: | 990887 |
Deposited By: | Tomasz Neugebauer |
Deposited On: | 24 Aug 2022 17:48 |
Last Modified: | 21 Oct 2022 20:41 |
Repository Staff Only: item control page