-
Notifications
You must be signed in to change notification settings - Fork 0
Metadata extraction
Kevina-Zeni edited this page May 13, 2022
·
5 revisions
We host research manuscripts on currently six (6) partner repositories, namely Open Science Framework (OSF), Zenodo, ScienceOpen, PubPub, Qeios, Figshare. Here is how we extract the data:
AfricArXiv submissions on OSF: osf.io/preprints/africarxiv/
- …
- …
AfricArXiv submissions on Zenodo: zenodo.org/communities/africarxiv/
This workflow extracts metadata from our AfricArXiv.org Zenodo community: https://zenodo.org/communities/africarxiv/ for the joint project with Masakhane.io called Decolonise Science: https://www.masakhane.io/ongoing-projects/masakhane-mt-decolonise-science.
- navigate to the Google Colab notebook and run the three code blocks (hover over and hit the 'play' button)
- click the files tab (folder icon on the left-hand side)
- download the 'decolsci_zenodo-extract_...' CSV file by clicking the 'three-dot' icon on the right and selecting 'Download'
- download the decolsci_zenodo-stats.sh shell script to a linux machine or virtual environment
- run the script:
bash zenodo-community-stats.sh
- upload the generated .csv file to the GitHub repository
- for JSON responses to parse: https://developers.zenodo.org/#representation
- to understand the metadata fields: https://help.zenodo.org/guides/search
- original bash script: https://gist.github.com/slint/eb4bcb8bc572a37b9650b8c55e759fc9
- Colab entry https://github.com/AfricArxiv/decolonise-science/blob/master/AfricArxiv_stats_on_Zenodo.ipynb
AfricArXiv submissions on ScienceOpen: https://www.scienceopen.com/collection/africarxiv