Skip to content

Latest commit

 

History

History
7 lines (6 loc) · 821 Bytes

README.md

File metadata and controls

7 lines (6 loc) · 821 Bytes

OpenML-Paper-Impact-Analysis

This repository contains the dataset and code used to analyze the impact of OpenML. The results are included in OpenML cells paper. The analysis focuses on research papers citing the core OpenML paper, Python and R connectors, and benchmarking suite papers.

Contents

  1. Data: data/collected_papers.csv: Contains the originally collected data on 1719 papers from Google Scholar. data/Final_survey_data.csv: The cleaned dataset (after filtering papers based on availability, language, and other criteria) with review results.
  2. Code: scripts/analysis.py: Python scripts used to clean the data, run statistical analyses, and generate figures/tables for the paper.
  3. Documentation: docs/methodology.md Details of the review methodology and questionnaire used for the analysis.