Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[pull] develop from openml:develop #38

Open
wants to merge 126 commits into
base: develop
Choose a base branch
from

Conversation

pull[bot]
Copy link

@pull pull bot commented Feb 21, 2023

See Commits and Changes for more details.


Created by pull[bot]

Can you help keep this open source service alive? 💖 Please sponsor : )

mfeurer and others added 6 commits October 25, 2020 20:00
In #1188 we changed the standard cache file convention from
dataset.pq to dataset_{did}.pq. See also #1188.
Bumps [actions/checkout](https://github.com/actions/checkout) from 2 to 3.
- [Release notes](https://github.com/actions/checkout/releases)
- [Changelog](https://github.com/actions/checkout/blob/main/CHANGELOG.md)
- [Commits](actions/checkout@v2...v3)

---
updated-dependencies:
- dependency-name: actions/checkout
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
@pull pull bot added the ⤵️ pull label Feb 21, 2023
mfeurer and others added 23 commits February 22, 2023 17:18
* Update docker actions

* Fix context

* Specify tag for docker container to use strict python version (3.10)

* Load OpenML in Docker file

* load correct image

* load correct image

* Remove loading python again
* Drop upper bound on numpy version

* Update changelog
* Allow unknown task types on the server

* Applied black to openml/tasks/functions.py

* Some more fixes
* Add sklearn marker

* Mark tests that use scikit-learn

* Only run scikit-learn tests multiple times

The generic tests that don't use scikit-learn should only be tested once
(per platform).

* Rename for correct variable

* Add sklearn mark for filesystem test

* Remove quotes around sklearn

* Instead include sklearn in the matrix definition

* Update jobnames

* Add explicit false to jobname

* Remove space

* Add function inside of expression?

* Do string testing instead

* Add missing ${{

* Add explicit true to old sklearn tests

* Add instruction to add pytest marker for sklearn tests
Bumps [actions/setup-python](https://github.com/actions/setup-python) from 2 to 4.
- [Release notes](https://github.com/actions/setup-python/releases)
- [Commits](actions/setup-python@v2...v4)

---
updated-dependencies:
- dependency-name: actions/setup-python
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
It provides a better repr and is less verbose.
…n run.data_content (#1209)

* add test and fix for switch of ground truth and predictions

* undo import optimization

* fix bug with model passing to function

* fix order in other tests

* update progress.rst

* new unit test for run consistency and bug fixed

* clarify new assert

* minor loop refactor

* refactor default to None

* directly test prediction data equal

* Update tests/test_runs/test_run.py

Co-authored-by: Pieter Gijsbers <[email protected]>

* Mark sklearn tests (#1202)

* Add sklearn marker

* Mark tests that use scikit-learn

* Only run scikit-learn tests multiple times

The generic tests that don't use scikit-learn should only be tested once
(per platform).

* Rename for correct variable

* Add sklearn mark for filesystem test

* Remove quotes around sklearn

* Instead include sklearn in the matrix definition

* Update jobnames

* Add explicit false to jobname

* Remove space

* Add function inside of expression?

* Do string testing instead

* Add missing ${{

* Add explicit true to old sklearn tests

* Add instruction to add pytest marker for sklearn tests

* add test and fix for switch of ground truth and predictions

* undo import optimization

* fix mask error resulting from rebase

* make dummy classifier strategy consistent to avoid problems as a result of the random state problems for sklearn < 0.24

---------

Co-authored-by: Pieter Gijsbers <[email protected]>
* Fix documentation building

* Fix numpy version

* Fix two links
* Try Ubunte 20.04 for Python 3.6

* use old ubuntu for python 3.6
Bumps [docker/setup-buildx-action](https://github.com/docker/setup-buildx-action) from 1 to 2.
- [Release notes](https://github.com/docker/setup-buildx-action/releases)
- [Commits](docker/setup-buildx-action@v1...v2)

---
updated-dependencies:
- dependency-name: docker/setup-buildx-action
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* Update run.py

* Update run.py

updated description to not contain duplicate information.

* Update run.py
* Refactor if-statements

* Add explicit names to conditional expression

* Add 'dependencies' to better mimic OpenMLFlow
* Install custom numpy version for specific combination of Python3.8 and numpy

* Debug output

* Change syntax

* move to coverage action v3

* Remove test output
* added additional task agnostic local result to print of run

* add PR to progress.rst

* fix comment typo

* Update openml/runs/run.py

Co-authored-by: Matthias Feurer <[email protected]>

* add a function to list available estimation procedures

* refactor print to only work for supported task types and local measures

* add test for pint out and update progress

* added additional task agnostic local result to print of run

* add PR to progress.rst

* fix comment typo

* Update openml/runs/run.py

Co-authored-by: Matthias Feurer <[email protected]>

* add a function to list available estimation procedures

* refactor print to only work for supported task types and local measures

* add test for pint out and update progress

* Fix CI Python 3.6 (#1218)

* Try Ubunte 20.04 for Python 3.6

* use old ubuntu for python 3.6

* Bump docker/setup-buildx-action from 1 to 2 (#1221)

Bumps [docker/setup-buildx-action](https://github.com/docker/setup-buildx-action) from 1 to 2.
- [Release notes](https://github.com/docker/setup-buildx-action/releases)
- [Commits](docker/setup-buildx-action@v1...v2)

---
updated-dependencies:
- dependency-name: docker/setup-buildx-action
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Update run.py (#1194)

* Update run.py

* Update run.py

updated description to not contain duplicate information.

* Update run.py

* add type hint for new function

* update add description

* Refactor if-statements (#1219)

* Refactor if-statements

* Add explicit names to conditional expression

* Add 'dependencies' to better mimic OpenMLFlow

* Ci python 38 (#1220)

* Install custom numpy version for specific combination of Python3.8 and numpy

* Debug output

* Change syntax

* move to coverage action v3

* Remove test output

* added additional task agnostic local result to print of run

* add PR to progress.rst

* fix comment typo

* Update openml/runs/run.py

Co-authored-by: Matthias Feurer <[email protected]>

* add a function to list available estimation procedures

* refactor print to only work for supported task types and local measures

* add test for pint out and update progress

* added additional task agnostic local result to print of run

* add PR to progress.rst

* add type hint for new function

* update add description

* fix run doc string

---------

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: Matthias Feurer <[email protected]>
Co-authored-by: Matthias Feurer <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Vishal Parmar <[email protected]>
Co-authored-by: Pieter Gijsbers <[email protected]>
* add better error handling for checksum when downloading a file

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* update usage of __is_checksum_equal

* Update openml/_api_calls.py

Co-authored-by: Pieter Gijsbers <[email protected]>

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Pieter Gijsbers <[email protected]>
* Correctly only clean up tests/files/

* Log to console for pytest invocation
… Homepage; More documentation for `components` (#1243)

* fix task ID for Iris task

* update comment on homepage

* added additional documentation specific to the `components` parameter.

* add change to progress.rst

* Fix dataframe append being deprecated by replacing it with (backwards-compatible) pd.concat

* fix logging example and add new changes to progress.rst

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fix comment too long

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* [pre-commit.ci] pre-commit autoupdate

updates:
- [github.com/psf/black: 22.6.0 → 23.3.0](psf/black@22.6.0...23.3.0)
- [github.com/pre-commit/mirrors-mypy: v0.961 → v1.2.0](pre-commit/mirrors-mypy@v0.961...v1.2.0)
- [github.com/pycqa/flake8: 4.0.1 → 6.0.0](PyCQA/flake8@4.0.1...6.0.0)

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Fix mypy errors: made implicit optional typing to be explicit

* Drop duplicate flake8 config

* Fix a few flake8 issues

* Update python version for pre-commit workflow

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Lennart Purucker <[email protected]>
Co-authored-by: Matthias Feurer <[email protected]>
updates:
- [github.com/pre-commit/mirrors-mypy: v1.2.0 → v1.3.0](pre-commit/mirrors-mypy@v1.2.0...v1.3.0)

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
PGijsbers and others added 30 commits September 29, 2024 14:44
* Remove archive after it is extracted to save disk space

* Leave a marker after removing archive to avoid redownload

* Automatic refresh if expected marker is absent

* Be consistent about syntax use for path construction
* Pass kwargs through task to ```get_dataset```

Allows to follow the directions in the warning ```Starting from Version 0.15 `download_data`, `download_qualities`, and `download_features_meta_data` will all be ``False`` instead of ``True`` by default to enable lazy loading.```

* docs: explain that ```task.get_dataset``` passes kwargs

* Update openml/tasks/task.py

Remove Py3.8+ feature for backwards compatibility

---------

Co-authored-by: Pieter Gijsbers <[email protected]>
* Change defaults for `get_task`

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Fix linting errors

* Add missing type annotation

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* Expand 0.15.0 changelog with other PRs not yet added

* Bump version number

* Add newer Python versions since we are compatible

* Revert "Add newer Python versions since we are compatible"

This reverts commit 5088c80.

* Add newer compatible versions of Python
* [pre-commit.ci] pre-commit autoupdate

updates:
- [github.com/astral-sh/ruff-pre-commit: v0.1.14 → v0.6.9](astral-sh/ruff-pre-commit@v0.1.14...v0.6.9)
- [github.com/pre-commit/mirrors-mypy: v1.8.0 → v1.11.2](pre-commit/mirrors-mypy@v1.8.0...v1.11.2)
- [github.com/python-jsonschema/check-jsonschema: 0.27.3 → 0.29.3](python-jsonschema/check-jsonschema@0.27.3...0.29.3)
- [github.com/pre-commit/pre-commit-hooks: v4.5.0 → v5.0.0](pre-commit/pre-commit-hooks@v4.5.0...v5.0.0)

* fix(pre-commit): Minor fixes

* maint: Update to 3.8 min

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: eddiebergman <[email protected]>
Bumps [codecov/codecov-action](https://github.com/codecov/codecov-action) from 3 to 4.
- [Release notes](https://github.com/codecov/codecov-action/releases)
- [Changelog](https://github.com/codecov/codecov-action/blob/main/CHANGELOG.md)
- [Commits](codecov/codecov-action@v3...v4)

---
updated-dependencies:
- dependency-name: codecov/codecov-action
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
…#1356)

* fix(datasets): Add code `111` for dataset description not found error

* test(dataset): Test the error raised

* test: Make error tested for tighter
…1359)

* fix(config): Fix XDG_X_HOME env vars, add OPENML_CACHE_DIR env var

* fix(config): Check correct backwards compat location

* test: Add safe context manager for environ vriable
* fix: make FakeObject be the correct standard and robustify usage

* fix: get none valued study object from server

* add: test for minio download failures

* fix: skip test for WSL as it is not supported

* maint: rework if/else case workflow

* maint: ruff fix

* add/fix: log messages for no premission

* fix: make flow name unique and enable testing of avoiding duplicates
* fix/maint: deprecate outdated examples, discounting progress.rst, and minor fixes to the tests.

* doc: update wording to reflect new state
* fix: minimal invasive change to avoid stripping whitespaces for feature names

Co-authored-by: amastruserio <[email protected]>

* fix: roll back change to work with older and newer xmltodict versions

* add: test for whitespaces in features xml

---------

Co-authored-by: amastruserio <[email protected]>
* fix: if docs do not change, do not fail ci

* fix: roll back change that is not 3.8 specific

* fix: delete non cleaned up test dirs
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
updates:
- [github.com/astral-sh/ruff-pre-commit: v0.6.9 → v0.7.2](astral-sh/ruff-pre-commit@v0.6.9...v0.7.2)
- [github.com/pre-commit/mirrors-mypy: v1.11.2 → v1.13.0](pre-commit/mirrors-mypy@v1.11.2...v1.13.0)

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* [pre-commit.ci] pre-commit autoupdate

updates:
- [github.com/astral-sh/ruff-pre-commit: v0.7.2 → v0.7.3](astral-sh/ruff-pre-commit@v0.7.2...v0.7.3)

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Update openml/runs/functions.py

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Pieter Gijsbers <[email protected]>
* Allow skipping parquet download through environment variable

* Allow skip of parquet file, fix bug if no pq file is returned

* Declare the environment file in config.py
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

10 participants