Releases: ilri/csv-metadata-quality
Releases · ilri/csv-metadata-quality
Version 0.4.0
Added
- Unicode normalization (enable with
--unsafe-fixes
, see README.md)
Updated
- Update python dependencies to latest versions, including numpy 1.18.1, pandas 1.0.0rc0, flake8 3.7.9, pytest 5.3.2, and black 19.10b0
- Regenerate
requirements.txt
andrequirements-dev.txt
Changed
- Use Python 3.8.0 for pipenv
- Use Ubuntu 18.04 "Bionic" for TravisCI builds
- Test Python 3.8 in TravisCI builds
Version 0.3.1
Changed
- Replace non-breaking spaces (U+00A0) with space instead of removing them
- Harmonize language of script output when fixing various issues
Version 0.3.0
Updated
- Update python dependencies to latest versions, including numpy 1.17.2, pandas 0.25.1, pytest 5.1.3, and requests-cache 0.5.2
Added
- csvkit to dev requirements (csvcut etc are useful during development)
- Experimental language validation using the Python
langid
library (enable with-e
, see README.md)
Changed
- Re-formatted code with black and isort
Version 0.2.2
Changed
- Output of date checks to include column names (helps debugging in case there are multiple date fields)
Added
- Ability to exclude certain fields using
--exclude-fields
- Fix for missing space after a comma, ie "Orth,Alan S."
Improved
- AGROVOC lookup code
Version 0.2.1
Added
- Check for uncommon filename extensions
- Replacement of unneccessary Unicode characters like soft hyphens (U+00AD)
Version 0.2.0
Added
- Handle Ctrl-C interrupt gracefully
- Make output in suspicious character check more user friendly
- Add pytest-clarity to dev packages for more user friendly pytest output
Version 0.1.0
Changed
- AGROVOC validation is now turned off by default
Added
- Ability to enable AGROVOC validation on a field-by-field basis using the
--agrovoc-fields
option - Option to print the version (
--version
or-V
)