mirror of
https://github.com/ilri/csv-metadata-quality.git
synced 2024-11-29 17:18:19 +01:00
Compare commits
No commits in common. "27b2d81ca867a8dfe7ac368b5bd7799dbeab8608" and "d76e72532a3e2c07480cba1ce72b3f3588a1cf3f" have entirely different histories.
27b2d81ca8
...
d76e72532a
@ -4,11 +4,6 @@ All notable changes to this project will be documented in this file.
|
|||||||
The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/),
|
The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/),
|
||||||
and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).
|
and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).
|
||||||
|
|
||||||
## Unreleased changes
|
|
||||||
### Added
|
|
||||||
- Check dates in dcterms.issued field as well, not just fields that have the
|
|
||||||
word "date" in them
|
|
||||||
|
|
||||||
## [0.4.4] - 2021-02-21
|
## [0.4.4] - 2021-02-21
|
||||||
### Added
|
### Added
|
||||||
- Accept dates formatted in ISO 8601 extended with combined date and time, for
|
- Accept dates formatted in ISO 8601 extended with combined date and time, for
|
||||||
|
@ -109,6 +109,8 @@ This currently uses the [Python langid](https://github.com/saffsd/langid.py) lib
|
|||||||
- Add an option to drop invalid AGROVOC subjects?
|
- Add an option to drop invalid AGROVOC subjects?
|
||||||
- Add tests for application invocation, ie `tests/test_app.py`?
|
- Add tests for application invocation, ie `tests/test_app.py`?
|
||||||
- Validate ISSNs or journal titles against CrossRef API?
|
- Validate ISSNs or journal titles against CrossRef API?
|
||||||
|
- Better ISO 8601 date parsing (currently only supports simple dates, perhaps we need to use dateutil.parser.parseiso())
|
||||||
|
- Fix lazy date check (assumes field name has "date" but could be dcterms.issued etc!)
|
||||||
|
|
||||||
## License
|
## License
|
||||||
This work is licensed under the [GPLv3](https://www.gnu.org/licenses/gpl-3.0.en.html).
|
This work is licensed under the [GPLv3](https://www.gnu.org/licenses/gpl-3.0.en.html).
|
||||||
|
@ -142,7 +142,7 @@ def run(argv):
|
|||||||
df[column] = df[column].apply(check.isbn)
|
df[column] = df[column].apply(check.isbn)
|
||||||
|
|
||||||
# Check: invalid date
|
# Check: invalid date
|
||||||
match = re.match(r"^.*?(date|dcterms\.issued).*$", column)
|
match = re.match(r"^.*?date.*$", column)
|
||||||
if match is not None:
|
if match is not None:
|
||||||
df[column] = df[column].apply(check.date, field_name=column)
|
df[column] = df[column].apply(check.date, field_name=column)
|
||||||
|
|
||||||
|
Loading…
Reference in New Issue
Block a user