Update notes for 2019-07-29

This commit is contained in:
2019-07-29 17:17:42 +03:00
parent cbbb9b4e77
commit f69530c005
3 changed files with 10 additions and 8 deletions

View File

@ -392,6 +392,7 @@ value.replace(/\s+\|\|/,"||").replace(/\|\|\s+/,"||")
- I turned the Pandas script into a proper Python package called [csv-metadata-quality](https://git.sr.ht/~alanorth/csv-metadata-quality)
- It supports CSV and Excel files
- It fixes whitespace errors and erroneous multi-value separators ("|") and validates ISSN, ISBNs, and dates
- Also I added a bunch of other checks/fixes for unnecessary and "suspicious" Unicode characters
- Inform Bioversity that there is an error in their CSV, seemingly caused by quotes in the citation field
<!-- vim: set sw=2 ts=2: -->