diff --git a/content/posts/2019-07.md b/content/posts/2019-07.md index ccdcdcd1c..19bed3af7 100644 --- a/content/posts/2019-07.md +++ b/content/posts/2019-07.md @@ -387,4 +387,11 @@ value.replace(/\s+\|\|/,"||").replace(/\|\|\s+/,"||") - I whipped up a quick script using Python Pandas to do whitespace cleanup +## 2019-07-29 + +- I turned the Pandas script into a proper Python package called [csv-metadata-quality](https://git.sr.ht/~alanorth/csv-metadata-quality) + - It supports CSV and Excel files + - It fixes whitespace errors and erroneous multi-value separators ("|") and validates ISSN, ISBNs, and dates +- Inform Bioversity that there is an error in their CSV, seemingly caused by quotes in the citation field + diff --git a/docs/2019-07/index.html b/docs/2019-07/index.html index aa2fe486c..2d5847171 100644 --- a/docs/2019-07/index.html +++ b/docs/2019-07/index.html @@ -21,7 +21,7 @@ Abenet had another similar issue a few days ago when trying to find the stats fo - + @@ -47,9 +47,9 @@ Abenet had another similar issue a few days ago when trying to find the stats fo "@type": "BlogPosting", "headline": "July, 2019", "url": "https:\/\/alanorth.github.io\/cgspace-notes\/2019-07\/", - "wordCount": "2191", + "wordCount": "2243", "datePublished": "2019-07-01T12:13:51\x2b03:00", - "dateModified": "2019-07-25T16:58:23\x2b03:00", + "dateModified": "2019-07-26T18:49:38\x2b03:00", "author": { "@type": "Person", "name": "Alan Orth" @@ -580,6 +580,18 @@ issn.validate('1020-3362')
  • I whipped up a quick script using Python Pandas to do whitespace cleanup

  • +

    2019-07-29

    + + + diff --git a/docs/sitemap.xml b/docs/sitemap.xml index 1ba2b6d77..2abec0c7b 100644 --- a/docs/sitemap.xml +++ b/docs/sitemap.xml @@ -4,30 +4,30 @@ https://alanorth.github.io/cgspace-notes/ - 2019-07-25T16:58:23+03:00 + 2019-07-26T18:49:38+03:00 0 https://alanorth.github.io/cgspace-notes/2019-07/ - 2019-07-25T16:58:23+03:00 + 2019-07-26T18:49:38+03:00 https://alanorth.github.io/cgspace-notes/tags/notes/ - 2019-07-25T16:58:23+03:00 + 2019-07-26T18:49:38+03:00 0 https://alanorth.github.io/cgspace-notes/posts/ - 2019-07-25T16:58:23+03:00 + 2019-07-26T18:49:38+03:00 0 https://alanorth.github.io/cgspace-notes/tags/ - 2019-07-25T16:58:23+03:00 + 2019-07-26T18:49:38+03:00 0