From cc498e12a7591f0cf99e1476a7b34a40695b0d6f Mon Sep 17 00:00:00 2001 From: Alan Orth Date: Sat, 6 Feb 2021 14:00:36 +0200 Subject: [PATCH] Add notes for 2021-02-04 --- content/posts/2021-02.md | 7 +++++++ docs/2021-02/index.html | 11 ++++++++--- docs/categories/index.html | 2 +- docs/categories/notes/index.html | 2 +- docs/categories/notes/page/2/index.html | 2 +- docs/categories/notes/page/3/index.html | 2 +- docs/categories/notes/page/4/index.html | 2 +- docs/categories/notes/page/5/index.html | 2 +- docs/index.html | 2 +- docs/page/2/index.html | 2 +- docs/page/3/index.html | 2 +- docs/page/4/index.html | 2 +- docs/page/5/index.html | 2 +- docs/page/6/index.html | 2 +- docs/page/7/index.html | 2 +- docs/posts/index.html | 2 +- docs/posts/page/2/index.html | 2 +- docs/posts/page/3/index.html | 2 +- docs/posts/page/4/index.html | 2 +- docs/posts/page/5/index.html | 2 +- docs/posts/page/6/index.html | 2 +- docs/posts/page/7/index.html | 2 +- docs/sitemap.xml | 10 +++++----- 23 files changed, 40 insertions(+), 28 deletions(-) diff --git a/content/posts/2021-02.md b/content/posts/2021-02.md index b02505879..60b7ffd15 100644 --- a/content/posts/2021-02.md +++ b/content/posts/2021-02.md @@ -201,6 +201,13 @@ $ ./ilri/delete-metadata-values.py -i /tmp/2020-10-28-Series-PB.csv -db dspace - - Sistematización de experiencias Proyecto ACORDAR - Strüngmann Forum - Unité de Recherche +- I ended up using [python-ftfy](https://github.com/LuminosoInsight/python-ftfy) to fix those very easily, then replaced them in the CSV +- Then I trimmed whitespace at the beginning, end, and around the ";", and applied the 1,600 fixes using `fix-metadata-values.py`: + +```console +$ ./ilri/fix-metadata-values.py -i /tmp/2020-10-28-Series-PB.csv -db dspace -u dspace -p 'fuuu' -f dc.relation.ispartofseries -t 'correct' -m 43 +``` + - Help Peter debug an issue with one of Alan Duncan's new FEAST Data reports on CGSpace - For some reason the default policy for the item was "COLLECTION_492_DEFAULT_READ" group, which had zero members - I changed them all to Anonymous and the item was accessible diff --git a/docs/2021-02/index.html b/docs/2021-02/index.html index f7e1ba54f..91cba332d 100644 --- a/docs/2021-02/index.html +++ b/docs/2021-02/index.html @@ -32,7 +32,7 @@ $ curl -s 'http://localhost:9200/openrxv-items-temp/_count?q=*&pretty - + @@ -70,9 +70,9 @@ $ curl -s 'http://localhost:9200/openrxv-items-temp/_count?q=*&pretty "@type": "BlogPosting", "headline": "February, 2021", "url": "https://alanorth.github.io/cgspace-notes/2021-02/", - "wordCount": "1355", + "wordCount": "1406", "datePublished": "2021-02-01T10:13:54+02:00", - "dateModified": "2021-02-01T12:28:54+02:00", + "dateModified": "2021-02-04T17:28:20+02:00", "author": { "@type": "Person", "name": "Alan Orth" @@ -357,6 +357,11 @@ $ dspace oai import -c
  • Unité de Recherche
  • +
  • I ended up using python-ftfy to fix those very easily, then replaced them in the CSV
  • +
  • Then I trimmed whitespace at the beginning, end, and around the “;”, and applied the 1,600 fixes using fix-metadata-values.py:
  • + +
    $ ./ilri/fix-metadata-values.py -i /tmp/2020-10-28-Series-PB.csv -db dspace -u dspace -p 'fuuu' -f dc.relation.ispartofseries -t 'correct' -m 43
    +