diff --git a/content/posts/2019-08.md b/content/posts/2019-08.md index 8dfa573e7..6522c3f4c 100644 --- a/content/posts/2019-08.md +++ b/content/posts/2019-08.md @@ -358,5 +358,12 @@ sys 2m27.496s - After reading the code I see that XSLT is reading the community titles from the DIM representation (stored in the `$dim` variable) created from METS - I modified the patterns in my sed script so that those lines are not replaced and then the community list works again - This is actually not a problem at all because this metadata is only used in the HTML meta tags in XMLUI community lists and has nothing to do with item metadata +- Get a list of institutions from CCAFS's Clarisa API and try to parse it with `jq` and pass it through `csvcut` to add line numbers: + +``` +$ cat ~/Downloads/institutions.json| jq '.[] | {name: .name}' | grep name | awk -F: '{print $2}' | sed 's/"//g' | csvcut -l > /tmp/investors.csv +``` + +- I could potentially use this with reconcile-csv and OpenRefine as a source to validate our institutional authors against... diff --git a/content/posts/2019-09.md b/content/posts/2019-09.md index 27e2f7917..f85ccf356 100644 --- a/content/posts/2019-09.md +++ b/content/posts/2019-09.md @@ -316,5 +316,8 @@ $ dspace import -a me@cgiar.org -m 2019-09-20-bioversity2.map -s /home/aorth/Bio - Release [version 0.3.0 of the csv-metadata-quality](https://github.com/ilri/csv-metadata-quality/releases/tag/v0.3.0) tool - This version includes the experimental validation of languages using the Python `langid` library - I also included updated pytest tests and test files that specifically test this functionality +- Give more feedback to Bosede about the [IITA Sept 6 (20196th.xls) records on DSpace Test](https://dspacetest.cgiar.org/handle/10568/105116) + - I told her to delete one item that appears to be a duplicate, or to fix its citation to be correct if she thinks it is not a duplicate + - I deleted another item that I had previously identified as a duplicate that she had fixed by incorrectly deleting the original (ugh) diff --git a/docs/2019-08/index.html b/docs/2019-08/index.html index 141532d12..c1bc10d59 100644 --- a/docs/2019-08/index.html +++ b/docs/2019-08/index.html @@ -59,7 +59,7 @@ Run system updates on DSpace Test (linode19) and reboot it "@type": "BlogPosting", "headline": "August, 2019", "url": "https:\/\/alanorth.github.io\/cgspace-notes\/2019-08\/", - "wordCount": "2703", + "wordCount": "2770", "datePublished": "2019-08-03T12:39:51\x2b03:00", "dateModified": "2019-09-01T01:54:55\x2b03:00", "author": { @@ -603,6 +603,13 @@ sys 2m27.496s
  • I modified the patterns in my sed script so that those lines are not replaced and then the community list works again
  • This is actually not a problem at all because this metadata is only used in the HTML meta tags in XMLUI community lists and has nothing to do with item metadata
  • + +
  • Get a list of institutions from CCAFS’s Clarisa API and try to parse it with jq and pass it through csvcut to add line numbers:

    + +
    $ cat ~/Downloads/institutions.json| jq '.[] | {name: .name}' | grep name | awk -F: '{print $2}' | sed 's/"//g' | csvcut -l > /tmp/investors.csv
    +
  • + +
  • I could potentially use this with reconcile-csv and OpenRefine as a source to validate our institutional authors against…

  • diff --git a/docs/2019-09/index.html b/docs/2019-09/index.html index 9fbe00add..1b443a6dc 100644 --- a/docs/2019-09/index.html +++ b/docs/2019-09/index.html @@ -40,7 +40,7 @@ Here are the top ten IPs in the nginx XMLUI and REST/OAI logs this morning: - + @@ -85,9 +85,9 @@ Here are the top ten IPs in the nginx XMLUI and REST/OAI logs this morning: "@type": "BlogPosting", "headline": "September, 2019", "url": "https:\/\/alanorth.github.io\/cgspace-notes\/2019-09\/", - "wordCount": "2417", + "wordCount": "2497", "datePublished": "2019-09-01T10:17:51\x2b03:00", - "dateModified": "2019-09-24T20:13:20\x2b03:00", + "dateModified": "2019-09-26T14:21:41\x2b03:00", "author": { "@type": "Person", "name": "Alan Orth" @@ -552,11 +552,19 @@ $ dspace import -a me@cgiar.org -m 2019-09-20-bioversity2.map -s /home/aorth/Bio
  • Release version 0.3.0 of the csv-metadata-quality tool
  • +
  • Give more feedback to Bosede about the IITA Sept 6 (20196th.xls) records on DSpace Test + +
  • + + diff --git a/docs/sitemap.xml b/docs/sitemap.xml index ddf533ecf..cb676705f 100644 --- a/docs/sitemap.xml +++ b/docs/sitemap.xml @@ -4,27 +4,27 @@ https://alanorth.github.io/cgspace-notes/ - 2019-09-24T20:13:20+03:00 + 2019-09-26T14:21:41+03:00 https://alanorth.github.io/cgspace-notes/tags/notes/ - 2019-09-24T20:13:20+03:00 + 2019-09-26T14:21:41+03:00 https://alanorth.github.io/cgspace-notes/posts/ - 2019-09-24T20:13:20+03:00 + 2019-09-26T14:21:41+03:00 https://alanorth.github.io/cgspace-notes/2019-09/ - 2019-09-24T20:13:20+03:00 + 2019-09-26T14:21:41+03:00 https://alanorth.github.io/cgspace-notes/tags/ - 2019-09-24T20:13:20+03:00 + 2019-09-26T14:21:41+03:00