diff --git a/content/posts/2019-03.md b/content/posts/2019-03.md index a49f780f2..43ccdfa91 100644 --- a/content/posts/2019-03.md +++ b/content/posts/2019-03.md @@ -356,15 +356,15 @@ $ ./agrovoc-lookup.py -l es -i 2019-03-18-top-1500-subject.csv -om /tmp/es-subje $ ./agrovoc-lookup.py -l fr -i 2019-03-18-top-1500-subject.csv -om /tmp/fr-subjects-matched.txt -or /tmp/fr-subjects-unmatched.txt $ cat /tmp/*-subjects-matched.txt | sort -u > /tmp/subjects-matched-sorted.txt $ wc -l /tmp/subjects-matched-sorted.txt -1317 /tmp/subjects-matched-sorted.txt +1318 /tmp/subjects-matched-sorted.txt $ sort -u 2019-03-18-top-1500-subject.csv > /tmp/1500-subjects-sorted.txt -$ comm -13 /tmp/subjects-matched-sorted.txt /tmp/1500-subjects-sorted.txt > 2019-03-18-subjects-unmatch -ed.txt +$ comm -13 /tmp/subjects-matched-sorted.txt /tmp/1500-subjects-sorted.txt > 2019-03-18-subjects-unmatched.txt $ wc -l 2019-03-18-subjects-unmatched.txt -183 2019-03-18-subjects-unmatched.txt +182 2019-03-18-subjects-unmatched.txt ``` - So the new total of matched terms with the updated regex is 1317 and unmatched is 183 (previous number of matched terms was 1187) +- Create and merge a pull request to update the controlled vocabulary for AGROVOC terms ([#416](https://github.com/ilri/DSpace/pull/416)) - We are getting the blank page issue on CGSpace again today and I see a ~~large number~~ of the "SQL QueryTable Error" in the DSpace log again (last time was 2019-03-15): ``` diff --git a/docs/2019-03/index.html b/docs/2019-03/index.html index e002695f9..6f1c8b33b 100644 --- a/docs/2019-03/index.html +++ b/docs/2019-03/index.html @@ -25,7 +25,7 @@ I think I will need to ask Udana to re-copy and paste the abstracts with more ca - + @@ -55,9 +55,9 @@ I think I will need to ask Udana to re-copy and paste the abstracts with more ca "@type": "BlogPosting", "headline": "March, 2019", "url": "https://alanorth.github.io/cgspace-notes/2019-03/", - "wordCount": "2959", + "wordCount": "2973", "datePublished": "2019-03-01T12:16:30+01:00", - "dateModified": "2019-03-17T22:24:02+02:00", + "dateModified": "2019-03-18T15:32:22+02:00", "author": { "@type": "Person", "name": "Alan Orth" @@ -547,16 +547,16 @@ $ ./agrovoc-lookup.py -l es -i 2019-03-18-top-1500-subject.csv -om /tmp/es-subje $ ./agrovoc-lookup.py -l fr -i 2019-03-18-top-1500-subject.csv -om /tmp/fr-subjects-matched.txt -or /tmp/fr-subjects-unmatched.txt $ cat /tmp/*-subjects-matched.txt | sort -u > /tmp/subjects-matched-sorted.txt $ wc -l /tmp/subjects-matched-sorted.txt -1317 /tmp/subjects-matched-sorted.txt +1318 /tmp/subjects-matched-sorted.txt $ sort -u 2019-03-18-top-1500-subject.csv > /tmp/1500-subjects-sorted.txt -$ comm -13 /tmp/subjects-matched-sorted.txt /tmp/1500-subjects-sorted.txt > 2019-03-18-subjects-unmatch -ed.txt +$ comm -13 /tmp/subjects-matched-sorted.txt /tmp/1500-subjects-sorted.txt > 2019-03-18-subjects-unmatched.txt $ wc -l 2019-03-18-subjects-unmatched.txt -183 2019-03-18-subjects-unmatched.txt +182 2019-03-18-subjects-unmatched.txt