From a1b6171b48c2f16689d278f9e947e5d53eed7e75 Mon Sep 17 00:00:00 2001 From: Alan Orth Date: Wed, 8 Jan 2020 15:33:56 +0200 Subject: [PATCH] Add notes for 2020-01-08 --- content/posts/2020-01.md | 46 +++++++++++++++++++++++++ docs/categories/notes/index.xml | 21 ++++++++++- docs/categories/notes/page/2/index.html | 21 ++++++++++- docs/categories/page/2/index.html | 21 ++++++++++- docs/index.xml | 21 ++++++++++- docs/page/2/index.html | 21 ++++++++++- docs/posts/index.xml | 21 ++++++++++- docs/posts/page/2/index.html | 21 ++++++++++- docs/sitemap.xml | 10 +++--- docs/tags/page/2/index.html | 21 ++++++++++- 10 files changed, 211 insertions(+), 13 deletions(-) diff --git a/content/posts/2020-01.md b/content/posts/2020-01.md index dbf7b31fa..89baa28d6 100644 --- a/content/posts/2020-01.md +++ b/content/posts/2020-01.md @@ -19,4 +19,50 @@ categories: ["Notes"] - The DOI has a score of 259, but the Handle has no score at all - I [tweeted](https://twitter.com/mralanorth/status/1214471427157626881) the CGSpace repository link + + +## 2020-01-08 + +- Export a list of authors from CGSpace for Peter Ballantyne to look through and correct: + +``` +dspace=# \COPY (SELECT DISTINCT text_value as "dc.contributor.author", count(*) FROM metadatavalue WHERE resource_type_id = 2 AND metadata_field_id = 3 GROUP BY text_value ORDER BY count DESC) to /tmp/2020-01-08-authors.csv WITH CSV HEADER; +COPY 68790 +``` + +- As I always have encoding issues with files Peter sends, I tried to convert it to some Windows encoding, but got an error: + +``` +$ iconv -f utf-8 -t windows-1252 /tmp/2020-01-08-authors.csv -o /tmp/2020-01-08-authors-windows.csv +iconv: illegal input sequence at position 104779 +``` + +- According to [this trick](https://www.datafix.com.au/BASHing/2018-09-13.html) the troublesome character is on line 5227: + +``` +$ awk 'END {print NR": "$0}' /tmp/2020-01-08-authors-windows.csv +5227: "Oue +$ sed -n '5227p' /tmp/2020-01-08-authors.csv | xxd -c1 +00000000: 22 " +00000001: 4f O +00000002: 75 u +00000003: 65 e +00000004: cc . +00000005: 81 . +00000006: 64 d +00000007: 72 r +``` + +- According to the blog post linked above the troublesome character is probably the "High Octect Preset" (81), which vim identifies (using `ga` on the character) as: + +``` + 101, Hex 65, Octal 145 < ́> 769, Hex 0301, Octal 1401 +``` + +- If I understand the situation correctly it sounds like this means that the character is not actually encoded as UTF-8, so it's stored incorrectly in the database... +- Other encodings like `windows-1251` and `windows-1257` also fail on different characters like "ž" and "é" that _are_ legitimate UTF-8 characters +- Then there is the issue of Russian, Chinese, etc characters, which are simply not representable in any of those encodings +- I think the solution is to upload it to Google Docs, or just send it to him and deal with each case manually in the corrections he sends me +- Re-deploy DSpace Test (linode19) with a fresh snapshot of the CGSpace database and assetstore, and using the `5_x-prod` (no CG Core v2) branch + diff --git a/docs/categories/notes/index.xml b/docs/categories/notes/index.xml index 890e9b815..fa09735dd 100644 --- a/docs/categories/notes/index.xml +++ b/docs/categories/notes/index.xml @@ -298,7 +298,26 @@ sys 0m1.979s Sun, 06 Jan 2019 10:48:30 +0200 https://alanorth.github.io/cgspace-notes/2019-01/ - 2020-01-06 Open a ticket with Atmire to request a quote for the upgrade to DSpace 6 Last week Altmetric responded about the item that had a lower score than than its DOI The score is now linked to the DOI Another item that had the same problem in 2019 has now also linked to the score for its DOI Another item that had the same problem in 2019 has also been fixed 2020-01-07 Peter Ballantyne highlighted one more WLE item that is missing the Altmetric score that its DOI has The DOI has a score of 259, but the Handle has no score at all I tweeted the CGSpace repository link + <h2 id="2020-01-06">2020-01-06</h2> +<ul> +<li>Open <a href="https://tracker.atmire.com/tickets-cgiar-ilri/view-ticket?id=706">a ticket</a> with Atmire to request a quote for the upgrade to DSpace 6</li> +<li>Last week Altmetric responded about the <a href="https://hdl.handle.net/10568/97087">item</a> that had a lower score than than its DOI +<ul> +<li>The score is now linked to the DOI</li> +<li>Another <a href="https://handle.hdl.net/10568/91278">item</a> that had the same problem in 2019 has now also linked to the score for its DOI</li> +<li>Another <a href="https://hdl.handle.net/10568/81236">item</a> that had the same problem in 2019 has also been fixed</li> +</ul> +</li> +</ul> +<h2 id="2020-01-07">2020-01-07</h2> +<ul> +<li>Peter Ballantyne highlighted one more WLE <a href="https://hdl.handle.net/10568/101286">item</a> that is missing the Altmetric score that its DOI has +<ul> +<li>The DOI has a score of 259, but the Handle has no score at all</li> +<li>I <a href="https://twitter.com/mralanorth/status/1214471427157626881">tweeted</a> the CGSpace repository link</li> +</ul> +</li> +</ul> diff --git a/docs/categories/notes/page/2/index.html b/docs/categories/notes/page/2/index.html index 8480e711f..beb665b68 100644 --- a/docs/categories/notes/page/2/index.html +++ b/docs/categories/notes/page/2/index.html @@ -161,7 +161,26 @@ sys 0m1.979s

- 2020-01-06 Open a ticket with Atmire to request a quote for the upgrade to DSpace 6 Last week Altmetric responded about the item that had a lower score than than its DOI The score is now linked to the DOI Another item that had the same problem in 2019 has now also linked to the score for its DOI Another item that had the same problem in 2019 has also been fixed 2020-01-07 Peter Ballantyne highlighted one more WLE item that is missing the Altmetric score that its DOI has The DOI has a score of 259, but the Handle has no score at all I tweeted the CGSpace repository link +

2020-01-06

+
    +
  • Open a ticket with Atmire to request a quote for the upgrade to DSpace 6
  • +
  • Last week Altmetric responded about the item that had a lower score than than its DOI +
      +
    • The score is now linked to the DOI
    • +
    • Another item that had the same problem in 2019 has now also linked to the score for its DOI
    • +
    • Another item that had the same problem in 2019 has also been fixed
    • +
    +
  • +
+

2020-01-07

+
    +
  • Peter Ballantyne highlighted one more WLE item that is missing the Altmetric score that its DOI has +
      +
    • The DOI has a score of 259, but the Handle has no score at all
    • +
    • I tweeted the CGSpace repository link
    • +
    +
  • +
Read more → diff --git a/docs/categories/page/2/index.html b/docs/categories/page/2/index.html index aae11fe28..85dc8471e 100644 --- a/docs/categories/page/2/index.html +++ b/docs/categories/page/2/index.html @@ -176,7 +176,26 @@ sys 0m1.979s

- 2020-01-06 Open a ticket with Atmire to request a quote for the upgrade to DSpace 6 Last week Altmetric responded about the item that had a lower score than than its DOI The score is now linked to the DOI Another item that had the same problem in 2019 has now also linked to the score for its DOI Another item that had the same problem in 2019 has also been fixed 2020-01-07 Peter Ballantyne highlighted one more WLE item that is missing the Altmetric score that its DOI has The DOI has a score of 259, but the Handle has no score at all I tweeted the CGSpace repository link +

2020-01-06

+
    +
  • Open a ticket with Atmire to request a quote for the upgrade to DSpace 6
  • +
  • Last week Altmetric responded about the item that had a lower score than than its DOI +
      +
    • The score is now linked to the DOI
    • +
    • Another item that had the same problem in 2019 has now also linked to the score for its DOI
    • +
    • Another item that had the same problem in 2019 has also been fixed
    • +
    +
  • +
+

2020-01-07

+
    +
  • Peter Ballantyne highlighted one more WLE item that is missing the Altmetric score that its DOI has +
      +
    • The DOI has a score of 259, but the Handle has no score at all
    • +
    • I tweeted the CGSpace repository link
    • +
    +
  • +
Read more → diff --git a/docs/index.xml b/docs/index.xml index 702eeaa8b..bb87e478a 100644 --- a/docs/index.xml +++ b/docs/index.xml @@ -298,7 +298,26 @@ sys 0m1.979s Sun, 06 Jan 2019 10:48:30 +0200 https://alanorth.github.io/cgspace-notes/2019-01/ - 2020-01-06 Open a ticket with Atmire to request a quote for the upgrade to DSpace 6 Last week Altmetric responded about the item that had a lower score than than its DOI The score is now linked to the DOI Another item that had the same problem in 2019 has now also linked to the score for its DOI Another item that had the same problem in 2019 has also been fixed 2020-01-07 Peter Ballantyne highlighted one more WLE item that is missing the Altmetric score that its DOI has The DOI has a score of 259, but the Handle has no score at all I tweeted the CGSpace repository link + <h2 id="2020-01-06">2020-01-06</h2> +<ul> +<li>Open <a href="https://tracker.atmire.com/tickets-cgiar-ilri/view-ticket?id=706">a ticket</a> with Atmire to request a quote for the upgrade to DSpace 6</li> +<li>Last week Altmetric responded about the <a href="https://hdl.handle.net/10568/97087">item</a> that had a lower score than than its DOI +<ul> +<li>The score is now linked to the DOI</li> +<li>Another <a href="https://handle.hdl.net/10568/91278">item</a> that had the same problem in 2019 has now also linked to the score for its DOI</li> +<li>Another <a href="https://hdl.handle.net/10568/81236">item</a> that had the same problem in 2019 has also been fixed</li> +</ul> +</li> +</ul> +<h2 id="2020-01-07">2020-01-07</h2> +<ul> +<li>Peter Ballantyne highlighted one more WLE <a href="https://hdl.handle.net/10568/101286">item</a> that is missing the Altmetric score that its DOI has +<ul> +<li>The DOI has a score of 259, but the Handle has no score at all</li> +<li>I <a href="https://twitter.com/mralanorth/status/1214471427157626881">tweeted</a> the CGSpace repository link</li> +</ul> +</li> +</ul>
diff --git a/docs/page/2/index.html b/docs/page/2/index.html index 5eba0ca5a..22f7b1327 100644 --- a/docs/page/2/index.html +++ b/docs/page/2/index.html @@ -176,7 +176,26 @@ sys 0m1.979s

- 2020-01-06 Open a ticket with Atmire to request a quote for the upgrade to DSpace 6 Last week Altmetric responded about the item that had a lower score than than its DOI The score is now linked to the DOI Another item that had the same problem in 2019 has now also linked to the score for its DOI Another item that had the same problem in 2019 has also been fixed 2020-01-07 Peter Ballantyne highlighted one more WLE item that is missing the Altmetric score that its DOI has The DOI has a score of 259, but the Handle has no score at all I tweeted the CGSpace repository link +

2020-01-06

+
    +
  • Open a ticket with Atmire to request a quote for the upgrade to DSpace 6
  • +
  • Last week Altmetric responded about the item that had a lower score than than its DOI +
      +
    • The score is now linked to the DOI
    • +
    • Another item that had the same problem in 2019 has now also linked to the score for its DOI
    • +
    • Another item that had the same problem in 2019 has also been fixed
    • +
    +
  • +
+

2020-01-07

+
    +
  • Peter Ballantyne highlighted one more WLE item that is missing the Altmetric score that its DOI has +
      +
    • The DOI has a score of 259, but the Handle has no score at all
    • +
    • I tweeted the CGSpace repository link
    • +
    +
  • +
Read more → diff --git a/docs/posts/index.xml b/docs/posts/index.xml index c27a5b650..5c2b9e00d 100644 --- a/docs/posts/index.xml +++ b/docs/posts/index.xml @@ -298,7 +298,26 @@ sys 0m1.979s Sun, 06 Jan 2019 10:48:30 +0200 https://alanorth.github.io/cgspace-notes/2019-01/ - 2020-01-06 Open a ticket with Atmire to request a quote for the upgrade to DSpace 6 Last week Altmetric responded about the item that had a lower score than than its DOI The score is now linked to the DOI Another item that had the same problem in 2019 has now also linked to the score for its DOI Another item that had the same problem in 2019 has also been fixed 2020-01-07 Peter Ballantyne highlighted one more WLE item that is missing the Altmetric score that its DOI has The DOI has a score of 259, but the Handle has no score at all I tweeted the CGSpace repository link + <h2 id="2020-01-06">2020-01-06</h2> +<ul> +<li>Open <a href="https://tracker.atmire.com/tickets-cgiar-ilri/view-ticket?id=706">a ticket</a> with Atmire to request a quote for the upgrade to DSpace 6</li> +<li>Last week Altmetric responded about the <a href="https://hdl.handle.net/10568/97087">item</a> that had a lower score than than its DOI +<ul> +<li>The score is now linked to the DOI</li> +<li>Another <a href="https://handle.hdl.net/10568/91278">item</a> that had the same problem in 2019 has now also linked to the score for its DOI</li> +<li>Another <a href="https://hdl.handle.net/10568/81236">item</a> that had the same problem in 2019 has also been fixed</li> +</ul> +</li> +</ul> +<h2 id="2020-01-07">2020-01-07</h2> +<ul> +<li>Peter Ballantyne highlighted one more WLE <a href="https://hdl.handle.net/10568/101286">item</a> that is missing the Altmetric score that its DOI has +<ul> +<li>The DOI has a score of 259, but the Handle has no score at all</li> +<li>I <a href="https://twitter.com/mralanorth/status/1214471427157626881">tweeted</a> the CGSpace repository link</li> +</ul> +</li> +</ul>
diff --git a/docs/posts/page/2/index.html b/docs/posts/page/2/index.html index b640eba8b..18be19e14 100644 --- a/docs/posts/page/2/index.html +++ b/docs/posts/page/2/index.html @@ -176,7 +176,26 @@ sys 0m1.979s

- 2020-01-06 Open a ticket with Atmire to request a quote for the upgrade to DSpace 6 Last week Altmetric responded about the item that had a lower score than than its DOI The score is now linked to the DOI Another item that had the same problem in 2019 has now also linked to the score for its DOI Another item that had the same problem in 2019 has also been fixed 2020-01-07 Peter Ballantyne highlighted one more WLE item that is missing the Altmetric score that its DOI has The DOI has a score of 259, but the Handle has no score at all I tweeted the CGSpace repository link +

2020-01-06

+
    +
  • Open a ticket with Atmire to request a quote for the upgrade to DSpace 6
  • +
  • Last week Altmetric responded about the item that had a lower score than than its DOI +
      +
    • The score is now linked to the DOI
    • +
    • Another item that had the same problem in 2019 has now also linked to the score for its DOI
    • +
    • Another item that had the same problem in 2019 has also been fixed
    • +
    +
  • +
+

2020-01-07

+
    +
  • Peter Ballantyne highlighted one more WLE item that is missing the Altmetric score that its DOI has +
      +
    • The DOI has a score of 259, but the Handle has no score at all
    • +
    • I tweeted the CGSpace repository link
    • +
    +
  • +
Read more → diff --git a/docs/sitemap.xml b/docs/sitemap.xml index 2862da38d..3222e1ed0 100644 --- a/docs/sitemap.xml +++ b/docs/sitemap.xml @@ -4,12 +4,12 @@ https://alanorth.github.io/cgspace-notes/categories/ - 2020-01-06T11:14:44+02:00 + 2020-01-07T12:24:29+02:00 https://alanorth.github.io/cgspace-notes/ - 2020-01-06T11:14:44+02:00 + 2020-01-07T12:24:29+02:00 @@ -19,12 +19,12 @@ https://alanorth.github.io/cgspace-notes/categories/notes/ - 2020-01-06T11:14:44+02:00 + 2020-01-07T12:24:29+02:00 https://alanorth.github.io/cgspace-notes/posts/ - 2020-01-06T11:14:44+02:00 + 2020-01-07T12:24:29+02:00 @@ -94,7 +94,7 @@ https://alanorth.github.io/cgspace-notes/2019-01/ - 2020-01-06T11:14:44+02:00 + 2020-01-07T12:24:29+02:00 diff --git a/docs/tags/page/2/index.html b/docs/tags/page/2/index.html index 10b46e995..53e48f9c5 100644 --- a/docs/tags/page/2/index.html +++ b/docs/tags/page/2/index.html @@ -176,7 +176,26 @@ sys 0m1.979s

- 2020-01-06 Open a ticket with Atmire to request a quote for the upgrade to DSpace 6 Last week Altmetric responded about the item that had a lower score than than its DOI The score is now linked to the DOI Another item that had the same problem in 2019 has now also linked to the score for its DOI Another item that had the same problem in 2019 has also been fixed 2020-01-07 Peter Ballantyne highlighted one more WLE item that is missing the Altmetric score that its DOI has The DOI has a score of 259, but the Handle has no score at all I tweeted the CGSpace repository link +

2020-01-06

+
    +
  • Open a ticket with Atmire to request a quote for the upgrade to DSpace 6
  • +
  • Last week Altmetric responded about the item that had a lower score than than its DOI +
      +
    • The score is now linked to the DOI
    • +
    • Another item that had the same problem in 2019 has now also linked to the score for its DOI
    • +
    • Another item that had the same problem in 2019 has also been fixed
    • +
    +
  • +
+

2020-01-07

+
    +
  • Peter Ballantyne highlighted one more WLE item that is missing the Altmetric score that its DOI has +
      +
    • The DOI has a score of 259, but the Handle has no score at all
    • +
    • I tweeted the CGSpace repository link
    • +
    +
  • +
Read more →