diff --git a/content/posts/2019-08.md b/content/posts/2019-08.md index d973ac1db..b590177fd 100644 --- a/content/posts/2019-08.md +++ b/content/posts/2019-08.md @@ -34,6 +34,7 @@ tags: ["Notes"] - The following PDFs are used by several items incorrectly: - `Report_of_a_Working_Group_on_Allium_7.pdf` - `Report_of_a_Working_Group_on_Allium_Fourth_meeting_1696.pdf` + - I checked the SHA1 hashes of each PDF and found that some appear more than once... - The following items use the same PDF with a different name, but seem to be duplicates (pick one?): - https://www.bioversityinternational.org/index.php?id=244&tx_news_pi1[news]=433 - https://www.bioversityinternational.org/index.php?id=244&tx_news_pi1[news]=10189 @@ -61,4 +62,12 @@ or( - I tried to extract the filenames and construct a URL to download the PDFs with my `generate-thumbnails.py` script, but there seem to be several paths for PDFs so I can't guess it properly - I will have to wait for Francesco to respond about the PDFs, or perhaps proceed with a metadata-only upload so we can do other checks on DSpace Test +## 2019-08-06 + +- Francesca responded to address my feedback yesterday + - I made some changes to the CSV based on her feedback (remove two duplicates, change one PDF file name, change two titles) + - Then I found some items that have PDFs in multiple languages that only list one language in `dc.language.iso` so I changed them + - Strangley, one item was referring to a 7zip file... + - After removing the two duplicates there are now 1427 records + diff --git a/docs/2019-08/index.html b/docs/2019-08/index.html index de6938e61..cf5ea479a 100644 --- a/docs/2019-08/index.html +++ b/docs/2019-08/index.html @@ -27,7 +27,7 @@ Run system updates on DSpace Test (linode19) and reboot it - + @@ -59,9 +59,9 @@ Run system updates on DSpace Test (linode19) and reboot it "@type": "BlogPosting", "headline": "August, 2019", "url": "https:\/\/alanorth.github.io\/cgspace-notes\/2019-08\/", - "wordCount": "341", + "wordCount": "428", "datePublished": "2019-08-03T12:39:51\x2b03:00", - "dateModified": "2019-08-04T22:49:04\x2b03:00", + "dateModified": "2019-08-05T16:49:31\x2b03:00", "author": { "@type": "Person", "name": "Alan Orth" @@ -166,6 +166,7 @@ Run system updates on DSpace Test (linode19) and reboot it
  • The following PDFs are used by several items incorrectly:
  • Report_of_a_Working_Group_on_Allium_7.pdf
  • Report_of_a_Working_Group_on_Allium_Fourth_meeting_1696.pdf
  • +
  • I checked the SHA1 hashes of each PDF and found that some appear more than once…
  • The following items use the same PDF with a different name, but seem to be duplicates (pick one?):
  • https://www.bioversityinternational.org/index.php?id=244&tx_news_pi1[news]=433
  • https://www.bioversityinternational.org/index.php?id=244&tx_news_pi1[news]=10189
  • @@ -196,6 +197,19 @@ isNotNull(value.match(/^.*รป.*$/))
  • I will have to wait for Francesco to respond about the PDFs, or perhaps proceed with a metadata-only upload so we can do other checks on DSpace Test

  • +

    2019-08-06

    + + + diff --git a/docs/sitemap.xml b/docs/sitemap.xml index 21dccf28e..c0c5cebb9 100644 --- a/docs/sitemap.xml +++ b/docs/sitemap.xml @@ -4,30 +4,30 @@ https://alanorth.github.io/cgspace-notes/2019-08/ - 2019-08-04T22:49:04+03:00 + 2019-08-05T16:49:31+03:00 https://alanorth.github.io/cgspace-notes/ - 2019-08-04T22:49:04+03:00 + 2019-08-05T16:49:31+03:00 0 https://alanorth.github.io/cgspace-notes/tags/notes/ - 2019-08-04T22:49:04+03:00 + 2019-08-05T16:49:31+03:00 0 https://alanorth.github.io/cgspace-notes/posts/ - 2019-08-04T22:49:04+03:00 + 2019-08-05T16:49:31+03:00 0 https://alanorth.github.io/cgspace-notes/tags/ - 2019-08-04T22:49:04+03:00 + 2019-08-05T16:49:31+03:00 0