diff --git a/content/posts/2024-05.md b/content/posts/2024-05.md
new file mode 100644
index 000000000..f53a6f161
--- /dev/null
+++ b/content/posts/2024-05.md
@@ -0,0 +1,15 @@
+---
+title: "May, 2024"
+date: 2024-05-01T10:39:00+03:00
+author: "Alan Orth"
+categories: ["Notes"]
+---
+
+## 2024-05-01
+
+- I dumped all the CGSpace DOIs and resolved them with my `crossref_doi_lookup.py` script
+ - Then I did some work to add missing abstracts (about 900!), volumes, issues, licenses, publishers, and types, etc
+
+
+
+
diff --git a/docs/2015-11/index.html b/docs/2015-11/index.html
index 5f4204690..6f9138559 100644
--- a/docs/2015-11/index.html
+++ b/docs/2015-11/index.html
@@ -242,6 +242,8 @@ db.statementpool = true
-
diff --git a/docs/2016-03/index.html b/docs/2016-03/index.html
index 71433646d..c7d92a88c 100644
--- a/docs/2016-03/index.html
+++ b/docs/2016-03/index.html
@@ -316,6 +316,8 @@ Reinstall my local (Mac OS X) DSpace stack with Tomcat 7, PostgreSQL 9.3, and Ja
+
-
diff --git a/docs/2017-07/index.html b/docs/2017-07/index.html
index f0f29c56a..75629dadd 100644
--- a/docs/2017-07/index.html
+++ b/docs/2017-07/index.html
@@ -275,6 +275,8 @@ delete from metadatavalue where resource_type_id=2 and metadata_field_id=235 and
+
-
diff --git a/docs/2018-09/index.html b/docs/2018-09/index.html
index b67af60ed..13220b21f 100644
--- a/docs/2018-09/index.html
+++ b/docs/2018-09/index.html
@@ -748,6 +748,8 @@ UPDATE metadatavalue SET text_value='ja' WHERE resource_type_id=2 AND me
+
-
diff --git a/docs/2021-01/index.html b/docs/2021-01/index.html
index 563d64d30..f744db983 100644
--- a/docs/2021-01/index.html
+++ b/docs/2021-01/index.html
@@ -688,6 +688,8 @@ java.lang.IllegalArgumentException: Invalid character found in the request targe
+
-
diff --git a/docs/2021-03/index.html b/docs/2021-03/index.html
index 16da8b5d2..296a02e7f 100644
--- a/docs/2021-03/index.html
+++ b/docs/2021-03/index.html
@@ -875,6 +875,8 @@ Also, we found some issues building and running OpenRXV currently due to ecosyst
+
-
diff --git a/docs/2021-06/index.html b/docs/2021-06/index.html
index caf64af7d..935d173fa 100644
--- a/docs/2021-06/index.html
+++ b/docs/2021-06/index.html
@@ -693,6 +693,8 @@ I simply started it and AReS was running again:
+
-
diff --git a/docs/2021-10/index.html b/docs/2021-10/index.html
index 6882182a7..ff8578033 100644
--- a/docs/2021-10/index.html
+++ b/docs/2021-10/index.html
@@ -791,6 +791,8 @@ Try doing it in two imports. In first import, remove all authors. In second impo
+
-
diff --git a/docs/2022-05/index.html b/docs/2022-05/index.html
index 2eddb958c..71da16fd7 100644
--- a/docs/2022-05/index.html
+++ b/docs/2022-05/index.html
@@ -445,6 +445,8 @@ I purged 93,974 hits from these IPs using my check-spider-ip-hits.sh script
+
-
diff --git a/docs/2022-06/index.html b/docs/2022-06/index.html
index bdc31bfe7..b39d924a5 100644
--- a/docs/2022-06/index.html
+++ b/docs/2022-06/index.html
@@ -458,6 +458,8 @@ There seem to be many more of these:
+
-
diff --git a/docs/2022-07/index.html b/docs/2022-07/index.html
index 1bbfc60be..57a01e46f 100644
--- a/docs/2022-07/index.html
+++ b/docs/2022-07/index.html
@@ -736,6 +736,8 @@ Also, the trgm functions I’ve used before are case insensitive, but Levens
+
-
diff --git a/docs/2022-08/index.html b/docs/2022-08/index.html
index b4f4a3a10..0d5ac4365 100644
--- a/docs/2022-08/index.html
+++ b/docs/2022-08/index.html
@@ -522,6 +522,8 @@ Our request to add CC-BY-3.0-IGO to SPDX was approved a few weeks ago
+
-
diff --git a/docs/2022-09/index.html b/docs/2022-09/index.html
index 583b17b7f..7ec0eff16 100644
--- a/docs/2022-09/index.html
+++ b/docs/2022-09/index.html
@@ -783,6 +783,8 @@ harvesting of meat from wildlife and not from livestock.
+
-
diff --git a/docs/2022-10/index.html b/docs/2022-10/index.html
index ea67f91fb..8ca6d163a 100644
--- a/docs/2022-10/index.html
+++ b/docs/2022-10/index.html
@@ -978,6 +978,8 @@ I filed an issue to ask about Java 11+ support
+
-
diff --git a/docs/2022-11/index.html b/docs/2022-11/index.html
index 6be72a285..bd66f5a88 100644
--- a/docs/2022-11/index.html
+++ b/docs/2022-11/index.html
@@ -757,6 +757,8 @@ I reverted the Cocoon autosave change because it was more of a nuissance that Pe
+
-
diff --git a/docs/2023-01/index.html b/docs/2023-01/index.html
index ace5dafa1..d2c0fc2d3 100644
--- a/docs/2023-01/index.html
+++ b/docs/2023-01/index.html
@@ -827,6 +827,8 @@ I see we have some new ones that aren’t in our list if I combine with this
+
-
diff --git a/docs/2023-02/index.html b/docs/2023-02/index.html
index d3fd8bfd6..a59c41442 100644
--- a/docs/2023-02/index.html
+++ b/docs/2023-02/index.html
@@ -647,6 +647,8 @@ I want to try to expand my use of their data to journals, publishers, volumes, i
+
-
diff --git a/docs/2023-05/index.html b/docs/2023-05/index.html
index 193dd95c8..eb3dac6dd 100644
--- a/docs/2023-05/index.html
+++ b/docs/2023-05/index.html
@@ -374,6 +374,8 @@ Work on cleaning, proofing, and uploading twenty-seven records for IFPRI to CGSp
+
-
diff --git a/docs/2023-06/index.html b/docs/2023-06/index.html
index c796f7fd2..fb2cfa697 100644
--- a/docs/2023-06/index.html
+++ b/docs/2023-06/index.html
@@ -446,6 +446,8 @@ From what I can see we need to upgrade the MODS schema from 3.1 to 3.7 and then
+
-
diff --git a/docs/2023-10/index.html b/docs/2023-10/index.html
index a2d9a6d8b..2a05b3269 100644
--- a/docs/2023-10/index.html
+++ b/docs/2023-10/index.html
@@ -345,6 +345,8 @@ We can be on the safe side by using only abstracts for items that are licensed u
+
-
diff --git a/docs/2024-02/index.html b/docs/2024-02/index.html
index d3b044570..3f4df0371 100644
--- a/docs/2024-02/index.html
+++ b/docs/2024-02/index.html
@@ -247,6 +247,8 @@ Lower case all the AGROVOC subjects on CGSpace
+
-
- 2023-07-01 Export CGSpace to check for missing Initiative collection mappings Start harvesting on AReS 2023-07-02 Minor edits to the crossref_doi_lookup.py script while running some checks from 22,000 CGSpace DOIs 2023-07-03 I analyzed the licenses declared by Crossref and found with high confidence that ~400 of ours were incorrect I took the more accurate ones from Crossref and updated the items on CGSpace I took a few hundred ISBNs as well for where we were missing them I also tagged ~4,700 items with missing licenses as “Copyrighted; all rights reserved” based on their Crossref license status being TDM, mostly from Elsevier, Wiley, and Springer Checking a dozen or so manually, I confirmed that if Crossref only has a TDM license then it’s usually copyrighted (could still be open access, but we can’t tell via Crossref) I would be curious to write a script to check the Unpaywall API for open access status… In the past I found that their license status was not very accurate, but the open access status might be more reliable More minor work on the DSpace 7 item views I learned some new Angular template syntax I created a custom component to show Creative Commons licenses on the simple item page I also decided that I don’t like the Impact Area icons as a component because they don’t have any visual meaning 2023-07-04 Focus group meeting with CGSpace partners about DSpace 7 I added a themed file selection component to the CGSpace theme It displays the bistream description instead of the file name, just like we did in DSpace 6 XMLUI I added a custom component to show share icons 2023-07-05 I spent some time trying to update OpenRXV from Angular 9 to 10 to 11 to 12 to 13 Most things work but there are some minor bugs it seems Mishell from CIP emailed me to say she was having problems approving an item on CGSpace Looking at PostgreSQL I saw there were a dozen or so locks that were several hours and even over one day old so I killed those processes and told her to try again 2023-07-06 Types meeting I wrote a Python script to check Unpaywall for some information about DOIs 2023-07-7 Continue exploring Unpaywall data for some of our DOIs In the past I’ve found their licensing information to not be very reliable (preferring Crossref), but I think their open access status is more reliable, especially when the provider is listed as being the publisher Even so, sometimes the version can be “acceptedVersion”, which is presumably the author’s version, as opposed to the “publishedVersion”, which means it’s available as open access on the publisher’s website I did some quality assurance and found ~100 that were marked as Limited Access, but should have been Open Access, and fixed a handful of licenses Delete duplicate metadata as described in my DSpace issue from last year: https://github.
- Read more →
-
-
-
-
-
-