diff --git a/content/posts/2019-04.md b/content/posts/2019-04.md index 5a9b00408..419bf46f7 100644 --- a/content/posts/2019-04.md +++ b/content/posts/2019-04.md @@ -553,5 +553,20 @@ $ psql -c 'select * from pg_stat_activity' | grep -o -E '(dspaceWeb|dspaceApi|ds ``` - Ironically I do still see some 2 to 10% of CPU steal in `iostat 1 10` +- Leroy from CIAT contacted me to say he knows the team who is making all those requests to CGSpace + - I told them how to use the REST API to get the CIAT Datasets collection and enumerate its items +- In other news, Linode staff identified a noisy neighbor sharing our host and migrated it elsewhere last night + +## 2019-04-10 + +- Abenet pointed out a possibility of validating funders against the [CrossRef API](https://support.crossref.org/hc/en-us/articles/215788143-Funder-data-via-the-API) +- Note that if you use HTTPS and specify a contact address in the API request you have less likelihood of being blocked + +``` +$ http 'https://api.crossref.org/funders?query=mercator&mailto=me@cgiar.org' +``` + +- Otherwise, they provide the funder data in [CSV and RDF format](https://www.crossref.org/services/funder-registry/) +- I did a quick test with the recent IITA records against reconcile-csv in OpenRefine and it matched a few, but the ones that didn't match will need a human to go and do some manual checking and informed decision making... diff --git a/docs/2019-04/index.html b/docs/2019-04/index.html index 25d8fba8b..5ab2866b8 100644 --- a/docs/2019-04/index.html +++ b/docs/2019-04/index.html @@ -38,7 +38,7 @@ $ ./delete-metadata-values.py -i /tmp/2019-02-21-delete-1-region.csv -db dspace - + @@ -81,9 +81,9 @@ $ ./delete-metadata-values.py -i /tmp/2019-02-21-delete-1-region.csv -db dspace "@type": "BlogPosting", "headline": "April, 2019", "url": "https://alanorth.github.io/cgspace-notes/2019-04/", - "wordCount": "3073", + "wordCount": "3218", "datePublished": "2019-04-01T09:00:43+03:00", - "dateModified": "2019-04-08T20:22:40+03:00", + "dateModified": "2019-04-09T09:33:52+03:00", "author": { "@type": "Person", "name": "Alan Orth" @@ -800,6 +800,27 @@ org.apache.tomcat.jdbc.pool.PoolExhaustedException: [http-bio-127.0.0.1-8443-exe + +

2019-04-10

+ + + +
$ http 'https://api.crossref.org/funders?query=mercator&mailto=me@cgiar.org'
+
+ + diff --git a/docs/sitemap.xml b/docs/sitemap.xml index f95d8b2f2..988928486 100644 --- a/docs/sitemap.xml +++ b/docs/sitemap.xml @@ -4,7 +4,7 @@ https://alanorth.github.io/cgspace-notes/2019-04/ - 2019-04-08T20:22:40+03:00 + 2019-04-09T09:33:52+03:00 @@ -219,7 +219,7 @@ https://alanorth.github.io/cgspace-notes/ - 2019-04-08T20:22:40+03:00 + 2019-04-09T09:33:52+03:00 0 @@ -230,7 +230,7 @@ https://alanorth.github.io/cgspace-notes/tags/notes/ - 2019-04-08T20:22:40+03:00 + 2019-04-09T09:33:52+03:00 0 @@ -242,13 +242,13 @@ https://alanorth.github.io/cgspace-notes/posts/ - 2019-04-08T20:22:40+03:00 + 2019-04-09T09:33:52+03:00 0 https://alanorth.github.io/cgspace-notes/tags/ - 2019-04-08T20:22:40+03:00 + 2019-04-09T09:33:52+03:00 0