2021-05-01
- I looked at the top user agents and IPs in the Solr statistics for last month and I see these user agents:
- “RI/1.0”, 1337
- “Microsoft Office Word 2014”, 941
- I will add the RI/1.0 pattern to our DSpace agents overload and purge them from Solr (we had previously seen this agent with 9,000 hits or so in 2020-09), but I think I will leave the Microsoft Word one… as that’s an actual user…
Read more →
2021-04-01
- I wrote a script to query Sherpa’s API for our ISSNs:
sherpa-issn-lookup.py
- I’m curious to see how the results compare with the results from Crossref yesterday
- AReS Explorer was down since this morning, I didn’t see anything in the systemd journal
- I simply took everything down with docker-compose and then back up, and then it was OK
- Perhaps one of the containers crashed, I should have looked closer but I was in a hurry
Read more →
2021-03-01
- Discuss some OpenRXV issues with Abdullah from CodeObia
- He’s trying to work on the DSpace 6+ metadata schema autoimport using the DSpace 6+ REST API
- Also, we found some issues building and running OpenRXV currently due to ecosystem shift in the Node.js dependencies
Read more →
Changes to CGSpace metadata fields to align more with DC, QDC, and DCTERMS as well as CG Core v2. Implemented on 2021-02-21.
With reference to CG Core v2 draft standard by Marie-Angélique as well as DCMI DCTERMS.
Read more →
2021-02-01
- Abenet said that CIP found more duplicate records in their export from AReS
- I had a call with CodeObia to discuss the work on OpenRXV
- Check the results of the AReS harvesting from last night:
$ curl -s 'http://localhost:9200/openrxv-items-temp/_count?q=*&pretty'
{
"count" : 100875,
"_shards" : {
"total" : 1,
"successful" : 1,
"skipped" : 0,
"failed" : 0
}
}
Read more →
2021-01-03
- Peter notified me that some filters on AReS were broken again
- It’s the same issue with the field names getting
.keyword
appended to the end that I already filed an issue on OpenRXV about last month
- I fixed the broken filters (careful to not edit any others, lest they break too!)
- Fix an issue with start page number for the DSpace REST API and statistics API in OpenRXV
- The start page had been “1” in the UI, but in the backend they were doing some gymnastics to adjust to the zero-based offset/limit/page of the DSpace REST API and the statistics API
- I adjusted it to default to 0 and added a note to the admin screen
- I realized that this issue was actually causing the first page of 100 statistics to be missing…
- For example, this item has 51 views on CGSpace, but 0 on AReS
Read more →
2020-12-01
- Atmire responded about the issue with duplicate data in our Solr statistics
- They noticed that some records in the statistics-2015 core haven’t been migrated with the AtomicStatisticsUpdateCLI tool yet and assumed that I haven’t migrated any of the records yet
- That’s strange, as I checked all ten cores and 2015 is the only one with some unmigrated documents, as according to the
cua_version
field
- I started processing those (about 411,000 records):
Read more →
Notes about the DSpace 6 upgrade on CGSpace in 2020-11.
Read more →
2020-11-01
- Continue with processing the statistics-2019 Solr core with the AtomicStatisticsUpdateCLI tool on DSpace Test
- So far we’ve spent at least fifty hours to process the statistics and statistics-2019 core… wow.
Read more →
2020-10-06
- Add tests for the new
/items
POST handlers to the DSpace 6.x branch of my dspace-statistics-api
- Trying to test the changes Atmire sent last week but I had to re-create my local database from a recent CGSpace dump
- During the FlywayDB migration I got an error:
Read more →