CGSpace Notes

Documenting day-to-day work on the CGSpace repository.

November, 2024

2024-11-11

  • Some IP in India is making tons of requests this morning with a normal user agent:
# awk '{print $1}' /var/log/nginx/api-access.log | sort | uniq -c | sort -h | tail -n 40
...
513743 49.207.196.249
Read more →

August, 2024

2024-08-08

  • While working on the CGIAR Climate Change Synthesis I learned some new tricks with OpenRefine
Read more →

July, 2024

2024-07-01

  • A bit of work to clean up duplicate DOIs on CGSpace
    • A handful of book chapters, working papers, and journal articles using the wrong DOI
  • I tried to delete all users who have been inactive since six years ago (July 1, 2018):
Read more →

May, 2024

2024-05-01

  • I dumped all the CGSpace DOIs and resolved them with my crossref_doi_lookup.py script
    • Then I did some work to add missing abstracts (about 900!), volumes, issues, licenses, publishers, and types, etc
Read more →