diff --git a/content/posts/2018-11.md b/content/posts/2018-11.md index 448f32fb2..b8e270da9 100644 --- a/content/posts/2018-11.md +++ b/content/posts/2018-11.md @@ -232,6 +232,9 @@ $ grep -c -E 'session_id=[A-Z0-9]{32}:ip_addr=2a03:2880:11ff' dspace.log.2018-11 - I added the "most-popular" pages to the list that return `X-Robots-Tag: none` to try to inform bots not to index or follow those pages - Also, I implemented an nginx rate limit of twelve requests per minute on all dynamic pages... I figure a human user might legitimately request one every five seconds + +## 2018-11-05 + - I wrote a small Python script [add-dc-rights.py](https://gist.github.com/alanorth/4ff81d5f65613814a66cb6f84fdf1fc5) to add usage rights (`dc.rights`) to CGSpace items based on the CSV Hector gave me from MARLO: ``` @@ -254,4 +257,8 @@ $ ./add-dc-rights.py -i /tmp/marlo.csv -db dspace -u dspace -p 'fuuu' - 29,000 requests from Facebook, 29,000 Tomcat sessions, and none of the requests are to the dynamic pages I rate limited yesterday! +## 2018-11-06 + +- I updated all the [DSpace helper Python scripts](https://github.com/ilri/DSpace/wiki/Scripts) to validate against PEP 8 using Flake8 + diff --git a/docs/2018-11/index.html b/docs/2018-11/index.html index e370258f0..4ee52fcc1 100644 --- a/docs/2018-11/index.html +++ b/docs/2018-11/index.html @@ -23,7 +23,7 @@ Today these are the top 10 IPs: " /> - + @@ -52,9 +52,9 @@ Today these are the top 10 IPs: "@type": "BlogPosting", "headline": "November, 2018", "url": "https://alanorth.github.io/cgspace-notes/2018-11/", - "wordCount": "1129", + "wordCount": "1146", "datePublished": "2018-11-01T16:41:30+02:00", - "dateModified": "2018-11-05T17:45:39+02:00", + "dateModified": "2018-11-06T00:04:18+02:00", "author": { "@type": "Person", "name": "Alan Orth" @@ -368,6 +368,11 @@ Today these are the top 10 IPs:
X-Robots-Tag: none
to try to inform bots not to index or follow those pagesdc.rights
) to CGSpace items based on the CSV Hector gave me from MARLO: