Update notes for 2019-03-26

This commit is contained in:
2019-03-27 09:51:30 +02:00
parent 9f7556a803
commit 28116d091e
4 changed files with 44 additions and 14 deletions

View File

@ -917,5 +917,20 @@ $ grep -o -E 'session_id=[A-Z0-9]{32}:ip_addr=(18.195.78.144|18.196.196.108)' ds
```
- I will add their IPs to the list of bot IPs in nginx so I can tag them as bots to let Tomcat's Crawler Session Manager Valve to force them to re-use their session
- Another user agent behaving badly in Colombia is "GuzzleHttp/6.3.3 curl/7.47.0 PHP/7.0.30-0ubuntu0.16.04.1"
- I will add curl to the Tomcat Crawler Session Manager because anyone using curl is most likely an automated read-only request
- I will add GuzzleHttp to the nginx badbots rate limiting, because it is making requests to dynamic Discovery pages
```
# zcat --force /var/log/nginx/{access,error,library-access}.log /var/log/nginx/{access,error,library-access}.log.1 | grep 45.5.184.72 | grep -E "26/Mar/2019:" | grep -E '(discover|browse)' | wc -l
119
```
- What's strange is that I can't see any of their requests in the DSpace log...
```
$ grep -I -c 45.5.184.72 dspace.log.2019-03-26
0
```
<!-- vim: set sw=2 ts=2: -->