mirror of
https://github.com/alanorth/cgspace-notes.git
synced 2025-01-27 05:49:12 +01:00
Update notes for 2018-10-03
This commit is contained in:
@ -53,5 +53,47 @@ Given Names Deactivated Family Name Deactivated: 0000-0001-7930-5752
|
||||
|
||||
- It appears to be Jim Lorenzen... I need to check that later!
|
||||
- I merged the changes to the `5_x-prod` branch ([#390](https://github.com/ilri/DSpace/pull/390))
|
||||
- Linode sent another alert about CPU usage on CGSpace (linode18) this evening
|
||||
- It seems that Moayad is making quite a lot of requests today:
|
||||
|
||||
```
|
||||
# zcat --force /var/log/nginx/*.log /var/log/nginx/*.log.1 | grep -E "03/Oct/2018" | awk '{print $1}' | sort | uniq -c | sort -n | tail -n 10
|
||||
1594 157.55.39.160
|
||||
1627 157.55.39.173
|
||||
1774 136.243.6.84
|
||||
4228 35.237.175.180
|
||||
4497 70.32.83.92
|
||||
4856 66.249.64.59
|
||||
7120 50.116.102.77
|
||||
12518 138.201.49.199
|
||||
87646 34.218.226.147
|
||||
111729 213.139.53.62
|
||||
```
|
||||
|
||||
- But in super positive news, he says they are using my new [dspace-statistics-api](https://github.com/alanorth/dspace-statistics-api) and it's MUCH faster than using Atmire CUA's internal "restlet" API
|
||||
- I don't recognize the `138.201.49.199` IP, but it is in Germany (Hetzner) and appears to be paginating over some browse pages and downloading bitstreams:
|
||||
|
||||
```
|
||||
# grep 138.201.49.199 /var/log/nginx/access.log | grep -o -E 'GET /[a-z]+' | sort | uniq -c
|
||||
8324 GET /bitstream
|
||||
4193 GET /handle
|
||||
```
|
||||
|
||||
- Suspiciously, it's only grabbing the CGIAR System Office community (handle prefix 10947):
|
||||
|
||||
```
|
||||
# grep 138.201.49.199 /var/log/nginx/access.log | grep -o -E 'GET /handle/[0-9]{5}' | sort | uniq -c
|
||||
7 GET /handle/10568
|
||||
4186 GET /handle/10947
|
||||
```
|
||||
|
||||
- The user agent is suspicious too:
|
||||
|
||||
```
|
||||
Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/41.0.2227.0 Safari/537.36
|
||||
```
|
||||
|
||||
- It's clearly a bot and it's not re-using its Tomcat session, so I will add its IP to the nginx bad bot list
|
||||
- I looked in Solr's statistics core and these hits were actually all counted as `isBot:false` (of course)... hmmm
|
||||
|
||||
<!-- vim: set sw=2 ts=2: -->
|
||||
|
Reference in New Issue
Block a user