mirror of
https://github.com/alanorth/cgspace-notes.git
synced 2025-01-27 05:49:12 +01:00
Add notes for 2017-11-23
This commit is contained in:
@ -780,3 +780,49 @@ $ grep -c com.atmire.utils.UpdateSolrStatsMetadata dspace.log.2017-11-19
|
||||
- In other news, it looks like the JVM garbage collection pattern is back to its standard jigsaw pattern after switching back to CMS a few days ago:
|
||||
|
||||

|
||||
|
||||
## 2017-11-23
|
||||
|
||||
- Linode alerted again that CPU usage was high on CGSpace from 4:13 to 6:13 AM
|
||||
- I see a lot of Googlebot (66.249.66.90) in the XMLUI access logs
|
||||
|
||||
```
|
||||
# cat /var/log/nginx/access.log /var/log/nginx/access.log.1 | grep -E "23/Nov/2017:0[456]" | awk '{print $1}' | sort -n | uniq -c | sort -h | tail
|
||||
88 66.249.66.91
|
||||
140 68.180.229.254
|
||||
155 54.196.2.131
|
||||
182 54.224.164.166
|
||||
301 157.55.39.79
|
||||
315 207.46.13.36
|
||||
331 207.46.13.23
|
||||
358 207.46.13.137
|
||||
565 104.196.152.243
|
||||
1570 66.249.66.90
|
||||
```
|
||||
|
||||
- ... and the usual REST scrapers from CIAT (45.5.184.196) and CCAFS (70.32.83.92):
|
||||
|
||||
```
|
||||
# cat /var/log/nginx/rest.log /var/log/nginx/rest.log.1 | grep -E "23/Nov/2017:0[456]" | awk '{print $1}' | sort -n | uniq -c | sort -h | tail
|
||||
5 190.120.6.219
|
||||
6 104.198.9.108
|
||||
14 104.196.152.243
|
||||
21 112.134.150.6
|
||||
22 157.55.39.79
|
||||
22 207.46.13.137
|
||||
23 207.46.13.36
|
||||
26 207.46.13.23
|
||||
942 45.5.184.196
|
||||
3995 70.32.83.92
|
||||
```
|
||||
|
||||
- These IPs crawling the REST API don't specify user agents and I'd assume they are creating many Tomcat sessions
|
||||
- I would catch them in nginx to assign a "bot" user agent to them so that the Tomcat Crawler Session Manager valve could deal with them, but they seem to create any really — at least not in the dspace.log:
|
||||
|
||||
```
|
||||
$ grep 70.32.83.92 dspace.log.2017-11-23 | grep -o -E 'session_id=[A-Z0-9]{32}' | sort -n | uniq | wc -l
|
||||
2
|
||||
```
|
||||
|
||||
- I'm wondering if REST works differently, or just doesn't log these sessions?
|
||||
- I wonder if they are measurable via JMX MBeans?
|
||||
|
Reference in New Issue
Block a user