mirror of
https://github.com/alanorth/cgspace-notes.git
synced 2025-01-27 05:49:12 +01:00
Add notes for 2020-08-13
This commit is contained in:
@ -385,4 +385,17 @@ dspace=# SELECT count(text_value) FROM metadatavalue WHERE metadata_field_id = 2
|
||||
- I noticed a bunch of user agents with "Crawl" in the Solr stats, which is strange because the DSpace spider agents file has had "crawl" for a long time (and it is case insensitive)
|
||||
- In any case I will purge them and add them to the Tomcat Crawler Session Manager Valve so that at least their sessions get re-used
|
||||
|
||||
## 2020-08-13
|
||||
|
||||
- Linode keeps sending mails that the load and outgoing bandwidth is above the threshold
|
||||
- I took a look briefly and found two IPs with the "Delphi 2009" user agent
|
||||
- Then there is 88.99.115.53 which made 82,000 requests in 2020 so far with no user agent
|
||||
- 64.62.202.73 has made 7,000 requests with this user agent `Mozilla/5.0 (compatible; +centuryb.o.t9[at]gmail.com)`
|
||||
- I had added it to the Tomcat Crawler Session Manager Valve last week but never purged the hits from Solr
|
||||
- 195.54.160.163 is making thousands of requests with user agents liket this:
|
||||
|
||||
`(CASE WHEN 2850=9474 THEN 2850 ELSE NULL END)`
|
||||
|
||||
- I purged 150,000 hits from 2020 and 2020 from these user agents and hosts
|
||||
|
||||
<!-- vim: set sw=2 ts=2: -->
|
||||
|
Reference in New Issue
Block a user