From 41f6b356dd66a604f6205887b000ae2071fcb76d Mon Sep 17 00:00:00 2001 From: Alan Orth Date: Mon, 30 Oct 2017 18:45:35 +0200 Subject: [PATCH] Update notes for 2017-10-30 --- content/post/2017-10.md | 7 +++++++ public/2017-10/index.html | 14 +++++++++++--- public/sitemap.xml | 10 +++++----- 3 files changed, 23 insertions(+), 8 deletions(-) diff --git a/content/post/2017-10.md b/content/post/2017-10.md index 618add04c..a1a51303a 100644 --- a/content/post/2017-10.md +++ b/content/post/2017-10.md @@ -300,6 +300,13 @@ session_id=6C30F10B4351A4ED83EC6ED50AFD6B6A - From looking at the requests, it appears these are from CIAT and CCAFS - I wonder if I could somehow instruct them to use a user agent so that we could apply a crawler session manager valve to them - Actually, according to the Tomcat docs, we could use an IP with `crawlerIps`: https://tomcat.apache.org/tomcat-7.0-doc/config/valve.html#Crawler_Session_Manager_Valve +- Ah, wait, it looks like `crawlerIps` only came in 2017-06, so probably isn't in Ubuntu 16.04's 7.0.68 build! +- That would explain the errors I was getting when trying to set it: + +``` +WARNING: [SetPropertiesRule]{Server/Service/Engine/Host/Valve} Setting property 'crawlerIps' to '190\.19\.92\.5|104\.196\.152\.243' did not find a matching property. +``` + - As for now, it actually seems the CORE bot coming from 137.108.70.6 and 137.108.70.7 is only using a few sessions per day, which is good: ``` diff --git a/public/2017-10/index.html b/public/2017-10/index.html index 9df71442e..47c6f5b7c 100644 --- a/public/2017-10/index.html +++ b/public/2017-10/index.html @@ -28,7 +28,7 @@ Add Katherine Lutz to the groups for content sumission and edit steps of the CGI - + @@ -66,9 +66,9 @@ Add Katherine Lutz to the groups for content sumission and edit steps of the CGI "@type": "BlogPosting", "headline": "October, 2017", "url": "https://alanorth.github.io/cgspace-notes/2017-10/", - "wordCount": "2261", + "wordCount": "2305", "datePublished": "2017-10-01T08:07:54+03:00", - "dateModified": "2017-10-29T10:02:34+02:00", + "dateModified": "2017-10-30T18:01:05+02:00", "author": { "@type": "Person", "name": "Alan Orth" @@ -478,6 +478,14 @@ session_id=6C30F10B4351A4ED83EC6ED50AFD6B6A
  • From looking at the requests, it appears these are from CIAT and CCAFS
  • I wonder if I could somehow instruct them to use a user agent so that we could apply a crawler session manager valve to them
  • Actually, according to the Tomcat docs, we could use an IP with crawlerIps: https://tomcat.apache.org/tomcat-7.0-doc/config/valve.html#Crawler_Session_Manager_Valve
  • +
  • Ah, wait, it looks like crawlerIps only came in 2017-06, so probably isn’t in Ubuntu 16.04’s 7.0.68 build!
  • +
  • That would explain the errors I was getting when trying to set it:
  • + + +
    WARNING: [SetPropertiesRule]{Server/Service/Engine/Host/Valve} Setting property 'crawlerIps' to '190\.19\.92\.5|104\.196\.152\.243' did not find a matching property.
    +
    + + diff --git a/public/sitemap.xml b/public/sitemap.xml index 2c40b65c1..f4e0e1b41 100644 --- a/public/sitemap.xml +++ b/public/sitemap.xml @@ -4,7 +4,7 @@ https://alanorth.github.io/cgspace-notes/2017-10/ - 2017-10-29T10:02:34+02:00 + 2017-10-30T18:01:05+02:00 @@ -129,7 +129,7 @@ https://alanorth.github.io/cgspace-notes/ - 2017-10-29T10:02:34+02:00 + 2017-10-30T18:01:05+02:00 0 @@ -140,7 +140,7 @@ https://alanorth.github.io/cgspace-notes/tags/notes/ - 2017-10-29T10:02:34+02:00 + 2017-10-30T18:01:05+02:00 0 @@ -152,13 +152,13 @@ https://alanorth.github.io/cgspace-notes/post/ - 2017-10-29T10:02:34+02:00 + 2017-10-30T18:01:05+02:00 0 https://alanorth.github.io/cgspace-notes/tags/ - 2017-10-29T10:02:34+02:00 + 2017-10-30T18:01:05+02:00 0