mirror of
https://github.com/alanorth/cgspace-notes.git
synced 2024-11-26 16:38:19 +01:00
Update notes for 2017-10-30
This commit is contained in:
parent
5646221467
commit
41f6b356dd
@ -300,6 +300,13 @@ session_id=6C30F10B4351A4ED83EC6ED50AFD6B6A
|
|||||||
- From looking at the requests, it appears these are from CIAT and CCAFS
|
- From looking at the requests, it appears these are from CIAT and CCAFS
|
||||||
- I wonder if I could somehow instruct them to use a user agent so that we could apply a crawler session manager valve to them
|
- I wonder if I could somehow instruct them to use a user agent so that we could apply a crawler session manager valve to them
|
||||||
- Actually, according to the Tomcat docs, we could use an IP with `crawlerIps`: https://tomcat.apache.org/tomcat-7.0-doc/config/valve.html#Crawler_Session_Manager_Valve
|
- Actually, according to the Tomcat docs, we could use an IP with `crawlerIps`: https://tomcat.apache.org/tomcat-7.0-doc/config/valve.html#Crawler_Session_Manager_Valve
|
||||||
|
- Ah, wait, it looks like `crawlerIps` only came in 2017-06, so probably isn't in Ubuntu 16.04's 7.0.68 build!
|
||||||
|
- That would explain the errors I was getting when trying to set it:
|
||||||
|
|
||||||
|
```
|
||||||
|
WARNING: [SetPropertiesRule]{Server/Service/Engine/Host/Valve} Setting property 'crawlerIps' to '190\.19\.92\.5|104\.196\.152\.243' did not find a matching property.
|
||||||
|
```
|
||||||
|
|
||||||
- As for now, it actually seems the CORE bot coming from 137.108.70.6 and 137.108.70.7 is only using a few sessions per day, which is good:
|
- As for now, it actually seems the CORE bot coming from 137.108.70.6 and 137.108.70.7 is only using a few sessions per day, which is good:
|
||||||
|
|
||||||
```
|
```
|
||||||
|
@ -28,7 +28,7 @@ Add Katherine Lutz to the groups for content sumission and edit steps of the CGI
|
|||||||
|
|
||||||
<meta property="article:published_time" content="2017-10-01T08:07:54+03:00"/>
|
<meta property="article:published_time" content="2017-10-01T08:07:54+03:00"/>
|
||||||
|
|
||||||
<meta property="article:modified_time" content="2017-10-29T10:02:34+02:00"/>
|
<meta property="article:modified_time" content="2017-10-30T18:01:05+02:00"/>
|
||||||
|
|
||||||
|
|
||||||
|
|
||||||
@ -66,9 +66,9 @@ Add Katherine Lutz to the groups for content sumission and edit steps of the CGI
|
|||||||
"@type": "BlogPosting",
|
"@type": "BlogPosting",
|
||||||
"headline": "October, 2017",
|
"headline": "October, 2017",
|
||||||
"url": "https://alanorth.github.io/cgspace-notes/2017-10/",
|
"url": "https://alanorth.github.io/cgspace-notes/2017-10/",
|
||||||
"wordCount": "2261",
|
"wordCount": "2305",
|
||||||
"datePublished": "2017-10-01T08:07:54+03:00",
|
"datePublished": "2017-10-01T08:07:54+03:00",
|
||||||
"dateModified": "2017-10-29T10:02:34+02:00",
|
"dateModified": "2017-10-30T18:01:05+02:00",
|
||||||
"author": {
|
"author": {
|
||||||
"@type": "Person",
|
"@type": "Person",
|
||||||
"name": "Alan Orth"
|
"name": "Alan Orth"
|
||||||
@ -478,6 +478,14 @@ session_id=6C30F10B4351A4ED83EC6ED50AFD6B6A
|
|||||||
<li>From looking at the requests, it appears these are from CIAT and CCAFS</li>
|
<li>From looking at the requests, it appears these are from CIAT and CCAFS</li>
|
||||||
<li>I wonder if I could somehow instruct them to use a user agent so that we could apply a crawler session manager valve to them</li>
|
<li>I wonder if I could somehow instruct them to use a user agent so that we could apply a crawler session manager valve to them</li>
|
||||||
<li>Actually, according to the Tomcat docs, we could use an IP with <code>crawlerIps</code>: <a href="https://tomcat.apache.org/tomcat-7.0-doc/config/valve.html#Crawler_Session_Manager_Valve">https://tomcat.apache.org/tomcat-7.0-doc/config/valve.html#Crawler_Session_Manager_Valve</a></li>
|
<li>Actually, according to the Tomcat docs, we could use an IP with <code>crawlerIps</code>: <a href="https://tomcat.apache.org/tomcat-7.0-doc/config/valve.html#Crawler_Session_Manager_Valve">https://tomcat.apache.org/tomcat-7.0-doc/config/valve.html#Crawler_Session_Manager_Valve</a></li>
|
||||||
|
<li>Ah, wait, it looks like <code>crawlerIps</code> only came in 2017-06, so probably isn’t in Ubuntu 16.04’s 7.0.68 build!</li>
|
||||||
|
<li>That would explain the errors I was getting when trying to set it:</li>
|
||||||
|
</ul>
|
||||||
|
|
||||||
|
<pre><code>WARNING: [SetPropertiesRule]{Server/Service/Engine/Host/Valve} Setting property 'crawlerIps' to '190\.19\.92\.5|104\.196\.152\.243' did not find a matching property.
|
||||||
|
</code></pre>
|
||||||
|
|
||||||
|
<ul>
|
||||||
<li>As for now, it actually seems the CORE bot coming from 137.108.70.6 and 137.108.70.7 is only using a few sessions per day, which is good:</li>
|
<li>As for now, it actually seems the CORE bot coming from 137.108.70.6 and 137.108.70.7 is only using a few sessions per day, which is good:</li>
|
||||||
</ul>
|
</ul>
|
||||||
|
|
||||||
|
@ -4,7 +4,7 @@
|
|||||||
|
|
||||||
<url>
|
<url>
|
||||||
<loc>https://alanorth.github.io/cgspace-notes/2017-10/</loc>
|
<loc>https://alanorth.github.io/cgspace-notes/2017-10/</loc>
|
||||||
<lastmod>2017-10-29T10:02:34+02:00</lastmod>
|
<lastmod>2017-10-30T18:01:05+02:00</lastmod>
|
||||||
</url>
|
</url>
|
||||||
|
|
||||||
<url>
|
<url>
|
||||||
@ -129,7 +129,7 @@
|
|||||||
|
|
||||||
<url>
|
<url>
|
||||||
<loc>https://alanorth.github.io/cgspace-notes/</loc>
|
<loc>https://alanorth.github.io/cgspace-notes/</loc>
|
||||||
<lastmod>2017-10-29T10:02:34+02:00</lastmod>
|
<lastmod>2017-10-30T18:01:05+02:00</lastmod>
|
||||||
<priority>0</priority>
|
<priority>0</priority>
|
||||||
</url>
|
</url>
|
||||||
|
|
||||||
@ -140,7 +140,7 @@
|
|||||||
|
|
||||||
<url>
|
<url>
|
||||||
<loc>https://alanorth.github.io/cgspace-notes/tags/notes/</loc>
|
<loc>https://alanorth.github.io/cgspace-notes/tags/notes/</loc>
|
||||||
<lastmod>2017-10-29T10:02:34+02:00</lastmod>
|
<lastmod>2017-10-30T18:01:05+02:00</lastmod>
|
||||||
<priority>0</priority>
|
<priority>0</priority>
|
||||||
</url>
|
</url>
|
||||||
|
|
||||||
@ -152,13 +152,13 @@
|
|||||||
|
|
||||||
<url>
|
<url>
|
||||||
<loc>https://alanorth.github.io/cgspace-notes/post/</loc>
|
<loc>https://alanorth.github.io/cgspace-notes/post/</loc>
|
||||||
<lastmod>2017-10-29T10:02:34+02:00</lastmod>
|
<lastmod>2017-10-30T18:01:05+02:00</lastmod>
|
||||||
<priority>0</priority>
|
<priority>0</priority>
|
||||||
</url>
|
</url>
|
||||||
|
|
||||||
<url>
|
<url>
|
||||||
<loc>https://alanorth.github.io/cgspace-notes/tags/</loc>
|
<loc>https://alanorth.github.io/cgspace-notes/tags/</loc>
|
||||||
<lastmod>2017-10-29T10:02:34+02:00</lastmod>
|
<lastmod>2017-10-30T18:01:05+02:00</lastmod>
|
||||||
<priority>0</priority>
|
<priority>0</priority>
|
||||||
</url>
|
</url>
|
||||||
|
|
||||||
|
Loading…
Reference in New Issue
Block a user