Notes for 2020-05-03

This commit is contained in:
Alan Orth 2020-05-03 16:10:21 +03:00
parent 8d99b8bd08
commit a569001149
Signed by: alanorth
GPG Key ID: 0FB860CC9C45B1B9
3 changed files with 44 additions and 8 deletions

View File

@ -13,4 +13,19 @@ categories: ["Notes"]
<!--more-->
## 2020-05-03
- Purge a few remaining bots from CGSpace Solr statistics that I had identified a few months ago
- `lua-resty-http/0.10 (Lua) ngx_lua/10000`
- `omgili/0.5 +http://omgili.com`
- `IZaBEE/IZaBEE-1.01 (Buzzing Abound The Web; https://izabee.com; info at izabee dot com)`
- `Twurly v1.1 (https://twurly.org)`
- `Pattern/2.6 +http://www.clips.ua.ac.be/pattern`
- `CyotekWebCopy/1.7 CyotekHTTP/2.0`
- This is only about 2,500 hits total from the last ten years, and half of these bots no longer seem to exist, so I won't bother submitting them to the COUNTER-Robots project
- I noticed that our custom themes were incorrectly linking to the OpenSearch XML file
- The bug [was fixed](https://jira.lyrasis.org/browse/DS-2592) for Mirage2 in 2015
- Note that this did not prevent OpenSearch itself from working
- I will patch this on our DSpace 5.x and 6.x branches
<!-- vim: set sw=2 ts=2: -->

View File

@ -18,7 +18,7 @@ I see that CGSpace (linode18) is still using PostgreSQL JDBC driver version 42.2
<meta property="og:type" content="article" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2020-05/" />
<meta property="article:published_time" content="2020-05-02T09:52:04+03:00" />
<meta property="article:modified_time" content="2020-05-02T09:52:04+03:00" />
<meta property="article:modified_time" content="2020-05-02T10:08:14+03:00" />
<meta name="twitter:card" content="summary"/>
<meta name="twitter:title" content="May, 2020"/>
@ -41,9 +41,9 @@ I see that CGSpace (linode18) is still using PostgreSQL JDBC driver version 42.2
"@type": "BlogPosting",
"headline": "May, 2020",
"url": "https://alanorth.github.io/cgspace-notes/2020-05/",
"wordCount": "86",
"wordCount": "202",
"datePublished": "2020-05-02T09:52:04+03:00",
"dateModified": "2020-05-02T09:52:04+03:00",
"dateModified": "2020-05-02T10:08:14+03:00",
"author": {
"@type": "Person",
"name": "Alan Orth"
@ -126,6 +126,27 @@ I see that CGSpace (linode18) is still using PostgreSQL JDBC driver version 42.2
</ul>
</li>
</ul>
<h2 id="2020-05-03">2020-05-03</h2>
<ul>
<li>Purge a few remaining bots from CGSpace Solr statistics that I had identified a few months ago
<ul>
<li><code>lua-resty-http/0.10 (Lua) ngx_lua/10000</code></li>
<li><code>omgili/0.5 +http://omgili.com</code></li>
<li><code>IZaBEE/IZaBEE-1.01 (Buzzing Abound The Web; https://izabee.com; info at izabee dot com)</code></li>
<li><code>Twurly v1.1 (https://twurly.org)</code></li>
<li><code>Pattern/2.6 +http://www.clips.ua.ac.be/pattern</code></li>
<li><code>CyotekWebCopy/1.7 CyotekHTTP/2.0</code></li>
</ul>
</li>
<li>This is only about 2,500 hits total from the last ten years, and half of these bots no longer seem to exist, so I won&rsquo;t bother submitting them to the COUNTER-Robots project</li>
<li>I noticed that our custom themes were incorrectly linking to the OpenSearch XML file
<ul>
<li>The bug <a href="https://jira.lyrasis.org/browse/DS-2592">was fixed</a> for Mirage2 in 2015</li>
<li>Note that this did not prevent OpenSearch itself from working</li>
<li>I will patch this on our DSpace 5.x and 6.x branches</li>
</ul>
</li>
</ul>
<!-- raw HTML omitted -->

View File

@ -4,27 +4,27 @@
<url>
<loc>https://alanorth.github.io/cgspace-notes/categories/</loc>
<lastmod>2020-05-02T09:52:04+03:00</lastmod>
<lastmod>2020-05-02T10:08:14+03:00</lastmod>
</url>
<url>
<loc>https://alanorth.github.io/cgspace-notes/</loc>
<lastmod>2020-05-02T09:52:04+03:00</lastmod>
<lastmod>2020-05-02T10:08:14+03:00</lastmod>
</url>
<url>
<loc>https://alanorth.github.io/cgspace-notes/2020-05/</loc>
<lastmod>2020-05-02T09:52:04+03:00</lastmod>
<lastmod>2020-05-02T10:08:14+03:00</lastmod>
</url>
<url>
<loc>https://alanorth.github.io/cgspace-notes/categories/notes/</loc>
<lastmod>2020-05-02T09:52:04+03:00</lastmod>
<lastmod>2020-05-02T10:08:14+03:00</lastmod>
</url>
<url>
<loc>https://alanorth.github.io/cgspace-notes/posts/</loc>
<lastmod>2020-05-02T09:52:04+03:00</lastmod>
<lastmod>2020-05-02T10:08:14+03:00</lastmod>
</url>
<url>