Update notes for 2017-10-31

This commit is contained in:
Alan Orth 2017-10-31 15:38:27 +02:00
parent 8fb542f0ac
commit db726df881
Signed by: alanorth
GPG Key ID: 0FB860CC9C45B1B9
3 changed files with 40 additions and 8 deletions

View File

@ -322,3 +322,19 @@ WARNING: [SetPropertiesRule]{Server/Service/Engine/Host/Valve} Setting property
- Very nice, Linode alerted that CGSpace had high CPU usage at 2AM again - Very nice, Linode alerted that CGSpace had high CPU usage at 2AM again
- Ask on the dspace-tech mailing list if it's possible to use an existing item as a template for a new item - Ask on the dspace-tech mailing list if it's possible to use an existing item as a template for a new item
- To follow up on the CORE bot traffic, there were almost 300,000 request yesterday:
```
# grep "CORE/0.6" /var/log/nginx/access.log.1 | awk '{print $1}' | sort -n | uniq -c | sort -h
139109 137.108.70.6
139253 137.108.70.7
```
- I've emailed the CORE people to ask if they can update the repository information from CGIAR Library to CGSpace
- Also, I asked if they could perhaps use the `sitemap.xml`, OAI-PMH, or REST APIs to index us more efficiently, because they mostly seem to be crawling the nearly endless Discovery facets
- I added [GoAccess](https://goaccess.io/) to the list of package to install in the DSpace role of the [Ansible infrastructure scripts](https://github.com/ilri/rmg-ansible-public)
- It makes it very easy to analyze nginx logs from the command line, to see where traffic is coming from:
```
# goaccess /var/log/nginx/access.log --log-format=COMBINED
```

View File

@ -28,7 +28,7 @@ Add Katherine Lutz to the groups for content sumission and edit steps of the CGI
<meta property="article:published_time" content="2017-10-01T08:07:54&#43;03:00"/> <meta property="article:published_time" content="2017-10-01T08:07:54&#43;03:00"/>
<meta property="article:modified_time" content="2017-10-31T11:35:24&#43;02:00"/> <meta property="article:modified_time" content="2017-10-31T13:35:56&#43;02:00"/>
@ -66,9 +66,9 @@ Add Katherine Lutz to the groups for content sumission and edit steps of the CGI
"@type": "BlogPosting", "@type": "BlogPosting",
"headline": "October, 2017", "headline": "October, 2017",
"url": "https://alanorth.github.io/cgspace-notes/2017-10/", "url": "https://alanorth.github.io/cgspace-notes/2017-10/",
"wordCount": "2340", "wordCount": "2468",
"datePublished": "2017-10-01T08:07:54&#43;03:00", "datePublished": "2017-10-01T08:07:54&#43;03:00",
"dateModified": "2017-10-31T11:35:24&#43;02:00", "dateModified": "2017-10-31T13:35:56&#43;02:00",
"author": { "author": {
"@type": "Person", "@type": "Person",
"name": "Alan Orth" "name": "Alan Orth"
@ -504,8 +504,24 @@ session_id=6C30F10B4351A4ED83EC6ED50AFD6B6A
<ul> <ul>
<li>Very nice, Linode alerted that CGSpace had high CPU usage at 2AM again</li> <li>Very nice, Linode alerted that CGSpace had high CPU usage at 2AM again</li>
<li>Ask on the dspace-tech mailing list if it&rsquo;s possible to use an existing item as a template for a new item</li> <li>Ask on the dspace-tech mailing list if it&rsquo;s possible to use an existing item as a template for a new item</li>
<li>To follow up on the CORE bot traffic, there were almost 300,000 request yesterday:</li>
</ul> </ul>
<pre><code># grep &quot;CORE/0.6&quot; /var/log/nginx/access.log.1 | awk '{print $1}' | sort -n | uniq -c | sort -h
139109 137.108.70.6
139253 137.108.70.7
</code></pre>
<ul>
<li>I&rsquo;ve emailed the CORE people to ask if they can update the repository information from CGIAR Library to CGSpace</li>
<li>Also, I asked if they could perhaps use the <code>sitemap.xml</code>, OAI-PMH, or REST APIs to index us more efficiently, because they mostly seem to be crawling the nearly endless Discovery facets</li>
<li>I added <a href="https://goaccess.io/">GoAccess</a> to the list of package to install in the DSpace role of the <a href="https://github.com/ilri/rmg-ansible-public">Ansible infrastructure scripts</a></li>
<li>It makes it very easy to analyze nginx logs from the command line, to see where traffic is coming from:</li>
</ul>
<pre><code># goaccess /var/log/nginx/access.log --log-format=COMBINED
</code></pre>

View File

@ -4,7 +4,7 @@
<url> <url>
<loc>https://alanorth.github.io/cgspace-notes/2017-10/</loc> <loc>https://alanorth.github.io/cgspace-notes/2017-10/</loc>
<lastmod>2017-10-31T11:35:24+02:00</lastmod> <lastmod>2017-10-31T13:35:56+02:00</lastmod>
</url> </url>
<url> <url>
@ -129,7 +129,7 @@
<url> <url>
<loc>https://alanorth.github.io/cgspace-notes/</loc> <loc>https://alanorth.github.io/cgspace-notes/</loc>
<lastmod>2017-10-31T11:35:24+02:00</lastmod> <lastmod>2017-10-31T13:35:56+02:00</lastmod>
<priority>0</priority> <priority>0</priority>
</url> </url>
@ -146,19 +146,19 @@
<url> <url>
<loc>https://alanorth.github.io/cgspace-notes/tags/notes/</loc> <loc>https://alanorth.github.io/cgspace-notes/tags/notes/</loc>
<lastmod>2017-10-31T11:35:24+02:00</lastmod> <lastmod>2017-10-31T13:35:56+02:00</lastmod>
<priority>0</priority> <priority>0</priority>
</url> </url>
<url> <url>
<loc>https://alanorth.github.io/cgspace-notes/post/</loc> <loc>https://alanorth.github.io/cgspace-notes/post/</loc>
<lastmod>2017-10-31T11:35:24+02:00</lastmod> <lastmod>2017-10-31T13:35:56+02:00</lastmod>
<priority>0</priority> <priority>0</priority>
</url> </url>
<url> <url>
<loc>https://alanorth.github.io/cgspace-notes/tags/</loc> <loc>https://alanorth.github.io/cgspace-notes/tags/</loc>
<lastmod>2017-10-31T11:35:24+02:00</lastmod> <lastmod>2017-10-31T13:35:56+02:00</lastmod>
<priority>0</priority> <priority>0</priority>
</url> </url>