<!DOCTYPE html> <html lang="en-us"> <head prefix="og: http://ogp.me/ns#"> <meta charset="utf-8" /> <meta name="viewport" content="width=device-width, initial-scale=1.0, maximum-scale=1" /> <meta property="og:title" content=" May, 2016 · CGSpace Notes" /> <meta property="og:site_name" content="CGSpace Notes" /> <meta property="og:url" content="/cgspace-notes/2016-05/" /> <meta property="og:type" content="article" /> <meta property="og:article:published_time" content="2016-05-01T23:06:00+03:00" /> <meta property="og:article:tag" content="notes" /> <title> May, 2016 · CGSpace Notes </title> <link rel="stylesheet" href="/cgspace-notes/css/bootstrap.min.css" /> <link rel="stylesheet" href="/cgspace-notes/css/main.css" /> <link rel="stylesheet" href="/cgspace-notes/css/font-awesome.min.css" /> <link rel="stylesheet" href="/cgspace-notes/css/github.css" /> <link rel="stylesheet" href="//fonts.googleapis.com/css?family=Source+Sans+Pro:200,300,400" type="text/css"> <link rel="shortcut icon" href="/cgspace-notes/images/favicon.ico" /> <link rel="apple-touch-icon" href="/cgspace-notes/images/apple-touch-icon.png" /> </head> <body> <header class="global-header" style="background-image:url(../images/bg.jpg )"> <section class="header-text"> <h1><a href="/cgspace-notes/">CGSpace Notes</a></h1> <div class="sns-links hidden-print"> </div> <a href="/cgspace-notes/" class="btn-header btn-back hidden-xs"> <i class="fa fa-angle-left" aria-hidden="true"></i> Home </a> </section> </header> <main class="container"> <article> <header> <h1 class="text-primary">May, 2016</h1> <div class="post-meta clearfix"> <div class="post-date pull-left"> Posted on <time datetime="2016-05-01T23:06:00+03:00"> May 1, 2016 </time> </div> <div class="pull-right"> <span class="post-tag small"><a href="/cgspace-notes//tags/notes">#notes</a></span> </div> </div> </header> <section> <h2 id="2016-05-01:b7bf1a0f8f2415a40e1e11e343b04c0d">2016-05-01</h2> <ul> <li>Since yesterday there have been 10,000 REST errors and the site has been unstable again</li> <li>I have blocked access to the API now</li> <li>There are 3,000 IPs accessing the REST API in a 24-hour period!</li> </ul> <pre><code># awk '{print $1}' /var/log/nginx/rest.log | uniq | wc -l 3168 </code></pre> <ul> <li>The two most often requesters are in Ethiopia and Colombia: 213.55.99.121 and 181.118.144.29</li> <li>100% of the requests coming from Ethiopia are like this and result in an HTTP 500:</li> </ul> <pre><code>GET /rest/handle/10568/NaN?expand=parentCommunityList,metadata HTTP/1.1 </code></pre> <ul> <li>For now I’ll block just the Ethiopian IP</li> <li>The owner of that application has said that the <code>NaN</code> (not a number) is an error in his code and he’ll fix it</li> </ul> <h2 id="2016-05-03:b7bf1a0f8f2415a40e1e11e343b04c0d">2016-05-03</h2> <ul> <li>Update nginx to 1.10.x branch on CGSpace</li> <li>Fix a reference to <code>dc.type.output</code> in Discovery that I had missed when we migrated to <code>dc.type</code> last month (<a href="https://github.com/ilri/DSpace/pull/223">#223</a>)</li> </ul> <p><img src="../images/2016/05/discovery-types.png" alt="Item type in Discovery results" /></p> <h2 id="2016-05-06:b7bf1a0f8f2415a40e1e11e343b04c0d">2016-05-06</h2> <ul> <li>DSpace Test is down, <code>catalina.out</code> has lots of messages about heap space from some time yesterday (!)</li> <li>It looks like Sisay was doing some batch imports</li> <li>Hmm, also disk space is full</li> <li>I decided to blow away the solr indexes, since they are 50GB and we don’t really need all the Atmire stuff there right now</li> <li>I will re-generate the Discovery indexes after re-deploying</li> <li>Testing <code>renew-letsencrypt.sh</code> script for nginx</li> </ul> <pre><code>#!/usr/bin/env bash readonly SERVICE_BIN=/usr/sbin/service readonly LETSENCRYPT_BIN=/opt/letsencrypt/letsencrypt-auto # stop nginx so LE can listen on port 443 $SERVICE_BIN nginx stop $LETSENCRYPT_BIN renew -nvv --standalone --standalone-supported-challenges tls-sni-01 > /var/log/letsencrypt/renew.log 2>&1 LE_RESULT=$? $SERVICE_BIN nginx start if [[ "$LE_RESULT" != 0 ]]; then echo 'Automated renewal failed:' cat /var/log/letsencrypt/renew.log exit 1 fi </code></pre> <ul> <li>Seems to work well</li> </ul> <h2 id="2016-05-10:b7bf1a0f8f2415a40e1e11e343b04c0d">2016-05-10</h2> <ul> <li>Start looking at more metadata migrations</li> <li>There are lots of fields in <code>dcterms</code> namespace that look interesting, like: <ul> <li>dcterms.type</li> <li>dcterms.spatial</li> </ul></li> <li>Not sure what <code>dcterms</code> is…</li> <li>Looks like these were <a href="https://wiki.duraspace.org/display/DSDOC5x/Metadata+and+Bitstream+Format+Registries#MetadataandBitstreamFormatRegistries-DublinCoreTermsRegistry(DCTERMS)">added in DSpace 4</a> to allow for future work to make DSpace more flexible</li> <li>CGSpace’s <code>dc</code> registry has 96 items, and the default DSpace one has 73.</li> </ul> </section> <footer> <section class="author-info row"> <div class="author-avatar col-md-2"> </div> <div class="author-meta col-md-6"> <h1 class="author-name text-primary">Alan Orth</h1> </div> </section> <ul class="pager"> <li class="previous"><a href="/cgspace-notes/2016-04/"><span aria-hidden="true">←</span> Older</a></li> <li class="next disabled"><a href="#">Newer <span aria-hidden="true">→</span></a></li> </ul> </footer> </article> </main> <footer class="container global-footer"> <div class="copyright-note pull-left"> </div> <div class="sns-links hidden-print"> </div> </footer> <script src="/cgspace-notes/js/highlight.pack.js"></script> <script> hljs.initHighlightingOnLoad(); </script> </body> </html>