mirror of
https://github.com/alanorth/cgspace-notes.git
synced 2025-01-27 05:49:12 +01:00
Add notes for 2021-03-08
This commit is contained in:
@ -19,7 +19,7 @@ Also, we found some issues building and running OpenRXV currently due to ecosyst
|
||||
<meta property="og:type" content="article" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2021-03/" />
|
||||
<meta property="article:published_time" content="2021-03-01T10:13:54+02:00" />
|
||||
<meta property="article:modified_time" content="2021-03-06T13:35:20+02:00" />
|
||||
<meta property="article:modified_time" content="2021-03-07T15:51:12+02:00" />
|
||||
|
||||
|
||||
|
||||
@ -44,9 +44,9 @@ Also, we found some issues building and running OpenRXV currently due to ecosyst
|
||||
"@type": "BlogPosting",
|
||||
"headline": "March, 2021",
|
||||
"url": "https://alanorth.github.io/cgspace-notes/2021-03/",
|
||||
"wordCount": "1306",
|
||||
"wordCount": "1673",
|
||||
"datePublished": "2021-03-01T10:13:54+02:00",
|
||||
"dateModified": "2021-03-06T13:35:20+02:00",
|
||||
"dateModified": "2021-03-07T15:51:12+02:00",
|
||||
"author": {
|
||||
"@type": "Person",
|
||||
"name": "Alan Orth"
|
||||
@ -362,6 +362,59 @@ $ curl -X PUT "localhost:9200/openrxv-items/_settings" -H 'Content-Typ
|
||||
</ul>
|
||||
</li>
|
||||
</ul>
|
||||
<h2 id="2021-03-08">2021-03-08</h2>
|
||||
<ul>
|
||||
<li>I approved the WLE item that I edited last week, and all the metadata is there: <a href="https://hdl.handle.net/10568/111810">https://hdl.handle.net/10568/111810</a>
|
||||
<ul>
|
||||
<li>So I’m not sure what Niroshini’s issue with metadata is…</li>
|
||||
</ul>
|
||||
</li>
|
||||
<li>Peter sent a message yesterday saying that his item finally got committed
|
||||
<ul>
|
||||
<li>I looked at the Munin graphs and there was a MASSIVE spike in database activity two days ago, and now database locks are back down to normal levels (from 1000+):</li>
|
||||
</ul>
|
||||
</li>
|
||||
</ul>
|
||||
<pre><code class="language-console" data-lang="console">$ psql -c 'SELECT * FROM pg_locks pl LEFT JOIN pg_stat_activity psa ON pl.pid = psa.pid;' | wc -l
|
||||
13
|
||||
</code></pre><ul>
|
||||
<li>On 2021-03-03 the PostgreSQL transactions started rising:</li>
|
||||
</ul>
|
||||
<p><img src="/cgspace-notes/2021/03/postgres_querylength_ALL-week.png" alt="PostgreSQL query length week"></p>
|
||||
<ul>
|
||||
<li>After that the connections and locks started going up, peaking on 2021-03-06:</li>
|
||||
</ul>
|
||||
<p><img src="/cgspace-notes/2021/03/postgres_locks_ALL-week.png" alt="PostgreSQL locks week">
|
||||
<img src="/cgspace-notes/2021/03/postgres_connections_ALL-week.png" alt="PostgreSQL connections week"></p>
|
||||
<ul>
|
||||
<li>I sent another message to Atmire to ask if they have time to look into this</li>
|
||||
<li>CIFOR is pressuring me to upload the batch items from last week
|
||||
<ul>
|
||||
<li>Vika sent me a final file with some duplicates that Peter identified removed</li>
|
||||
<li>I extracted and re-applied my basic corrections from last week in OpenRefine, then ran the items through <code>csv-metadata-quality</code> checker and uploaded them to CGSpace</li>
|
||||
<li>In total there are 1,088 items</li>
|
||||
</ul>
|
||||
</li>
|
||||
<li>Udana from IWMI emailed to ask about CGSpace thumbnails</li>
|
||||
<li>Udana from IWMI emailed to ask about an item uploaded recently that does not appear in AReS
|
||||
<ul>
|
||||
<li><a href="https://hdl.handle.net/10568/111794">The item</a> was added to the archive on 2021-03-05, and I last harvested on 2021-03-06, so this might be an issue of a missing item</li>
|
||||
</ul>
|
||||
</li>
|
||||
<li>Abenet got a quote from Atmire to buy 125 credits for 3750€</li>
|
||||
<li>Maria at Bioversity sent some feedback about duplicate items on AReS</li>
|
||||
<li>I’m wondering if the issue of the <code>openrxv-items-final</code> index not getting cleared after a successful harvest (which results in having 200,000, then 300,000, etc items) has to do with the alias issue I fixed yesterday
|
||||
<ul>
|
||||
<li>I will start a fresh harvest on AReS without now to check, but first back up the current index just in case:</li>
|
||||
</ul>
|
||||
</li>
|
||||
</ul>
|
||||
<pre><code class="language-console" data-lang="console">$ curl -X PUT "localhost:9200/openrxv-items-final/_settings" -H 'Content-Type: application/json' -d'{"settings": {"index.blocks.write": true}}'
|
||||
$ curl -s -X POST http://localhost:9200/openrxv-items-final/_clone/openrxv-items-final-2021-03-08
|
||||
# start harvesting on AReS
|
||||
</code></pre><ul>
|
||||
<li>As I saw on my local test instance, even when you cancel a harvesting, it replaces the <code>openrxv-items-final</code> index with whatever is in <code>openrxv-items-temp</code> automatically, so I assume it will do the same now</li>
|
||||
</ul>
|
||||
<!-- raw HTML omitted -->
|
||||
|
||||
|
||||
|
Reference in New Issue
Block a user