mirror of
https://github.com/alanorth/cgspace-notes.git
synced 2025-01-27 05:49:12 +01:00
Add notes
This commit is contained in:
@ -32,7 +32,7 @@ $ curl -s 'http://localhost:9200/openrxv-items-temp/_count?q=*&pretty
|
||||
<meta property="og:type" content="article" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2021-02/" />
|
||||
<meta property="article:published_time" content="2021-02-01T10:13:54+02:00" />
|
||||
<meta property="article:modified_time" content="2021-02-21T20:37:27+02:00" />
|
||||
<meta property="article:modified_time" content="2021-02-24T09:21:07+02:00" />
|
||||
|
||||
|
||||
|
||||
@ -60,7 +60,7 @@ $ curl -s 'http://localhost:9200/openrxv-items-temp/_count?q=*&pretty
|
||||
}
|
||||
}
|
||||
"/>
|
||||
<meta name="generator" content="Hugo 0.80.0" />
|
||||
<meta name="generator" content="Hugo 0.81.0" />
|
||||
|
||||
|
||||
|
||||
@ -70,9 +70,9 @@ $ curl -s 'http://localhost:9200/openrxv-items-temp/_count?q=*&pretty
|
||||
"@type": "BlogPosting",
|
||||
"headline": "February, 2021",
|
||||
"url": "https://alanorth.github.io/cgspace-notes/2021-02/",
|
||||
"wordCount": "3754",
|
||||
"wordCount": "4170",
|
||||
"datePublished": "2021-02-01T10:13:54+02:00",
|
||||
"dateModified": "2021-02-21T20:37:27+02:00",
|
||||
"dateModified": "2021-02-24T09:21:07+02:00",
|
||||
"author": {
|
||||
"@type": "Person",
|
||||
"name": "Alan Orth"
|
||||
@ -779,7 +779,7 @@ UPDATE 18659
|
||||
</ul>
|
||||
<pre><code class="language-console" data-lang="console">$ dspace metadata-import -f /tmp/0.csv
|
||||
</code></pre><ul>
|
||||
<li>It took FOREVER to import each file… like several hours. MY GOD DSpace 6 is slow.</li>
|
||||
<li>It took FOREVER to import each file… like several hours <em>each</em>. MY GOD DSpace 6 is slow.</li>
|
||||
<li>Help Dominique Perera debug some issues with the WordPress DSpace importer plugin from Macaroni Bros
|
||||
<ul>
|
||||
<li>She is not seeing the community list for CGSpace, and I see weird requests like this in the logs:</li>
|
||||
@ -791,7 +791,94 @@ UPDATE 18659
|
||||
</code></pre><ul>
|
||||
<li>The first request is OK, but the second one is malformed for sure</li>
|
||||
</ul>
|
||||
<!-- raw HTML omitted -->
|
||||
<h2 id="2021-02-24">2021-02-24</h2>
|
||||
<ul>
|
||||
<li>Export a list of journals for Peter to look through:</li>
|
||||
</ul>
|
||||
<pre><code class="language-console" data-lang="console">localhost/dspace63= > \COPY (SELECT DISTINCT text_value as "cg.journal", count(*) FROM metadatavalue WHERE dspace_object_id IN (SELECT uuid FROM item) AND metadata_field_id=251 GROUP BY text_value ORDER BY count DESC) to /tmp/2021-02-24-journals.csv WITH CSV HEADER;
|
||||
COPY 3345
|
||||
</code></pre><ul>
|
||||
<li>Start a fresh harvesting on AReS because Udana mapped some items today and wants to include them in his report:</li>
|
||||
</ul>
|
||||
<pre><code class="language-console" data-lang="console">$ curl -XDELETE 'http://localhost:9200/openrxv-items-temp'
|
||||
# start indexing in AReS
|
||||
</code></pre><ul>
|
||||
<li>Also, I want to include the new series name/number cleanups so it’s not a total waste of time</li>
|
||||
</ul>
|
||||
<h2 id="2021-02-25">2021-02-25</h2>
|
||||
<ul>
|
||||
<li>Hmm the AReS harvest last night seems to have finished successfully, but the number of items is less than I was expecting:</li>
|
||||
</ul>
|
||||
<pre><code class="language-console" data-lang="console">$ curl -s 'http://localhost:9200/openrxv-items-temp/_count?q=*&pretty'
|
||||
{
|
||||
"count" : 99546,
|
||||
"_shards" : {
|
||||
"total" : 1,
|
||||
"successful" : 1,
|
||||
"skipped" : 0,
|
||||
"failed" : 0
|
||||
}
|
||||
}
|
||||
</code></pre><ul>
|
||||
<li>The current items index has 101380 items… I wonder what happened
|
||||
<ul>
|
||||
<li>I started a new indexing</li>
|
||||
</ul>
|
||||
</li>
|
||||
</ul>
|
||||
<h2 id="2021-02-26">2021-02-26</h2>
|
||||
<ul>
|
||||
<li>Last night’s indexing was more successful, there are now 101479 items in the index</li>
|
||||
<li>Yesterday Yousef sent a <a href="https://github.com/ilri/OpenRXV/pull/77/">pull request</a> for the next/previous buttons on OpenRXV
|
||||
<ul>
|
||||
<li>I tested it this morning and it seems to be working</li>
|
||||
</ul>
|
||||
</li>
|
||||
</ul>
|
||||
<h2 id="2021-02-28">2021-02-28</h2>
|
||||
<ul>
|
||||
<li>Abenet asked me to import seventy-three records for CRP Forests, Trees and Agroforestry
|
||||
<ul>
|
||||
<li>I checked them briefly and found that there were thirty+ journal articles, and none of them had <code>cg.journal</code>, <code>cg.volume</code>, <code>cg.issue</code>, or <code>dcterms.license</code> so I spent a little time adding them</li>
|
||||
<li>I used a GREL expression to extract the journal volume and issue from the citation into new columns:</li>
|
||||
</ul>
|
||||
</li>
|
||||
</ul>
|
||||
<pre><code class="language-console" data-lang="console">value.partition(/[0-9]+\([0-9]+\)/)[1].replace(/\(.*\)/,"")
|
||||
value.partition(/[0-9]+\([0-9]+\)/)[1].replace(/^\d+\((\d+)\)/,"$1")
|
||||
</code></pre><ul>
|
||||
<li>This <code>value.partition</code> was new to me… and it took me a bit of time to figure out whether I needed to escape the parentheses in the issue number or not (no) and how to reference a capture group with <code>value.replace</code></li>
|
||||
<li>I tried to check the 1095 CIFOR records from last week for duplicates on DSpace Test, but the page says “Processing” and never loads
|
||||
<ul>
|
||||
<li>I don’t see any errors in the logs, but there are two jQuery errors in the browser console</li>
|
||||
<li>I filed <a href="https://tracker.atmire.com/tickets-cgiar-ilri/view-ticket?id=934">an issue</a> with Atmire</li>
|
||||
</ul>
|
||||
</li>
|
||||
<li>Upload twelve items to CGSpace for Peter</li>
|
||||
<li>Niroshini from IWMI is still having issues adding WLE subjects to items during the metadata review step in the workflow</li>
|
||||
<li>It seems the BatchEditConsumer log spam is gone since I applied <a href="https://github.com/ilri/DSpace/pull/462">Atmire’s patch</a></li>
|
||||
</ul>
|
||||
<pre><code class="language-console" data-lang="console">$ grep -c 'BatchEditConsumer should not have been given' dspace.log.2021-02-[12]*
|
||||
dspace.log.2021-02-10:5067
|
||||
dspace.log.2021-02-11:2647
|
||||
dspace.log.2021-02-12:4231
|
||||
dspace.log.2021-02-13:221
|
||||
dspace.log.2021-02-14:0
|
||||
dspace.log.2021-02-15:0
|
||||
dspace.log.2021-02-16:0
|
||||
dspace.log.2021-02-17:0
|
||||
dspace.log.2021-02-18:0
|
||||
dspace.log.2021-02-19:0
|
||||
dspace.log.2021-02-20:0
|
||||
dspace.log.2021-02-21:0
|
||||
dspace.log.2021-02-22:0
|
||||
dspace.log.2021-02-23:0
|
||||
dspace.log.2021-02-24:0
|
||||
dspace.log.2021-02-25:0
|
||||
dspace.log.2021-02-26:0
|
||||
dspace.log.2021-02-27:0
|
||||
dspace.log.2021-02-28:0
|
||||
</code></pre><!-- raw HTML omitted -->
|
||||
|
||||
|
||||
|
||||
@ -812,6 +899,8 @@ UPDATE 18659
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2021-03/">March, 2021</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2021-02/">February, 2021</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2021-01/">January, 2021</a></li>
|
||||
@ -820,8 +909,6 @@ UPDATE 18659
|
||||
|
||||
<li><a href="/cgspace-notes/cgspace-dspace6-upgrade/">CGSpace DSpace 6 Upgrade</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2020-11/">November, 2020</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
Reference in New Issue
Block a user