Add notes

This commit is contained in:
2024-03-25 18:53:18 +03:00
parent 0e43fc97d7
commit d71c430a7d
148 changed files with 241 additions and 187 deletions

View File

@ -19,7 +19,7 @@ It might be this issue: https://github.com/DSpace/dspace-angular/issues/2808
<meta property="og:type" content="article" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2024-03/" />
<meta property="article:published_time" content="2024-03-01T09:55:00+03:00" />
<meta property="article:modified_time" content="2024-03-19T09:01:13+03:00" />
<meta property="article:modified_time" content="2024-03-19T16:24:20+03:00" />
@ -34,7 +34,7 @@ It might be this issue: https://github.com/DSpace/dspace-angular/issues/2808
"/>
<meta name="generator" content="Hugo 0.123.8">
<meta name="generator" content="Hugo 0.124.1">
@ -44,9 +44,9 @@ It might be this issue: https://github.com/DSpace/dspace-angular/issues/2808
"@type": "BlogPosting",
"headline": "March, 2024",
"url": "https://alanorth.github.io/cgspace-notes/2024-03/",
"wordCount": "1028",
"wordCount": "1169",
"datePublished": "2024-03-01T09:55:00+03:00",
"dateModified": "2024-03-19T09:01:13+03:00",
"dateModified": "2024-03-19T16:24:20+03:00",
"author": {
"@type": "Person",
"name": "Alan Orth"
@ -301,7 +301,37 @@ COPY 28024
</ul>
<div class="highlight"><pre tabindex="0" style="color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;"><code class="language-console" data-lang="console"><span style="display:flex;"><span>$ csvcut -c subject /tmp/2024-03-19-cgspace-subjects.csv | sed <span style="color:#e6db74">&#39;1d&#39;</span> &gt; /tmp/2024-03-19-cgspace-subjects.txt
</span></span><span style="display:flex;"><span>$ ./ilri/agrovoc_lookup.py -i /tmp/2024-03-19-cgspace-subjects.txt -o /tmp/2024-03-19-cgspace-subjects-results.csv
</span></span></code></pre></div><!-- raw HTML omitted -->
</span></span></code></pre></div><h2 id="2024-03-20">2024-03-20</h2>
<ul>
<li>Identify seven duplicates on CGSpace from the PRMS results and withdraw them from CGSpace</li>
</ul>
<h2 id="2024-03-21">2024-03-21</h2>
<ul>
<li>Look more closely at duplicates on CGSpace based on a fresh export
<ul>
<li>Using DOIs I found ~842 that occur more than once for journal articles alone, so probably around 400 duplicates</li>
<li>I did a handful of them, merging the metadata and withdrawing the duplicate, and decided to add <code>dcterms.replaces</code> with the handle in the original</li>
</ul>
</li>
</ul>
<h2 id="2024-03-22">2024-03-22</h2>
<ul>
<li>Look at duplicate DOIs on CGSpace and address a dozen or so</li>
</ul>
<h2 id="2024-03-23">2024-03-23</h2>
<ul>
<li>Look at duplicate DOIs on CGSpace and address a dozen or so</li>
<li>Update Tomcat and Solr to latest versions
<ul>
<li>I had done some tests with these last week, and did a last minute test on DSpace 7 Test to make sure submission and searching worked</li>
</ul>
</li>
</ul>
<h2 id="2024-03-24">2024-03-24</h2>
<ul>
<li>Slowly process several dozen more duplicate DOIs on CGSpace, sigh&hellip;</li>
</ul>
<!-- raw HTML omitted -->