mirror of
https://github.com/alanorth/cgspace-notes.git
synced 2025-01-27 05:49:12 +01:00
Add notes for 2023-01-10
This commit is contained in:
@ -19,7 +19,7 @@ I see we have some new ones that aren’t in our list if I combine with this
|
||||
<meta property="og:type" content="article" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2023-01/" />
|
||||
<meta property="article:published_time" content="2023-01-01T08:44:36+03:00" />
|
||||
<meta property="article:modified_time" content="2023-01-01T10:12:13+02:00" />
|
||||
<meta property="article:modified_time" content="2023-01-04T17:08:14+03:00" />
|
||||
|
||||
|
||||
|
||||
@ -34,7 +34,7 @@ I see we have some new ones that aren’t in our list if I combine with this
|
||||
|
||||
|
||||
"/>
|
||||
<meta name="generator" content="Hugo 0.108.0">
|
||||
<meta name="generator" content="Hugo 0.109.0">
|
||||
|
||||
|
||||
|
||||
@ -44,9 +44,9 @@ I see we have some new ones that aren’t in our list if I combine with this
|
||||
"@type": "BlogPosting",
|
||||
"headline": "January, 2023",
|
||||
"url": "https://alanorth.github.io/cgspace-notes/2023-01/",
|
||||
"wordCount": "567",
|
||||
"wordCount": "768",
|
||||
"datePublished": "2023-01-01T08:44:36+03:00",
|
||||
"dateModified": "2023-01-01T10:12:13+02:00",
|
||||
"dateModified": "2023-01-04T17:08:14+03:00",
|
||||
"author": {
|
||||
"@type": "Person",
|
||||
"name": "Alan Orth"
|
||||
@ -207,6 +207,40 @@ I see we have some new ones that aren’t in our list if I combine with this
|
||||
</li>
|
||||
<li>Start a harvest on AReS</li>
|
||||
</ul>
|
||||
<h2 id="2023-01-08">2023-01-08</h2>
|
||||
<ul>
|
||||
<li>It’s Sunday and I see some PostgreSQL locks belonging to dspaceCli that started at 05:00
|
||||
<ul>
|
||||
<li>That’s strange because I disabled the <code>dspace checker</code> one last week, so I’m not sure which this is…</li>
|
||||
<li>It’s currently 2:30PM on the server so these locks have been there for almost twelve hours</li>
|
||||
</ul>
|
||||
</li>
|
||||
<li>I exported the entire CGSpace to update the Initiative mappings
|
||||
<ul>
|
||||
<li>Items were mapped to ~58 new Initiative collections</li>
|
||||
</ul>
|
||||
</li>
|
||||
<li>Then I ran the ORCID import to catch any new ones that might not have been tagged</li>
|
||||
<li>Then I started a harvest on AReS</li>
|
||||
</ul>
|
||||
<h2 id="2023-01-09">2023-01-09</h2>
|
||||
<ul>
|
||||
<li>Fix some invalid Initiative names on CGSpace and then check for missing mappings</li>
|
||||
<li>Check for missing regions in the Initiatives collection</li>
|
||||
<li>Export a list of author affiliations from the Initiatives community for Peter to check
|
||||
<ul>
|
||||
<li>Was slightly ghetto because I did it from a CSV export of the Initiatives community, then imported to OpenRefine to split multi-value fields, then did some sed nonsense to handle the quoting:</li>
|
||||
</ul>
|
||||
</li>
|
||||
</ul>
|
||||
<div class="highlight"><pre tabindex="0" style="color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;"><code class="language-console" data-lang="console"><span style="display:flex;"><span>$ csvcut -c <span style="color:#e6db74">'cg.contributor.affiliation[en_US]'</span> ~/Downloads/2023-01-09-initiatives.csv | <span style="color:#ae81ff">\
|
||||
</span></span></span><span style="display:flex;"><span><span style="color:#ae81ff"></span> sed -e 's/^"//' -e 's/"$//' -e 's/||/\n/g' | \
|
||||
</span></span><span style="display:flex;"><span> sort -u | \
|
||||
</span></span><span style="display:flex;"><span> sed -e 's/^\(.*\)/"\1/' -e 's/\(.*\)$/\1"/' > /tmp/2023-01-09-initiatives-affiliations.csv
|
||||
</span></span></code></pre></div><h2 id="2023-01-10">2023-01-10</h2>
|
||||
<ul>
|
||||
<li>Export the CGSpace Initiatives collection to check for missing regions and collection mappings</li>
|
||||
</ul>
|
||||
<!-- raw HTML omitted -->
|
||||
|
||||
|
||||
|
Reference in New Issue
Block a user