mirror of
https://github.com/alanorth/cgspace-notes.git
synced 2025-01-27 05:49:12 +01:00
Add notes for 2023-03-09
This commit is contained in:
@ -16,7 +16,7 @@ I finally got through with porting the input form from DSpace 6 to DSpace 7
|
||||
<meta property="og:type" content="article" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2023-03/" />
|
||||
<meta property="article:published_time" content="2023-03-01T07:58:36+03:00" />
|
||||
<meta property="article:modified_time" content="2023-03-07T17:15:26+03:00" />
|
||||
<meta property="article:modified_time" content="2023-03-08T18:53:32+03:00" />
|
||||
|
||||
|
||||
|
||||
@ -38,9 +38,9 @@ I finally got through with porting the input form from DSpace 6 to DSpace 7
|
||||
"@type": "BlogPosting",
|
||||
"headline": "March, 2023",
|
||||
"url": "https://alanorth.github.io/cgspace-notes/2023-03/",
|
||||
"wordCount": "1336",
|
||||
"wordCount": "1433",
|
||||
"datePublished": "2023-03-01T07:58:36+03:00",
|
||||
"dateModified": "2023-03-07T17:15:26+03:00",
|
||||
"dateModified": "2023-03-08T18:53:32+03:00",
|
||||
"author": {
|
||||
"@type": "Person",
|
||||
"name": "Alan Orth"
|
||||
@ -294,6 +294,10 @@ pd.options.mode.nullable_dtypes = True
|
||||
</span></span><span style="display:flex;"><span>value.replace("<jats:sub>","").replace("</jats:sub>", "").replace("<jats:sup>","").replace("</jats:sup>", "")
|
||||
</span></span></code></pre></div><ul>
|
||||
<li>I uploaded the 350 items to DSpace Test so Peter and Abenet can explore them</li>
|
||||
<li>I exported a list of authors, affiliations, and funders from the new items to let Peter correct them:</li>
|
||||
</ul>
|
||||
<div class="highlight"><pre tabindex="0" style="color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;"><code class="language-console" data-lang="console"><span style="display:flex;"><span>$ csvcut -c dc.contributor.author /tmp/new-items.csv | sed -e 1d -e <span style="color:#e6db74">'s/"//g'</span> -e <span style="color:#e6db74">'s/||/\n/g'</span> | sort | uniq -c | sort -nr | awk <span style="color:#e6db74">'{$1=""; print $0}'</span> | sed -e <span style="color:#e6db74">'s/^ //'</span> > /tmp/new-authors.csv
|
||||
</span></span></code></pre></div><ul>
|
||||
<li>Meeting with FAO AGRIS team about how to detect duplicates
|
||||
<ul>
|
||||
<li>They are currently using a sha256 hash on titles, which will work, but will only return exact matches</li>
|
||||
@ -308,6 +312,15 @@ pd.options.mode.nullable_dtypes = True
|
||||
</ul>
|
||||
</li>
|
||||
</ul>
|
||||
<h2 id="2023-03-09">2023-03-09</h2>
|
||||
<ul>
|
||||
<li>Apply a bunch of corrections to authors, affiliations, and donors on the new items on DSpace Test</li>
|
||||
<li>Meeting with Peter and Abenet about future OpenRXV developments, DSpace 7, etc
|
||||
<ul>
|
||||
<li>I submitted an <a href="https://github.com/CodeObia/MEL/issues/11173">issue on MEL asking them to add provenance metadata when submitting to CGSpace</a></li>
|
||||
</ul>
|
||||
</li>
|
||||
</ul>
|
||||
<!-- raw HTML omitted -->
|
||||
|
||||
|
||||
|
Reference in New Issue
Block a user