mirror of
https://github.com/alanorth/cgspace-notes.git
synced 2025-01-27 05:49:12 +01:00
Add notes for 2020-01-27
This commit is contained in:
@ -9,7 +9,7 @@
|
||||
<meta property="og:description" content="2016-06-01
|
||||
|
||||
Experimenting with IFPRI OAI (we want to harvest their publications)
|
||||
After reading the ContentDM documentation I found IFPRI's OAI endpoint: http://ebrary.ifpri.org/oai/oai.php
|
||||
After reading the ContentDM documentation I found IFPRI’s OAI endpoint: http://ebrary.ifpri.org/oai/oai.php
|
||||
After reading the OAI documentation and testing with an OAI validator I found out how to get their publications
|
||||
This is their publications set: http://ebrary.ifpri.org/oai/oai.php?verb=ListRecords&from=2016-01-01&set=p15738coll2&metadataPrefix=oai_dc
|
||||
You can see the others by using the OAI ListSets verb: http://ebrary.ifpri.org/oai/oai.php?verb=ListSets
|
||||
@ -25,13 +25,13 @@ Working on second phase of metadata migration, looks like this will work for mov
|
||||
<meta name="twitter:description" content="2016-06-01
|
||||
|
||||
Experimenting with IFPRI OAI (we want to harvest their publications)
|
||||
After reading the ContentDM documentation I found IFPRI's OAI endpoint: http://ebrary.ifpri.org/oai/oai.php
|
||||
After reading the ContentDM documentation I found IFPRI’s OAI endpoint: http://ebrary.ifpri.org/oai/oai.php
|
||||
After reading the OAI documentation and testing with an OAI validator I found out how to get their publications
|
||||
This is their publications set: http://ebrary.ifpri.org/oai/oai.php?verb=ListRecords&from=2016-01-01&set=p15738coll2&metadataPrefix=oai_dc
|
||||
You can see the others by using the OAI ListSets verb: http://ebrary.ifpri.org/oai/oai.php?verb=ListSets
|
||||
Working on second phase of metadata migration, looks like this will work for moving CPWF-specific data in dc.identifier.fund to cg.identifier.cpwfproject and then the rest to dc.description.sponsorship
|
||||
"/>
|
||||
<meta name="generator" content="Hugo 0.62.2" />
|
||||
<meta name="generator" content="Hugo 0.63.1" />
|
||||
|
||||
|
||||
|
||||
@ -61,7 +61,7 @@ Working on second phase of metadata migration, looks like this will work for mov
|
||||
|
||||
<!-- combined, minified CSS -->
|
||||
|
||||
<link href="https://alanorth.github.io/cgspace-notes/css/style.a20c1a4367639632cdb341d23c27ca44fedcc75b0f8b3cbea6203010da153d3c.css" rel="stylesheet" integrity="sha256-ogwaQ2djljLNs0HSPCfKRP7cx1sPizy+piAwENoVPTw=" crossorigin="anonymous">
|
||||
<link href="https://alanorth.github.io/cgspace-notes/css/style.23e2c3298bcc8c1136c19aba330c211ec94c36f7c4454ea15cf4d3548370042a.css" rel="stylesheet" integrity="sha256-I+LDKYvMjBE2wZq6MwwhHslMNvfERU6hXPTTVINwBCo=" crossorigin="anonymous">
|
||||
|
||||
|
||||
<!-- RSS 2.0 feed -->
|
||||
@ -109,14 +109,14 @@ Working on second phase of metadata migration, looks like this will work for mov
|
||||
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2016-06/">June, 2016</a></h2>
|
||||
<p class="blog-post-meta"><time datetime="2016-06-01T10:53:00+03:00">Wed Jun 01, 2016</time> by Alan Orth in
|
||||
|
||||
<i class="fa fa-tag" aria-hidden="true"></i> <a href="/cgspace-notes/tags/notes" rel="tag">Notes</a>
|
||||
<span class="fas fa-tag" aria-hidden="true"></span> <a href="/cgspace-notes/tags/notes" rel="tag">Notes</a>
|
||||
|
||||
</p>
|
||||
</header>
|
||||
<h2 id="2016-06-01">2016-06-01</h2>
|
||||
<ul>
|
||||
<li>Experimenting with IFPRI OAI (we want to harvest their publications)</li>
|
||||
<li>After reading the <a href="https://www.oclc.org/support/services/contentdm/help/server-admin-help/oai-support.en.html">ContentDM documentation</a> I found IFPRI's OAI endpoint: <a href="http://ebrary.ifpri.org/oai/oai.php">http://ebrary.ifpri.org/oai/oai.php</a></li>
|
||||
<li>After reading the <a href="https://www.oclc.org/support/services/contentdm/help/server-admin-help/oai-support.en.html">ContentDM documentation</a> I found IFPRI’s OAI endpoint: <a href="http://ebrary.ifpri.org/oai/oai.php">http://ebrary.ifpri.org/oai/oai.php</a></li>
|
||||
<li>After reading the <a href="https://www.openarchives.org/OAI/openarchivesprotocol.html">OAI documentation</a> and testing with an <a href="http://validator.oaipmh.com/">OAI validator</a> I found out how to get their publications</li>
|
||||
<li>This is their publications set: <a href="http://ebrary.ifpri.org/oai/oai.php?verb=ListRecords&from=2016-01-01&set=p15738coll2&metadataPrefix=oai_dc">http://ebrary.ifpri.org/oai/oai.php?verb=ListRecords&from=2016-01-01&set=p15738coll2&metadataPrefix=oai_dc</a></li>
|
||||
<li>You can see the others by using the OAI <code>ListSets</code> verb: <a href="http://ebrary.ifpri.org/oai/oai.php?verb=ListSets">http://ebrary.ifpri.org/oai/oai.php?verb=ListSets</a></li>
|
||||
@ -132,12 +132,12 @@ UPDATE 14
|
||||
<h2 id="2016-06-02">2016-06-02</h2>
|
||||
<ul>
|
||||
<li>Testing the configuration and theme changes for the upcoming metadata migration and I found some issues with <code>cg.coverage.admin-unit</code></li>
|
||||
<li>Seems that the Browse configuration in <code>dspace.cfg</code> can't handle the ‘-’ in the field name:</li>
|
||||
<li>Seems that the Browse configuration in <code>dspace.cfg</code> can’t handle the ‘-’ in the field name:</li>
|
||||
</ul>
|
||||
<pre><code>webui.browse.index.12 = subregion:metadata:cg.coverage.admin-unit:text
|
||||
</code></pre><ul>
|
||||
<li>But actually, I think since DSpace 4 or 5 (we are 5.1) the Browse indexes come from Discovery (defined in discovery.xml) so this is really just a parsing error</li>
|
||||
<li>I've sent a message to the DSpace mailing list to ask about the Browse index definition</li>
|
||||
<li>I’ve sent a message to the DSpace mailing list to ask about the Browse index definition</li>
|
||||
<li>A user was having problems with submission and from the stacktrace it looks like a Sherpa/Romeo issue</li>
|
||||
<li>I found a thread on the mailing list talking about it and there is bug report and a patch: <a href="https://jira.duraspace.org/browse/DS-2740">https://jira.duraspace.org/browse/DS-2740</a></li>
|
||||
<li>The patch applies successfully on DSpace 5.1 so I will try it later</li>
|
||||
@ -196,7 +196,7 @@ UPDATE 960
|
||||
</ul>
|
||||
<pre><code>webui.browse.index.2 = author:metadataAuthority:dc.contributor.author:authority
|
||||
</code></pre><ul>
|
||||
<li>That would only be for the “Browse by” function… so we'll have to see what effect that has later</li>
|
||||
<li>That would only be for the “Browse by” function… so we’ll have to see what effect that has later</li>
|
||||
</ul>
|
||||
<h2 id="2016-06-04">2016-06-04</h2>
|
||||
<ul>
|
||||
@ -225,10 +225,10 @@ UPDATE 960
|
||||
</ul>
|
||||
</li>
|
||||
<li>
|
||||
<p>Discuss pulling data from IFPRI's ContentDM with Ryan Miller</p>
|
||||
<p>Discuss pulling data from IFPRI’s ContentDM with Ryan Miller</p>
|
||||
</li>
|
||||
<li>
|
||||
<p>Looks like OAI is kinda obtuse for this, and if we use ContentDM's API we'll be able to access their internal field names (rather than trying to figure out how they stuffed them into various, repeated Dublin Core fields)</p>
|
||||
<p>Looks like OAI is kinda obtuse for this, and if we use ContentDM’s API we’ll be able to access their internal field names (rather than trying to figure out how they stuffed them into various, repeated Dublin Core fields)</p>
|
||||
</li>
|
||||
</ul>
|
||||
<h2 id="2016-06-08">2016-06-08</h2>
|
||||
@ -241,13 +241,13 @@ UPDATE 960
|
||||
</code></pre><ul>
|
||||
<li>Write to Atmire about the use of <code>atmire.orcid.id</code> to see if we can change it</li>
|
||||
<li>Seems to be a virtual field that is queried from the authority cache… hmm</li>
|
||||
<li>In other news, I found out that the About page that we haven't been using lives in <code>dspace/config/about.xml</code>, so now we can update the text</li>
|
||||
<li>In other news, I found out that the About page that we haven’t been using lives in <code>dspace/config/about.xml</code>, so now we can update the text</li>
|
||||
<li>File bug about <code>closed="true"</code> attribute of controlled vocabularies not working: <a href="https://jira.duraspace.org/browse/DS-3238">https://jira.duraspace.org/browse/DS-3238</a></li>
|
||||
</ul>
|
||||
<h2 id="2016-06-09">2016-06-09</h2>
|
||||
<ul>
|
||||
<li>Atmire explained that the <code>atmire.orcid.id</code> field doesn't exist in the schema, as it actually comes from the authority cache during XMLUI run time</li>
|
||||
<li>This means we don't see it when harvesting via OAI or REST, for example</li>
|
||||
<li>Atmire explained that the <code>atmire.orcid.id</code> field doesn’t exist in the schema, as it actually comes from the authority cache during XMLUI run time</li>
|
||||
<li>This means we don’t see it when harvesting via OAI or REST, for example</li>
|
||||
<li>They opened a feature ticket on the DSpace tracker to ask for support of this: <a href="https://jira.duraspace.org/browse/DS-3239">https://jira.duraspace.org/browse/DS-3239</a></li>
|
||||
</ul>
|
||||
<h2 id="2016-06-10">2016-06-10</h2>
|
||||
@ -311,7 +311,7 @@ UPDATE 960
|
||||
</ul>
|
||||
<h2 id="2016-06-20">2016-06-20</h2>
|
||||
<ul>
|
||||
<li>CGSpace's HTTPS certificate expired last night and I didn't notice, had to renew:</li>
|
||||
<li>CGSpace’s HTTPS certificate expired last night and I didn’t notice, had to renew:</li>
|
||||
</ul>
|
||||
<pre><code># /opt/letsencrypt/letsencrypt-auto renew --standalone --pre-hook "/usr/bin/service nginx stop" --post-hook "/usr/bin/service nginx start"
|
||||
</code></pre><ul>
|
||||
@ -336,7 +336,7 @@ $ ./delete-metadata-values.py -i investors-delete-82.csv -f dc.description.spons
|
||||
<h2 id="2016-06-28">2016-06-28</h2>
|
||||
<ul>
|
||||
<li>Testing the cleanup of <code>dc.contributor.corporate</code> with 13 deletions and 121 replacements</li>
|
||||
<li>There are still ~97 fields that weren't indicated to do anything</li>
|
||||
<li>There are still ~97 fields that weren’t indicated to do anything</li>
|
||||
<li>After the above deletions and replacements I regenerated a CSV and sent it to Peter <em>et al</em> to have a look</li>
|
||||
</ul>
|
||||
<pre><code>dspacetest=# \copy (select text_value, count(*) from metadatavalue where resource_type_id=2 and metadata_field_id=126 group by text_value order by count desc) to /tmp/contributors-june28.csv with csv;
|
||||
|
Reference in New Issue
Block a user