mirror of
https://github.com/alanorth/cgspace-notes.git
synced 2025-01-27 05:49:12 +01:00
Add notes for 2019-09-01
This commit is contained in:
@ -10,7 +10,7 @@
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
|
||||
|
||||
<meta property="og:updated_time" content="2019-08-03T12:39:51+03:00" />
|
||||
<meta property="og:updated_time" content="2019-09-01T10:17:51+03:00" />
|
||||
|
||||
<meta name="twitter:card" content="summary"/>
|
||||
<meta name="twitter:title" content="CGSpace Notes"/>
|
||||
@ -29,7 +29,7 @@
|
||||
"@type": "Person",
|
||||
"name": "Alan Orth"
|
||||
},
|
||||
"dateModified": "2019-08-03T12:39:51\x2b03:00",
|
||||
"dateModified": "2019-09-01T10:17:51\x2b03:00",
|
||||
"keywords": "notes,notes,",
|
||||
"description": "Documenting day-to-day work on the [CGSpace](https:\/\/cgspace.cgiar.org) repository."
|
||||
}
|
||||
@ -89,6 +89,36 @@
|
||||
|
||||
|
||||
|
||||
<article class="blog-post">
|
||||
<header>
|
||||
<h2 class="blog-post-title"><a href="https://alanorth.github.io/cgspace-notes/2018-11/">November, 2018</a></h2>
|
||||
<p class="blog-post-meta"><time datetime="2018-11-01T16:41:30+02:00">Thu Nov 01, 2018</time> by Alan Orth in
|
||||
|
||||
<i class="fa fa-tag" aria-hidden="true"></i> <a href="/cgspace-notes/tags/notes" rel="tag">Notes</a>
|
||||
|
||||
</p>
|
||||
</header>
|
||||
<h2 id="2018-11-01">2018-11-01</h2>
|
||||
|
||||
<ul>
|
||||
<li>Finalize AReS Phase I and Phase II ToRs</li>
|
||||
<li>Send a note about my <a href="https://github.com/ilri/dspace-statistics-api">dspace-statistics-api</a> to the dspace-tech mailing list</li>
|
||||
</ul>
|
||||
|
||||
<h2 id="2018-11-03">2018-11-03</h2>
|
||||
|
||||
<ul>
|
||||
<li>Linode has been sending mails a few times a day recently that CGSpace (linode18) has had high CPU usage</li>
|
||||
<li>Today these are the top 10 IPs:</li>
|
||||
</ul>
|
||||
<a href='https://alanorth.github.io/cgspace-notes/2018-11/'>Read more →</a>
|
||||
</article>
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
<article class="blog-post">
|
||||
<header>
|
||||
<h2 class="blog-post-title"><a href="https://alanorth.github.io/cgspace-notes/2018-10/">October, 2018</a></h2>
|
||||
@ -347,98 +377,6 @@ sys 2m7.289s
|
||||
|
||||
|
||||
|
||||
|
||||
<article class="blog-post">
|
||||
<header>
|
||||
<h2 class="blog-post-title"><a href="https://alanorth.github.io/cgspace-notes/2018-01/">January, 2018</a></h2>
|
||||
<p class="blog-post-meta"><time datetime="2018-01-02T08:35:54-08:00">Tue Jan 02, 2018</time> by Alan Orth in
|
||||
|
||||
<i class="fa fa-tag" aria-hidden="true"></i> <a href="/cgspace-notes/tags/notes" rel="tag">Notes</a>
|
||||
|
||||
</p>
|
||||
</header>
|
||||
<h2 id="2018-01-02">2018-01-02</h2>
|
||||
|
||||
<ul>
|
||||
<li>Uptime Robot noticed that CGSpace went down and up a few times last night, for a few minutes each time</li>
|
||||
<li>I didn’t get any load alerts from Linode and the REST and XMLUI logs don’t show anything out of the ordinary</li>
|
||||
<li>The nginx logs show HTTP 200s until <code>02/Jan/2018:11:27:17 +0000</code> when Uptime Robot got an HTTP 500</li>
|
||||
<li>In dspace.log around that time I see many errors like “Client closed the connection before file download was complete”</li>
|
||||
|
||||
<li><p>And just before that I see this:</p>
|
||||
|
||||
<pre><code>Caused by: org.apache.tomcat.jdbc.pool.PoolExhaustedException: [http-bio-127.0.0.1-8443-exec-980] Timeout: Pool empty. Unable to fetch a connection in 5 seconds, none available[size:50; busy:50; idle:0; lastwait:5000].
|
||||
</code></pre></li>
|
||||
|
||||
<li><p>Ah hah! So the pool was actually empty!</p></li>
|
||||
|
||||
<li><p>I need to increase that, let’s try to bump it up from 50 to 75</p></li>
|
||||
|
||||
<li><p>After that one client got an HTTP 499 but then the rest were HTTP 200, so I don’t know what the hell Uptime Robot saw</p></li>
|
||||
|
||||
<li><p>I notice this error quite a few times in dspace.log:</p>
|
||||
|
||||
<pre><code>2018-01-02 01:21:19,137 ERROR org.dspace.app.xmlui.aspect.discovery.SidebarFacetsTransformer @ Error while searching for sidebar facets
|
||||
org.dspace.discovery.SearchServiceException: org.apache.solr.search.SyntaxError: Cannot parse 'dateIssued_keyword:[1976+TO+1979]': Encountered " "]" "] "" at line 1, column 32.
|
||||
</code></pre></li>
|
||||
|
||||
<li><p>And there are many of these errors every day for the past month:</p>
|
||||
|
||||
<pre><code>$ grep -c "Error while searching for sidebar facets" dspace.log.*
|
||||
dspace.log.2017-11-21:4
|
||||
dspace.log.2017-11-22:1
|
||||
dspace.log.2017-11-23:4
|
||||
dspace.log.2017-11-24:11
|
||||
dspace.log.2017-11-25:0
|
||||
dspace.log.2017-11-26:1
|
||||
dspace.log.2017-11-27:7
|
||||
dspace.log.2017-11-28:21
|
||||
dspace.log.2017-11-29:31
|
||||
dspace.log.2017-11-30:15
|
||||
dspace.log.2017-12-01:15
|
||||
dspace.log.2017-12-02:20
|
||||
dspace.log.2017-12-03:38
|
||||
dspace.log.2017-12-04:65
|
||||
dspace.log.2017-12-05:43
|
||||
dspace.log.2017-12-06:72
|
||||
dspace.log.2017-12-07:27
|
||||
dspace.log.2017-12-08:15
|
||||
dspace.log.2017-12-09:29
|
||||
dspace.log.2017-12-10:35
|
||||
dspace.log.2017-12-11:20
|
||||
dspace.log.2017-12-12:44
|
||||
dspace.log.2017-12-13:36
|
||||
dspace.log.2017-12-14:59
|
||||
dspace.log.2017-12-15:104
|
||||
dspace.log.2017-12-16:53
|
||||
dspace.log.2017-12-17:66
|
||||
dspace.log.2017-12-18:83
|
||||
dspace.log.2017-12-19:101
|
||||
dspace.log.2017-12-20:74
|
||||
dspace.log.2017-12-21:55
|
||||
dspace.log.2017-12-22:66
|
||||
dspace.log.2017-12-23:50
|
||||
dspace.log.2017-12-24:85
|
||||
dspace.log.2017-12-25:62
|
||||
dspace.log.2017-12-26:49
|
||||
dspace.log.2017-12-27:30
|
||||
dspace.log.2017-12-28:54
|
||||
dspace.log.2017-12-29:68
|
||||
dspace.log.2017-12-30:89
|
||||
dspace.log.2017-12-31:53
|
||||
dspace.log.2018-01-01:45
|
||||
dspace.log.2018-01-02:34
|
||||
</code></pre></li>
|
||||
|
||||
<li><p>Danny wrote to ask for help renewing the wildcard ilri.org certificate and I advised that we should probably use Let’s Encrypt if it’s just a handful of domains</p></li>
|
||||
</ul>
|
||||
<a href='https://alanorth.github.io/cgspace-notes/2018-01/'>Read more →</a>
|
||||
</article>
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
<nav class="blog-pagination">
|
||||
|
||||
<a class="btn btn-outline-primary" href="/cgspace-notes/" rel="prev" role="button">Previous page</a>
|
||||
@ -463,16 +401,16 @@ dspace.log.2018-01-02:34
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2019-08/">August, 2019</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/posts/">Posts</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2019-09/">September, 2019</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2019-08/">August, 2019</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2019-07/">July, 2019</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2019-06/">June, 2019</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2019-05/">May, 2019</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -10,7 +10,7 @@
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
|
||||
|
||||
<meta property="og:updated_time" content="2019-08-03T12:39:51+03:00" />
|
||||
<meta property="og:updated_time" content="2019-09-01T10:17:51+03:00" />
|
||||
|
||||
<meta name="twitter:card" content="summary"/>
|
||||
<meta name="twitter:title" content="CGSpace Notes"/>
|
||||
@ -29,7 +29,7 @@
|
||||
"@type": "Person",
|
||||
"name": "Alan Orth"
|
||||
},
|
||||
"dateModified": "2019-08-03T12:39:51\x2b03:00",
|
||||
"dateModified": "2019-09-01T10:17:51\x2b03:00",
|
||||
"keywords": "notes,notes,",
|
||||
"description": "Documenting day-to-day work on the [CGSpace](https:\/\/cgspace.cgiar.org) repository."
|
||||
}
|
||||
@ -89,6 +89,98 @@
|
||||
|
||||
|
||||
|
||||
<article class="blog-post">
|
||||
<header>
|
||||
<h2 class="blog-post-title"><a href="https://alanorth.github.io/cgspace-notes/2018-01/">January, 2018</a></h2>
|
||||
<p class="blog-post-meta"><time datetime="2018-01-02T08:35:54-08:00">Tue Jan 02, 2018</time> by Alan Orth in
|
||||
|
||||
<i class="fa fa-tag" aria-hidden="true"></i> <a href="/cgspace-notes/tags/notes" rel="tag">Notes</a>
|
||||
|
||||
</p>
|
||||
</header>
|
||||
<h2 id="2018-01-02">2018-01-02</h2>
|
||||
|
||||
<ul>
|
||||
<li>Uptime Robot noticed that CGSpace went down and up a few times last night, for a few minutes each time</li>
|
||||
<li>I didn’t get any load alerts from Linode and the REST and XMLUI logs don’t show anything out of the ordinary</li>
|
||||
<li>The nginx logs show HTTP 200s until <code>02/Jan/2018:11:27:17 +0000</code> when Uptime Robot got an HTTP 500</li>
|
||||
<li>In dspace.log around that time I see many errors like “Client closed the connection before file download was complete”</li>
|
||||
|
||||
<li><p>And just before that I see this:</p>
|
||||
|
||||
<pre><code>Caused by: org.apache.tomcat.jdbc.pool.PoolExhaustedException: [http-bio-127.0.0.1-8443-exec-980] Timeout: Pool empty. Unable to fetch a connection in 5 seconds, none available[size:50; busy:50; idle:0; lastwait:5000].
|
||||
</code></pre></li>
|
||||
|
||||
<li><p>Ah hah! So the pool was actually empty!</p></li>
|
||||
|
||||
<li><p>I need to increase that, let’s try to bump it up from 50 to 75</p></li>
|
||||
|
||||
<li><p>After that one client got an HTTP 499 but then the rest were HTTP 200, so I don’t know what the hell Uptime Robot saw</p></li>
|
||||
|
||||
<li><p>I notice this error quite a few times in dspace.log:</p>
|
||||
|
||||
<pre><code>2018-01-02 01:21:19,137 ERROR org.dspace.app.xmlui.aspect.discovery.SidebarFacetsTransformer @ Error while searching for sidebar facets
|
||||
org.dspace.discovery.SearchServiceException: org.apache.solr.search.SyntaxError: Cannot parse 'dateIssued_keyword:[1976+TO+1979]': Encountered " "]" "] "" at line 1, column 32.
|
||||
</code></pre></li>
|
||||
|
||||
<li><p>And there are many of these errors every day for the past month:</p>
|
||||
|
||||
<pre><code>$ grep -c "Error while searching for sidebar facets" dspace.log.*
|
||||
dspace.log.2017-11-21:4
|
||||
dspace.log.2017-11-22:1
|
||||
dspace.log.2017-11-23:4
|
||||
dspace.log.2017-11-24:11
|
||||
dspace.log.2017-11-25:0
|
||||
dspace.log.2017-11-26:1
|
||||
dspace.log.2017-11-27:7
|
||||
dspace.log.2017-11-28:21
|
||||
dspace.log.2017-11-29:31
|
||||
dspace.log.2017-11-30:15
|
||||
dspace.log.2017-12-01:15
|
||||
dspace.log.2017-12-02:20
|
||||
dspace.log.2017-12-03:38
|
||||
dspace.log.2017-12-04:65
|
||||
dspace.log.2017-12-05:43
|
||||
dspace.log.2017-12-06:72
|
||||
dspace.log.2017-12-07:27
|
||||
dspace.log.2017-12-08:15
|
||||
dspace.log.2017-12-09:29
|
||||
dspace.log.2017-12-10:35
|
||||
dspace.log.2017-12-11:20
|
||||
dspace.log.2017-12-12:44
|
||||
dspace.log.2017-12-13:36
|
||||
dspace.log.2017-12-14:59
|
||||
dspace.log.2017-12-15:104
|
||||
dspace.log.2017-12-16:53
|
||||
dspace.log.2017-12-17:66
|
||||
dspace.log.2017-12-18:83
|
||||
dspace.log.2017-12-19:101
|
||||
dspace.log.2017-12-20:74
|
||||
dspace.log.2017-12-21:55
|
||||
dspace.log.2017-12-22:66
|
||||
dspace.log.2017-12-23:50
|
||||
dspace.log.2017-12-24:85
|
||||
dspace.log.2017-12-25:62
|
||||
dspace.log.2017-12-26:49
|
||||
dspace.log.2017-12-27:30
|
||||
dspace.log.2017-12-28:54
|
||||
dspace.log.2017-12-29:68
|
||||
dspace.log.2017-12-30:89
|
||||
dspace.log.2017-12-31:53
|
||||
dspace.log.2018-01-01:45
|
||||
dspace.log.2018-01-02:34
|
||||
</code></pre></li>
|
||||
|
||||
<li><p>Danny wrote to ask for help renewing the wildcard ilri.org certificate and I advised that we should probably use Let’s Encrypt if it’s just a handful of domains</p></li>
|
||||
</ul>
|
||||
<a href='https://alanorth.github.io/cgspace-notes/2018-01/'>Read more →</a>
|
||||
</article>
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
<article class="blog-post">
|
||||
<header>
|
||||
<h2 class="blog-post-title"><a href="https://alanorth.github.io/cgspace-notes/2017-12/">December, 2017</a></h2>
|
||||
@ -332,40 +424,6 @@ COPY 54701
|
||||
|
||||
|
||||
|
||||
|
||||
<article class="blog-post">
|
||||
<header>
|
||||
<h2 class="blog-post-title"><a href="https://alanorth.github.io/cgspace-notes/2017-04/">April, 2017</a></h2>
|
||||
<p class="blog-post-meta"><time datetime="2017-04-02T17:08:52+02:00">Sun Apr 02, 2017</time> by Alan Orth in
|
||||
|
||||
<i class="fa fa-tag" aria-hidden="true"></i> <a href="/cgspace-notes/tags/notes" rel="tag">Notes</a>
|
||||
|
||||
</p>
|
||||
</header>
|
||||
<h2 id="2017-04-02">2017-04-02</h2>
|
||||
|
||||
<ul>
|
||||
<li>Merge one change to CCAFS flagships that I had forgotten to remove last month (“MANAGING CLIMATE RISK”): <a href="https://github.com/ilri/DSpace/pull/317">https://github.com/ilri/DSpace/pull/317</a></li>
|
||||
<li>Quick proof-of-concept hack to add <code>dc.rights</code> to the input form, including some inline instructions/hints:</li>
|
||||
</ul>
|
||||
|
||||
<p><img src="/cgspace-notes/2017/04/dc-rights.png" alt="dc.rights in the submission form" /></p>
|
||||
|
||||
<ul>
|
||||
<li>Remove redundant/duplicate text in the DSpace submission license</li>
|
||||
|
||||
<li><p>Testing the CMYK patch on a collection with 650 items:</p>
|
||||
|
||||
<pre><code>$ [dspace]/bin/dspace filter-media -f -i 10568/16498 -p "ImageMagick PDF Thumbnail" -v >& /tmp/filter-media-cmyk.txt
|
||||
</code></pre></li>
|
||||
</ul>
|
||||
<a href='https://alanorth.github.io/cgspace-notes/2017-04/'>Read more →</a>
|
||||
</article>
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
<nav class="blog-pagination">
|
||||
|
||||
<a class="btn btn-outline-primary" href="/cgspace-notes/page/2/" rel="prev" role="button">Previous page</a>
|
||||
@ -390,16 +448,16 @@ COPY 54701
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2019-08/">August, 2019</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/posts/">Posts</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2019-09/">September, 2019</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2019-08/">August, 2019</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2019-07/">July, 2019</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2019-06/">June, 2019</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2019-05/">May, 2019</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -10,7 +10,7 @@
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
|
||||
|
||||
<meta property="og:updated_time" content="2019-08-03T12:39:51+03:00" />
|
||||
<meta property="og:updated_time" content="2019-09-01T10:17:51+03:00" />
|
||||
|
||||
<meta name="twitter:card" content="summary"/>
|
||||
<meta name="twitter:title" content="CGSpace Notes"/>
|
||||
@ -29,7 +29,7 @@
|
||||
"@type": "Person",
|
||||
"name": "Alan Orth"
|
||||
},
|
||||
"dateModified": "2019-08-03T12:39:51\x2b03:00",
|
||||
"dateModified": "2019-09-01T10:17:51\x2b03:00",
|
||||
"keywords": "notes,notes,",
|
||||
"description": "Documenting day-to-day work on the [CGSpace](https:\/\/cgspace.cgiar.org) repository."
|
||||
}
|
||||
@ -89,6 +89,40 @@
|
||||
|
||||
|
||||
|
||||
<article class="blog-post">
|
||||
<header>
|
||||
<h2 class="blog-post-title"><a href="https://alanorth.github.io/cgspace-notes/2017-04/">April, 2017</a></h2>
|
||||
<p class="blog-post-meta"><time datetime="2017-04-02T17:08:52+02:00">Sun Apr 02, 2017</time> by Alan Orth in
|
||||
|
||||
<i class="fa fa-tag" aria-hidden="true"></i> <a href="/cgspace-notes/tags/notes" rel="tag">Notes</a>
|
||||
|
||||
</p>
|
||||
</header>
|
||||
<h2 id="2017-04-02">2017-04-02</h2>
|
||||
|
||||
<ul>
|
||||
<li>Merge one change to CCAFS flagships that I had forgotten to remove last month (“MANAGING CLIMATE RISK”): <a href="https://github.com/ilri/DSpace/pull/317">https://github.com/ilri/DSpace/pull/317</a></li>
|
||||
<li>Quick proof-of-concept hack to add <code>dc.rights</code> to the input form, including some inline instructions/hints:</li>
|
||||
</ul>
|
||||
|
||||
<p><img src="/cgspace-notes/2017/04/dc-rights.png" alt="dc.rights in the submission form" /></p>
|
||||
|
||||
<ul>
|
||||
<li>Remove redundant/duplicate text in the DSpace submission license</li>
|
||||
|
||||
<li><p>Testing the CMYK patch on a collection with 650 items:</p>
|
||||
|
||||
<pre><code>$ [dspace]/bin/dspace filter-media -f -i 10568/16498 -p "ImageMagick PDF Thumbnail" -v >& /tmp/filter-media-cmyk.txt
|
||||
</code></pre></li>
|
||||
</ul>
|
||||
<a href='https://alanorth.github.io/cgspace-notes/2017-04/'>Read more →</a>
|
||||
</article>
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
<article class="blog-post">
|
||||
<header>
|
||||
<h2 class="blog-post-title"><a href="https://alanorth.github.io/cgspace-notes/2017-03/">March, 2017</a></h2>
|
||||
@ -380,33 +414,6 @@ text_value
|
||||
|
||||
|
||||
|
||||
|
||||
<article class="blog-post">
|
||||
<header>
|
||||
<h2 class="blog-post-title"><a href="https://alanorth.github.io/cgspace-notes/2016-06/">June, 2016</a></h2>
|
||||
<p class="blog-post-meta"><time datetime="2016-06-01T10:53:00+03:00">Wed Jun 01, 2016</time> by Alan Orth in
|
||||
|
||||
<i class="fa fa-tag" aria-hidden="true"></i> <a href="/cgspace-notes/tags/notes" rel="tag">Notes</a>
|
||||
|
||||
</p>
|
||||
</header>
|
||||
<h2 id="2016-06-01">2016-06-01</h2>
|
||||
|
||||
<ul>
|
||||
<li>Experimenting with IFPRI OAI (we want to harvest their publications)</li>
|
||||
<li>After reading the <a href="https://www.oclc.org/support/services/contentdm/help/server-admin-help/oai-support.en.html">ContentDM documentation</a> I found IFPRI’s OAI endpoint: <a href="http://ebrary.ifpri.org/oai/oai.php">http://ebrary.ifpri.org/oai/oai.php</a></li>
|
||||
<li>After reading the <a href="https://www.openarchives.org/OAI/openarchivesprotocol.html">OAI documentation</a> and testing with an <a href="http://validator.oaipmh.com/">OAI validator</a> I found out how to get their publications</li>
|
||||
<li>This is their publications set: <a href="http://ebrary.ifpri.org/oai/oai.php?verb=ListRecords&from=2016-01-01&set=p15738coll2&metadataPrefix=oai_dc">http://ebrary.ifpri.org/oai/oai.php?verb=ListRecords&from=2016-01-01&set=p15738coll2&metadataPrefix=oai_dc</a></li>
|
||||
<li>You can see the others by using the OAI <code>ListSets</code> verb: <a href="http://ebrary.ifpri.org/oai/oai.php?verb=ListSets">http://ebrary.ifpri.org/oai/oai.php?verb=ListSets</a></li>
|
||||
<li>Working on second phase of metadata migration, looks like this will work for moving CPWF-specific data in <code>dc.identifier.fund</code> to <code>cg.identifier.cpwfproject</code> and then the rest to <code>dc.description.sponsorship</code></li>
|
||||
</ul>
|
||||
<a href='https://alanorth.github.io/cgspace-notes/2016-06/'>Read more →</a>
|
||||
</article>
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
<nav class="blog-pagination">
|
||||
|
||||
<a class="btn btn-outline-primary" href="/cgspace-notes/page/3/" rel="prev" role="button">Previous page</a>
|
||||
@ -431,16 +438,16 @@ text_value
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2019-08/">August, 2019</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/posts/">Posts</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2019-09/">September, 2019</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2019-08/">August, 2019</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2019-07/">July, 2019</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2019-06/">June, 2019</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2019-05/">May, 2019</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -10,7 +10,7 @@
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
|
||||
|
||||
<meta property="og:updated_time" content="2019-08-03T12:39:51+03:00" />
|
||||
<meta property="og:updated_time" content="2019-09-01T10:17:51+03:00" />
|
||||
|
||||
<meta name="twitter:card" content="summary"/>
|
||||
<meta name="twitter:title" content="CGSpace Notes"/>
|
||||
@ -29,7 +29,7 @@
|
||||
"@type": "Person",
|
||||
"name": "Alan Orth"
|
||||
},
|
||||
"dateModified": "2019-08-03T12:39:51\x2b03:00",
|
||||
"dateModified": "2019-09-01T10:17:51\x2b03:00",
|
||||
"keywords": "notes,notes,",
|
||||
"description": "Documenting day-to-day work on the [CGSpace](https:\/\/cgspace.cgiar.org) repository."
|
||||
}
|
||||
@ -89,6 +89,33 @@
|
||||
|
||||
|
||||
|
||||
<article class="blog-post">
|
||||
<header>
|
||||
<h2 class="blog-post-title"><a href="https://alanorth.github.io/cgspace-notes/2016-06/">June, 2016</a></h2>
|
||||
<p class="blog-post-meta"><time datetime="2016-06-01T10:53:00+03:00">Wed Jun 01, 2016</time> by Alan Orth in
|
||||
|
||||
<i class="fa fa-tag" aria-hidden="true"></i> <a href="/cgspace-notes/tags/notes" rel="tag">Notes</a>
|
||||
|
||||
</p>
|
||||
</header>
|
||||
<h2 id="2016-06-01">2016-06-01</h2>
|
||||
|
||||
<ul>
|
||||
<li>Experimenting with IFPRI OAI (we want to harvest their publications)</li>
|
||||
<li>After reading the <a href="https://www.oclc.org/support/services/contentdm/help/server-admin-help/oai-support.en.html">ContentDM documentation</a> I found IFPRI’s OAI endpoint: <a href="http://ebrary.ifpri.org/oai/oai.php">http://ebrary.ifpri.org/oai/oai.php</a></li>
|
||||
<li>After reading the <a href="https://www.openarchives.org/OAI/openarchivesprotocol.html">OAI documentation</a> and testing with an <a href="http://validator.oaipmh.com/">OAI validator</a> I found out how to get their publications</li>
|
||||
<li>This is their publications set: <a href="http://ebrary.ifpri.org/oai/oai.php?verb=ListRecords&from=2016-01-01&set=p15738coll2&metadataPrefix=oai_dc">http://ebrary.ifpri.org/oai/oai.php?verb=ListRecords&from=2016-01-01&set=p15738coll2&metadataPrefix=oai_dc</a></li>
|
||||
<li>You can see the others by using the OAI <code>ListSets</code> verb: <a href="http://ebrary.ifpri.org/oai/oai.php?verb=ListSets">http://ebrary.ifpri.org/oai/oai.php?verb=ListSets</a></li>
|
||||
<li>Working on second phase of metadata migration, looks like this will work for moving CPWF-specific data in <code>dc.identifier.fund</code> to <code>cg.identifier.cpwfproject</code> and then the rest to <code>dc.description.sponsorship</code></li>
|
||||
</ul>
|
||||
<a href='https://alanorth.github.io/cgspace-notes/2016-06/'>Read more →</a>
|
||||
</article>
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
<article class="blog-post">
|
||||
<header>
|
||||
<h2 class="blog-post-title"><a href="https://alanorth.github.io/cgspace-notes/2016-05/">May, 2016</a></h2>
|
||||
@ -304,16 +331,16 @@
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2019-08/">August, 2019</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/posts/">Posts</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2019-09/">September, 2019</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2019-08/">August, 2019</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2019-07/">July, 2019</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2019-06/">June, 2019</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2019-05/">May, 2019</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
Reference in New Issue
Block a user