mirror of
https://github.com/alanorth/cgspace-notes.git
synced 2024-12-22 13:12:19 +01:00
Update notes for 2020-04-13
This commit is contained in:
parent
246538db59
commit
3fab0c20e2
@ -119,4 +119,36 @@ $ ~/dspace63/bin/dspace solr-upgrade-statistics-6x
|
||||
- More work on the DSpace 6.3 stuff, improving the GDPR consent logic to use [haven](https://github.com/chiiya/haven) instead of cookieconsent
|
||||
- It works better by injecting the Google Analytics script after the user clicks agree, and it also has a preferences section that gets automatically injected on the privacy page!
|
||||
|
||||
## 2020-04-13
|
||||
|
||||
- I realized that `solr-upgrade-statistics-6x` only processes 100,000 records by default so I think we actually need to finish running it for all legacy Solr records before asking Atmire why CUA statlets and detailed statistics aren't working
|
||||
- For now I am just doing 250,000 records at a time on my local environment:
|
||||
|
||||
```
|
||||
$ export JAVA_OPTS="-Xmx2000m -Dfile.encoding=UTF-8"
|
||||
$ ~/dspace63/bin/dspace solr-upgrade-statistics-6x -n 250000
|
||||
```
|
||||
|
||||
- Despite running the migration for all of my local 1.5 million Solr records, I still see a few hundred thousand like `-1` and `0-unmigrated`
|
||||
- I will purge them all and try to import only a subset...
|
||||
- After importing again I see there are indeed tens of thousands of these documents with IDs "-1" and "0"
|
||||
- They are all `type: 5`, which is "SITE" according to `Constants.java`:
|
||||
|
||||
```
|
||||
/** DSpace site type */
|
||||
public static final int SITE = 5;
|
||||
```
|
||||
|
||||
- Even after deleting those documents and re-running `solr-upgrade-statistics-6x` I still get the UUID errors when using CUA and the statlets
|
||||
- I have sent some feedback and questions to Atmire (including about the  issue with glypicons in the header trail)
|
||||
- In other news, my local Artifactory container stopped working for some reason so I re-created it and it seems some things have changed upstream (port 8082 for web UI?):
|
||||
|
||||
```
|
||||
$ podman rm artifactory
|
||||
$ podman pull docker.bintray.io/jfrog/artifactory-oss:latest
|
||||
$ podman create --ulimit nofile=32000:32000 --name artifactory -v artifactory_data:/var/opt/jfrog/artifactory -p 8081-8082:8081-8082 docker.bintray.io/jfrog/artifactory-oss
|
||||
$ podman start artifactory
|
||||
```
|
||||
|
||||
<!-- vim: set sw=2 ts=2: -->
|
||||
|
||||
|
@ -15,7 +15,7 @@ Reinstall my local (Mac OS X) DSpace stack with Tomcat 7, PostgreSQL 9.3, and Ja
|
||||
<meta property="og:type" content="article" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2016-03/" />
|
||||
<meta property="article:published_time" content="2016-03-02T16:50:00+03:00" />
|
||||
<meta property="article:modified_time" content="2018-03-09T22:10:33+02:00" />
|
||||
<meta property="article:modified_time" content="2020-04-13T15:30:24+03:00" />
|
||||
|
||||
<meta name="twitter:card" content="summary"/>
|
||||
<meta name="twitter:title" content="March, 2016"/>
|
||||
@ -37,7 +37,7 @@ Reinstall my local (Mac OS X) DSpace stack with Tomcat 7, PostgreSQL 9.3, and Ja
|
||||
"url": "https://alanorth.github.io/cgspace-notes/2016-03/",
|
||||
"wordCount": "1581",
|
||||
"datePublished": "2016-03-02T16:50:00+03:00",
|
||||
"dateModified": "2018-03-09T22:10:33+02:00",
|
||||
"dateModified": "2020-04-13T15:30:24+03:00",
|
||||
"author": {
|
||||
"@type": "Person",
|
||||
"name": "Alan Orth"
|
||||
@ -276,7 +276,7 @@ Reinstall my local (Mac OS X) DSpace stack with Tomcat 7, PostgreSQL 9.3, and Ja
|
||||
</ul>
|
||||
<h2 id="2016-03-28">2016-03-28</h2>
|
||||
<ul>
|
||||
<li>Look into enabling the embargo during item submission, see: <a href="https://wiki.duraspace.org/display/DSDOC5x/Embargo#Embargo-SubmissionProcess">https://wiki.duraspace.org/display/DSDOC5x/Embargo#Embargo-SubmissionProcess</a></li>
|
||||
<li>Look into enabling the embargo during item submission, see: <a href="https://wiki.lyrasis.org/display/DSDOC5x/Embargo#Embargo-SubmissionProcess">https://wiki.lyrasis.org/display/DSDOC5x/Embargo#Embargo-SubmissionProcess</a></li>
|
||||
<li>Seems we only want <code>AccessStep</code> because <code>UploadWithEmbargoStep</code> disables the ability to edit embargos at the item level</li>
|
||||
<li>This pull request enables the ability to set an item-level embargo during submission: <a href="https://github.com/ilri/DSpace/pull/203">https://github.com/ilri/DSpace/pull/203</a></li>
|
||||
<li>I figured out that the problem with Listings and Reports was because I disabled the <code>search.index.*</code> last week, and they are still used by JSPUI apparently</li>
|
||||
|
@ -18,7 +18,7 @@ There are 3,000 IPs accessing the REST API in a 24-hour period!
|
||||
<meta property="og:type" content="article" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2016-05/" />
|
||||
<meta property="article:published_time" content="2016-05-01T23:06:00+03:00" />
|
||||
<meta property="article:modified_time" content="2018-03-09T22:10:33+02:00" />
|
||||
<meta property="article:modified_time" content="2020-04-13T15:30:24+03:00" />
|
||||
|
||||
<meta name="twitter:card" content="summary"/>
|
||||
<meta name="twitter:title" content="May, 2016"/>
|
||||
@ -43,7 +43,7 @@ There are 3,000 IPs accessing the REST API in a 24-hour period!
|
||||
"url": "https://alanorth.github.io/cgspace-notes/2016-05/",
|
||||
"wordCount": "1349",
|
||||
"datePublished": "2016-05-01T23:06:00+03:00",
|
||||
"dateModified": "2018-03-09T22:10:33+02:00",
|
||||
"dateModified": "2020-04-13T15:30:24+03:00",
|
||||
"author": {
|
||||
"@type": "Person",
|
||||
"name": "Alan Orth"
|
||||
@ -183,7 +183,7 @@ fi
|
||||
</ul>
|
||||
</li>
|
||||
<li>Not sure what <code>dcterms</code> is…</li>
|
||||
<li>Looks like these were <a href="https://wiki.duraspace.org/display/DSDOC5x/Metadata+and+Bitstream+Format+Registries#MetadataandBitstreamFormatRegistries-DublinCoreTermsRegistry(DCTERMS)">added in DSpace 4</a> to allow for future work to make DSpace more flexible</li>
|
||||
<li>Looks like these were <a href="https://wiki.lyrasis.org/display/DSDOC5x/Metadata+and+Bitstream+Format+Registries#MetadataandBitstreamFormatRegistries-DublinCoreTermsRegistry(DCTERMS)">added in DSpace 4</a> to allow for future work to make DSpace more flexible</li>
|
||||
<li>CGSpace’s <code>dc</code> registry has 96 items, and the default DSpace one has 73.</li>
|
||||
</ul>
|
||||
<h2 id="2016-05-11">2016-05-11</h2>
|
||||
|
@ -18,7 +18,7 @@ Working on second phase of metadata migration, looks like this will work for mov
|
||||
<meta property="og:type" content="article" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2016-06/" />
|
||||
<meta property="article:published_time" content="2016-06-01T10:53:00+03:00" />
|
||||
<meta property="article:modified_time" content="2018-03-09T22:10:33+02:00" />
|
||||
<meta property="article:modified_time" content="2020-04-13T15:30:24+03:00" />
|
||||
|
||||
<meta name="twitter:card" content="summary"/>
|
||||
<meta name="twitter:title" content="June, 2016"/>
|
||||
@ -43,7 +43,7 @@ Working on second phase of metadata migration, looks like this will work for mov
|
||||
"url": "https://alanorth.github.io/cgspace-notes/2016-06/",
|
||||
"wordCount": "1549",
|
||||
"datePublished": "2016-06-01T10:53:00+03:00",
|
||||
"dateModified": "2018-03-09T22:10:33+02:00",
|
||||
"dateModified": "2020-04-13T15:30:24+03:00",
|
||||
"author": {
|
||||
"@type": "Person",
|
||||
"name": "Alan Orth"
|
||||
@ -238,7 +238,7 @@ UPDATE 960
|
||||
<h2 id="2016-06-08">2016-06-08</h2>
|
||||
<ul>
|
||||
<li>Discuss controlled vocabularies for ~28 fields</li>
|
||||
<li>Looks like this is all we need: <a href="https://wiki.duraspace.org/display/DSDOC5x/Submission+User+Interface#SubmissionUserInterface-ConfiguringControlledVocabularies">https://wiki.duraspace.org/display/DSDOC5x/Submission+User+Interface#SubmissionUserInterface-ConfiguringControlledVocabularies</a></li>
|
||||
<li>Looks like this is all we need: <a href="https://wiki.lyrasis.org/display/DSDOC5x/Submission+User+Interface#SubmissionUserInterface-ConfiguringControlledVocabularies">https://wiki.lyrasis.org/display/DSDOC5x/Submission+User+Interface#SubmissionUserInterface-ConfiguringControlledVocabularies</a></li>
|
||||
<li>I wrote an XPath expression to extract the ILRI subjects from <code>input-forms.xml</code> (uses xmlstartlet):</li>
|
||||
</ul>
|
||||
<pre><code>$ xml sel -t -m '//value-pairs[@value-pairs-name="ilrisubject"]/pair/displayed-value/text()' -c '.' -n dspace/config/input-forms.xml
|
||||
|
@ -22,7 +22,7 @@ I exported a random item’s metadata as CSV, deleted all columns except id
|
||||
<meta property="og:type" content="article" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2016-10/" />
|
||||
<meta property="article:published_time" content="2016-10-03T15:53:00+03:00" />
|
||||
<meta property="article:modified_time" content="2018-04-04T16:25:57+03:00" />
|
||||
<meta property="article:modified_time" content="2020-04-13T15:30:24+03:00" />
|
||||
|
||||
<meta name="twitter:card" content="summary"/>
|
||||
<meta name="twitter:title" content="October, 2016"/>
|
||||
@ -51,7 +51,7 @@ I exported a random item’s metadata as CSV, deleted all columns except id
|
||||
"url": "https://alanorth.github.io/cgspace-notes/2016-10/",
|
||||
"wordCount": "1828",
|
||||
"datePublished": "2016-10-03T15:53:00+03:00",
|
||||
"dateModified": "2018-04-04T16:25:57+03:00",
|
||||
"dateModified": "2020-04-13T15:30:24+03:00",
|
||||
"author": {
|
||||
"@type": "Person",
|
||||
"name": "Alan Orth"
|
||||
@ -127,7 +127,7 @@ I exported a random item’s metadata as CSV, deleted all columns except id
|
||||
</header>
|
||||
<h2 id="2016-10-03">2016-10-03</h2>
|
||||
<ul>
|
||||
<li>Testing adding <a href="https://wiki.duraspace.org/display/DSDOC5x/ORCID+Integration#ORCIDIntegration-EditingexistingitemsusingBatchCSVEditing">ORCIDs to a CSV</a> file for a single item to see if the author orders get messed up</li>
|
||||
<li>Testing adding <a href="https://wiki.lyrasis.org/display/DSDOC5x/ORCID+Integration#ORCIDIntegration-EditingexistingitemsusingBatchCSVEditing">ORCIDs to a CSV</a> file for a single item to see if the author orders get messed up</li>
|
||||
<li>Need to test the following scenarios to see how author order is affected:
|
||||
<ul>
|
||||
<li>ORCIDs only</li>
|
||||
|
@ -14,7 +14,7 @@ Add dc.type to the output options for Atmire’s Listings and Reports module
|
||||
<meta property="og:type" content="article" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2016-11/" />
|
||||
<meta property="article:published_time" content="2016-11-01T09:21:00+03:00" />
|
||||
<meta property="article:modified_time" content="2018-03-09T22:10:33+02:00" />
|
||||
<meta property="article:modified_time" content="2020-04-13T15:30:24+03:00" />
|
||||
|
||||
<meta name="twitter:card" content="summary"/>
|
||||
<meta name="twitter:title" content="November, 2016"/>
|
||||
@ -35,7 +35,7 @@ Add dc.type to the output options for Atmire’s Listings and Reports module
|
||||
"url": "https://alanorth.github.io/cgspace-notes/2016-11/",
|
||||
"wordCount": "2825",
|
||||
"datePublished": "2016-11-01T09:21:00+03:00",
|
||||
"dateModified": "2018-03-09T22:10:33+02:00",
|
||||
"dateModified": "2020-04-13T15:30:24+03:00",
|
||||
"author": {
|
||||
"@type": "Person",
|
||||
"name": "Alan Orth"
|
||||
@ -471,7 +471,7 @@ UPDATE 7
|
||||
[> ] 0% time remaining: 11 hour(s) 21 minute(s) 57 seconds. timestamp: 2016-11-28 03:00:20
|
||||
</code></pre><ul>
|
||||
<li>It says OAI, and seems to start at 3:00 AM, but I only see the <code>filter-media</code> cron job set to start then</li>
|
||||
<li>Double checking the <a href="https://wiki.duraspace.org/display/DSDOC5x/Upgrading+DSpace">DSpace 5.x upgrade notes</a> for anything I missed, or troubleshooting tips</li>
|
||||
<li>Double checking the <a href="https://wiki.lyrasis.org/display/DSDOC5x/Upgrading+DSpace">DSpace 5.x upgrade notes</a> for anything I missed, or troubleshooting tips</li>
|
||||
<li>Running some manual processes just in case:</li>
|
||||
</ul>
|
||||
<pre><code>$ /home/dspacetest.cgiar.org/bin/dspace registry-loader -metadata /home/dspacetest.cgiar.org/config/registries/dcterms-types.xml
|
||||
@ -480,7 +480,7 @@ $ /home/dspacetest.cgiar.org/bin/dspace registry-loader -metadata /home/dspacete
|
||||
$ /home/dspacetest.cgiar.org/bin/dspace registry-loader -metadata /home/dspacetest.cgiar.org/config/registries/workflow-types.xml
|
||||
</code></pre><ul>
|
||||
<li>Start working on paper for KM4Dev journal</li>
|
||||
<li>Wow, Bram from Atmire pointed out this solution for using multiple handles with one DSpace instance: <a href="https://wiki.duraspace.org/display/DSDOC5x/Installing+DSpace?focusedCommentId=78163296#comment-78163296">https://wiki.duraspace.org/display/DSDOC5x/Installing+DSpace?focusedCommentId=78163296#comment-78163296</a></li>
|
||||
<li>Wow, Bram from Atmire pointed out this solution for using multiple handles with one DSpace instance: <a href="https://wiki.lyrasis.org/display/DSDOC5x/Installing+DSpace?focusedCommentId=78163296#comment-78163296">https://wiki.lyrasis.org/display/DSDOC5x/Installing+DSpace?focusedCommentId=78163296#comment-78163296</a></li>
|
||||
<li>We might be able to migrate the <a href="http://library.cgiar.org/">CGIAR Library</a> now, as they had wanted to keep their handles</li>
|
||||
</ul>
|
||||
<h2 id="2016-11-29">2016-11-29</h2>
|
||||
|
@ -26,7 +26,7 @@ Looks like we’ll be using cg.identifier.ccafsprojectpii as the field name
|
||||
<meta property="og:type" content="article" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2017-02/" />
|
||||
<meta property="article:published_time" content="2017-02-07T07:04:52-08:00" />
|
||||
<meta property="article:modified_time" content="2018-03-09T22:10:33+02:00" />
|
||||
<meta property="article:modified_time" content="2020-04-13T15:30:24+03:00" />
|
||||
|
||||
<meta name="twitter:card" content="summary"/>
|
||||
<meta name="twitter:title" content="February, 2017"/>
|
||||
@ -59,7 +59,7 @@ Looks like we’ll be using cg.identifier.ccafsprojectpii as the field name
|
||||
"url": "https://alanorth.github.io/cgspace-notes/2017-02/",
|
||||
"wordCount": "2028",
|
||||
"datePublished": "2017-02-07T07:04:52-08:00",
|
||||
"dateModified": "2018-03-09T22:10:33+02:00",
|
||||
"dateModified": "2020-04-13T15:30:24+03:00",
|
||||
"author": {
|
||||
"@type": "Person",
|
||||
"name": "Alan Orth"
|
||||
@ -263,7 +263,7 @@ dspace=# update metadatavalue set text_value = 'https://dx.doi.org/10.15446/agro
|
||||
<pre><code>dspace=# update metadatavalue set text_value = regexp_replace(text_value, 'http://dx.doi.org', 'https://dx.doi.org') where resource_type_id=2 and metadata_field_id IN (select metadata_field_id from metadatafieldregistry where element = 'identifier' and qualifier = 'doi') and text_value like 'http://dx.doi.org%';
|
||||
</code></pre><ul>
|
||||
<li>Run all DOI corrections on CGSpace</li>
|
||||
<li>Something to think about here is to write a <a href="https://wiki.duraspace.org/display/DSDOC5x/Curation+System#CurationSystem-ScriptedTasks">Curation Task</a> in Java to do these sanity checks / corrections every night</li>
|
||||
<li>Something to think about here is to write a <a href="https://wiki.lyrasis.org/display/DSDOC5x/Curation+System#CurationSystem-ScriptedTasks">Curation Task</a> in Java to do these sanity checks / corrections every night</li>
|
||||
<li>Then we could add a cron job for them and run them from the command line like:</li>
|
||||
</ul>
|
||||
<pre><code>[dspace]/bin/dspace curate -t noop -i 10568/79891
|
||||
|
@ -28,7 +28,7 @@ $ identify ~/Desktop/alc_contrastes_desafios.jpg
|
||||
<meta property="og:type" content="article" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2017-03/" />
|
||||
<meta property="article:published_time" content="2017-03-01T17:08:52+02:00" />
|
||||
<meta property="article:modified_time" content="2018-03-09T22:10:33+02:00" />
|
||||
<meta property="article:modified_time" content="2020-04-13T15:30:24+03:00" />
|
||||
|
||||
<meta name="twitter:card" content="summary"/>
|
||||
<meta name="twitter:title" content="March, 2017"/>
|
||||
@ -63,7 +63,7 @@ $ identify ~/Desktop/alc_contrastes_desafios.jpg
|
||||
"url": "https://alanorth.github.io/cgspace-notes/2017-03/",
|
||||
"wordCount": "1538",
|
||||
"datePublished": "2017-03-01T17:08:52+02:00",
|
||||
"dateModified": "2018-03-09T22:10:33+02:00",
|
||||
"dateModified": "2020-04-13T15:30:24+03:00",
|
||||
"author": {
|
||||
"@type": "Person",
|
||||
"name": "Alan Orth"
|
||||
@ -198,7 +198,7 @@ DirectClass sRGB Alpha
|
||||
<li>But there are hundreds of combinations of fields and values (like <code>dc.subject</code> and all the center subjects), and we can’t use wildcards in REST!</li>
|
||||
<li>Reading about enabling multiple handle prefixes in DSpace</li>
|
||||
<li>There is a mailing list thread from 2011 about it: <a href="http://dspace.2283337.n4.nabble.com/Multiple-handle-prefixes-merged-DSpace-instances-td3427192.html">http://dspace.2283337.n4.nabble.com/Multiple-handle-prefixes-merged-DSpace-instances-td3427192.html</a></li>
|
||||
<li>And a comment from Atmire’s Bram about it on the DSpace wiki: <a href="https://wiki.duraspace.org/display/DSDOC5x/Installing+DSpace?focusedCommentId=78163296#comment-78163296">https://wiki.duraspace.org/display/DSDOC5x/Installing+DSpace?focusedCommentId=78163296#comment-78163296</a></li>
|
||||
<li>And a comment from Atmire’s Bram about it on the DSpace wiki: <a href="https://wiki.lyrasis.org/display/DSDOC5x/Installing+DSpace?focusedCommentId=78163296#comment-78163296">https://wiki.lyrasis.org/display/DSDOC5x/Installing+DSpace?focusedCommentId=78163296#comment-78163296</a></li>
|
||||
<li>Bram mentions an undocumented configuration option <code>handle.plugin.checknameauthority</code>, but I noticed another one in <code>dspace.cfg</code>:</li>
|
||||
</ul>
|
||||
<pre><code># List any additional prefixes that need to be managed by this handle server
|
||||
|
@ -21,7 +21,7 @@ $ [dspace]/bin/dspace filter-media -f -i 10568/16498 -p "ImageMagick PDF Th
|
||||
<meta property="og:type" content="article" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2017-04/" />
|
||||
<meta property="article:published_time" content="2017-04-02T17:08:52+02:00" />
|
||||
<meta property="article:modified_time" content="2018-03-09T22:10:33+02:00" />
|
||||
<meta property="article:modified_time" content="2020-04-13T15:30:24+03:00" />
|
||||
|
||||
<meta name="twitter:card" content="summary"/>
|
||||
<meta name="twitter:title" content="April, 2017"/>
|
||||
@ -49,7 +49,7 @@ $ [dspace]/bin/dspace filter-media -f -i 10568/16498 -p "ImageMagick PDF Th
|
||||
"url": "https://alanorth.github.io/cgspace-notes/2017-04/",
|
||||
"wordCount": "2917",
|
||||
"datePublished": "2017-04-02T17:08:52+02:00",
|
||||
"dateModified": "2018-03-09T22:10:33+02:00",
|
||||
"dateModified": "2020-04-13T15:30:24+03:00",
|
||||
"author": {
|
||||
"@type": "Person",
|
||||
"name": "Alan Orth"
|
||||
@ -190,7 +190,7 @@ ILAC_Brief21_PMCA.pdf: 113462 bytes, checksum: 249fef468f401c066a119f5db687add0
|
||||
2505
|
||||
</code></pre><h2 id="2017-04-06">2017-04-06</h2>
|
||||
<ul>
|
||||
<li>After reading the <a href="https://wiki.duraspace.org/display/cmtygp/DCAT+Meeting+April+2017">notes for DCAT April 2017</a> I am testing some new settings for PostgreSQL on DSpace Test:
|
||||
<li>After reading the <a href="https://wiki.lyrasis.org/display/cmtygp/DCAT+Meeting+April+2017">notes for DCAT April 2017</a> I am testing some new settings for PostgreSQL on DSpace Test:
|
||||
<ul>
|
||||
<li><code>db.maxconnections</code> 30→70 (the default PostgreSQL config allows 100 connections, so DSpace’s default of 30 is quite low)</li>
|
||||
<li><code>db.maxwait</code> 5000→10000</li>
|
||||
@ -201,7 +201,7 @@ ILAC_Brief21_PMCA.pdf: 113462 bytes, checksum: 249fef468f401c066a119f5db687add0
|
||||
<li>Run system updates on DSpace Test and reboot the server</li>
|
||||
<li>Discussing harvesting CIFOR’s DSpace via OAI</li>
|
||||
<li>Sisay added their OAI as a source to a new collection, but using the Simple Dublin Core method, so many fields are unqualified and duplicated</li>
|
||||
<li>Looking at the <a href="https://wiki.duraspace.org/display/DSDOC5x/XMLUI+Configuration+and+Customization">documentation</a> it seems that we probably want to be using DSpace Intermediate Metadata</li>
|
||||
<li>Looking at the <a href="https://wiki.lyrasis.org/display/DSDOC5x/XMLUI+Configuration+and+Customization">documentation</a> it seems that we probably want to be using DSpace Intermediate Metadata</li>
|
||||
</ul>
|
||||
<h2 id="2017-04-10">2017-04-10</h2>
|
||||
<ul>
|
||||
@ -306,7 +306,7 @@ sys 1m29.310s
|
||||
<ul>
|
||||
<li>DSpace committers reviewed my patch for DS-3516 and proposed a simpler idea involving incorrect use of <code>SelfRegisteredInputFormats</code></li>
|
||||
<li>I tested the idea and it works, so I made a new patch: <a href="https://github.com/DSpace/DSpace/pull/1709">https://github.com/DSpace/DSpace/pull/1709</a></li>
|
||||
<li>I discovered that we can override metadata formats in OAI by creating a new “context”: <a href="https://wiki.duraspace.org/display/DSDOC5x/OAI+2.0+Server">https://wiki.duraspace.org/display/DSDOC5x/OAI+2.0+Server</a></li>
|
||||
<li>I discovered that we can override metadata formats in OAI by creating a new “context”: <a href="https://wiki.lyrasis.org/display/DSDOC5x/OAI+2.0+Server">https://wiki.lyrasis.org/display/DSDOC5x/OAI+2.0+Server</a></li>
|
||||
<li>This allows us to have, say a default “request” context and a “cgiar” context, both of which implement the DSpace Intermediate Metadata formats, but have the later use a overridden version that exposes CG metadata</li>
|
||||
<li>Compare the following results:
|
||||
<ul>
|
||||
|
@ -10,7 +10,7 @@
|
||||
<meta property="og:type" content="article" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2017-05/" />
|
||||
<meta property="article:published_time" content="2017-05-01T16:21:52+02:00" />
|
||||
<meta property="article:modified_time" content="2018-03-09T22:10:33+02:00" />
|
||||
<meta property="article:modified_time" content="2020-04-13T15:30:24+03:00" />
|
||||
|
||||
<meta name="twitter:card" content="summary"/>
|
||||
<meta name="twitter:title" content="May, 2017"/>
|
||||
@ -27,7 +27,7 @@
|
||||
"url": "https://alanorth.github.io/cgspace-notes/2017-05/",
|
||||
"wordCount": "2398",
|
||||
"datePublished": "2017-05-01T16:21:52+02:00",
|
||||
"dateModified": "2018-03-09T22:10:33+02:00",
|
||||
"dateModified": "2020-04-13T15:30:24+03:00",
|
||||
"author": {
|
||||
"@type": "Person",
|
||||
"name": "Alan Orth"
|
||||
@ -136,7 +136,7 @@
|
||||
<ul>
|
||||
<li>Add “Blog Post” to <code>dc.type</code></li>
|
||||
<li>Create ticket on Atmire tracker to ask about commissioning them to develop the feature to expose ORCID via REST/OAI: <a href="https://tracker.atmire.com/tickets-cgiar-ilri/view-ticket?id=510">https://tracker.atmire.com/tickets-cgiar-ilri/view-ticket?id=510</a></li>
|
||||
<li>According to the <a href="https://wiki.duraspace.org/display/DSDOC5x/Curation+System">DSpace curation docs</a> the fact that the <code>requiredmetadata</code> curation task stops when it finds a missing metadata field is by design</li>
|
||||
<li>According to the <a href="https://wiki.lyrasis.org/display/DSDOC5x/Curation+System">DSpace curation docs</a> the fact that the <code>requiredmetadata</code> curation task stops when it finds a missing metadata field is by design</li>
|
||||
</ul>
|
||||
<h2 id="2017-05-07">2017-05-07</h2>
|
||||
<ul>
|
||||
@ -337,7 +337,7 @@ COPY 111
|
||||
<ul>
|
||||
<li>File an issue on GitHub to explore/track migration to proper country/region codes (ISO 2/3 and UN M.49): <a href="https://github.com/ilri/DSpace/issues/326">#326</a></li>
|
||||
<li>Ask Peter how the Landportal.info people should acknowledge us as the source of data on their website</li>
|
||||
<li>Communicate with MARLO people about progress on exposing ORCIDs via the REST API, as it is set to be discussed in the <a href="https://wiki.duraspace.org/display/cmtygp/DCAT+Meeting+June+2017">June, 2017 DCAT meeting</a></li>
|
||||
<li>Communicate with MARLO people about progress on exposing ORCIDs via the REST API, as it is set to be discussed in the <a href="https://wiki.lyrasis.org/display/cmtygp/DCAT+Meeting+June+2017">June, 2017 DCAT meeting</a></li>
|
||||
<li>Find all of Amos Omore’s author name variations so I can link them to his authority entry that has an ORCID:</li>
|
||||
</ul>
|
||||
<pre><code>dspace=# select distinct text_value, authority, confidence from metadatavalue where metadata_field_id=3 and text_value like 'Omore, A%';
|
||||
|
@ -10,7 +10,7 @@
|
||||
<meta property="og:type" content="article" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2017-06/" />
|
||||
<meta property="article:published_time" content="2017-06-01T10:14:52+03:00" />
|
||||
<meta property="article:modified_time" content="2018-03-09T22:10:33+02:00" />
|
||||
<meta property="article:modified_time" content="2020-04-13T15:30:24+03:00" />
|
||||
|
||||
<meta name="twitter:card" content="summary"/>
|
||||
<meta name="twitter:title" content="June, 2017"/>
|
||||
@ -27,7 +27,7 @@
|
||||
"url": "https://alanorth.github.io/cgspace-notes/2017-06/",
|
||||
"wordCount": "1261",
|
||||
"datePublished": "2017-06-01T10:14:52+03:00",
|
||||
"dateModified": "2018-03-09T22:10:33+02:00",
|
||||
"dateModified": "2020-04-13T15:30:24+03:00",
|
||||
"author": {
|
||||
"@type": "Person",
|
||||
"name": "Alan Orth"
|
||||
@ -233,7 +233,7 @@ $ JAVA_OPTS="-Xmx1024m -Dfile.encoding=UTF-8" [dspace]/bin/dspace impo
|
||||
<pre><code>Java stacktrace: java.util.NoSuchElementException: Timeout waiting for idle object
|
||||
</code></pre><ul>
|
||||
<li>After looking at the Tomcat logs, Munin graphs, and PostgreSQL connection stats, it seems there is just a high load</li>
|
||||
<li>Might be a good time to adjust DSpace’s database connection settings, like I first mentioned in April, 2017 after reading the <a href="https://wiki.duraspace.org/display/cmtygp/DCAT+Meeting+April+2017">2017-04 DCAT comments</a></li>
|
||||
<li>Might be a good time to adjust DSpace’s database connection settings, like I first mentioned in April, 2017 after reading the <a href="https://wiki.lyrasis.org/display/cmtygp/DCAT+Meeting+April+2017">2017-04 DCAT comments</a></li>
|
||||
<li>I’ve adjusted the following in CGSpace’s config:
|
||||
<ul>
|
||||
<li><code>db.maxconnections</code> 30→70 (the default PostgreSQL config allows 100 connections, so DSpace’s default of 30 is quite low)</li>
|
||||
|
@ -19,7 +19,7 @@ We can use PostgreSQL’s extended output format (-x) plus sed to format the
|
||||
<meta property="og:type" content="article" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2017-07/" />
|
||||
<meta property="article:published_time" content="2017-07-01T18:03:52+03:00" />
|
||||
<meta property="article:modified_time" content="2018-03-09T22:10:33+02:00" />
|
||||
<meta property="article:modified_time" content="2020-04-13T15:30:24+03:00" />
|
||||
|
||||
<meta name="twitter:card" content="summary"/>
|
||||
<meta name="twitter:title" content="July, 2017"/>
|
||||
@ -45,7 +45,7 @@ We can use PostgreSQL’s extended output format (-x) plus sed to format the
|
||||
"url": "https://alanorth.github.io/cgspace-notes/2017-07/",
|
||||
"wordCount": "1151",
|
||||
"datePublished": "2017-07-01T18:03:52+03:00",
|
||||
"dateModified": "2018-03-09T22:10:33+02:00",
|
||||
"dateModified": "2020-04-13T15:30:24+03:00",
|
||||
"author": {
|
||||
"@type": "Person",
|
||||
"name": "Alan Orth"
|
||||
@ -141,7 +141,7 @@ We can use PostgreSQL’s extended output format (-x) plus sed to format the
|
||||
<li>We also need to check to see which ORCID authority core this uses, because it seems to be using an entirely new one rather than the one for <code>dc.contributor.author</code> (which makes sense of course, but fuck, all the author problems aren’t bad enough?!)</li>
|
||||
<li>Also would need to edit XMLUI item displays to incorporate this into authors list</li>
|
||||
<li>And fuck, then anyone consuming our data via REST / OAI will not notice that we have an author outside of <code>dc.contributor.authors</code>… ugh</li>
|
||||
<li>What if we modify the item submission form to use <a href="https://wiki.duraspace.org/display/DSDOC5x/Submission+User+Interface#SubmissionUserInterface-ItemtypeBasedMetadataCollection"><code>type-bind</code> fields to show/hide certain fields depending on the type</a>?</li>
|
||||
<li>What if we modify the item submission form to use <a href="https://wiki.lyrasis.org/display/DSDOC5x/Submission+User+Interface#SubmissionUserInterface-ItemtypeBasedMetadataCollection"><code>type-bind</code> fields to show/hide certain fields depending on the type</a>?</li>
|
||||
</ul>
|
||||
<h2 id="2017-07-05">2017-07-05</h2>
|
||||
<ul>
|
||||
|
@ -31,7 +31,7 @@ Then I cleaned up the author authorities and HTML characters in OpenRefine and s
|
||||
<meta property="og:type" content="article" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2017-08/" />
|
||||
<meta property="article:published_time" content="2017-08-01T11:51:52+03:00" />
|
||||
<meta property="article:modified_time" content="2019-03-04T02:43:22+02:00" />
|
||||
<meta property="article:modified_time" content="2020-04-13T15:30:24+03:00" />
|
||||
|
||||
<meta name="twitter:card" content="summary"/>
|
||||
<meta name="twitter:title" content="August, 2017"/>
|
||||
@ -69,7 +69,7 @@ Then I cleaned up the author authorities and HTML characters in OpenRefine and s
|
||||
"url": "https://alanorth.github.io/cgspace-notes/2017-08/",
|
||||
"wordCount": "3542",
|
||||
"datePublished": "2017-08-01T11:51:52+03:00",
|
||||
"dateModified": "2019-03-04T02:43:22+02:00",
|
||||
"dateModified": "2020-04-13T15:30:24+03:00",
|
||||
"author": {
|
||||
"@type": "Person",
|
||||
"name": "Alan Orth"
|
||||
@ -218,7 +218,7 @@ Then I cleaned up the author authorities and HTML characters in OpenRefine and s
|
||||
<ul>
|
||||
<li>Alan to follow up with ICARDA about depositing in CGSpace, we want ICARD and Drylands legacy content but not duplicates</li>
|
||||
<li>Alan to follow up on dc.rights, where are we?</li>
|
||||
<li>Alan to follow up with Atmire about a dedicated field for ORCIDs, based on the discussion in the <a href="https://wiki.duraspace.org/display/cmtygp/DCAT+Meeting+June+2017">June, 2017 DCAT meeting</a></li>
|
||||
<li>Alan to follow up with Atmire about a dedicated field for ORCIDs, based on the discussion in the <a href="https://wiki.lyrasis.org/display/cmtygp/DCAT+Meeting+June+2017">June, 2017 DCAT meeting</a></li>
|
||||
<li>Alan to ask about how to query external services like AGROVOC in the DSpace submission form</li>
|
||||
</ul>
|
||||
</li>
|
||||
@ -272,7 +272,7 @@ DELETE 1
|
||||
dspace=# delete from metadatavalue where resource_type_id=2 and metadata_field_id=3 and text_value='WSSD';
|
||||
</code></pre><ul>
|
||||
<li>Generate a new list of authors from the CGIAR Library community for Peter to look through now that the initial corrections have been done</li>
|
||||
<li>Thinking about resource limits for PostgreSQL again after last week’s CGSpace crash and related to a recently discussion I had in the comments of the <a href="https://wiki.duraspace.org/display/cmtygp/DCAT+Meeting+April+2017">April, 2017 DCAT meeting notes</a></li>
|
||||
<li>Thinking about resource limits for PostgreSQL again after last week’s CGSpace crash and related to a recently discussion I had in the comments of the <a href="https://wiki.lyrasis.org/display/cmtygp/DCAT+Meeting+April+2017">April, 2017 DCAT meeting notes</a></li>
|
||||
<li>In that thread Chris Wilper suggests a new default of 35 max connections for <code>db.maxconnections</code> (from the current default of 30), knowing that <em>each DSpace web application</em> gets to use up to this many on its own</li>
|
||||
<li>It would be good to approximate what the theoretical maximum number of connections on a busy server would be, perhaps by looking to see which apps use SQL:</li>
|
||||
</ul>
|
||||
@ -406,7 +406,7 @@ dspace.log.2017-08-17:584
|
||||
<li>He linked to some examples from DSpace-CRIS that use this functionality: <a href="https://github.com/4Science/DSpace/blob/dspace-5_x_x-cris/dspace-api/src/main/java/org/dspace/content/authority/VIAFAuthority.java">VIAFAuthority</a></li>
|
||||
<li>I wired it up to the <code>dc.subject</code> field of the submission interface using the “lookup” type and it works!</li>
|
||||
<li>I think we can use this example to get a working AGROVOC query</li>
|
||||
<li>More information about authority framework: <a href="https://wiki.duraspace.org/display/DSPACE/Authority+Control+of+Metadata+Values">https://wiki.duraspace.org/display/DSPACE/Authority+Control+of+Metadata+Values</a></li>
|
||||
<li>More information about authority framework: <a href="https://wiki.lyrasis.org/display/DSPACE/Authority+Control+of+Metadata+Values">https://wiki.lyrasis.org/display/DSPACE/Authority+Control+of+Metadata+Values</a></li>
|
||||
<li>Wow, I’m playing with the AGROVOC SPARQL endpoint using the <a href="https://github.com/tialaramex/sparql-query">sparql-query tool</a>:</li>
|
||||
</ul>
|
||||
<pre><code>$ ./sparql-query http://202.45.139.84:10035/catalogs/fao/repositories/agrovoc
|
||||
|
@ -17,7 +17,7 @@ Ask Sisay to clean up the WLE approvers a bit, as Marianne’s user account
|
||||
<meta property="og:type" content="article" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2017-09/" />
|
||||
<meta property="article:published_time" content="2017-09-07T16:54:52+07:00" />
|
||||
<meta property="article:modified_time" content="2018-03-09T22:10:33+02:00" />
|
||||
<meta property="article:modified_time" content="2020-04-13T15:30:24+03:00" />
|
||||
|
||||
<meta name="twitter:card" content="summary"/>
|
||||
<meta name="twitter:title" content="September, 2017"/>
|
||||
@ -41,7 +41,7 @@ Ask Sisay to clean up the WLE approvers a bit, as Marianne’s user account
|
||||
"url": "https://alanorth.github.io/cgspace-notes/2017-09/",
|
||||
"wordCount": "4199",
|
||||
"datePublished": "2017-09-07T16:54:52+07:00",
|
||||
"dateModified": "2018-03-09T22:10:33+02:00",
|
||||
"dateModified": "2020-04-13T15:30:24+03:00",
|
||||
"author": {
|
||||
"@type": "Person",
|
||||
"name": "Alan Orth"
|
||||
@ -167,7 +167,7 @@ dspace.log.2017-09-10:0
|
||||
</ul>
|
||||
<h2 id="2017-09-12">2017-09-12</h2>
|
||||
<ul>
|
||||
<li>I was testing the <a href="https://wiki.duraspace.org/display/DSDOC5x/AIP+Backup+and+Restore#AIPBackupandRestore-AIPConfigurationsToImproveIngestionSpeedwhileValidating">METS XSD caching during AIP ingest</a> but it doesn’t seem to help actually</li>
|
||||
<li>I was testing the <a href="https://wiki.lyrasis.org/display/DSDOC5x/AIP+Backup+and+Restore#AIPBackupandRestore-AIPConfigurationsToImproveIngestionSpeedwhileValidating">METS XSD caching during AIP ingest</a> but it doesn’t seem to help actually</li>
|
||||
<li>The import process takes the same amount of time with and without the caching</li>
|
||||
<li>Also, I captured TCP packets destined for port 80 and both imports only captured ONE packet (an update check from some component in Java):</li>
|
||||
</ul>
|
||||
@ -453,7 +453,7 @@ DELETE 207
|
||||
</code></pre><ul>
|
||||
<li>More work on the CGIAR Library migration test run locally, as I was having problem with importing the last fourteen items from the CGIAR System Management Office community</li>
|
||||
<li>The problem was that we remapped the items to new collections after the initial import, so the items were using the 10947 prefix but the community and collection was using 10568</li>
|
||||
<li>I ended up having to read the <a href="https://wiki.duraspace.org/display/DSDOC5x/AIP+Backup+and+Restore#AIPBackupandRestore-ForceReplaceMode">AIP Backup and Restore</a> closely a few times and then explicitly preserve handles and ignore parents:</li>
|
||||
<li>I ended up having to read the <a href="https://wiki.lyrasis.org/display/DSDOC5x/AIP+Backup+and+Restore#AIPBackupandRestore-ForceReplaceMode">AIP Backup and Restore</a> closely a few times and then explicitly preserve handles and ignore parents:</li>
|
||||
</ul>
|
||||
<pre><code>$ for item in 10568-93759/ITEM@10947-46*; do ~/dspace/bin/dspace packager -r -t AIP -o ignoreHandle=false -o ignoreParent=true -e aorth@mjanja.ch -p 10568/87738 $item; done
|
||||
</code></pre><ul>
|
||||
@ -503,7 +503,7 @@ DELETE 207
|
||||
</ul>
|
||||
<h2 id="2017-09-22">2017-09-22</h2>
|
||||
<ul>
|
||||
<li>Experimenting with setting up a global JNDI database resource that can be pooled among all the DSpace webapps (reference the <a href="https://wiki.duraspace.org/display/cmtygp/DCAT+Meeting+April+2017">April, 2017 DCAT meeting</a> comments)</li>
|
||||
<li>Experimenting with setting up a global JNDI database resource that can be pooled among all the DSpace webapps (reference the <a href="https://wiki.lyrasis.org/display/cmtygp/DCAT+Meeting+April+2017">April, 2017 DCAT meeting</a> comments)</li>
|
||||
<li>See: <a href="https://www.journaldev.com/2513/tomcat-datasource-jndi-example-java">https://www.journaldev.com/2513/tomcat-datasource-jndi-example-java</a></li>
|
||||
<li>See: <a href="http://memorynotfound.com/configure-jndi-datasource-tomcat/">http://memorynotfound.com/configure-jndi-datasource-tomcat/</a></li>
|
||||
</ul>
|
||||
|
@ -16,7 +16,7 @@ The list of connections to XMLUI and REST API for today:
|
||||
<meta property="og:type" content="article" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2017-12/" />
|
||||
<meta property="article:published_time" content="2017-12-01T13:53:54+03:00" />
|
||||
<meta property="article:modified_time" content="2019-10-28T13:39:25+02:00" />
|
||||
<meta property="article:modified_time" content="2020-04-13T15:30:24+03:00" />
|
||||
|
||||
<meta name="twitter:card" content="summary"/>
|
||||
<meta name="twitter:title" content="December, 2017"/>
|
||||
@ -39,7 +39,7 @@ The list of connections to XMLUI and REST API for today:
|
||||
"url": "https://alanorth.github.io/cgspace-notes/2017-12/",
|
||||
"wordCount": "4088",
|
||||
"datePublished": "2017-12-01T13:53:54+03:00",
|
||||
"dateModified": "2019-10-28T13:39:25+02:00",
|
||||
"dateModified": "2020-04-13T15:30:24+03:00",
|
||||
"author": {
|
||||
"@type": "Person",
|
||||
"name": "Alan Orth"
|
||||
@ -477,7 +477,7 @@ $ schedtool -D -e ionice -c2 -n7 nice -n19 dspace index-discovery
|
||||
</code></pre><ul>
|
||||
<li>I don’t have time now to look into this but the Solr sharding has long been an issue!</li>
|
||||
<li>Looking into using JDBC / JNDI to provide a database pool to DSpace</li>
|
||||
<li>The <a href="https://wiki.duraspace.org/display/DSDOC6x/Configuration+Reference">DSpace 6.x configuration docs</a> have more notes about setting up the database pool than the 5.x ones (which actually have none!)</li>
|
||||
<li>The <a href="https://wiki.lyrasis.org/display/DSDOC6x/Configuration+Reference">DSpace 6.x configuration docs</a> have more notes about setting up the database pool than the 5.x ones (which actually have none!)</li>
|
||||
<li>First, I uncomment <code>db.jndi</code> in <em>dspace/config/dspace.cfg</em></li>
|
||||
<li>Then I create a global <code>Resource</code> in the main Tomcat <em>server.xml</em> (inside <code>GlobalNamingResources</code>):</li>
|
||||
</ul>
|
||||
|
@ -76,7 +76,7 @@ Danny wrote to ask for help renewing the wildcard ilri.org certificate and I adv
|
||||
<meta property="og:type" content="article" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2018-01/" />
|
||||
<meta property="article:published_time" content="2018-01-02T08:35:54-08:00" />
|
||||
<meta property="article:modified_time" content="2019-10-28T13:39:25+02:00" />
|
||||
<meta property="article:modified_time" content="2020-04-13T15:30:24+03:00" />
|
||||
|
||||
<meta name="twitter:card" content="summary"/>
|
||||
<meta name="twitter:title" content="January, 2018"/>
|
||||
@ -159,7 +159,7 @@ Danny wrote to ask for help renewing the wildcard ilri.org certificate and I adv
|
||||
"url": "https://alanorth.github.io/cgspace-notes/2018-01/",
|
||||
"wordCount": "7940",
|
||||
"datePublished": "2018-01-02T08:35:54-08:00",
|
||||
"dateModified": "2019-10-28T13:39:25+02:00",
|
||||
"dateModified": "2020-04-13T15:30:24+03:00",
|
||||
"author": {
|
||||
"@type": "Person",
|
||||
"name": "Alan Orth"
|
||||
@ -545,7 +545,7 @@ Caused by: org.apache.http.client.ClientProtocolException
|
||||
at org.apache.solr.client.solrj.impl.HttpSolrServer.executeMethod(HttpSolrServer.java:448)
|
||||
... 10 more
|
||||
</code></pre><ul>
|
||||
<li>There is interesting documentation about this on the DSpace Wiki: <a href="https://wiki.duraspace.org/display/DSDOC5x/SOLR+Statistics+Maintenance#SOLRStatisticsMaintenance-SolrShardingByYear">https://wiki.duraspace.org/display/DSDOC5x/SOLR+Statistics+Maintenance#SOLRStatisticsMaintenance-SolrShardingByYear</a></li>
|
||||
<li>There is interesting documentation about this on the DSpace Wiki: <a href="https://wiki.lyrasis.org/display/DSDOC5x/SOLR+Statistics+Maintenance#SOLRStatisticsMaintenance-SolrShardingByYear">https://wiki.lyrasis.org/display/DSDOC5x/SOLR+Statistics+Maintenance#SOLRStatisticsMaintenance-SolrShardingByYear</a></li>
|
||||
<li>I’m looking to see maybe if we’re hitting the issues mentioned in <a href="https://jira.duraspace.org/browse/DS-2212">DS-2212</a> that were apparently fixed in DSpace 5.2</li>
|
||||
<li>I can apparently search for records in the Solr stats core that have an empty <code>owningColl</code> field using this in the Solr admin query: <code>-owningColl:*</code></li>
|
||||
<li>On CGSpace I see 48,000,000 records that have an <code>owningColl</code> field and 34,000,000 that don’t:</li>
|
||||
@ -561,7 +561,7 @@ $ http 'http://localhost:3000/solr/statistics/select?q=-owningColl%3A*&wt=js
|
||||
<pre><code>2018-01-10 10:51:19,301 INFO org.dspace.statistics.SolrLogger @ Created core with name: statistics-2016
|
||||
2018-01-10 10:51:19,301 INFO org.dspace.statistics.SolrLogger @ Moving: 3821 records into core statistics-2016
|
||||
</code></pre><ul>
|
||||
<li>Terry Brady has written some notes on the DSpace Wiki about Solr sharing issues: <a href="https://wiki.duraspace.org/display/%7Eterrywbrady/Statistics+Import+Export+Issues">https://wiki.duraspace.org/display/%7Eterrywbrady/Statistics+Import+Export+Issues</a></li>
|
||||
<li>Terry Brady has written some notes on the DSpace Wiki about Solr sharing issues: <a href="https://wiki.lyrasis.org/display/%7Eterrywbrady/Statistics+Import+Export+Issues">https://wiki.lyrasis.org/display/%7Eterrywbrady/Statistics+Import+Export+Issues</a></li>
|
||||
<li>Uptime Robot said that CGSpace went down at around 9:43 AM</li>
|
||||
<li>I looked at PostgreSQL’s <code>pg_stat_activity</code> table and saw 161 active connections, but no pool errors in the DSpace logs:</li>
|
||||
</ul>
|
||||
@ -680,7 +680,7 @@ cache_alignment : 64
|
||||
</ul>
|
||||
<h2 id="2018-01-12">2018-01-12</h2>
|
||||
<ul>
|
||||
<li>I’m looking at the <a href="https://wiki.duraspace.org/display/DSDOC6x/Installing+DSpace#InstallingDSpace-ServletEngine(ApacheTomcat7orlater,Jetty,CauchoResinorequivalent)">DSpace 6.0 Install docs</a> and notice they tweak the number of threads in their Tomcat connector:</li>
|
||||
<li>I’m looking at the <a href="https://wiki.lyrasis.org/display/DSDOC6x/Installing+DSpace#InstallingDSpace-ServletEngine(ApacheTomcat7orlater,Jetty,CauchoResinorequivalent)">DSpace 6.0 Install docs</a> and notice they tweak the number of threads in their Tomcat connector:</li>
|
||||
</ul>
|
||||
<pre><code><!-- Define a non-SSL HTTP/1.1 Connector on port 8080 -->
|
||||
<Connector port="8080"
|
||||
@ -790,7 +790,7 @@ dspace=# select handle from item, handle where handle.resource_id = item.item_id
|
||||
(0 rows)
|
||||
</code></pre><ul>
|
||||
<li>Even searching in the DSpace advanced search for author equals “Tarawali” produces nothing…</li>
|
||||
<li>Otherwise, the <a href="https://wiki.duraspace.org/display/DSPACE/Helper+SQL+functions+for+DSpace+5">DSpace 5 SQL Helper Functions</a> provide <code>ds5_item2itemhandle()</code>, which is much easier than my long query above that I always have to go search for</li>
|
||||
<li>Otherwise, the <a href="https://wiki.lyrasis.org/display/DSPACE/Helper+SQL+functions+for+DSpace+5">DSpace 5 SQL Helper Functions</a> provide <code>ds5_item2itemhandle()</code>, which is much easier than my long query above that I always have to go search for</li>
|
||||
<li>For example, to find the Handle for an item that has the author “Erni”:</li>
|
||||
</ul>
|
||||
<pre><code>dspace=# select * from metadatavalue where resource_type_id=2 and metadata_field_id=3 and text_value='Erni';
|
||||
|
@ -20,7 +20,7 @@ Also, I switched it to use OpenJDK instead of Oracle Java, as well as re-worked
|
||||
<meta property="og:type" content="article" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2018-05/" />
|
||||
<meta property="article:published_time" content="2018-05-01T16:43:54+03:00" />
|
||||
<meta property="article:modified_time" content="2019-10-28T13:39:25+02:00" />
|
||||
<meta property="article:modified_time" content="2020-04-13T15:30:24+03:00" />
|
||||
|
||||
<meta name="twitter:card" content="summary"/>
|
||||
<meta name="twitter:title" content="May, 2018"/>
|
||||
@ -47,7 +47,7 @@ Also, I switched it to use OpenJDK instead of Oracle Java, as well as re-worked
|
||||
"url": "https://alanorth.github.io/cgspace-notes/2018-05/",
|
||||
"wordCount": "3503",
|
||||
"datePublished": "2018-05-01T16:43:54+03:00",
|
||||
"dateModified": "2019-10-28T13:39:25+02:00",
|
||||
"dateModified": "2020-04-13T15:30:24+03:00",
|
||||
"author": {
|
||||
"@type": "Person",
|
||||
"name": "Alan Orth"
|
||||
@ -419,7 +419,7 @@ $ ./bin/post -c countries ~/src/git/DSpace/2018-05-10-countries.csv
|
||||
<h2 id="2018-05-21">2018-05-21</h2>
|
||||
<ul>
|
||||
<li>Geoffrey from IITA got back with more questions about depositing items programatically into the CGSpace workflow</li>
|
||||
<li>I pointed out that <a href="http://swordapp.org/">SWORD</a> might be an option, as <a href="https://wiki.duraspace.org/display/DSDOC5x/SWORDv2+Server">DSpace supports the SWORDv2 protocol</a> (although we have never tested it)</li>
|
||||
<li>I pointed out that <a href="http://swordapp.org/">SWORD</a> might be an option, as <a href="https://wiki.lyrasis.org/display/DSDOC5x/SWORDv2+Server">DSpace supports the SWORDv2 protocol</a> (although we have never tested it)</li>
|
||||
<li>Work on implementing <a href="https://cookieconsent.insites.com">cookie consent</a> popup for all XMLUI themes (SASS theme with primary / secondary branding from Bootstrap)</li>
|
||||
</ul>
|
||||
<h2 id="2018-05-22">2018-05-22</h2>
|
||||
|
@ -16,7 +16,7 @@ I’m testing the new DSpace 5.8 branch in my Ubuntu 18.04 environment and I
|
||||
<meta property="og:type" content="article" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2018-09/" />
|
||||
<meta property="article:published_time" content="2018-09-02T09:55:54+03:00" />
|
||||
<meta property="article:modified_time" content="2020-01-21T09:37:11+02:00" />
|
||||
<meta property="article:modified_time" content="2020-04-13T15:30:24+03:00" />
|
||||
|
||||
<meta name="twitter:card" content="summary"/>
|
||||
<meta name="twitter:title" content="September, 2018"/>
|
||||
@ -39,7 +39,7 @@ I’m testing the new DSpace 5.8 branch in my Ubuntu 18.04 environment and I
|
||||
"url": "https://alanorth.github.io/cgspace-notes/2018-09/",
|
||||
"wordCount": "5246",
|
||||
"datePublished": "2018-09-02T09:55:54+03:00",
|
||||
"dateModified": "2020-01-21T09:37:11+02:00",
|
||||
"dateModified": "2020-04-13T15:30:24+03:00",
|
||||
"author": {
|
||||
"@type": "Person",
|
||||
"name": "Alan Orth"
|
||||
@ -391,7 +391,7 @@ $ grep -c -E 'session_id=[A-Z0-9]{32}:ip_addr=50.116.102.77' dspace.log.2018-09-
|
||||
</li>
|
||||
<li>I want to explore creating a thin API to make the item view and download stats available from Solr so CodeObia can use them in the AReS explorer</li>
|
||||
<li>Currently CodeObia is exploring using the Atmire statlets internal API, but I don’t really like that…</li>
|
||||
<li>There are some example queries on the <a href="https://wiki.duraspace.org/display/DSPACE/Solr">DSpace Solr wiki</a></li>
|
||||
<li>There are some example queries on the <a href="https://wiki.lyrasis.org/display/DSPACE/Solr">DSpace Solr wiki</a></li>
|
||||
<li>For example, this query returns 1655 rows for item <a href="https://cgspace.cgiar.org/handle/10568/10630">10568/10630</a>:</li>
|
||||
</ul>
|
||||
<pre><code>$ http 'http://localhost:3000/solr/statistics/select?indent=on&rows=0&q=type:0+owningItem:11576&fq=isBot:false'
|
||||
@ -553,7 +553,7 @@ dspacestatistics-> (id INT PRIMARY KEY, views INT DEFAULT 0, downloads INT DE
|
||||
<li>I want to purge the bot hits from the Solr statistics core, as I am now realizing that I don’t give a shit about tens of millions of hits by Google and Bing indexing my shit every day (at least not in Solr!)</li>
|
||||
<li>CGSpace’s Solr core has 150,000,000 documents in it… and it’s still pretty fast to query, but it’s really a maintenance and backup burden</li>
|
||||
<li>DSpace Test currently has about 2,000,000 documents with <code>isBot:true</code> in its Solr statistics core, and the size on disk is 2GB (it’s not much, but I have to test this somewhere!)</li>
|
||||
<li>According to the <a href="https://wiki.duraspace.org/display/DSDOC5x/SOLR+Statistics+Maintenance">DSpace 5.x Solr documentation</a> I can use <code>dspace stats-util -f</code>, so let’s try it:</li>
|
||||
<li>According to the <a href="https://wiki.lyrasis.org/display/DSDOC5x/SOLR+Statistics+Maintenance">DSpace 5.x Solr documentation</a> I can use <code>dspace stats-util -f</code>, so let’s try it:</li>
|
||||
</ul>
|
||||
<pre><code>$ dspace stats-util -f
|
||||
</code></pre><ul>
|
||||
|
@ -14,7 +14,7 @@ I created a GitHub issue to track this #389, because I’m super busy in Nai
|
||||
<meta property="og:type" content="article" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2018-10/" />
|
||||
<meta property="article:published_time" content="2018-10-01T22:31:54+03:00" />
|
||||
<meta property="article:modified_time" content="2020-02-23T20:10:47+02:00" />
|
||||
<meta property="article:modified_time" content="2020-04-13T15:30:24+03:00" />
|
||||
|
||||
<meta name="twitter:card" content="summary"/>
|
||||
<meta name="twitter:title" content="October, 2018"/>
|
||||
@ -35,7 +35,7 @@ I created a GitHub issue to track this #389, because I’m super busy in Nai
|
||||
"url": "https://alanorth.github.io/cgspace-notes/2018-10/",
|
||||
"wordCount": "4518",
|
||||
"datePublished": "2018-10-01T22:31:54+03:00",
|
||||
"dateModified": "2020-02-23T20:10:47+02:00",
|
||||
"dateModified": "2020-04-13T15:30:24+03:00",
|
||||
"author": {
|
||||
"@type": "Person",
|
||||
"name": "Alan Orth"
|
||||
@ -619,7 +619,7 @@ $ curl -X GET -H "Content-Type: application/json" -H "Accept: app
|
||||
<ul>
|
||||
<li>Meet with the COPO guys to walk them through the CGSpace submission workflow and discuss CG core, REST API, etc
|
||||
<ul>
|
||||
<li>I suggested that they look into submitting via the <a href="https://wiki.duraspace.org/display/DSDOC5x/SWORDv2+Server">SWORDv2</a> protocol because it respects the workflows</li>
|
||||
<li>I suggested that they look into submitting via the <a href="https://wiki.lyrasis.org/display/DSDOC5x/SWORDv2+Server">SWORDv2</a> protocol because it respects the workflows</li>
|
||||
<li>They said that they’re not too worried about the hierarchical CG core schema, that they would just flatten metadata like affiliations when depositing to a DSpace repository</li>
|
||||
<li>I said that it might be time to engage the DSpace community to add support for more advanced schemas in DSpace 7+ (perhaps partnership with Atmire?)</li>
|
||||
</ul>
|
||||
|
@ -24,7 +24,7 @@ I think I will need to ask Udana to re-copy and paste the abstracts with more ca
|
||||
<meta property="og:type" content="article" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2019-03/" />
|
||||
<meta property="article:published_time" content="2019-03-01T12:16:30+01:00" />
|
||||
<meta property="article:modified_time" content="2019-10-28T13:39:25+02:00" />
|
||||
<meta property="article:modified_time" content="2020-04-13T15:30:24+03:00" />
|
||||
|
||||
<meta name="twitter:card" content="summary"/>
|
||||
<meta name="twitter:title" content="March, 2019"/>
|
||||
@ -55,7 +55,7 @@ I think I will need to ask Udana to re-copy and paste the abstracts with more ca
|
||||
"url": "https://alanorth.github.io/cgspace-notes/2019-03/",
|
||||
"wordCount": "7105",
|
||||
"datePublished": "2019-03-01T12:16:30+01:00",
|
||||
"dateModified": "2019-10-28T13:39:25+02:00",
|
||||
"dateModified": "2020-04-13T15:30:24+03:00",
|
||||
"author": {
|
||||
"@type": "Person",
|
||||
"name": "Alan Orth"
|
||||
@ -283,7 +283,7 @@ UPDATE 44
|
||||
<li>I think that I could get the resource IDs from SQL and then export them using <code>dspace metadata-export</code>…</li>
|
||||
</ul>
|
||||
</li>
|
||||
<li>This is a bit ugly, but it works (using the <a href="https://wiki.duraspace.org/display/DSPACE/Helper+SQL+functions+for+DSpace+5">DSpace 5 SQL helper function</a> to resolve ID to handle):</li>
|
||||
<li>This is a bit ugly, but it works (using the <a href="https://wiki.lyrasis.org/display/DSPACE/Helper+SQL+functions+for+DSpace+5">DSpace 5 SQL helper function</a> to resolve ID to handle):</li>
|
||||
</ul>
|
||||
<pre><code>for id in $(psql -U postgres -d dspacetest -h localhost -c "SELECT resource_id FROM metadatavalue WHERE resource_type_id=2 AND metadata_field_id=228 AND text_value LIKE '%SWAZILAND%'" | grep -oE '[0-9]{3,}'); do
|
||||
|
||||
|
@ -33,7 +33,7 @@ $ ./delete-metadata-values.py -i /tmp/2019-02-21-delete-1-region.csv -db dspace
|
||||
<meta property="og:type" content="article" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2019-04/" />
|
||||
<meta property="article:published_time" content="2019-04-01T09:00:43+03:00" />
|
||||
<meta property="article:modified_time" content="2019-10-28T13:39:25+02:00" />
|
||||
<meta property="article:modified_time" content="2020-04-13T15:30:24+03:00" />
|
||||
|
||||
<meta name="twitter:card" content="summary"/>
|
||||
<meta name="twitter:title" content="April, 2019"/>
|
||||
@ -73,7 +73,7 @@ $ ./delete-metadata-values.py -i /tmp/2019-02-21-delete-1-region.csv -db dspace
|
||||
"url": "https://alanorth.github.io/cgspace-notes/2019-04/",
|
||||
"wordCount": "6778",
|
||||
"datePublished": "2019-04-01T09:00:43+03:00",
|
||||
"dateModified": "2019-10-28T13:39:25+02:00",
|
||||
"dateModified": "2020-04-13T15:30:24+03:00",
|
||||
"author": {
|
||||
"@type": "Person",
|
||||
"name": "Alan Orth"
|
||||
@ -442,7 +442,7 @@ X-XSS-Protection: 1; mode=block
|
||||
<li>I will try to re-deploy the <code>5_x-dev</code> branch and test again</li>
|
||||
</ul>
|
||||
</li>
|
||||
<li>According to the <a href="https://wiki.duraspace.org/display/DSDOC5x/SOLR+Statistics">DSpace 5.x Solr documentation</a> the default commit time is after 15 minutes or 10,000 documents (see <code>solrconfig.xml</code>)</li>
|
||||
<li>According to the <a href="https://wiki.lyrasis.org/display/DSDOC5x/SOLR+Statistics">DSpace 5.x Solr documentation</a> the default commit time is after 15 minutes or 10,000 documents (see <code>solrconfig.xml</code>)</li>
|
||||
<li>I looped some GET and HEAD requests to a bitstream on my local instance and after some time I see that they <em>do</em> register as downloads (even though they are internal):</li>
|
||||
</ul>
|
||||
<pre><code>$ http --print b 'http://localhost:8080/solr/statistics/select?q=type%3A0+AND+time%3A2019-04-07*&fq=statistics_type%3Aview&fq=isInternal%3Atrue&rows=0&wt=json&indent=true'
|
||||
@ -1229,7 +1229,7 @@ COPY 65752
|
||||
</ul>
|
||||
<pre><code>2019-04-28 08:53:07,170 ERROR org.dspace.rest.ItemsResource @ User(anonymous) has not permission to read item(id=74648)!
|
||||
</code></pre><ul>
|
||||
<li>I even tried to “expunge” the item using an <a href="https://wiki.duraspace.org/display/DSDOC5x/Batch+Metadata+Editing#BatchMetadataEditing-Performing'actions'onitems">action in CSV</a>, and it said “EXPUNGED!” but the item is still there…</li>
|
||||
<li>I even tried to “expunge” the item using an <a href="https://wiki.lyrasis.org/display/DSDOC5x/Batch+Metadata+Editing#BatchMetadataEditing-Performing'actions'onitems">action in CSV</a>, and it said “EXPUNGED!” but the item is still there…</li>
|
||||
</ul>
|
||||
<h2 id="2019-04-30">2019-04-30</h2>
|
||||
<ul>
|
||||
|
@ -25,7 +25,7 @@ But after this I tried to delete the item from the XMLUI and it is still present
|
||||
<meta property="og:type" content="article" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2019-05/" />
|
||||
<meta property="article:published_time" content="2019-05-01T07:37:43+03:00" />
|
||||
<meta property="article:modified_time" content="2020-02-24T18:07:35+02:00" />
|
||||
<meta property="article:modified_time" content="2020-04-13T15:30:24+03:00" />
|
||||
|
||||
<meta name="twitter:card" content="summary"/>
|
||||
<meta name="twitter:title" content="May, 2019"/>
|
||||
@ -57,7 +57,7 @@ But after this I tried to delete the item from the XMLUI and it is still present
|
||||
"url": "https://alanorth.github.io/cgspace-notes/2019-05/",
|
||||
"wordCount": "3190",
|
||||
"datePublished": "2019-05-01T07:37:43+03:00",
|
||||
"dateModified": "2020-02-24T18:07:35+02:00",
|
||||
"dateModified": "2020-04-13T15:30:24+03:00",
|
||||
"author": {
|
||||
"@type": "Person",
|
||||
"name": "Alan Orth"
|
||||
@ -189,7 +189,7 @@ curl: (22) The requested URL returned error: 401 Unauthorized
|
||||
<li>CIP is asking about embedding PDF thumbnail images in their RSS feeds again
|
||||
<ul>
|
||||
<li>They asked in 2018-09 as well and I told them it wasn’t possible</li>
|
||||
<li>To make sure, I looked at <a href="https://wiki.duraspace.org/display/DSPACE/Enable+Media+RSS+Feeds">the documentation for RSS media feeds</a> and tried it, but couldn’t get it to work</li>
|
||||
<li>To make sure, I looked at <a href="https://wiki.lyrasis.org/display/DSPACE/Enable+Media+RSS+Feeds">the documentation for RSS media feeds</a> and tried it, but couldn’t get it to work</li>
|
||||
<li>It seems to be geared towards iTunes and Podcasts… I dunno</li>
|
||||
</ul>
|
||||
</li>
|
||||
|
@ -37,7 +37,7 @@ Here are the top ten IPs in the nginx XMLUI and REST/OAI logs this morning:
|
||||
<meta property="og:type" content="article" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2019-09/" />
|
||||
<meta property="article:published_time" content="2019-09-01T10:17:51+03:00" />
|
||||
<meta property="article:modified_time" content="2019-10-28T13:39:25+02:00" />
|
||||
<meta property="article:modified_time" content="2020-04-13T15:30:24+03:00" />
|
||||
|
||||
<meta name="twitter:card" content="summary"/>
|
||||
<meta name="twitter:title" content="September, 2019"/>
|
||||
@ -81,7 +81,7 @@ Here are the top ten IPs in the nginx XMLUI and REST/OAI logs this morning:
|
||||
"url": "https://alanorth.github.io/cgspace-notes/2019-09/",
|
||||
"wordCount": "2870",
|
||||
"datePublished": "2019-09-01T10:17:51+03:00",
|
||||
"dateModified": "2019-10-28T13:39:25+02:00",
|
||||
"dateModified": "2020-04-13T15:30:24+03:00",
|
||||
"author": {
|
||||
"@type": "Person",
|
||||
"name": "Alan Orth"
|
||||
@ -230,7 +230,7 @@ Here are the top ten IPs in the nginx XMLUI and REST/OAI logs this morning:
|
||||
</li>
|
||||
<li>Continue working on CG Core v2 migration, focusing on the crosswalk mappings
|
||||
<ul>
|
||||
<li>I think we can skip the MODS crosswalk for now because it is only used in <a href="https://wiki.duraspace.org/display/DSDOC5x/DSpace+AIP+Format#DSpaceAIPFormat-MODSSchema">AIP exports that are meant for non-DSpace systems</a></li>
|
||||
<li>I think we can skip the MODS crosswalk for now because it is only used in <a href="https://wiki.lyrasis.org/display/DSDOC5x/DSpace+AIP+Format#DSpaceAIPFormat-MODSSchema">AIP exports that are meant for non-DSpace systems</a></li>
|
||||
<li>We should probably do the QDC crosswalk as well as those in <code>xhtml-head-item.properties</code>…</li>
|
||||
<li>Ouch, there is potentially a lot of work in the OAI metadata formats like DIM, METS, and QDC (see <code>dspace/config/crosswalks/oai/*.xsl</code>)</li>
|
||||
<li>In general I think I should only modify the left side of the crosswalk mappings (ie, where metadata is coming from) so we maintain the same exact output for search engines, etc</li>
|
||||
|
@ -25,7 +25,7 @@ On the same note, the one item Abenet pointed out last week now has a donut with
|
||||
<meta property="og:type" content="article" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2020-04/" />
|
||||
<meta property="article:published_time" content="2020-04-02T10:53:24+03:00" />
|
||||
<meta property="article:modified_time" content="2020-04-07T16:32:02+03:00" />
|
||||
<meta property="article:modified_time" content="2020-04-12T17:09:14+03:00" />
|
||||
|
||||
<meta name="twitter:card" content="summary"/>
|
||||
<meta name="twitter:title" content="April, 2020"/>
|
||||
@ -55,9 +55,9 @@ On the same note, the one item Abenet pointed out last week now has a donut with
|
||||
"@type": "BlogPosting",
|
||||
"headline": "April, 2020",
|
||||
"url": "https://alanorth.github.io/cgspace-notes/2020-04/",
|
||||
"wordCount": "793",
|
||||
"wordCount": "1025",
|
||||
"datePublished": "2020-04-02T10:53:24+03:00",
|
||||
"dateModified": "2020-04-07T16:32:02+03:00",
|
||||
"dateModified": "2020-04-12T17:09:14+03:00",
|
||||
"author": {
|
||||
"@type": "Person",
|
||||
"name": "Alan Orth"
|
||||
@ -250,7 +250,34 @@ $ ~/dspace63/bin/dspace solr-upgrade-statistics-6x
|
||||
</ul>
|
||||
</li>
|
||||
</ul>
|
||||
<!-- raw HTML omitted -->
|
||||
<h2 id="2020-04-13">2020-04-13</h2>
|
||||
<ul>
|
||||
<li>I realized that <code>solr-upgrade-statistics-6x</code> only processes 100,000 records by default so I think we actually need to finish running it for all legacy Solr records before asking Atmire why CUA statlets and detailed statistics aren’t working</li>
|
||||
<li>For now I am just doing 250,000 records at a time on my local environment:</li>
|
||||
</ul>
|
||||
<pre><code>$ export JAVA_OPTS="-Xmx2000m -Dfile.encoding=UTF-8"
|
||||
$ ~/dspace63/bin/dspace solr-upgrade-statistics-6x -n 250000
|
||||
</code></pre><ul>
|
||||
<li>Despite running the migration for all of my local 1.5 million Solr records, I still see a few hundred thousand like <code>-1</code> and <code>0-unmigrated</code>
|
||||
<ul>
|
||||
<li>I will purge them all and try to import only a subset…</li>
|
||||
<li>After importing again I see there are indeed tens of thousands of these documents with IDs “-1” and “0”</li>
|
||||
<li>They are all <code>type: 5</code>, which is “SITE” according to <code>Constants.java</code>:</li>
|
||||
</ul>
|
||||
</li>
|
||||
</ul>
|
||||
<pre><code>/** DSpace site type */
|
||||
public static final int SITE = 5;
|
||||
</code></pre><ul>
|
||||
<li>Even after deleting those documents and re-running <code>solr-upgrade-statistics-6x</code> I still get the UUID errors when using CUA and the statlets</li>
|
||||
<li>I have sent some feedback and questions to Atmire (including about the  issue with glypicons in the header trail)</li>
|
||||
<li>In other news, my local Artifactory container stopped working for some reason so I re-created it and it seems some things have changed upstream (port 8082 for web UI?):</li>
|
||||
</ul>
|
||||
<pre><code>$ podman rm artifactory
|
||||
$ podman pull docker.bintray.io/jfrog/artifactory-oss:latest
|
||||
$ podman create --ulimit nofile=32000:32000 --name artifactory -v artifactory_data:/var/opt/jfrog/artifactory -p 8081-8082:8081-8082 docker.bintray.io/jfrog/artifactory-oss
|
||||
$ podman start artifactory
|
||||
</code></pre><!-- raw HTML omitted -->
|
||||
|
||||
|
||||
|
||||
|
@ -217,7 +217,7 @@ DELETE 1
|
||||
</header>
|
||||
<h2 id="2016-10-03">2016-10-03</h2>
|
||||
<ul>
|
||||
<li>Testing adding <a href="https://wiki.duraspace.org/display/DSDOC5x/ORCID+Integration#ORCIDIntegration-EditingexistingitemsusingBatchCSVEditing">ORCIDs to a CSV</a> file for a single item to see if the author orders get messed up</li>
|
||||
<li>Testing adding <a href="https://wiki.lyrasis.org/display/DSDOC5x/ORCID+Integration#ORCIDIntegration-EditingexistingitemsusingBatchCSVEditing">ORCIDs to a CSV</a> file for a single item to see if the author orders get messed up</li>
|
||||
<li>Need to test the following scenarios to see how author order is affected:
|
||||
<ul>
|
||||
<li>ORCIDs only</li>
|
||||
|
@ -10,7 +10,7 @@
|
||||
<meta property="og:type" content="article" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/cgspace-cgcorev2-migration/" />
|
||||
<meta property="article:published_time" content="2019-10-28T13:27:35+02:00" />
|
||||
<meta property="article:modified_time" content="2019-12-22T11:56:45+02:00" />
|
||||
<meta property="article:modified_time" content="2020-04-13T15:30:24+03:00" />
|
||||
|
||||
<meta name="twitter:card" content="summary"/>
|
||||
<meta name="twitter:title" content="CGSpace CG Core v2 Migration"/>
|
||||
@ -27,7 +27,7 @@
|
||||
"url": "https://alanorth.github.io/cgspace-notes/cgspace-cgcorev2-migration/",
|
||||
"wordCount": "555",
|
||||
"datePublished": "2019-10-28T13:27:35+02:00",
|
||||
"dateModified": "2019-12-22T11:56:45+02:00",
|
||||
"dateModified": "2020-04-13T15:30:24+03:00",
|
||||
"author": {
|
||||
"@type": "Person",
|
||||
"name": "Alan Orth"
|
||||
@ -424,7 +424,7 @@
|
||||
<p>There are a few things that I need to check once I get a deployment of this code up and running:</p>
|
||||
<ul>
|
||||
<li>Assess the XSL changes to see if things like <code>not(@qualifier)]</code> still make sense after we move fields from DC to DCTERMS, as some fields will no longer have qualifiers</li>
|
||||
<li>Do I need to edit crosswalks that we are not using, like <a href="https://wiki.duraspace.org/display/DSDOC5x/DSpace+AIP+Format#DSpaceAIPFormat-MODSSchema">MODS</a>?</li>
|
||||
<li>Do I need to edit crosswalks that we are not using, like <a href="https://wiki.lyrasis.org/display/DSDOC5x/DSpace+AIP+Format#DSpaceAIPFormat-MODSSchema">MODS</a>?</li>
|
||||
<li>There is potentially a lot of work in the OAI metadata formats like DIM, METS, and QDC (see <code>dspace/config/crosswalks/oai/*.xsl</code>)</li>
|
||||
</ul>
|
||||
<hr>
|
||||
|
@ -954,7 +954,7 @@ DELETE 1
|
||||
<guid>https://alanorth.github.io/cgspace-notes/2016-10/</guid>
|
||||
<description><h2 id="2016-10-03">2016-10-03</h2>
|
||||
<ul>
|
||||
<li>Testing adding <a href="https://wiki.duraspace.org/display/DSDOC5x/ORCID+Integration#ORCIDIntegration-EditingexistingitemsusingBatchCSVEditing">ORCIDs to a CSV</a> file for a single item to see if the author orders get messed up</li>
|
||||
<li>Testing adding <a href="https://wiki.lyrasis.org/display/DSDOC5x/ORCID+Integration#ORCIDIntegration-EditingexistingitemsusingBatchCSVEditing">ORCIDs to a CSV</a> file for a single item to see if the author orders get messed up</li>
|
||||
<li>Need to test the following scenarios to see how author order is affected:
|
||||
<ul>
|
||||
<li>ORCIDs only</li>
|
||||
|
@ -217,7 +217,7 @@ DELETE 1
|
||||
</header>
|
||||
<h2 id="2016-10-03">2016-10-03</h2>
|
||||
<ul>
|
||||
<li>Testing adding <a href="https://wiki.duraspace.org/display/DSDOC5x/ORCID+Integration#ORCIDIntegration-EditingexistingitemsusingBatchCSVEditing">ORCIDs to a CSV</a> file for a single item to see if the author orders get messed up</li>
|
||||
<li>Testing adding <a href="https://wiki.lyrasis.org/display/DSDOC5x/ORCID+Integration#ORCIDIntegration-EditingexistingitemsusingBatchCSVEditing">ORCIDs to a CSV</a> file for a single item to see if the author orders get messed up</li>
|
||||
<li>Need to test the following scenarios to see how author order is affected:
|
||||
<ul>
|
||||
<li>ORCIDs only</li>
|
||||
|
@ -954,7 +954,7 @@ DELETE 1
|
||||
<guid>https://alanorth.github.io/cgspace-notes/2016-10/</guid>
|
||||
<description><h2 id="2016-10-03">2016-10-03</h2>
|
||||
<ul>
|
||||
<li>Testing adding <a href="https://wiki.duraspace.org/display/DSDOC5x/ORCID+Integration#ORCIDIntegration-EditingexistingitemsusingBatchCSVEditing">ORCIDs to a CSV</a> file for a single item to see if the author orders get messed up</li>
|
||||
<li>Testing adding <a href="https://wiki.lyrasis.org/display/DSDOC5x/ORCID+Integration#ORCIDIntegration-EditingexistingitemsusingBatchCSVEditing">ORCIDs to a CSV</a> file for a single item to see if the author orders get messed up</li>
|
||||
<li>Need to test the following scenarios to see how author order is affected:
|
||||
<ul>
|
||||
<li>ORCIDs only</li>
|
||||
|
@ -217,7 +217,7 @@ DELETE 1
|
||||
</header>
|
||||
<h2 id="2016-10-03">2016-10-03</h2>
|
||||
<ul>
|
||||
<li>Testing adding <a href="https://wiki.duraspace.org/display/DSDOC5x/ORCID+Integration#ORCIDIntegration-EditingexistingitemsusingBatchCSVEditing">ORCIDs to a CSV</a> file for a single item to see if the author orders get messed up</li>
|
||||
<li>Testing adding <a href="https://wiki.lyrasis.org/display/DSDOC5x/ORCID+Integration#ORCIDIntegration-EditingexistingitemsusingBatchCSVEditing">ORCIDs to a CSV</a> file for a single item to see if the author orders get messed up</li>
|
||||
<li>Need to test the following scenarios to see how author order is affected:
|
||||
<ul>
|
||||
<li>ORCIDs only</li>
|
||||
|
@ -4,27 +4,27 @@
|
||||
|
||||
<url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/2020-04/</loc>
|
||||
<lastmod>2020-04-07T16:32:02+03:00</lastmod>
|
||||
<lastmod>2020-04-12T17:09:14+03:00</lastmod>
|
||||
</url>
|
||||
|
||||
<url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/categories/</loc>
|
||||
<lastmod>2020-04-07T16:32:02+03:00</lastmod>
|
||||
<lastmod>2020-04-13T15:30:24+03:00</lastmod>
|
||||
</url>
|
||||
|
||||
<url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/</loc>
|
||||
<lastmod>2020-04-07T16:32:02+03:00</lastmod>
|
||||
<lastmod>2020-04-13T15:30:24+03:00</lastmod>
|
||||
</url>
|
||||
|
||||
<url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/categories/notes/</loc>
|
||||
<lastmod>2020-04-07T16:32:02+03:00</lastmod>
|
||||
<lastmod>2020-04-13T15:30:24+03:00</lastmod>
|
||||
</url>
|
||||
|
||||
<url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/posts/</loc>
|
||||
<lastmod>2020-04-07T16:32:02+03:00</lastmod>
|
||||
<lastmod>2020-04-13T15:30:24+03:00</lastmod>
|
||||
</url>
|
||||
|
||||
<url>
|
||||
@ -54,17 +54,17 @@
|
||||
|
||||
<url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/cgspace-cgcorev2-migration/</loc>
|
||||
<lastmod>2019-12-22T11:56:45+02:00</lastmod>
|
||||
<lastmod>2020-04-13T15:30:24+03:00</lastmod>
|
||||
</url>
|
||||
|
||||
<url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/tags/migration/</loc>
|
||||
<lastmod>2019-12-22T11:56:45+02:00</lastmod>
|
||||
<lastmod>2020-04-13T15:30:24+03:00</lastmod>
|
||||
</url>
|
||||
|
||||
<url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/tags/</loc>
|
||||
<lastmod>2019-12-22T11:56:45+02:00</lastmod>
|
||||
<lastmod>2020-04-13T15:30:24+03:00</lastmod>
|
||||
</url>
|
||||
|
||||
<url>
|
||||
@ -74,7 +74,7 @@
|
||||
|
||||
<url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/2019-09/</loc>
|
||||
<lastmod>2019-10-28T13:39:25+02:00</lastmod>
|
||||
<lastmod>2020-04-13T15:30:24+03:00</lastmod>
|
||||
</url>
|
||||
|
||||
<url>
|
||||
@ -94,17 +94,17 @@
|
||||
|
||||
<url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/2019-05/</loc>
|
||||
<lastmod>2020-02-24T18:07:35+02:00</lastmod>
|
||||
<lastmod>2020-04-13T15:30:24+03:00</lastmod>
|
||||
</url>
|
||||
|
||||
<url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/2019-04/</loc>
|
||||
<lastmod>2019-10-28T13:39:25+02:00</lastmod>
|
||||
<lastmod>2020-04-13T15:30:24+03:00</lastmod>
|
||||
</url>
|
||||
|
||||
<url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/2019-03/</loc>
|
||||
<lastmod>2019-10-28T13:39:25+02:00</lastmod>
|
||||
<lastmod>2020-04-13T15:30:24+03:00</lastmod>
|
||||
</url>
|
||||
|
||||
<url>
|
||||
@ -129,12 +129,12 @@
|
||||
|
||||
<url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/2018-10/</loc>
|
||||
<lastmod>2020-02-23T20:10:47+02:00</lastmod>
|
||||
<lastmod>2020-04-13T15:30:24+03:00</lastmod>
|
||||
</url>
|
||||
|
||||
<url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/2018-09/</loc>
|
||||
<lastmod>2020-01-21T09:37:11+02:00</lastmod>
|
||||
<lastmod>2020-04-13T15:30:24+03:00</lastmod>
|
||||
</url>
|
||||
|
||||
<url>
|
||||
@ -154,7 +154,7 @@
|
||||
|
||||
<url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/2018-05/</loc>
|
||||
<lastmod>2019-10-28T13:39:25+02:00</lastmod>
|
||||
<lastmod>2020-04-13T15:30:24+03:00</lastmod>
|
||||
</url>
|
||||
|
||||
<url>
|
||||
@ -174,12 +174,12 @@
|
||||
|
||||
<url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/2018-01/</loc>
|
||||
<lastmod>2019-10-28T13:39:25+02:00</lastmod>
|
||||
<lastmod>2020-04-13T15:30:24+03:00</lastmod>
|
||||
</url>
|
||||
|
||||
<url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/2017-12/</loc>
|
||||
<lastmod>2019-10-28T13:39:25+02:00</lastmod>
|
||||
<lastmod>2020-04-13T15:30:24+03:00</lastmod>
|
||||
</url>
|
||||
|
||||
<url>
|
||||
@ -199,47 +199,47 @@
|
||||
|
||||
<url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/tags/notes/</loc>
|
||||
<lastmod>2019-03-04T02:43:22+02:00</lastmod>
|
||||
<lastmod>2020-04-13T15:30:24+03:00</lastmod>
|
||||
</url>
|
||||
|
||||
<url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/2017-09/</loc>
|
||||
<lastmod>2018-03-09T22:10:33+02:00</lastmod>
|
||||
<lastmod>2020-04-13T15:30:24+03:00</lastmod>
|
||||
</url>
|
||||
|
||||
<url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/2017-08/</loc>
|
||||
<lastmod>2019-03-04T02:43:22+02:00</lastmod>
|
||||
<lastmod>2020-04-13T15:30:24+03:00</lastmod>
|
||||
</url>
|
||||
|
||||
<url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/2017-07/</loc>
|
||||
<lastmod>2018-03-09T22:10:33+02:00</lastmod>
|
||||
<lastmod>2020-04-13T15:30:24+03:00</lastmod>
|
||||
</url>
|
||||
|
||||
<url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/2017-06/</loc>
|
||||
<lastmod>2018-03-09T22:10:33+02:00</lastmod>
|
||||
<lastmod>2020-04-13T15:30:24+03:00</lastmod>
|
||||
</url>
|
||||
|
||||
<url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/2017-05/</loc>
|
||||
<lastmod>2018-03-09T22:10:33+02:00</lastmod>
|
||||
<lastmod>2020-04-13T15:30:24+03:00</lastmod>
|
||||
</url>
|
||||
|
||||
<url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/2017-04/</loc>
|
||||
<lastmod>2018-03-09T22:10:33+02:00</lastmod>
|
||||
<lastmod>2020-04-13T15:30:24+03:00</lastmod>
|
||||
</url>
|
||||
|
||||
<url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/2017-03/</loc>
|
||||
<lastmod>2018-03-09T22:10:33+02:00</lastmod>
|
||||
<lastmod>2020-04-13T15:30:24+03:00</lastmod>
|
||||
</url>
|
||||
|
||||
<url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/2017-02/</loc>
|
||||
<lastmod>2018-03-09T22:10:33+02:00</lastmod>
|
||||
<lastmod>2020-04-13T15:30:24+03:00</lastmod>
|
||||
</url>
|
||||
|
||||
<url>
|
||||
@ -254,12 +254,12 @@
|
||||
|
||||
<url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/2016-11/</loc>
|
||||
<lastmod>2018-03-09T22:10:33+02:00</lastmod>
|
||||
<lastmod>2020-04-13T15:30:24+03:00</lastmod>
|
||||
</url>
|
||||
|
||||
<url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/2016-10/</loc>
|
||||
<lastmod>2018-04-04T16:25:57+03:00</lastmod>
|
||||
<lastmod>2020-04-13T15:30:24+03:00</lastmod>
|
||||
</url>
|
||||
|
||||
<url>
|
||||
@ -279,12 +279,12 @@
|
||||
|
||||
<url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/2016-06/</loc>
|
||||
<lastmod>2018-03-09T22:10:33+02:00</lastmod>
|
||||
<lastmod>2020-04-13T15:30:24+03:00</lastmod>
|
||||
</url>
|
||||
|
||||
<url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/2016-05/</loc>
|
||||
<lastmod>2018-03-09T22:10:33+02:00</lastmod>
|
||||
<lastmod>2020-04-13T15:30:24+03:00</lastmod>
|
||||
</url>
|
||||
|
||||
<url>
|
||||
@ -294,7 +294,7 @@
|
||||
|
||||
<url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/2016-03/</loc>
|
||||
<lastmod>2018-03-09T22:10:33+02:00</lastmod>
|
||||
<lastmod>2020-04-13T15:30:24+03:00</lastmod>
|
||||
</url>
|
||||
|
||||
<url>
|
||||
|
@ -223,7 +223,7 @@ DELETE 1
|
||||
<guid>https://alanorth.github.io/cgspace-notes/2016-10/</guid>
|
||||
<description><h2 id="2016-10-03">2016-10-03</h2>
|
||||
<ul>
|
||||
<li>Testing adding <a href="https://wiki.duraspace.org/display/DSDOC5x/ORCID+Integration#ORCIDIntegration-EditingexistingitemsusingBatchCSVEditing">ORCIDs to a CSV</a> file for a single item to see if the author orders get messed up</li>
|
||||
<li>Testing adding <a href="https://wiki.lyrasis.org/display/DSDOC5x/ORCID+Integration#ORCIDIntegration-EditingexistingitemsusingBatchCSVEditing">ORCIDs to a CSV</a> file for a single item to see if the author orders get messed up</li>
|
||||
<li>Need to test the following scenarios to see how author order is affected:
|
||||
<ul>
|
||||
<li>ORCIDs only</li>
|
||||
|
@ -113,7 +113,7 @@
|
||||
</header>
|
||||
<h2 id="2016-10-03">2016-10-03</h2>
|
||||
<ul>
|
||||
<li>Testing adding <a href="https://wiki.duraspace.org/display/DSDOC5x/ORCID+Integration#ORCIDIntegration-EditingexistingitemsusingBatchCSVEditing">ORCIDs to a CSV</a> file for a single item to see if the author orders get messed up</li>
|
||||
<li>Testing adding <a href="https://wiki.lyrasis.org/display/DSDOC5x/ORCID+Integration#ORCIDIntegration-EditingexistingitemsusingBatchCSVEditing">ORCIDs to a CSV</a> file for a single item to see if the author orders get messed up</li>
|
||||
<li>Need to test the following scenarios to see how author order is affected:
|
||||
<ul>
|
||||
<li>ORCIDs only</li>
|
||||
|
@ -217,7 +217,7 @@ DELETE 1
|
||||
</header>
|
||||
<h2 id="2016-10-03">2016-10-03</h2>
|
||||
<ul>
|
||||
<li>Testing adding <a href="https://wiki.duraspace.org/display/DSDOC5x/ORCID+Integration#ORCIDIntegration-EditingexistingitemsusingBatchCSVEditing">ORCIDs to a CSV</a> file for a single item to see if the author orders get messed up</li>
|
||||
<li>Testing adding <a href="https://wiki.lyrasis.org/display/DSDOC5x/ORCID+Integration#ORCIDIntegration-EditingexistingitemsusingBatchCSVEditing">ORCIDs to a CSV</a> file for a single item to see if the author orders get messed up</li>
|
||||
<li>Need to test the following scenarios to see how author order is affected:
|
||||
<ul>
|
||||
<li>ORCIDs only</li>
|
||||
|
Loading…
Reference in New Issue
Block a user