mirror of
https://github.com/alanorth/cgspace-notes.git
synced 2024-11-26 08:28:18 +01:00
Add notes for 2019-05-17
This commit is contained in:
parent
856084655e
commit
32830944c2
@ -364,4 +364,24 @@ COPY 995
|
|||||||
- Fork the [ICARDA AReS v1 repository](https://github.com/icarda-git/AReS) to [ILRI's GitHub](https://github.com/ilri/AReS) and give access to CodeObia guys
|
- Fork the [ICARDA AReS v1 repository](https://github.com/icarda-git/AReS) to [ILRI's GitHub](https://github.com/ilri/AReS) and give access to CodeObia guys
|
||||||
- The plan is that we develop the v2 code here
|
- The plan is that we develop the v2 code here
|
||||||
|
|
||||||
|
## 2019-05-17
|
||||||
|
|
||||||
|
- Peter sent me a bunch of fixes for investors from yesterday
|
||||||
|
- I did a quick check in Open Refine (trim and collapse whitespace, clean smart quotes, etc) and then applied them on CGSpace:
|
||||||
|
|
||||||
|
```
|
||||||
|
$ ./fix-metadata-values.py -i /tmp/2019-05-16-fix-306-Investors.csv -db dspace-u dspace-p 'fuuu' -f dc.description.sponsorship -m 29 -t correct -d
|
||||||
|
$ ./delete-metadata-values.py -i /tmp/2019-05-16-delete-297-Investors.csv -db dspace -u dspace -p 'fuuu' -m 29 -f dc.description.sponsorship -d
|
||||||
|
```
|
||||||
|
|
||||||
|
- Then I started a full Discovery re-indexing:
|
||||||
|
|
||||||
|
```
|
||||||
|
$ export JAVA_OPTS="-Dfile.encoding=UTF-8 -Xmx1024m"
|
||||||
|
$ time schedtool -B -e ionice -c2 -n7 nice -n19 dspace index-discovery -b
|
||||||
|
```
|
||||||
|
|
||||||
|
- I was going to make a new controlled vocabulary of the top 100 terms after these corrections, but I noticed a bunch of duplicates and variations when I sorted them alphabetically
|
||||||
|
- Instead, I exported a new list and asked Peter to look at it again
|
||||||
|
|
||||||
<!-- vim: set sw=2 ts=2: -->
|
<!-- vim: set sw=2 ts=2: -->
|
||||||
|
@ -28,7 +28,7 @@ But after this I tried to delete the item from the XMLUI and it is still present
|
|||||||
<meta property="og:type" content="article" />
|
<meta property="og:type" content="article" />
|
||||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2019-05/" />
|
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2019-05/" />
|
||||||
<meta property="article:published_time" content="2019-05-01T07:37:43+03:00"/>
|
<meta property="article:published_time" content="2019-05-01T07:37:43+03:00"/>
|
||||||
<meta property="article:modified_time" content="2019-05-16T00:12:50+03:00"/>
|
<meta property="article:modified_time" content="2019-05-16T18:26:49+03:00"/>
|
||||||
|
|
||||||
<meta name="twitter:card" content="summary"/>
|
<meta name="twitter:card" content="summary"/>
|
||||||
<meta name="twitter:title" content="May, 2019"/>
|
<meta name="twitter:title" content="May, 2019"/>
|
||||||
@ -61,9 +61,9 @@ But after this I tried to delete the item from the XMLUI and it is still present
|
|||||||
"@type": "BlogPosting",
|
"@type": "BlogPosting",
|
||||||
"headline": "May, 2019",
|
"headline": "May, 2019",
|
||||||
"url": "https:\/\/alanorth.github.io\/cgspace-notes\/2019-05\/",
|
"url": "https:\/\/alanorth.github.io\/cgspace-notes\/2019-05\/",
|
||||||
"wordCount": "2541",
|
"wordCount": "2674",
|
||||||
"datePublished": "2019-05-01T07:37:43\x2b03:00",
|
"datePublished": "2019-05-01T07:37:43\x2b03:00",
|
||||||
"dateModified": "2019-05-16T00:12:50\x2b03:00",
|
"dateModified": "2019-05-16T18:26:49\x2b03:00",
|
||||||
"author": {
|
"author": {
|
||||||
"@type": "Person",
|
"@type": "Person",
|
||||||
"name": "Alan Orth"
|
"name": "Alan Orth"
|
||||||
@ -581,6 +581,28 @@ COPY 995
|
|||||||
</ul></li>
|
</ul></li>
|
||||||
</ul>
|
</ul>
|
||||||
|
|
||||||
|
<h2 id="2019-05-17">2019-05-17</h2>
|
||||||
|
|
||||||
|
<ul>
|
||||||
|
<li>Peter sent me a bunch of fixes for investors from yesterday</li>
|
||||||
|
|
||||||
|
<li><p>I did a quick check in Open Refine (trim and collapse whitespace, clean smart quotes, etc) and then applied them on CGSpace:</p>
|
||||||
|
|
||||||
|
<pre><code>$ ./fix-metadata-values.py -i /tmp/2019-05-16-fix-306-Investors.csv -db dspace-u dspace-p 'fuuu' -f dc.description.sponsorship -m 29 -t correct -d
|
||||||
|
$ ./delete-metadata-values.py -i /tmp/2019-05-16-delete-297-Investors.csv -db dspace -u dspace -p 'fuuu' -m 29 -f dc.description.sponsorship -d
|
||||||
|
</code></pre></li>
|
||||||
|
|
||||||
|
<li><p>Then I started a full Discovery re-indexing:</p>
|
||||||
|
|
||||||
|
<pre><code>$ export JAVA_OPTS="-Dfile.encoding=UTF-8 -Xmx1024m"
|
||||||
|
$ time schedtool -B -e ionice -c2 -n7 nice -n19 dspace index-discovery -b
|
||||||
|
</code></pre></li>
|
||||||
|
|
||||||
|
<li><p>I was going to make a new controlled vocabulary of the top 100 terms after these corrections, but I noticed a bunch of duplicates and variations when I sorted them alphabetically</p></li>
|
||||||
|
|
||||||
|
<li><p>Instead, I exported a new list and asked Peter to look at it again</p></li>
|
||||||
|
</ul>
|
||||||
|
|
||||||
<!-- vim: set sw=2 ts=2: -->
|
<!-- vim: set sw=2 ts=2: -->
|
||||||
|
|
||||||
|
|
||||||
|
@ -4,30 +4,30 @@
|
|||||||
|
|
||||||
<url>
|
<url>
|
||||||
<loc>https://alanorth.github.io/cgspace-notes/</loc>
|
<loc>https://alanorth.github.io/cgspace-notes/</loc>
|
||||||
<lastmod>2019-05-16T00:12:50+03:00</lastmod>
|
<lastmod>2019-05-16T18:26:49+03:00</lastmod>
|
||||||
<priority>0</priority>
|
<priority>0</priority>
|
||||||
</url>
|
</url>
|
||||||
|
|
||||||
<url>
|
<url>
|
||||||
<loc>https://alanorth.github.io/cgspace-notes/2019-05/</loc>
|
<loc>https://alanorth.github.io/cgspace-notes/2019-05/</loc>
|
||||||
<lastmod>2019-05-16T00:12:50+03:00</lastmod>
|
<lastmod>2019-05-16T18:26:49+03:00</lastmod>
|
||||||
</url>
|
</url>
|
||||||
|
|
||||||
<url>
|
<url>
|
||||||
<loc>https://alanorth.github.io/cgspace-notes/tags/notes/</loc>
|
<loc>https://alanorth.github.io/cgspace-notes/tags/notes/</loc>
|
||||||
<lastmod>2019-05-16T00:12:50+03:00</lastmod>
|
<lastmod>2019-05-16T18:26:49+03:00</lastmod>
|
||||||
<priority>0</priority>
|
<priority>0</priority>
|
||||||
</url>
|
</url>
|
||||||
|
|
||||||
<url>
|
<url>
|
||||||
<loc>https://alanorth.github.io/cgspace-notes/posts/</loc>
|
<loc>https://alanorth.github.io/cgspace-notes/posts/</loc>
|
||||||
<lastmod>2019-05-16T00:12:50+03:00</lastmod>
|
<lastmod>2019-05-16T18:26:49+03:00</lastmod>
|
||||||
<priority>0</priority>
|
<priority>0</priority>
|
||||||
</url>
|
</url>
|
||||||
|
|
||||||
<url>
|
<url>
|
||||||
<loc>https://alanorth.github.io/cgspace-notes/tags/</loc>
|
<loc>https://alanorth.github.io/cgspace-notes/tags/</loc>
|
||||||
<lastmod>2019-05-16T00:12:50+03:00</lastmod>
|
<lastmod>2019-05-16T18:26:49+03:00</lastmod>
|
||||||
<priority>0</priority>
|
<priority>0</priority>
|
||||||
</url>
|
</url>
|
||||||
|
|
||||||
|
Loading…
Reference in New Issue
Block a user