Update notes for 2020-01

This commit is contained in:
Alan Orth 2020-02-10 10:34:19 +02:00
parent 139988b7f5
commit 4bd85f9323
Signed by: alanorth
GPG Key ID: 0FB860CC9C45B1B9
4 changed files with 84 additions and 51 deletions

View File

@ -362,7 +362,7 @@ COPY 2900
- I re-applied all my corrections, filtering out things like multi-value separators and values that are actually ISBNs so I can fix them later
- Then I applied 181 fixes for ISSNs using `fix-metadata-values.py` on DSpace Test and CGSpace (after testing locally):
``
```
$ ./fix-metadata-values.py -i /tmp/2020-01-29-ISSNs-Distinct.csv -db dspace -u dspace -p 'fuuu' -f 'dc.identifier.issn[en_US]' -m 21 -t correct -d
```

View File

@ -63,7 +63,7 @@ I tweeted the CGSpace repository link
"@type": "BlogPosting",
"headline": "January, 2020",
"url": "https:\/\/alanorth.github.io\/cgspace-notes\/2020-01\/",
"wordCount": "3565",
"wordCount": "3523",
"datePublished": "2020-01-06T10:48:30+02:00",
"dateModified": "2020-02-04T13:06:23+02:00",
"author": {
@ -506,48 +506,81 @@ COPY 2900
<li>I re-applied all my corrections, filtering out things like multi-value separators and values that are actually ISBNs so I can fix them later</li>
<li>Then I applied 181 fixes for ISSNs using <code>fix-metadata-values.py</code> on DSpace Test and CGSpace (after testing locally):</li>
</ul>
<p>``
$ ./fix-metadata-values.py -i /tmp/2020-01-29-ISSNs-Distinct.csv -db dspace -u dspace -p &lsquo;fuuu&rsquo; -f &lsquo;dc.identifier.issn[en_US]&rsquo; -m 21 -t correct -d</p>
<pre><code>
## 2020-01-30
- About to start working on the DSpace 6 port and I'm looking at commits that are in the not-yet-tagged DSpace 6.4:
- [DS-4342] improve the performance of the collections/collection_id/items REST endpoint:
- c2e6719fa763e291b81b2d61da2f8c758fe38ff3
- [DS-4136] Improve OAI import performance for a large install:
- 3f81daf3d89b17ff4d08783ee9899e5a745851dc
- 37004bbcf4ca3ef2a74ebc6e4774cb605884864e
- DS-4110: fix issue in legacy id cleanup of stats records
- 3752247d6a4b83ee809cc9b197f34a8ff50b9e74
- e6004e57f0f2f3ce5f433647fe8a467b0176836b
- 2fb3751c9adfe7311c6df43dbd51a41479480f5e
- Fix DS-4066 by update all IDs to string type in schema:
- f15cb33ab4272a3970572e608810de3076d541a3
- DS-3914: Fix community defiliation:
- 19cc9719879cf69019acad72ee13915a4128e859
- b86a7b8d66608ee2bec67fb69b37e27c9a620aa3
- [DS-3849] Default ID 'order by' clause for other 'get items' queries:
- 7b888fa558e5792cd780d1d6a7f75564f4da3bf9
- 8d1aa33f7b9ea5a623e1ed13f139695671c598d4
- [DS-3664] ImageMagick: Only execute &quot;identify&quot; on first page:
- 33ba419f3560639bff8ea002cdfc38345c0fea8d
- DS-3658 Configure ReindexerThread disable reindex
- 1d2f10592ac2d86f28044749f34ac05347ea0e0a
- 05959ef315d2a1670e4b59eee4db21f93ba238fa
- 7253095b623069d7ef0a1a13cc5a21385d0878c9
- [DS-3602] 6x Port: Incremental Update of Legacy Id fields in Solr Statistics:
- 184f2b2153479045fba6239342c63e7f8564b8b6
- Dspace 6 ds 3545 mirage2: custom sitemap.xmap is ignored
- 71c68f2f54dead69329298810d0fecdf76b59c09
- It's annoying that we have to target DSpace 6.3... I think I should totally cherry-pick these when I'm done
- For now I just created a new DSpace repository and checked out the `dspace-6.3` tag and started diffing and copying changes over from our 5.8 repository
- There are some things I need to remember to check:
- `search.index` settings in DSpace 5's dspace.cfg (dunno where they are now)
- `thumbnail-fallback-files.xml`
- The code currently lives in the `6_x-dev` branch
&lt;!-- vim: set sw=2 ts=2: --&gt;
</code></pre>
<pre><code>$ ./fix-metadata-values.py -i /tmp/2020-01-29-ISSNs-Distinct.csv -db dspace -u dspace -p 'fuuu' -f 'dc.identifier.issn[en_US]' -m 21 -t correct -d
</code></pre><h2 id="2020-01-30">2020-01-30</h2>
<ul>
<li>About to start working on the DSpace 6 port and I&rsquo;m looking at commits that are in the not-yet-tagged DSpace 6.4:
<ul>
<li>[DS-4342] improve the performance of the collections/collection_id/items REST endpoint:
<ul>
<li>c2e6719fa763e291b81b2d61da2f8c758fe38ff3</li>
</ul>
</li>
<li>[DS-4136] Improve OAI import performance for a large install:
<ul>
<li>3f81daf3d89b17ff4d08783ee9899e5a745851dc</li>
<li>37004bbcf4ca3ef2a74ebc6e4774cb605884864e</li>
</ul>
</li>
<li>DS-4110: fix issue in legacy id cleanup of stats records
<ul>
<li>3752247d6a4b83ee809cc9b197f34a8ff50b9e74</li>
<li>e6004e57f0f2f3ce5f433647fe8a467b0176836b</li>
<li>2fb3751c9adfe7311c6df43dbd51a41479480f5e</li>
</ul>
</li>
<li>Fix DS-4066 by update all IDs to string type in schema:
<ul>
<li>f15cb33ab4272a3970572e608810de3076d541a3</li>
</ul>
</li>
<li>DS-3914: Fix community defiliation:
<ul>
<li>19cc9719879cf69019acad72ee13915a4128e859</li>
<li>b86a7b8d66608ee2bec67fb69b37e27c9a620aa3</li>
</ul>
</li>
<li>[DS-3849] Default ID &lsquo;order by&rsquo; clause for other &lsquo;get items&rsquo; queries:
<ul>
<li>7b888fa558e5792cd780d1d6a7f75564f4da3bf9</li>
<li>8d1aa33f7b9ea5a623e1ed13f139695671c598d4</li>
</ul>
</li>
<li>[DS-3664] ImageMagick: Only execute &ldquo;identify&rdquo; on first page:
<ul>
<li>33ba419f3560639bff8ea002cdfc38345c0fea8d</li>
</ul>
</li>
<li>DS-3658 Configure ReindexerThread disable reindex
<ul>
<li>1d2f10592ac2d86f28044749f34ac05347ea0e0a</li>
<li>05959ef315d2a1670e4b59eee4db21f93ba238fa</li>
<li>7253095b623069d7ef0a1a13cc5a21385d0878c9</li>
</ul>
</li>
<li>[DS-3602] 6x Port: Incremental Update of Legacy Id fields in Solr Statistics:
<ul>
<li>184f2b2153479045fba6239342c63e7f8564b8b6</li>
</ul>
</li>
<li>Dspace 6 ds 3545 mirage2: custom sitemap.xmap is ignored
<ul>
<li>71c68f2f54dead69329298810d0fecdf76b59c09</li>
</ul>
</li>
</ul>
</li>
<li>It&rsquo;s annoying that we have to target DSpace 6.3&hellip; I think I should totally cherry-pick these when I&rsquo;m done</li>
<li>For now I just created a new DSpace repository and checked out the <code>dspace-6.3</code> tag and started diffing and copying changes over from our 5.8 repository</li>
<li>There are some things I need to remember to check:
<ul>
<li><code>search.index</code> settings in DSpace 5&rsquo;s dspace.cfg (dunno where they are now)</li>
<li><code>thumbnail-fallback-files.xml</code></li>
</ul>
</li>
<li>The code currently lives in the <code>6_x-dev</code> branch</li>
</ul>
<!-- raw HTML omitted -->

View File

@ -20,7 +20,7 @@ The code finally builds and runs with a fresh install
<meta property="og:type" content="article" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2020-02/" />
<meta property="article:published_time" content="2020-02-02T11:56:30+02:00" />
<meta property="article:modified_time" content="2020-02-09T15:55:49+02:00" />
<meta property="article:modified_time" content="2020-02-09T17:34:12+02:00" />
<meta name="twitter:card" content="summary"/>
<meta name="twitter:title" content="February, 2020"/>
@ -47,7 +47,7 @@ The code finally builds and runs with a fresh install
"url": "https:\/\/alanorth.github.io\/cgspace-notes\/2020-02\/",
"wordCount": "2551",
"datePublished": "2020-02-02T11:56:30+02:00",
"dateModified": "2020-02-09T15:55:49+02:00",
"dateModified": "2020-02-09T17:34:12+02:00",
"author": {
"@type": "Person",
"name": "Alan Orth"

View File

@ -4,27 +4,27 @@
<url>
<loc>https://alanorth.github.io/cgspace-notes/categories/</loc>
<lastmod>2020-02-09T15:55:49+02:00</lastmod>
<lastmod>2020-02-09T17:34:12+02:00</lastmod>
</url>
<url>
<loc>https://alanorth.github.io/cgspace-notes/</loc>
<lastmod>2020-02-09T15:55:49+02:00</lastmod>
<lastmod>2020-02-09T17:34:12+02:00</lastmod>
</url>
<url>
<loc>https://alanorth.github.io/cgspace-notes/2020-02/</loc>
<lastmod>2020-02-09T15:55:49+02:00</lastmod>
<lastmod>2020-02-09T17:34:12+02:00</lastmod>
</url>
<url>
<loc>https://alanorth.github.io/cgspace-notes/categories/notes/</loc>
<lastmod>2020-02-09T15:55:49+02:00</lastmod>
<lastmod>2020-02-09T17:34:12+02:00</lastmod>
</url>
<url>
<loc>https://alanorth.github.io/cgspace-notes/posts/</loc>
<lastmod>2020-02-09T15:55:49+02:00</lastmod>
<lastmod>2020-02-09T17:34:12+02:00</lastmod>
</url>
<url>