mirror of
https://github.com/alanorth/cgspace-notes.git
synced 2024-11-22 06:35:03 +01:00
Add notes for 2022-07-03
This commit is contained in:
parent
05cf7a26ec
commit
4d4bde3474
@ -284,5 +284,7 @@ $ csvgrep -c 'number of matches' -r '^0$' /tmp/2022-06-30-cgspace-subjects-resul
|
||||
|
||||
- Then I took all the terms with fifty or more occurences and put them on a Google Sheet
|
||||
- There I started removing any term that was a variation of an existing AGROVOC term (like cowpea/cowpeas, policy/policies) or a compound concept
|
||||
- pnbecker on DSpace Slack mentioned that they made a JSPUI deduplication step that is open source: https://github.com/the-library-code/deduplication
|
||||
- It uses Levenshtein distance via PostgreSQL's fuzzystrmatch extension
|
||||
|
||||
<!-- vim: set sw=2 ts=2: -->
|
||||
|
35
content/posts/2022-07.md
Normal file
35
content/posts/2022-07.md
Normal file
@ -0,0 +1,35 @@
|
||||
---
|
||||
title: "July, 2022"
|
||||
date: 2022-07-02T14:07:36+03:00
|
||||
author: "Alan Orth"
|
||||
categories: ["Notes"]
|
||||
---
|
||||
|
||||
## 2022-07-02
|
||||
|
||||
- I learned how to use the Levenshtein functions in PostgreSQL
|
||||
- The thing is that there is a limit of 255 characters for these functions in PostgreSQL so you need to truncate the strings before comparing
|
||||
- Also, the trgm functions I've used before are case insensitive, but Levenshtein is not, so you need to make sure to lower case both strings first
|
||||
|
||||
<!--more-->
|
||||
|
||||
- A working query checking for duplicates in the recent AfricaRice items is:
|
||||
|
||||
```console
|
||||
localhost/dspace= ☘ SELECT text_value FROM metadatavalue WHERE dspace_object_id IN (SELECT uuid FROM item) AND metadata_field_id=64 AND levenshtein_less_equal(LOWER('International Trade and Exotic Pests: The Risks for Biodiversity and African Economies'), LEFT(LOWER(text_value), 255), 3) <= 3;
|
||||
text_value
|
||||
────────────────────────────────────────────────────────────────────────────────────────
|
||||
International trade and exotic pests: the risks for biodiversity and African economies
|
||||
(1 row)
|
||||
|
||||
Time: 399.751 ms
|
||||
```
|
||||
|
||||
- There is a great [blog post discussing Soundex with Levenshtein](https://www.crunchydata.com/blog/fuzzy-name-matching-in-postgresql) and creating indexes to make them faster
|
||||
- I want to do some proper checks of accuracy and speed against my trigram method
|
||||
|
||||
## 2022-07-03
|
||||
|
||||
- Start a harvest on AReS
|
||||
|
||||
<!-- vim: set sw=2 ts=2: -->
|
@ -242,6 +242,8 @@ db.statementpool = true
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -250,8 +252,6 @@ db.statementpool = true
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -264,6 +264,8 @@ $ curl -o /dev/null -s -w %{time_total}\\n https://cgspace.cgiar.org/rest/handle
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -272,8 +274,6 @@ $ curl -o /dev/null -s -w %{time_total}\\n https://cgspace.cgiar.org/rest/handle
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -200,6 +200,8 @@ $ find SimpleArchiveForBio/ -iname “*.pdf” -exec basename {} ; | sor
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -208,8 +210,6 @@ $ find SimpleArchiveForBio/ -iname “*.pdf” -exec basename {} ; | sor
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -378,6 +378,8 @@ Bitstream: tést señora alimentación.pdf
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -386,8 +388,6 @@ Bitstream: tést señora alimentación.pdf
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -316,6 +316,8 @@ Reinstall my local (Mac OS X) DSpace stack with Tomcat 7, PostgreSQL 9.3, and Ja
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -324,8 +326,6 @@ Reinstall my local (Mac OS X) DSpace stack with Tomcat 7, PostgreSQL 9.3, and Ja
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -495,6 +495,8 @@ dspace.log.2016-04-27:7271
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -503,8 +505,6 @@ dspace.log.2016-04-27:7271
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -371,6 +371,8 @@ sys 0m20.540s
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -379,8 +381,6 @@ sys 0m20.540s
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -409,6 +409,8 @@ $ ./delete-metadata-values.py -f dc.contributor.corporate -i Corporate-Authors-D
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -417,8 +419,6 @@ $ ./delete-metadata-values.py -f dc.contributor.corporate -i Corporate-Authors-D
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -325,6 +325,8 @@ discovery.index.authority.ignore-variants=true
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -333,8 +335,6 @@ discovery.index.authority.ignore-variants=true
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -389,6 +389,8 @@ $ JAVA_OPTS="-Dfile.encoding=UTF-8 -Xmx512m" /home/cgspace.cgiar.org/bin
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -397,8 +399,6 @@ $ JAVA_OPTS="-Dfile.encoding=UTF-8 -Xmx512m" /home/cgspace.cgiar.org/bin
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -606,6 +606,8 @@ $ ./delete-metadata-values.py -i ilrisubjects-delete-13.csv -f cg.subject.ilri -
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -614,8 +616,6 @@ $ ./delete-metadata-values.py -i ilrisubjects-delete-13.csv -f cg.subject.ilri -
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -372,6 +372,8 @@ dspace=# update metadatavalue set text_value = regexp_replace(text_value, 'h
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -380,8 +382,6 @@ dspace=# update metadatavalue set text_value = regexp_replace(text_value, 'h
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -548,6 +548,8 @@ org.dspace.discovery.SearchServiceException: Error executing query
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -556,8 +558,6 @@ org.dspace.discovery.SearchServiceException: Error executing query
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -784,6 +784,8 @@ $ exit
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -792,8 +794,6 @@ $ exit
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -369,6 +369,8 @@ $ gs -sDEVICE=pdfwrite -dCompatibilityLevel=1.4 -dPDFSETTINGS=/ebook -dNOPAUSE -
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -377,8 +379,6 @@ $ gs -sDEVICE=pdfwrite -dCompatibilityLevel=1.4 -dPDFSETTINGS=/ebook -dNOPAUSE -
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -423,6 +423,8 @@ COPY 1968
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -431,8 +433,6 @@ COPY 1968
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -355,6 +355,8 @@ $ ./delete-metadata-values.py -i Investors-Delete-121.csv -f dc.description.spon
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -363,8 +365,6 @@ $ ./delete-metadata-values.py -i Investors-Delete-121.csv -f dc.description.spon
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -585,6 +585,8 @@ $ gem install compass -v 1.0.3
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -593,8 +595,6 @@ $ gem install compass -v 1.0.3
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -391,6 +391,8 @@ UPDATE 187
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -399,8 +401,6 @@ UPDATE 187
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -270,6 +270,8 @@ $ JAVA_OPTS="-Xmx1024m -Dfile.encoding=UTF-8" [dspace]/bin/dspace import
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -278,8 +280,6 @@ $ JAVA_OPTS="-Xmx1024m -Dfile.encoding=UTF-8" [dspace]/bin/dspace import
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -275,6 +275,8 @@ delete from metadatavalue where resource_type_id=2 and metadata_field_id=235 and
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -283,8 +285,6 @@ delete from metadatavalue where resource_type_id=2 and metadata_field_id=235 and
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -517,6 +517,8 @@ org.apache.commons.dbcp.SQLNestedException: Cannot get a connection, pool error
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -525,8 +527,6 @@ org.apache.commons.dbcp.SQLNestedException: Cannot get a connection, pool error
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -659,6 +659,8 @@ Cert Status: good
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -667,8 +669,6 @@ Cert Status: good
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -443,6 +443,8 @@ session_id=6C30F10B4351A4ED83EC6ED50AFD6B6A
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -451,8 +453,6 @@ session_id=6C30F10B4351A4ED83EC6ED50AFD6B6A
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -944,6 +944,8 @@ $ cat dspace.log.2017-11-28 | grep -o -E 'session_id=[A-Z0-9]{32}' | sor
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -952,8 +954,6 @@ $ cat dspace.log.2017-11-28 | grep -o -E 'session_id=[A-Z0-9]{32}' | sor
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -783,6 +783,8 @@ DELETE 20
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -791,8 +793,6 @@ DELETE 20
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -1452,6 +1452,8 @@ Catalina:type=Manager,context=/,host=localhost activeSessions 8
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -1460,8 +1462,6 @@ Catalina:type=Manager,context=/,host=localhost activeSessions 8
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -1038,6 +1038,8 @@ UPDATE 3
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -1046,8 +1048,6 @@ UPDATE 3
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -585,6 +585,8 @@ Fixed 5 occurences of: GENEBANKS
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -593,8 +595,6 @@ Fixed 5 occurences of: GENEBANKS
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -594,6 +594,8 @@ $ pg_restore -O -U dspacetest -d dspacetest -W -h localhost /tmp/dspace_2018-04-
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -602,8 +604,6 @@ $ pg_restore -O -U dspacetest -d dspacetest -W -h localhost /tmp/dspace_2018-04-
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -523,6 +523,8 @@ $ psql -h localhost -U postgres dspacetest
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -531,8 +533,6 @@ $ psql -h localhost -U postgres dspacetest
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -517,6 +517,8 @@ $ sed '/^id/d' 10568-*.csv | csvcut -c 1,2 > map-to-cifor-archive.csv
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -525,8 +527,6 @@ $ sed '/^id/d' 10568-*.csv | csvcut -c 1,2 > map-to-cifor-archive.csv
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -569,6 +569,8 @@ dspace=# select count(text_value) from metadatavalue where resource_type_id=2 an
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -577,8 +579,6 @@ dspace=# select count(text_value) from metadatavalue where resource_type_id=2 an
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -442,6 +442,8 @@ $ dspace database migrate ignored
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -450,8 +452,6 @@ $ dspace database migrate ignored
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -748,6 +748,8 @@ UPDATE metadatavalue SET text_value='ja' WHERE resource_type_id=2 AND me
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -756,8 +758,6 @@ UPDATE metadatavalue SET text_value='ja' WHERE resource_type_id=2 AND me
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -656,6 +656,8 @@ $ curl -X GET -H "Content-Type: application/json" -H "Accept: applic
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -664,8 +666,6 @@ $ curl -X GET -H "Content-Type: application/json" -H "Accept: applic
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -553,6 +553,8 @@ $ dspace dsrun org.dspace.eperson.Groomer -a -b 11/27/2016 -d
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -561,8 +563,6 @@ $ dspace dsrun org.dspace.eperson.Groomer -a -b 11/27/2016 -d
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -594,6 +594,8 @@ UPDATE 1
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -602,8 +604,6 @@ UPDATE 1
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -1264,6 +1264,8 @@ identify: CorruptImageProfile `xmp' @ warning/profile.c/SetImageProfileInter
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -1272,8 +1274,6 @@ identify: CorruptImageProfile `xmp' @ warning/profile.c/SetImageProfileInter
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -1344,6 +1344,8 @@ Please see the DSpace documentation for assistance.
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -1352,8 +1354,6 @@ Please see the DSpace documentation for assistance.
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -1208,6 +1208,8 @@ sys 0m2.551s
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -1216,8 +1218,6 @@ sys 0m2.551s
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -1299,6 +1299,8 @@ UPDATE 14
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -1307,8 +1309,6 @@ UPDATE 14
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -631,6 +631,8 @@ COPY 64871
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -639,8 +641,6 @@ COPY 64871
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -317,6 +317,8 @@ UPDATE 2
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -325,8 +327,6 @@ UPDATE 2
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -554,6 +554,8 @@ issn.validate('1020-3362')
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -562,8 +564,6 @@ issn.validate('1020-3362')
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -573,6 +573,8 @@ sys 2m27.496s
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -581,8 +583,6 @@ sys 2m27.496s
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -581,6 +581,8 @@ $ csv-metadata-quality -i /tmp/clarisa-institutions.csv -o /tmp/clarisa-institut
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -589,8 +591,6 @@ $ csv-metadata-quality -i /tmp/clarisa-institutions.csv -o /tmp/clarisa-institut
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -385,6 +385,8 @@ $ dspace import -a -c 10568/104057 -e fuu@cgiar.org -m 2019-10-15-Bioversity.map
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -393,8 +395,6 @@ $ dspace import -a -c 10568/104057 -e fuu@cgiar.org -m 2019-10-15-Bioversity.map
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -692,6 +692,8 @@ $ tidy -xml -utf8 -iq -m -w 0 dspace/config/controlled-vocabularies/cg-creator-i
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -700,8 +702,6 @@ $ tidy -xml -utf8 -iq -m -w 0 dspace/config/controlled-vocabularies/cg-creator-i
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -404,6 +404,8 @@ UPDATE 1
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -412,8 +414,6 @@ UPDATE 1
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -604,6 +604,8 @@ COPY 2900
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -612,8 +614,6 @@ COPY 2900
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -1275,6 +1275,8 @@ Moving: 21993 into core statistics-2019
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -1283,8 +1285,6 @@ Moving: 21993 into core statistics-2019
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -484,6 +484,8 @@ $ tidy -xml -utf8 -iq -m -w 0 dspace/config/controlled-vocabularies/cg-creator-i
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -492,8 +494,6 @@ $ tidy -xml -utf8 -iq -m -w 0 dspace/config/controlled-vocabularies/cg-creator-i
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -658,6 +658,8 @@ $ psql -c 'select * from pg_stat_activity' | wc -l
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -666,8 +668,6 @@ $ psql -c 'select * from pg_stat_activity' | wc -l
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -477,6 +477,8 @@ Caused by: java.lang.NullPointerException
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -485,8 +487,6 @@ Caused by: java.lang.NullPointerException
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -811,6 +811,8 @@ $ csvcut -c 'id,cg.subject.ilri[],cg.subject.ilri[en_US],dc.subject[en_US]&#
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -819,8 +821,6 @@ $ csvcut -c 'id,cg.subject.ilri[],cg.subject.ilri[en_US],dc.subject[en_US]&#
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -1142,6 +1142,8 @@ Fixed 4 occurences of: Muloi, D.M.
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -1150,8 +1152,6 @@ Fixed 4 occurences of: Muloi, D.M.
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -798,6 +798,8 @@ $ grep -c added /tmp/2020-08-27-countrycodetagger.log
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -806,8 +808,6 @@ $ grep -c added /tmp/2020-08-27-countrycodetagger.log
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -717,6 +717,8 @@ solr_query_params = {
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -725,8 +727,6 @@ solr_query_params = {
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -1241,6 +1241,8 @@ $ ./delete-metadata-values.py -i 2020-10-31-delete-74-sponsors.csv -db dspace -u
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -1249,8 +1251,6 @@ $ ./delete-metadata-values.py -i 2020-10-31-delete-74-sponsors.csv -db dspace -u
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -731,6 +731,8 @@ $ ./fix-metadata-values.py -i 2020-11-30-fix-hung-orcid.csv -db dspace63 -u dspa
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -739,8 +741,6 @@ $ ./fix-metadata-values.py -i 2020-11-30-fix-hung-orcid.csv -db dspace63 -u dspa
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -869,6 +869,8 @@ $ query-json '.items | length' /tmp/policy2.json
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -877,8 +879,6 @@ $ query-json '.items | length' /tmp/policy2.json
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -688,6 +688,8 @@ java.lang.IllegalArgumentException: Invalid character found in the request targe
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -696,8 +698,6 @@ java.lang.IllegalArgumentException: Invalid character found in the request targe
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -898,6 +898,8 @@ dspace=# UPDATE metadatavalue SET text_lang='en_US' WHERE dspace_object_
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -906,8 +908,6 @@ dspace=# UPDATE metadatavalue SET text_lang='en_US' WHERE dspace_object_
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -875,6 +875,8 @@ Also, we found some issues building and running OpenRXV currently due to ecosyst
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -883,8 +885,6 @@ Also, we found some issues building and running OpenRXV currently due to ecosyst
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -1042,6 +1042,8 @@ $ chrt -b 0 dspace dsrun com.atmire.statistics.util.update.atomic.AtomicStatisti
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -1050,8 +1052,6 @@ $ chrt -b 0 dspace dsrun com.atmire.statistics.util.update.atomic.AtomicStatisti
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -685,6 +685,8 @@ May 26, 02:57 UTC
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -693,8 +695,6 @@ May 26, 02:57 UTC
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -693,6 +693,8 @@ I simply started it and AReS was running again:
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -701,8 +703,6 @@ I simply started it and AReS was running again:
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -715,6 +715,8 @@ COPY 20994
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -723,8 +725,6 @@ COPY 20994
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -606,6 +606,8 @@ I decided to upgrade linode20 from Ubuntu 18.04 to 20.04
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -614,8 +616,6 @@ I decided to upgrade linode20 from Ubuntu 18.04 to 20.04
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -588,6 +588,8 @@ The syntax Moayad showed me last month doesn’t seem to honor the search qu
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -596,8 +598,6 @@ The syntax Moayad showed me last month doesn’t seem to honor the search qu
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -791,6 +791,8 @@ Try doing it in two imports. In first import, remove all authors. In second impo
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -799,8 +801,6 @@ Try doing it in two imports. In first import, remove all authors. In second impo
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -494,6 +494,8 @@ $ zstd statistics-2019.json
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -502,8 +504,6 @@ $ zstd statistics-2019.json
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -577,6 +577,8 @@ Total number of bot hits purged: 3679
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -585,8 +587,6 @@ Total number of bot hits purged: 3679
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -380,6 +380,8 @@ Start a full harvest on AReS
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -388,8 +390,6 @@ Start a full harvest on AReS
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -724,6 +724,8 @@ isNotNull(value.match('699'))
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -732,8 +734,6 @@ isNotNull(value.match('699'))
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -476,6 +476,8 @@ isNotNull(value.match('889'))
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -484,8 +486,6 @@ isNotNull(value.match('889'))
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -509,6 +509,8 @@
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -517,8 +519,6 @@
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -445,6 +445,8 @@ I purged 93,974 hits from these IPs using my check-spider-ip-hits.sh script
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -453,8 +455,6 @@ I purged 93,974 hits from these IPs using my check-spider-ip-hits.sh script
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -26,7 +26,7 @@ There seem to be many more of these:
|
||||
<meta property="og:type" content="article" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2022-06/" />
|
||||
<meta property="article:published_time" content="2022-06-06T09:01:36+03:00" />
|
||||
<meta property="article:modified_time" content="2022-06-30T09:41:54+03:00" />
|
||||
<meta property="article:modified_time" content="2022-06-30T16:48:03+03:00" />
|
||||
|
||||
|
||||
|
||||
@ -58,9 +58,9 @@ There seem to be many more of these:
|
||||
"@type": "BlogPosting",
|
||||
"headline": "June, 2022",
|
||||
"url": "https://alanorth.github.io/cgspace-notes/2022-06/",
|
||||
"wordCount": "1761",
|
||||
"wordCount": "1786",
|
||||
"datePublished": "2022-06-06T09:01:36+03:00",
|
||||
"dateModified": "2022-06-30T09:41:54+03:00",
|
||||
"dateModified": "2022-06-30T16:48:03+03:00",
|
||||
"author": {
|
||||
"@type": "Person",
|
||||
"name": "Alan Orth"
|
||||
@ -431,6 +431,11 @@ There seem to be many more of these:
|
||||
<li>There I started removing any term that was a variation of an existing AGROVOC term (like cowpea/cowpeas, policy/policies) or a compound concept</li>
|
||||
</ul>
|
||||
</li>
|
||||
<li>pnbecker on DSpace Slack mentioned that they made a JSPUI deduplication step that is open source: <a href="https://github.com/the-library-code/deduplication">https://github.com/the-library-code/deduplication</a>
|
||||
<ul>
|
||||
<li>It uses Levenshtein distance via PostgreSQL’s fuzzystrmatch extension</li>
|
||||
</ul>
|
||||
</li>
|
||||
</ul>
|
||||
<!-- raw HTML omitted -->
|
||||
|
||||
@ -453,6 +458,8 @@ There seem to be many more of these:
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -461,8 +468,6 @@ There seem to be many more of these:
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
222
docs/2022-07/index.html
Normal file
222
docs/2022-07/index.html
Normal file
@ -0,0 +1,222 @@
|
||||
<!DOCTYPE html>
|
||||
<html lang="en" >
|
||||
|
||||
<head>
|
||||
<meta charset="utf-8">
|
||||
<meta name="viewport" content="width=device-width, initial-scale=1, shrink-to-fit=no">
|
||||
|
||||
|
||||
<meta property="og:title" content="July, 2022" />
|
||||
<meta property="og:description" content="2022-07-02
|
||||
|
||||
I learned how to use the Levenshtein functions in PostgreSQL
|
||||
|
||||
The thing is that there is a limit of 255 characters for these functions in PostgreSQL so you need to truncate the strings before comparing
|
||||
Also, the trgm functions I’ve used before are case insensitive, but Levenshtein is not, so you need to make sure to lower case both strings first
|
||||
|
||||
|
||||
" />
|
||||
<meta property="og:type" content="article" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2022-07/" />
|
||||
<meta property="article:published_time" content="2022-07-02T14:07:36+03:00" />
|
||||
<meta property="article:modified_time" content="2022-07-02T14:07:36+03:00" />
|
||||
|
||||
|
||||
|
||||
<meta name="twitter:card" content="summary"/>
|
||||
<meta name="twitter:title" content="July, 2022"/>
|
||||
<meta name="twitter:description" content="2022-07-02
|
||||
|
||||
I learned how to use the Levenshtein functions in PostgreSQL
|
||||
|
||||
The thing is that there is a limit of 255 characters for these functions in PostgreSQL so you need to truncate the strings before comparing
|
||||
Also, the trgm functions I’ve used before are case insensitive, but Levenshtein is not, so you need to make sure to lower case both strings first
|
||||
|
||||
|
||||
"/>
|
||||
<meta name="generator" content="Hugo 0.101.0" />
|
||||
|
||||
|
||||
|
||||
<script type="application/ld+json">
|
||||
{
|
||||
"@context": "http://schema.org",
|
||||
"@type": "BlogPosting",
|
||||
"headline": "July, 2022",
|
||||
"url": "https://alanorth.github.io/cgspace-notes/2022-07/",
|
||||
"wordCount": "164",
|
||||
"datePublished": "2022-07-02T14:07:36+03:00",
|
||||
"dateModified": "2022-07-02T14:07:36+03:00",
|
||||
"author": {
|
||||
"@type": "Person",
|
||||
"name": "Alan Orth"
|
||||
},
|
||||
"keywords": "Notes"
|
||||
}
|
||||
</script>
|
||||
|
||||
|
||||
|
||||
<link rel="canonical" href="https://alanorth.github.io/cgspace-notes/2022-07/">
|
||||
|
||||
<title>July, 2022 | CGSpace Notes</title>
|
||||
|
||||
|
||||
<!-- combined, minified CSS -->
|
||||
|
||||
<link href="https://alanorth.github.io/cgspace-notes/css/style.beb8012edc08ba10be012f079d618dc243812267efe62e11f22fe49618f976a4.css" rel="stylesheet" integrity="sha256-vrgBLtwIuhC+AS8HnWGNwkOBImfv5i4R8i/klhj5dqQ=" crossorigin="anonymous">
|
||||
|
||||
|
||||
<!-- minified Font Awesome for SVG icons -->
|
||||
|
||||
<script defer src="https://alanorth.github.io/cgspace-notes/js/fontawesome.min.f5072c55a0721857184db93a50561d7dc13975b4de2e19db7f81eb5f3fa57270.js" integrity="sha256-9QcsVaByGFcYTbk6UFYdfcE5dbTeLhnbf4HrXz+lcnA=" crossorigin="anonymous"></script>
|
||||
|
||||
<!-- RSS 2.0 feed -->
|
||||
|
||||
|
||||
|
||||
|
||||
</head>
|
||||
|
||||
<body>
|
||||
|
||||
|
||||
<div class="blog-masthead">
|
||||
<div class="container">
|
||||
<nav class="nav blog-nav">
|
||||
<a class="nav-link " href="https://alanorth.github.io/cgspace-notes/">Home</a>
|
||||
</nav>
|
||||
</div>
|
||||
</div>
|
||||
|
||||
|
||||
|
||||
|
||||
<header class="blog-header">
|
||||
<div class="container">
|
||||
<h1 class="blog-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/" rel="home">CGSpace Notes</a></h1>
|
||||
<p class="lead blog-description" dir="auto">Documenting day-to-day work on the <a href="https://cgspace.cgiar.org">CGSpace</a> repository.</p>
|
||||
</div>
|
||||
</header>
|
||||
|
||||
|
||||
|
||||
|
||||
<div class="container">
|
||||
<div class="row">
|
||||
<div class="col-sm-8 blog-main">
|
||||
|
||||
|
||||
|
||||
|
||||
<article class="blog-post">
|
||||
<header>
|
||||
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2022-07/">July, 2022</a></h2>
|
||||
<p class="blog-post-meta">
|
||||
<time datetime="2022-07-02T14:07:36+03:00">Sat Jul 02, 2022</time>
|
||||
in
|
||||
<span class="fas fa-folder" aria-hidden="true"></span> <a href="/categories/notes/" rel="category tag">Notes</a>
|
||||
|
||||
|
||||
</p>
|
||||
</header>
|
||||
<h2 id="2022-07-02">2022-07-02</h2>
|
||||
<ul>
|
||||
<li>I learned how to use the Levenshtein functions in PostgreSQL
|
||||
<ul>
|
||||
<li>The thing is that there is a limit of 255 characters for these functions in PostgreSQL so you need to truncate the strings before comparing</li>
|
||||
<li>Also, the trgm functions I’ve used before are case insensitive, but Levenshtein is not, so you need to make sure to lower case both strings first</li>
|
||||
</ul>
|
||||
</li>
|
||||
</ul>
|
||||
<ul>
|
||||
<li>A working query checking for duplicates in the recent AfricaRice items is:</li>
|
||||
</ul>
|
||||
<div class="highlight"><pre tabindex="0" style="color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;"><code class="language-console" data-lang="console"><span style="display:flex;"><span>localhost/dspace= ☘ SELECT text_value FROM metadatavalue WHERE dspace_object_id IN (SELECT uuid FROM item) AND metadata_field_id=64 AND levenshtein_less_equal(LOWER('International Trade and Exotic Pests: The Risks for Biodiversity and African Economies'), LEFT(LOWER(text_value), 255), 3) <= 3;
|
||||
</span></span><span style="display:flex;"><span> text_value
|
||||
</span></span><span style="display:flex;"><span>────────────────────────────────────────────────────────────────────────────────────────
|
||||
</span></span><span style="display:flex;"><span> International trade and exotic pests: the risks for biodiversity and African economies
|
||||
</span></span><span style="display:flex;"><span>(1 row)
|
||||
</span></span><span style="display:flex;"><span><span style="color:#960050;background-color:#1e0010">
|
||||
</span></span></span><span style="display:flex;"><span><span style="color:#960050;background-color:#1e0010"></span>Time: 399.751 ms
|
||||
</span></span></code></pre></div><ul>
|
||||
<li>There is a great <a href="https://www.crunchydata.com/blog/fuzzy-name-matching-in-postgresql">blog post discussing Soundex with Levenshtein</a> and creating indexes to make them faster</li>
|
||||
<li>I want to do some proper checks of accuracy and speed against my trigram method</li>
|
||||
</ul>
|
||||
<h2 id="2022-07-03">2022-07-03</h2>
|
||||
<ul>
|
||||
<li>Start a harvest on AReS</li>
|
||||
</ul>
|
||||
<!-- raw HTML omitted -->
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
</article>
|
||||
|
||||
|
||||
|
||||
</div> <!-- /.blog-main -->
|
||||
|
||||
<aside class="col-sm-3 ml-auto blog-sidebar">
|
||||
|
||||
|
||||
|
||||
<section class="sidebar-module">
|
||||
<h4>Recent Posts</h4>
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-04/">April, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
||||
|
||||
|
||||
<section class="sidebar-module">
|
||||
<h4>Links</h4>
|
||||
<ol class="list-unstyled">
|
||||
|
||||
<li><a href="https://cgspace.cgiar.org">CGSpace</a></li>
|
||||
|
||||
<li><a href="https://dspacetest.cgiar.org">DSpace Test</a></li>
|
||||
|
||||
<li><a href="https://github.com/ilri/DSpace">CGSpace @ GitHub</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
</aside>
|
||||
|
||||
|
||||
</div> <!-- /.row -->
|
||||
</div> <!-- /.container -->
|
||||
|
||||
|
||||
|
||||
<footer class="blog-footer">
|
||||
<p dir="auto">
|
||||
|
||||
Blog template created by <a href="https://twitter.com/mdo">@mdo</a>, ported to Hugo by <a href='https://twitter.com/mralanorth'>@mralanorth</a>.
|
||||
|
||||
</p>
|
||||
<p>
|
||||
<a href="#">Back to top</a>
|
||||
</p>
|
||||
</footer>
|
||||
|
||||
|
||||
</body>
|
||||
|
||||
</html>
|
@ -95,6 +95,8 @@
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -103,8 +105,6 @@
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -10,7 +10,7 @@
|
||||
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/" />
|
||||
<meta property="og:updated_time" content="2022-06-30T09:41:54+03:00" />
|
||||
<meta property="og:updated_time" content="2022-07-02T14:07:36+03:00" />
|
||||
|
||||
|
||||
|
||||
@ -84,7 +84,7 @@
|
||||
<article class="blog-post">
|
||||
<header>
|
||||
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/categories/notes/">Notes</a></h2>
|
||||
<p class="blog-post-meta"><time datetime="2022-06-06T09:01:36+03:00">Mon Jun 06, 2022</time> by Alan Orth</p>
|
||||
<p class="blog-post-meta"><time datetime="2022-07-02T14:07:36+03:00">Sat Jul 02, 2022</time> by Alan Orth</p>
|
||||
</header>
|
||||
|
||||
<a href='https://alanorth.github.io/cgspace-notes/categories/notes/'>Read more →</a>
|
||||
@ -108,6 +108,8 @@
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -116,8 +118,6 @@
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -6,11 +6,11 @@
|
||||
<description>Recent content in Categories on CGSpace Notes</description>
|
||||
<generator>Hugo -- gohugo.io</generator>
|
||||
<language>en-us</language>
|
||||
<lastBuildDate>Mon, 06 Jun 2022 09:01:36 +0300</lastBuildDate><atom:link href="https://alanorth.github.io/cgspace-notes/categories/index.xml" rel="self" type="application/rss+xml" />
|
||||
<lastBuildDate>Sat, 02 Jul 2022 14:07:36 +0300</lastBuildDate><atom:link href="https://alanorth.github.io/cgspace-notes/categories/index.xml" rel="self" type="application/rss+xml" />
|
||||
<item>
|
||||
<title>Notes</title>
|
||||
<link>https://alanorth.github.io/cgspace-notes/categories/notes/</link>
|
||||
<pubDate>Mon, 06 Jun 2022 09:01:36 +0300</pubDate>
|
||||
<pubDate>Sat, 02 Jul 2022 14:07:36 +0300</pubDate>
|
||||
|
||||
<guid>https://alanorth.github.io/cgspace-notes/categories/notes/</guid>
|
||||
<description></description>
|
||||
|
@ -10,7 +10,7 @@
|
||||
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" />
|
||||
<meta property="og:updated_time" content="2022-06-30T09:41:54+03:00" />
|
||||
<meta property="og:updated_time" content="2022-07-02T14:07:36+03:00" />
|
||||
|
||||
|
||||
|
||||
@ -81,6 +81,32 @@
|
||||
|
||||
|
||||
|
||||
<article class="blog-post">
|
||||
<header>
|
||||
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2022-07/">July, 2022</a></h2>
|
||||
<p class="blog-post-meta"><time datetime="2022-07-02T14:07:36+03:00">Sat Jul 02, 2022</time> by Alan Orth in
|
||||
<span class="fas fa-folder" aria-hidden="true"></span> <a href="/categories/notes/" rel="category tag">Notes</a>
|
||||
|
||||
|
||||
</p>
|
||||
</header>
|
||||
<h2 id="2022-07-02">2022-07-02</h2>
|
||||
<ul>
|
||||
<li>I learned how to use the Levenshtein functions in PostgreSQL
|
||||
<ul>
|
||||
<li>The thing is that there is a limit of 255 characters for these functions in PostgreSQL so you need to truncate the strings before comparing</li>
|
||||
<li>Also, the trgm functions I’ve used before are case insensitive, but Levenshtein is not, so you need to make sure to lower case both strings first</li>
|
||||
</ul>
|
||||
</li>
|
||||
</ul>
|
||||
<a href='https://alanorth.github.io/cgspace-notes/2022-07/'>Read more →</a>
|
||||
</article>
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
<article class="blog-post">
|
||||
<header>
|
||||
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2022-06/">June, 2022</a></h2>
|
||||
@ -334,39 +360,6 @@
|
||||
|
||||
|
||||
|
||||
|
||||
<article class="blog-post">
|
||||
<header>
|
||||
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2021-09/">September, 2021</a></h2>
|
||||
<p class="blog-post-meta"><time datetime="2021-09-01T09:14:07+03:00">Wed Sep 01, 2021</time> by Alan Orth in
|
||||
<span class="fas fa-folder" aria-hidden="true"></span> <a href="/categories/notes/" rel="category tag">Notes</a>
|
||||
|
||||
|
||||
</p>
|
||||
</header>
|
||||
<h2 id="2021-09-02">2021-09-02</h2>
|
||||
<ul>
|
||||
<li>Troubleshooting the missing Altmetric scores on AReS
|
||||
<ul>
|
||||
<li>Turns out that I didn’t actually fix them last month because the check for <code>content.altmetric</code> still exists, and I can’t access the DOIs using <code>_h.source.DOI</code> for some reason</li>
|
||||
<li>I can access all other kinds of item metadata using the Elasticsearch label, but not DOI!!!</li>
|
||||
<li>I will change <code>DOI</code> to <code>tomato</code> in the repository setup and start a re-harvest… I need to see if this is some kind of reserved word or something…</li>
|
||||
<li>Even as <code>tomato</code> I can’t access that field as <code>_h.source.tomato</code> in Angular, but it does work as a filter source… sigh</li>
|
||||
</ul>
|
||||
</li>
|
||||
<li>I’m having problems using the OpenRXV API
|
||||
<ul>
|
||||
<li>The syntax Moayad showed me last month doesn’t seem to honor the search query properly…</li>
|
||||
</ul>
|
||||
</li>
|
||||
</ul>
|
||||
<a href='https://alanorth.github.io/cgspace-notes/2021-09/'>Read more →</a>
|
||||
</article>
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
<nav class="blog-pagination">
|
||||
|
||||
|
||||
@ -391,6 +384,8 @@
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -399,8 +394,6 @@
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -6,7 +6,24 @@
|
||||
<description>Recent content in Notes on CGSpace Notes</description>
|
||||
<generator>Hugo -- gohugo.io</generator>
|
||||
<language>en-us</language>
|
||||
<lastBuildDate>Mon, 06 Jun 2022 09:01:36 +0300</lastBuildDate><atom:link href="https://alanorth.github.io/cgspace-notes/categories/notes/index.xml" rel="self" type="application/rss+xml" />
|
||||
<lastBuildDate>Sat, 02 Jul 2022 14:07:36 +0300</lastBuildDate><atom:link href="https://alanorth.github.io/cgspace-notes/categories/notes/index.xml" rel="self" type="application/rss+xml" />
|
||||
<item>
|
||||
<title>July, 2022</title>
|
||||
<link>https://alanorth.github.io/cgspace-notes/2022-07/</link>
|
||||
<pubDate>Sat, 02 Jul 2022 14:07:36 +0300</pubDate>
|
||||
|
||||
<guid>https://alanorth.github.io/cgspace-notes/2022-07/</guid>
|
||||
<description><h2 id="2022-07-02">2022-07-02</h2>
|
||||
<ul>
|
||||
<li>I learned how to use the Levenshtein functions in PostgreSQL
|
||||
<ul>
|
||||
<li>The thing is that there is a limit of 255 characters for these functions in PostgreSQL so you need to truncate the strings before comparing</li>
|
||||
<li>Also, the trgm functions I&rsquo;ve used before are case insensitive, but Levenshtein is not, so you need to make sure to lower case both strings first</li>
|
||||
</ul>
|
||||
</li>
|
||||
</ul></description>
|
||||
</item>
|
||||
|
||||
<item>
|
||||
<title>June, 2022</title>
|
||||
<link>https://alanorth.github.io/cgspace-notes/2022-06/</link>
|
||||
|
@ -10,7 +10,7 @@
|
||||
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" />
|
||||
<meta property="og:updated_time" content="2022-06-30T09:41:54+03:00" />
|
||||
<meta property="og:updated_time" content="2022-07-02T14:07:36+03:00" />
|
||||
|
||||
|
||||
|
||||
@ -81,6 +81,39 @@
|
||||
|
||||
|
||||
|
||||
<article class="blog-post">
|
||||
<header>
|
||||
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2021-09/">September, 2021</a></h2>
|
||||
<p class="blog-post-meta"><time datetime="2021-09-01T09:14:07+03:00">Wed Sep 01, 2021</time> by Alan Orth in
|
||||
<span class="fas fa-folder" aria-hidden="true"></span> <a href="/categories/notes/" rel="category tag">Notes</a>
|
||||
|
||||
|
||||
</p>
|
||||
</header>
|
||||
<h2 id="2021-09-02">2021-09-02</h2>
|
||||
<ul>
|
||||
<li>Troubleshooting the missing Altmetric scores on AReS
|
||||
<ul>
|
||||
<li>Turns out that I didn’t actually fix them last month because the check for <code>content.altmetric</code> still exists, and I can’t access the DOIs using <code>_h.source.DOI</code> for some reason</li>
|
||||
<li>I can access all other kinds of item metadata using the Elasticsearch label, but not DOI!!!</li>
|
||||
<li>I will change <code>DOI</code> to <code>tomato</code> in the repository setup and start a re-harvest… I need to see if this is some kind of reserved word or something…</li>
|
||||
<li>Even as <code>tomato</code> I can’t access that field as <code>_h.source.tomato</code> in Angular, but it does work as a filter source… sigh</li>
|
||||
</ul>
|
||||
</li>
|
||||
<li>I’m having problems using the OpenRXV API
|
||||
<ul>
|
||||
<li>The syntax Moayad showed me last month doesn’t seem to honor the search query properly…</li>
|
||||
</ul>
|
||||
</li>
|
||||
</ul>
|
||||
<a href='https://alanorth.github.io/cgspace-notes/2021-09/'>Read more →</a>
|
||||
</article>
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
<article class="blog-post">
|
||||
<header>
|
||||
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2021-08/">August, 2021</a></h2>
|
||||
@ -334,33 +367,6 @@
|
||||
|
||||
|
||||
|
||||
|
||||
<article class="blog-post">
|
||||
<header>
|
||||
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2020-12/">December, 2020</a></h2>
|
||||
<p class="blog-post-meta"><time datetime="2020-12-01T11:32:54+02:00">Tue Dec 01, 2020</time> by Alan Orth in
|
||||
<span class="fas fa-folder" aria-hidden="true"></span> <a href="/categories/notes/" rel="category tag">Notes</a>
|
||||
|
||||
|
||||
</p>
|
||||
</header>
|
||||
<h2 id="2020-12-01">2020-12-01</h2>
|
||||
<ul>
|
||||
<li>Atmire responded about the issue with duplicate data in our Solr statistics
|
||||
<ul>
|
||||
<li>They noticed that some records in the statistics-2015 core haven’t been migrated with the AtomicStatisticsUpdateCLI tool yet and assumed that I haven’t migrated any of the records yet</li>
|
||||
<li>That’s strange, as I checked all ten cores and 2015 is the only one with some unmigrated documents, as according to the <code>cua_version</code> field</li>
|
||||
<li>I started processing those (about 411,000 records):</li>
|
||||
</ul>
|
||||
</li>
|
||||
</ul>
|
||||
<a href='https://alanorth.github.io/cgspace-notes/2020-12/'>Read more →</a>
|
||||
</article>
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
<nav class="blog-pagination">
|
||||
|
||||
<a class="btn btn-outline-primary" href="/cgspace-notes/categories/notes/" rel="prev" role="button">Previous page</a>
|
||||
@ -385,6 +391,8 @@
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -393,8 +401,6 @@
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -10,7 +10,7 @@
|
||||
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" />
|
||||
<meta property="og:updated_time" content="2022-06-30T09:41:54+03:00" />
|
||||
<meta property="og:updated_time" content="2022-07-02T14:07:36+03:00" />
|
||||
|
||||
|
||||
|
||||
@ -81,6 +81,33 @@
|
||||
|
||||
|
||||
|
||||
<article class="blog-post">
|
||||
<header>
|
||||
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2020-12/">December, 2020</a></h2>
|
||||
<p class="blog-post-meta"><time datetime="2020-12-01T11:32:54+02:00">Tue Dec 01, 2020</time> by Alan Orth in
|
||||
<span class="fas fa-folder" aria-hidden="true"></span> <a href="/categories/notes/" rel="category tag">Notes</a>
|
||||
|
||||
|
||||
</p>
|
||||
</header>
|
||||
<h2 id="2020-12-01">2020-12-01</h2>
|
||||
<ul>
|
||||
<li>Atmire responded about the issue with duplicate data in our Solr statistics
|
||||
<ul>
|
||||
<li>They noticed that some records in the statistics-2015 core haven’t been migrated with the AtomicStatisticsUpdateCLI tool yet and assumed that I haven’t migrated any of the records yet</li>
|
||||
<li>That’s strange, as I checked all ten cores and 2015 is the only one with some unmigrated documents, as according to the <code>cua_version</code> field</li>
|
||||
<li>I started processing those (about 411,000 records):</li>
|
||||
</ul>
|
||||
</li>
|
||||
</ul>
|
||||
<a href='https://alanorth.github.io/cgspace-notes/2020-12/'>Read more →</a>
|
||||
</article>
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
<article class="blog-post">
|
||||
<header>
|
||||
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/cgspace-dspace6-upgrade/">CGSpace DSpace 6 Upgrade</a></h2>
|
||||
@ -330,36 +357,6 @@
|
||||
|
||||
|
||||
|
||||
|
||||
<article class="blog-post">
|
||||
<header>
|
||||
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2020-03/">March, 2020</a></h2>
|
||||
<p class="blog-post-meta"><time datetime="2020-03-02T12:31:30+02:00">Mon Mar 02, 2020</time> by Alan Orth in
|
||||
<span class="fas fa-folder" aria-hidden="true"></span> <a href="/categories/notes/" rel="category tag">Notes</a>
|
||||
|
||||
|
||||
</p>
|
||||
</header>
|
||||
<h2 id="2020-03-02">2020-03-02</h2>
|
||||
<ul>
|
||||
<li>Update <a href="https://github.com/ilri/dspace-statistics-api">dspace-statistics-api</a> for DSpace 6+ UUIDs
|
||||
<ul>
|
||||
<li>Tag version 1.2.0 on GitHub</li>
|
||||
</ul>
|
||||
</li>
|
||||
<li>Test migrating legacy Solr statistics to UUIDs with the as-of-yet unreleased <a href="https://github.com/DSpace/DSpace/commit/184f2b2153479045fba6239342c63e7f8564b8b6#diff-0350ce2e13b28d5d61252b7a8f50a059">SolrUpgradePre6xStatistics.java</a>
|
||||
<ul>
|
||||
<li>You need to download this into the DSpace 6.x source and compile it</li>
|
||||
</ul>
|
||||
</li>
|
||||
</ul>
|
||||
<a href='https://alanorth.github.io/cgspace-notes/2020-03/'>Read more →</a>
|
||||
</article>
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
<nav class="blog-pagination">
|
||||
|
||||
<a class="btn btn-outline-primary" href="/cgspace-notes/categories/notes/page/2/" rel="prev" role="button">Previous page</a>
|
||||
@ -384,6 +381,8 @@
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -392,8 +391,6 @@
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -10,7 +10,7 @@
|
||||
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" />
|
||||
<meta property="og:updated_time" content="2022-06-30T09:41:54+03:00" />
|
||||
<meta property="og:updated_time" content="2022-07-02T14:07:36+03:00" />
|
||||
|
||||
|
||||
|
||||
@ -81,6 +81,36 @@
|
||||
|
||||
|
||||
|
||||
<article class="blog-post">
|
||||
<header>
|
||||
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2020-03/">March, 2020</a></h2>
|
||||
<p class="blog-post-meta"><time datetime="2020-03-02T12:31:30+02:00">Mon Mar 02, 2020</time> by Alan Orth in
|
||||
<span class="fas fa-folder" aria-hidden="true"></span> <a href="/categories/notes/" rel="category tag">Notes</a>
|
||||
|
||||
|
||||
</p>
|
||||
</header>
|
||||
<h2 id="2020-03-02">2020-03-02</h2>
|
||||
<ul>
|
||||
<li>Update <a href="https://github.com/ilri/dspace-statistics-api">dspace-statistics-api</a> for DSpace 6+ UUIDs
|
||||
<ul>
|
||||
<li>Tag version 1.2.0 on GitHub</li>
|
||||
</ul>
|
||||
</li>
|
||||
<li>Test migrating legacy Solr statistics to UUIDs with the as-of-yet unreleased <a href="https://github.com/DSpace/DSpace/commit/184f2b2153479045fba6239342c63e7f8564b8b6#diff-0350ce2e13b28d5d61252b7a8f50a059">SolrUpgradePre6xStatistics.java</a>
|
||||
<ul>
|
||||
<li>You need to download this into the DSpace 6.x source and compile it</li>
|
||||
</ul>
|
||||
</li>
|
||||
</ul>
|
||||
<a href='https://alanorth.github.io/cgspace-notes/2020-03/'>Read more →</a>
|
||||
</article>
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
<article class="blog-post">
|
||||
<header>
|
||||
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2020-02/">February, 2020</a></h2>
|
||||
@ -364,39 +394,6 @@
|
||||
|
||||
|
||||
|
||||
|
||||
<article class="blog-post">
|
||||
<header>
|
||||
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2019-05/">May, 2019</a></h2>
|
||||
<p class="blog-post-meta"><time datetime="2019-05-01T07:37:43+03:00">Wed May 01, 2019</time> by Alan Orth in
|
||||
<span class="fas fa-folder" aria-hidden="true"></span> <a href="/categories/notes/" rel="category tag">Notes</a>
|
||||
|
||||
|
||||
</p>
|
||||
</header>
|
||||
<h2 id="2019-05-01">2019-05-01</h2>
|
||||
<ul>
|
||||
<li>Help CCAFS with regenerating some item thumbnails after they uploaded new PDFs to some items on CGSpace</li>
|
||||
<li>A user on the dspace-tech mailing list offered some suggestions for troubleshooting the problem with the inability to delete certain items
|
||||
<ul>
|
||||
<li>Apparently if the item is in the <code>workflowitem</code> table it is submitted to a workflow</li>
|
||||
<li>And if it is in the <code>workspaceitem</code> table it is in the pre-submitted state</li>
|
||||
</ul>
|
||||
</li>
|
||||
<li>The item seems to be in a pre-submitted state, so I tried to delete it from there:</li>
|
||||
</ul>
|
||||
<pre tabindex="0"><code>dspace=# DELETE FROM workspaceitem WHERE item_id=74648;
|
||||
DELETE 1
|
||||
</code></pre><ul>
|
||||
<li>But after this I tried to delete the item from the XMLUI and it is <em>still</em> present…</li>
|
||||
</ul>
|
||||
<a href='https://alanorth.github.io/cgspace-notes/2019-05/'>Read more →</a>
|
||||
</article>
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
<nav class="blog-pagination">
|
||||
|
||||
<a class="btn btn-outline-primary" href="/cgspace-notes/categories/notes/page/3/" rel="prev" role="button">Previous page</a>
|
||||
@ -421,6 +418,8 @@ DELETE 1
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -429,8 +428,6 @@ DELETE 1
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -10,7 +10,7 @@
|
||||
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" />
|
||||
<meta property="og:updated_time" content="2022-06-30T09:41:54+03:00" />
|
||||
<meta property="og:updated_time" content="2022-07-02T14:07:36+03:00" />
|
||||
|
||||
|
||||
|
||||
@ -81,6 +81,39 @@
|
||||
|
||||
|
||||
|
||||
<article class="blog-post">
|
||||
<header>
|
||||
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2019-05/">May, 2019</a></h2>
|
||||
<p class="blog-post-meta"><time datetime="2019-05-01T07:37:43+03:00">Wed May 01, 2019</time> by Alan Orth in
|
||||
<span class="fas fa-folder" aria-hidden="true"></span> <a href="/categories/notes/" rel="category tag">Notes</a>
|
||||
|
||||
|
||||
</p>
|
||||
</header>
|
||||
<h2 id="2019-05-01">2019-05-01</h2>
|
||||
<ul>
|
||||
<li>Help CCAFS with regenerating some item thumbnails after they uploaded new PDFs to some items on CGSpace</li>
|
||||
<li>A user on the dspace-tech mailing list offered some suggestions for troubleshooting the problem with the inability to delete certain items
|
||||
<ul>
|
||||
<li>Apparently if the item is in the <code>workflowitem</code> table it is submitted to a workflow</li>
|
||||
<li>And if it is in the <code>workspaceitem</code> table it is in the pre-submitted state</li>
|
||||
</ul>
|
||||
</li>
|
||||
<li>The item seems to be in a pre-submitted state, so I tried to delete it from there:</li>
|
||||
</ul>
|
||||
<pre tabindex="0"><code>dspace=# DELETE FROM workspaceitem WHERE item_id=74648;
|
||||
DELETE 1
|
||||
</code></pre><ul>
|
||||
<li>But after this I tried to delete the item from the XMLUI and it is <em>still</em> present…</li>
|
||||
</ul>
|
||||
<a href='https://alanorth.github.io/cgspace-notes/2019-05/'>Read more →</a>
|
||||
</article>
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
<article class="blog-post">
|
||||
<header>
|
||||
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2019-04/">April, 2019</a></h2>
|
||||
@ -364,33 +397,6 @@ sys 0m1.979s
|
||||
|
||||
|
||||
|
||||
|
||||
<article class="blog-post">
|
||||
<header>
|
||||
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2018-07/">July, 2018</a></h2>
|
||||
<p class="blog-post-meta"><time datetime="2018-07-01T12:56:54+03:00">Sun Jul 01, 2018</time> by Alan Orth in
|
||||
<span class="fas fa-folder" aria-hidden="true"></span> <a href="/categories/notes/" rel="category tag">Notes</a>
|
||||
|
||||
|
||||
</p>
|
||||
</header>
|
||||
<h2 id="2018-07-01">2018-07-01</h2>
|
||||
<ul>
|
||||
<li>I want to upgrade DSpace Test to DSpace 5.8 so I took a backup of its current database just in case:</li>
|
||||
</ul>
|
||||
<pre tabindex="0"><code>$ pg_dump -b -v -o --format=custom -U dspace -f dspace-2018-07-01.backup dspace
|
||||
</code></pre><ul>
|
||||
<li>During the <code>mvn package</code> stage on the 5.8 branch I kept getting issues with java running out of memory:</li>
|
||||
</ul>
|
||||
<pre tabindex="0"><code>There is insufficient memory for the Java Runtime Environment to continue.
|
||||
</code></pre>
|
||||
<a href='https://alanorth.github.io/cgspace-notes/2018-07/'>Read more →</a>
|
||||
</article>
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
<nav class="blog-pagination">
|
||||
|
||||
<a class="btn btn-outline-primary" href="/cgspace-notes/categories/notes/page/4/" rel="prev" role="button">Previous page</a>
|
||||
@ -415,6 +421,8 @@ sys 0m1.979s
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -423,8 +431,6 @@ sys 0m1.979s
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -10,7 +10,7 @@
|
||||
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" />
|
||||
<meta property="og:updated_time" content="2022-06-30T09:41:54+03:00" />
|
||||
<meta property="og:updated_time" content="2022-07-02T14:07:36+03:00" />
|
||||
|
||||
|
||||
|
||||
@ -81,6 +81,33 @@
|
||||
|
||||
|
||||
|
||||
<article class="blog-post">
|
||||
<header>
|
||||
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2018-07/">July, 2018</a></h2>
|
||||
<p class="blog-post-meta"><time datetime="2018-07-01T12:56:54+03:00">Sun Jul 01, 2018</time> by Alan Orth in
|
||||
<span class="fas fa-folder" aria-hidden="true"></span> <a href="/categories/notes/" rel="category tag">Notes</a>
|
||||
|
||||
|
||||
</p>
|
||||
</header>
|
||||
<h2 id="2018-07-01">2018-07-01</h2>
|
||||
<ul>
|
||||
<li>I want to upgrade DSpace Test to DSpace 5.8 so I took a backup of its current database just in case:</li>
|
||||
</ul>
|
||||
<pre tabindex="0"><code>$ pg_dump -b -v -o --format=custom -U dspace -f dspace-2018-07-01.backup dspace
|
||||
</code></pre><ul>
|
||||
<li>During the <code>mvn package</code> stage on the 5.8 branch I kept getting issues with java running out of memory:</li>
|
||||
</ul>
|
||||
<pre tabindex="0"><code>There is insufficient memory for the Java Runtime Environment to continue.
|
||||
</code></pre>
|
||||
<a href='https://alanorth.github.io/cgspace-notes/2018-07/'>Read more →</a>
|
||||
</article>
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
<article class="blog-post">
|
||||
<header>
|
||||
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2018-06/">June, 2018</a></h2>
|
||||
@ -380,31 +407,11 @@ COPY 54701
|
||||
|
||||
|
||||
|
||||
|
||||
<article class="blog-post">
|
||||
<header>
|
||||
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/cgiar-library-migration/">CGIAR Library Migration</a></h2>
|
||||
<p class="blog-post-meta"><time datetime="2017-09-18T16:38:35+03:00">Mon Sep 18, 2017</time> by Alan Orth in
|
||||
<span class="fas fa-folder" aria-hidden="true"></span> <a href="/categories/notes/" rel="category tag">Notes</a>
|
||||
|
||||
|
||||
<span class="fas fa-tag" aria-hidden="true"></span> <a href="/tags/migration/" rel="tag">Migration</a>
|
||||
|
||||
</p>
|
||||
</header>
|
||||
<p>Rough notes for importing the CGIAR Library content. It was decided that this content would go to a new top-level community called <em>CGIAR System Organization</em>.</p>
|
||||
<a href='https://alanorth.github.io/cgspace-notes/cgiar-library-migration/'>Read more →</a>
|
||||
</article>
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
<nav class="blog-pagination">
|
||||
|
||||
|
||||
<a class="btn btn-outline-primary" href="/cgspace-notes/categories/notes/page/5/" rel="prev" role="button">Previous page</a>
|
||||
<a class="btn btn-outline-primary disabled" href="#" role="button" aria-disabled="true">Next page</a>
|
||||
<a class="btn btn-outline-primary" href="/cgspace-notes/categories/notes/page/7/" rel="next" role="button">Next page</a>
|
||||
|
||||
|
||||
|
||||
</nav>
|
||||
@ -424,6 +431,8 @@ COPY 54701
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -432,8 +441,6 @@ COPY 54701
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
178
docs/categories/notes/page/7/index.html
Normal file
178
docs/categories/notes/page/7/index.html
Normal file
@ -0,0 +1,178 @@
|
||||
<!DOCTYPE html>
|
||||
<html lang="en" >
|
||||
|
||||
<head>
|
||||
<meta charset="utf-8">
|
||||
<meta name="viewport" content="width=device-width, initial-scale=1, shrink-to-fit=no">
|
||||
|
||||
|
||||
<meta property="og:title" content="Notes" />
|
||||
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" />
|
||||
<meta property="og:updated_time" content="2022-07-02T14:07:36+03:00" />
|
||||
|
||||
|
||||
|
||||
<meta name="twitter:card" content="summary"/>
|
||||
<meta name="twitter:title" content="Notes"/>
|
||||
<meta name="twitter:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository."/>
|
||||
<meta name="generator" content="Hugo 0.101.0" />
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
<link rel="canonical" href="https://alanorth.github.io/cgspace-notes/categories/notes/">
|
||||
|
||||
<title>CGSpace Notes</title>
|
||||
|
||||
|
||||
<!-- combined, minified CSS -->
|
||||
|
||||
<link href="https://alanorth.github.io/cgspace-notes/css/style.beb8012edc08ba10be012f079d618dc243812267efe62e11f22fe49618f976a4.css" rel="stylesheet" integrity="sha256-vrgBLtwIuhC+AS8HnWGNwkOBImfv5i4R8i/klhj5dqQ=" crossorigin="anonymous">
|
||||
|
||||
|
||||
<!-- minified Font Awesome for SVG icons -->
|
||||
|
||||
<script defer src="https://alanorth.github.io/cgspace-notes/js/fontawesome.min.f5072c55a0721857184db93a50561d7dc13975b4de2e19db7f81eb5f3fa57270.js" integrity="sha256-9QcsVaByGFcYTbk6UFYdfcE5dbTeLhnbf4HrXz+lcnA=" crossorigin="anonymous"></script>
|
||||
|
||||
<!-- RSS 2.0 feed -->
|
||||
<link rel="alternate" type="application/rss+xml" href="https://alanorth.github.io/cgspace-notes/categories/notes/index.xml" title="CGSpace Notes" />
|
||||
|
||||
|
||||
|
||||
|
||||
</head>
|
||||
|
||||
<body>
|
||||
|
||||
|
||||
<div class="blog-masthead">
|
||||
<div class="container">
|
||||
<nav class="nav blog-nav">
|
||||
<a class="nav-link " href="https://alanorth.github.io/cgspace-notes/">Home</a>
|
||||
</nav>
|
||||
</div>
|
||||
</div>
|
||||
|
||||
|
||||
|
||||
|
||||
<header class="blog-header">
|
||||
<div class="container">
|
||||
<h1 class="blog-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/" rel="home">CGSpace Notes</a></h1>
|
||||
<p class="lead blog-description" dir="auto">Documenting day-to-day work on the <a href="https://cgspace.cgiar.org">CGSpace</a> repository.</p>
|
||||
</div>
|
||||
</header>
|
||||
|
||||
|
||||
|
||||
|
||||
<div class="container">
|
||||
<div class="row">
|
||||
<div class="col-sm-8 blog-main">
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
<article class="blog-post">
|
||||
<header>
|
||||
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/cgiar-library-migration/">CGIAR Library Migration</a></h2>
|
||||
<p class="blog-post-meta"><time datetime="2017-09-18T16:38:35+03:00">Mon Sep 18, 2017</time> by Alan Orth in
|
||||
<span class="fas fa-folder" aria-hidden="true"></span> <a href="/categories/notes/" rel="category tag">Notes</a>
|
||||
|
||||
|
||||
<span class="fas fa-tag" aria-hidden="true"></span> <a href="/tags/migration/" rel="tag">Migration</a>
|
||||
|
||||
</p>
|
||||
</header>
|
||||
<p>Rough notes for importing the CGIAR Library content. It was decided that this content would go to a new top-level community called <em>CGIAR System Organization</em>.</p>
|
||||
<a href='https://alanorth.github.io/cgspace-notes/cgiar-library-migration/'>Read more →</a>
|
||||
</article>
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
<nav class="blog-pagination">
|
||||
|
||||
|
||||
<a class="btn btn-outline-primary" href="/cgspace-notes/categories/notes/page/6/" rel="prev" role="button">Previous page</a>
|
||||
<a class="btn btn-outline-primary disabled" href="#" role="button" aria-disabled="true">Next page</a>
|
||||
|
||||
|
||||
</nav>
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
</div> <!-- /.blog-main -->
|
||||
|
||||
<aside class="col-sm-3 ml-auto blog-sidebar">
|
||||
|
||||
|
||||
|
||||
<section class="sidebar-module">
|
||||
<h4>Recent Posts</h4>
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-04/">April, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
||||
|
||||
|
||||
<section class="sidebar-module">
|
||||
<h4>Links</h4>
|
||||
<ol class="list-unstyled">
|
||||
|
||||
<li><a href="https://cgspace.cgiar.org">CGSpace</a></li>
|
||||
|
||||
<li><a href="https://dspacetest.cgiar.org">DSpace Test</a></li>
|
||||
|
||||
<li><a href="https://github.com/ilri/DSpace">CGSpace @ GitHub</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
</aside>
|
||||
|
||||
|
||||
</div> <!-- /.row -->
|
||||
</div> <!-- /.container -->
|
||||
|
||||
|
||||
|
||||
<footer class="blog-footer">
|
||||
<p dir="auto">
|
||||
|
||||
Blog template created by <a href="https://twitter.com/mdo">@mdo</a>, ported to Hugo by <a href='https://twitter.com/mralanorth'>@mralanorth</a>.
|
||||
|
||||
</p>
|
||||
<p>
|
||||
<a href="#">Back to top</a>
|
||||
</p>
|
||||
</footer>
|
||||
|
||||
|
||||
</body>
|
||||
|
||||
</html>
|
@ -282,6 +282,8 @@ dspace=# select setval('handle_seq',86873);
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -290,8 +292,6 @@ dspace=# select setval('handle_seq',86873);
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -467,6 +467,8 @@
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -475,8 +477,6 @@
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -471,6 +471,8 @@
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -479,8 +481,6 @@
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -10,7 +10,7 @@
|
||||
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
|
||||
<meta property="og:updated_time" content="2022-06-30T09:41:54+03:00" />
|
||||
<meta property="og:updated_time" content="2022-07-02T14:07:36+03:00" />
|
||||
|
||||
|
||||
|
||||
@ -31,7 +31,7 @@
|
||||
"@type": "Person",
|
||||
"name": "Alan Orth"
|
||||
},
|
||||
"dateModified": "2022-06-06T09:01:36+03:00",
|
||||
"dateModified": "2022-07-02T14:07:36+03:00",
|
||||
"keywords": "notes, migration, notes",
|
||||
"description":"Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository."
|
||||
}
|
||||
@ -96,6 +96,32 @@
|
||||
|
||||
|
||||
|
||||
<article class="blog-post">
|
||||
<header>
|
||||
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2022-07/">July, 2022</a></h2>
|
||||
<p class="blog-post-meta"><time datetime="2022-07-02T14:07:36+03:00">Sat Jul 02, 2022</time> by Alan Orth in
|
||||
<span class="fas fa-folder" aria-hidden="true"></span> <a href="/categories/notes/" rel="category tag">Notes</a>
|
||||
|
||||
|
||||
</p>
|
||||
</header>
|
||||
<h2 id="2022-07-02">2022-07-02</h2>
|
||||
<ul>
|
||||
<li>I learned how to use the Levenshtein functions in PostgreSQL
|
||||
<ul>
|
||||
<li>The thing is that there is a limit of 255 characters for these functions in PostgreSQL so you need to truncate the strings before comparing</li>
|
||||
<li>Also, the trgm functions I’ve used before are case insensitive, but Levenshtein is not, so you need to make sure to lower case both strings first</li>
|
||||
</ul>
|
||||
</li>
|
||||
</ul>
|
||||
<a href='https://alanorth.github.io/cgspace-notes/2022-07/'>Read more →</a>
|
||||
</article>
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
<article class="blog-post">
|
||||
<header>
|
||||
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2022-06/">June, 2022</a></h2>
|
||||
@ -349,39 +375,6 @@
|
||||
|
||||
|
||||
|
||||
|
||||
<article class="blog-post">
|
||||
<header>
|
||||
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2021-09/">September, 2021</a></h2>
|
||||
<p class="blog-post-meta"><time datetime="2021-09-01T09:14:07+03:00">Wed Sep 01, 2021</time> by Alan Orth in
|
||||
<span class="fas fa-folder" aria-hidden="true"></span> <a href="/categories/notes/" rel="category tag">Notes</a>
|
||||
|
||||
|
||||
</p>
|
||||
</header>
|
||||
<h2 id="2021-09-02">2021-09-02</h2>
|
||||
<ul>
|
||||
<li>Troubleshooting the missing Altmetric scores on AReS
|
||||
<ul>
|
||||
<li>Turns out that I didn’t actually fix them last month because the check for <code>content.altmetric</code> still exists, and I can’t access the DOIs using <code>_h.source.DOI</code> for some reason</li>
|
||||
<li>I can access all other kinds of item metadata using the Elasticsearch label, but not DOI!!!</li>
|
||||
<li>I will change <code>DOI</code> to <code>tomato</code> in the repository setup and start a re-harvest… I need to see if this is some kind of reserved word or something…</li>
|
||||
<li>Even as <code>tomato</code> I can’t access that field as <code>_h.source.tomato</code> in Angular, but it does work as a filter source… sigh</li>
|
||||
</ul>
|
||||
</li>
|
||||
<li>I’m having problems using the OpenRXV API
|
||||
<ul>
|
||||
<li>The syntax Moayad showed me last month doesn’t seem to honor the search query properly…</li>
|
||||
</ul>
|
||||
</li>
|
||||
</ul>
|
||||
<a href='https://alanorth.github.io/cgspace-notes/2021-09/'>Read more →</a>
|
||||
</article>
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
<nav class="blog-pagination">
|
||||
|
||||
|
||||
@ -406,6 +399,8 @@
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -414,8 +409,6 @@
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -6,7 +6,24 @@
|
||||
<description>Recent content on CGSpace Notes</description>
|
||||
<generator>Hugo -- gohugo.io</generator>
|
||||
<language>en-us</language>
|
||||
<lastBuildDate>Mon, 06 Jun 2022 09:01:36 +0300</lastBuildDate><atom:link href="https://alanorth.github.io/cgspace-notes/index.xml" rel="self" type="application/rss+xml" />
|
||||
<lastBuildDate>Sat, 02 Jul 2022 14:07:36 +0300</lastBuildDate><atom:link href="https://alanorth.github.io/cgspace-notes/index.xml" rel="self" type="application/rss+xml" />
|
||||
<item>
|
||||
<title>July, 2022</title>
|
||||
<link>https://alanorth.github.io/cgspace-notes/2022-07/</link>
|
||||
<pubDate>Sat, 02 Jul 2022 14:07:36 +0300</pubDate>
|
||||
|
||||
<guid>https://alanorth.github.io/cgspace-notes/2022-07/</guid>
|
||||
<description><h2 id="2022-07-02">2022-07-02</h2>
|
||||
<ul>
|
||||
<li>I learned how to use the Levenshtein functions in PostgreSQL
|
||||
<ul>
|
||||
<li>The thing is that there is a limit of 255 characters for these functions in PostgreSQL so you need to truncate the strings before comparing</li>
|
||||
<li>Also, the trgm functions I&rsquo;ve used before are case insensitive, but Levenshtein is not, so you need to make sure to lower case both strings first</li>
|
||||
</ul>
|
||||
</li>
|
||||
</ul></description>
|
||||
</item>
|
||||
|
||||
<item>
|
||||
<title>June, 2022</title>
|
||||
<link>https://alanorth.github.io/cgspace-notes/2022-06/</link>
|
||||
|
@ -10,7 +10,7 @@
|
||||
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
|
||||
<meta property="og:updated_time" content="2022-06-30T09:41:54+03:00" />
|
||||
<meta property="og:updated_time" content="2022-07-02T14:07:36+03:00" />
|
||||
|
||||
|
||||
|
||||
@ -31,7 +31,7 @@
|
||||
"@type": "Person",
|
||||
"name": "Alan Orth"
|
||||
},
|
||||
"dateModified": "2022-06-06T09:01:36+03:00",
|
||||
"dateModified": "2022-07-02T14:07:36+03:00",
|
||||
"keywords": "notes, migration, notes",
|
||||
"description":"Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository."
|
||||
}
|
||||
@ -96,6 +96,39 @@
|
||||
|
||||
|
||||
|
||||
<article class="blog-post">
|
||||
<header>
|
||||
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2021-09/">September, 2021</a></h2>
|
||||
<p class="blog-post-meta"><time datetime="2021-09-01T09:14:07+03:00">Wed Sep 01, 2021</time> by Alan Orth in
|
||||
<span class="fas fa-folder" aria-hidden="true"></span> <a href="/categories/notes/" rel="category tag">Notes</a>
|
||||
|
||||
|
||||
</p>
|
||||
</header>
|
||||
<h2 id="2021-09-02">2021-09-02</h2>
|
||||
<ul>
|
||||
<li>Troubleshooting the missing Altmetric scores on AReS
|
||||
<ul>
|
||||
<li>Turns out that I didn’t actually fix them last month because the check for <code>content.altmetric</code> still exists, and I can’t access the DOIs using <code>_h.source.DOI</code> for some reason</li>
|
||||
<li>I can access all other kinds of item metadata using the Elasticsearch label, but not DOI!!!</li>
|
||||
<li>I will change <code>DOI</code> to <code>tomato</code> in the repository setup and start a re-harvest… I need to see if this is some kind of reserved word or something…</li>
|
||||
<li>Even as <code>tomato</code> I can’t access that field as <code>_h.source.tomato</code> in Angular, but it does work as a filter source… sigh</li>
|
||||
</ul>
|
||||
</li>
|
||||
<li>I’m having problems using the OpenRXV API
|
||||
<ul>
|
||||
<li>The syntax Moayad showed me last month doesn’t seem to honor the search query properly…</li>
|
||||
</ul>
|
||||
</li>
|
||||
</ul>
|
||||
<a href='https://alanorth.github.io/cgspace-notes/2021-09/'>Read more →</a>
|
||||
</article>
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
<article class="blog-post">
|
||||
<header>
|
||||
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2021-08/">August, 2021</a></h2>
|
||||
@ -349,33 +382,6 @@
|
||||
|
||||
|
||||
|
||||
|
||||
<article class="blog-post">
|
||||
<header>
|
||||
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2020-12/">December, 2020</a></h2>
|
||||
<p class="blog-post-meta"><time datetime="2020-12-01T11:32:54+02:00">Tue Dec 01, 2020</time> by Alan Orth in
|
||||
<span class="fas fa-folder" aria-hidden="true"></span> <a href="/categories/notes/" rel="category tag">Notes</a>
|
||||
|
||||
|
||||
</p>
|
||||
</header>
|
||||
<h2 id="2020-12-01">2020-12-01</h2>
|
||||
<ul>
|
||||
<li>Atmire responded about the issue with duplicate data in our Solr statistics
|
||||
<ul>
|
||||
<li>They noticed that some records in the statistics-2015 core haven’t been migrated with the AtomicStatisticsUpdateCLI tool yet and assumed that I haven’t migrated any of the records yet</li>
|
||||
<li>That’s strange, as I checked all ten cores and 2015 is the only one with some unmigrated documents, as according to the <code>cua_version</code> field</li>
|
||||
<li>I started processing those (about 411,000 records):</li>
|
||||
</ul>
|
||||
</li>
|
||||
</ul>
|
||||
<a href='https://alanorth.github.io/cgspace-notes/2020-12/'>Read more →</a>
|
||||
</article>
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
<nav class="blog-pagination">
|
||||
|
||||
<a class="btn btn-outline-primary" href="/cgspace-notes/" rel="prev" role="button">Previous page</a>
|
||||
@ -400,6 +406,8 @@
|
||||
<ol class="list-unstyled">
|
||||
|
||||
|
||||
<li><a href="/cgspace-notes/2022-07/">July, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-06/">June, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-05/">May, 2022</a></li>
|
||||
@ -408,8 +416,6 @@
|
||||
|
||||
<li><a href="/cgspace-notes/2022-03/">March, 2022</a></li>
|
||||
|
||||
<li><a href="/cgspace-notes/2022-02/">February, 2022</a></li>
|
||||
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
Some files were not shown because too many files have changed in this diff Show More
Loading…
Reference in New Issue
Block a user