diff --git a/content/posts/2022-06.md b/content/posts/2022-06.md index d09c70f5e..087788205 100644 --- a/content/posts/2022-06.md +++ b/content/posts/2022-06.md @@ -284,5 +284,7 @@ $ csvgrep -c 'number of matches' -r '^0$' /tmp/2022-06-30-cgspace-subjects-resul - Then I took all the terms with fifty or more occurences and put them on a Google Sheet - There I started removing any term that was a variation of an existing AGROVOC term (like cowpea/cowpeas, policy/policies) or a compound concept +- pnbecker on DSpace Slack mentioned that they made a JSPUI deduplication step that is open source: https://github.com/the-library-code/deduplication + - It uses Levenshtein distance via PostgreSQL's fuzzystrmatch extension diff --git a/content/posts/2022-07.md b/content/posts/2022-07.md new file mode 100644 index 000000000..14e7568eb --- /dev/null +++ b/content/posts/2022-07.md @@ -0,0 +1,35 @@ +--- +title: "July, 2022" +date: 2022-07-02T14:07:36+03:00 +author: "Alan Orth" +categories: ["Notes"] +--- + +## 2022-07-02 + +- I learned how to use the Levenshtein functions in PostgreSQL + - The thing is that there is a limit of 255 characters for these functions in PostgreSQL so you need to truncate the strings before comparing + - Also, the trgm functions I've used before are case insensitive, but Levenshtein is not, so you need to make sure to lower case both strings first + + + +- A working query checking for duplicates in the recent AfricaRice items is: + +```console +localhost/dspace= ☘ SELECT text_value FROM metadatavalue WHERE dspace_object_id IN (SELECT uuid FROM item) AND metadata_field_id=64 AND levenshtein_less_equal(LOWER('International Trade and Exotic Pests: The Risks for Biodiversity and African Economies'), LEFT(LOWER(text_value), 255), 3) <= 3; + text_value +──────────────────────────────────────────────────────────────────────────────────────── + International trade and exotic pests: the risks for biodiversity and African economies +(1 row) + +Time: 399.751 ms +``` + +- There is a great [blog post discussing Soundex with Levenshtein](https://www.crunchydata.com/blog/fuzzy-name-matching-in-postgresql) and creating indexes to make them faster +- I want to do some proper checks of accuracy and speed against my trigram method + +## 2022-07-03 + +- Start a harvest on AReS + + diff --git a/docs/2015-11/index.html b/docs/2015-11/index.html index 3c77c3494..e4b8b830a 100644 --- a/docs/2015-11/index.html +++ b/docs/2015-11/index.html @@ -242,6 +242,8 @@ db.statementpool = true
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -250,8 +252,6 @@ db.statementpool = true
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2015-12/index.html b/docs/2015-12/index.html index 555c09285..592b081a0 100644 --- a/docs/2015-12/index.html +++ b/docs/2015-12/index.html @@ -264,6 +264,8 @@ $ curl -o /dev/null -s -w %{time_total}\\n https://cgspace.cgiar.org/rest/handle
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -272,8 +274,6 @@ $ curl -o /dev/null -s -w %{time_total}\\n https://cgspace.cgiar.org/rest/handle
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2016-01/index.html b/docs/2016-01/index.html index f0abe7b90..6b96c1796 100644 --- a/docs/2016-01/index.html +++ b/docs/2016-01/index.html @@ -200,6 +200,8 @@ $ find SimpleArchiveForBio/ -iname “*.pdf” -exec basename {} ; | sor
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -208,8 +210,6 @@ $ find SimpleArchiveForBio/ -iname “*.pdf” -exec basename {} ; | sor
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2016-02/index.html b/docs/2016-02/index.html index 217db062d..cd054813f 100644 --- a/docs/2016-02/index.html +++ b/docs/2016-02/index.html @@ -378,6 +378,8 @@ Bitstream: tést señora alimentación.pdf
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -386,8 +388,6 @@ Bitstream: tést señora alimentación.pdf
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2016-03/index.html b/docs/2016-03/index.html index 590c79eee..dd639e43d 100644 --- a/docs/2016-03/index.html +++ b/docs/2016-03/index.html @@ -316,6 +316,8 @@ Reinstall my local (Mac OS X) DSpace stack with Tomcat 7, PostgreSQL 9.3, and Ja
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -324,8 +326,6 @@ Reinstall my local (Mac OS X) DSpace stack with Tomcat 7, PostgreSQL 9.3, and Ja
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2016-04/index.html b/docs/2016-04/index.html index 3573bcfee..789f7af3b 100644 --- a/docs/2016-04/index.html +++ b/docs/2016-04/index.html @@ -495,6 +495,8 @@ dspace.log.2016-04-27:7271
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -503,8 +505,6 @@ dspace.log.2016-04-27:7271
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2016-05/index.html b/docs/2016-05/index.html index d719ee7e5..3728e4887 100644 --- a/docs/2016-05/index.html +++ b/docs/2016-05/index.html @@ -371,6 +371,8 @@ sys 0m20.540s
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -379,8 +381,6 @@ sys 0m20.540s
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2016-06/index.html b/docs/2016-06/index.html index 8c24e4d9d..6af220288 100644 --- a/docs/2016-06/index.html +++ b/docs/2016-06/index.html @@ -409,6 +409,8 @@ $ ./delete-metadata-values.py -f dc.contributor.corporate -i Corporate-Authors-D
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -417,8 +419,6 @@ $ ./delete-metadata-values.py -f dc.contributor.corporate -i Corporate-Authors-D
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2016-07/index.html b/docs/2016-07/index.html index 924e92a70..5a9bf66cd 100644 --- a/docs/2016-07/index.html +++ b/docs/2016-07/index.html @@ -325,6 +325,8 @@ discovery.index.authority.ignore-variants=true
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -333,8 +335,6 @@ discovery.index.authority.ignore-variants=true
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2016-08/index.html b/docs/2016-08/index.html index 54b79e5b8..6cfce356b 100644 --- a/docs/2016-08/index.html +++ b/docs/2016-08/index.html @@ -389,6 +389,8 @@ $ JAVA_OPTS="-Dfile.encoding=UTF-8 -Xmx512m" /home/cgspace.cgiar.org/bin
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -397,8 +399,6 @@ $ JAVA_OPTS="-Dfile.encoding=UTF-8 -Xmx512m" /home/cgspace.cgiar.org/bin
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2016-09/index.html b/docs/2016-09/index.html index 387c73dce..7f1b23869 100644 --- a/docs/2016-09/index.html +++ b/docs/2016-09/index.html @@ -606,6 +606,8 @@ $ ./delete-metadata-values.py -i ilrisubjects-delete-13.csv -f cg.subject.ilri -
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -614,8 +616,6 @@ $ ./delete-metadata-values.py -i ilrisubjects-delete-13.csv -f cg.subject.ilri -
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2016-10/index.html b/docs/2016-10/index.html index 07076b556..5bbfeb2a6 100644 --- a/docs/2016-10/index.html +++ b/docs/2016-10/index.html @@ -372,6 +372,8 @@ dspace=# update metadatavalue set text_value = regexp_replace(text_value, 'h
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -380,8 +382,6 @@ dspace=# update metadatavalue set text_value = regexp_replace(text_value, 'h
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2016-11/index.html b/docs/2016-11/index.html index 2ea568bd4..ce68953bc 100644 --- a/docs/2016-11/index.html +++ b/docs/2016-11/index.html @@ -548,6 +548,8 @@ org.dspace.discovery.SearchServiceException: Error executing query
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -556,8 +558,6 @@ org.dspace.discovery.SearchServiceException: Error executing query
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2016-12/index.html b/docs/2016-12/index.html index 8c0286974..3e8b6946d 100644 --- a/docs/2016-12/index.html +++ b/docs/2016-12/index.html @@ -784,6 +784,8 @@ $ exit
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -792,8 +794,6 @@ $ exit
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2017-01/index.html b/docs/2017-01/index.html index 3a16ced7d..6de34e8b6 100644 --- a/docs/2017-01/index.html +++ b/docs/2017-01/index.html @@ -369,6 +369,8 @@ $ gs -sDEVICE=pdfwrite -dCompatibilityLevel=1.4 -dPDFSETTINGS=/ebook -dNOPAUSE -
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -377,8 +379,6 @@ $ gs -sDEVICE=pdfwrite -dCompatibilityLevel=1.4 -dPDFSETTINGS=/ebook -dNOPAUSE -
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2017-02/index.html b/docs/2017-02/index.html index 42b4aaac5..4af52ffec 100644 --- a/docs/2017-02/index.html +++ b/docs/2017-02/index.html @@ -423,6 +423,8 @@ COPY 1968
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -431,8 +433,6 @@ COPY 1968
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2017-03/index.html b/docs/2017-03/index.html index 3399e1953..8d68bee17 100644 --- a/docs/2017-03/index.html +++ b/docs/2017-03/index.html @@ -355,6 +355,8 @@ $ ./delete-metadata-values.py -i Investors-Delete-121.csv -f dc.description.spon
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -363,8 +365,6 @@ $ ./delete-metadata-values.py -i Investors-Delete-121.csv -f dc.description.spon
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2017-04/index.html b/docs/2017-04/index.html index 11a13a031..974805c17 100644 --- a/docs/2017-04/index.html +++ b/docs/2017-04/index.html @@ -585,6 +585,8 @@ $ gem install compass -v 1.0.3
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -593,8 +595,6 @@ $ gem install compass -v 1.0.3
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2017-05/index.html b/docs/2017-05/index.html index 722ffc64a..49a413737 100644 --- a/docs/2017-05/index.html +++ b/docs/2017-05/index.html @@ -391,6 +391,8 @@ UPDATE 187
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -399,8 +401,6 @@ UPDATE 187
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2017-06/index.html b/docs/2017-06/index.html index 9ab03cd57..1cd9c0133 100644 --- a/docs/2017-06/index.html +++ b/docs/2017-06/index.html @@ -270,6 +270,8 @@ $ JAVA_OPTS="-Xmx1024m -Dfile.encoding=UTF-8" [dspace]/bin/dspace import
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -278,8 +280,6 @@ $ JAVA_OPTS="-Xmx1024m -Dfile.encoding=UTF-8" [dspace]/bin/dspace import
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2017-07/index.html b/docs/2017-07/index.html index d13f34b72..c675070cc 100644 --- a/docs/2017-07/index.html +++ b/docs/2017-07/index.html @@ -275,6 +275,8 @@ delete from metadatavalue where resource_type_id=2 and metadata_field_id=235 and
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -283,8 +285,6 @@ delete from metadatavalue where resource_type_id=2 and metadata_field_id=235 and
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2017-08/index.html b/docs/2017-08/index.html index 1014b1966..56e5e8293 100644 --- a/docs/2017-08/index.html +++ b/docs/2017-08/index.html @@ -517,6 +517,8 @@ org.apache.commons.dbcp.SQLNestedException: Cannot get a connection, pool error
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -525,8 +527,6 @@ org.apache.commons.dbcp.SQLNestedException: Cannot get a connection, pool error
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2017-09/index.html b/docs/2017-09/index.html index abff4642d..3901e578d 100644 --- a/docs/2017-09/index.html +++ b/docs/2017-09/index.html @@ -659,6 +659,8 @@ Cert Status: good
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -667,8 +669,6 @@ Cert Status: good
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2017-10/index.html b/docs/2017-10/index.html index b62841d62..013889988 100644 --- a/docs/2017-10/index.html +++ b/docs/2017-10/index.html @@ -443,6 +443,8 @@ session_id=6C30F10B4351A4ED83EC6ED50AFD6B6A
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -451,8 +453,6 @@ session_id=6C30F10B4351A4ED83EC6ED50AFD6B6A
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2017-11/index.html b/docs/2017-11/index.html index 753bdd8c1..8da4b2e1c 100644 --- a/docs/2017-11/index.html +++ b/docs/2017-11/index.html @@ -944,6 +944,8 @@ $ cat dspace.log.2017-11-28 | grep -o -E 'session_id=[A-Z0-9]{32}' | sor
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -952,8 +954,6 @@ $ cat dspace.log.2017-11-28 | grep -o -E 'session_id=[A-Z0-9]{32}' | sor
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2017-12/index.html b/docs/2017-12/index.html index c98338ec7..5e6a91636 100644 --- a/docs/2017-12/index.html +++ b/docs/2017-12/index.html @@ -783,6 +783,8 @@ DELETE 20
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -791,8 +793,6 @@ DELETE 20
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2018-01/index.html b/docs/2018-01/index.html index c754bc1c9..f05e6f3ba 100644 --- a/docs/2018-01/index.html +++ b/docs/2018-01/index.html @@ -1452,6 +1452,8 @@ Catalina:type=Manager,context=/,host=localhost activeSessions 8
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -1460,8 +1462,6 @@ Catalina:type=Manager,context=/,host=localhost activeSessions 8
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2018-02/index.html b/docs/2018-02/index.html index 8b2b06d27..ff603f424 100644 --- a/docs/2018-02/index.html +++ b/docs/2018-02/index.html @@ -1038,6 +1038,8 @@ UPDATE 3
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -1046,8 +1048,6 @@ UPDATE 3
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2018-03/index.html b/docs/2018-03/index.html index 356df0cde..1f6144911 100644 --- a/docs/2018-03/index.html +++ b/docs/2018-03/index.html @@ -585,6 +585,8 @@ Fixed 5 occurences of: GENEBANKS
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -593,8 +595,6 @@ Fixed 5 occurences of: GENEBANKS
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2018-04/index.html b/docs/2018-04/index.html index 8ce71a75d..cac614654 100644 --- a/docs/2018-04/index.html +++ b/docs/2018-04/index.html @@ -594,6 +594,8 @@ $ pg_restore -O -U dspacetest -d dspacetest -W -h localhost /tmp/dspace_2018-04-
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -602,8 +604,6 @@ $ pg_restore -O -U dspacetest -d dspacetest -W -h localhost /tmp/dspace_2018-04-
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2018-05/index.html b/docs/2018-05/index.html index b93293ccf..f9524b5d2 100644 --- a/docs/2018-05/index.html +++ b/docs/2018-05/index.html @@ -523,6 +523,8 @@ $ psql -h localhost -U postgres dspacetest
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -531,8 +533,6 @@ $ psql -h localhost -U postgres dspacetest
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2018-06/index.html b/docs/2018-06/index.html index 3379e3b3e..23ac0a8a6 100644 --- a/docs/2018-06/index.html +++ b/docs/2018-06/index.html @@ -517,6 +517,8 @@ $ sed '/^id/d' 10568-*.csv | csvcut -c 1,2 > map-to-cifor-archive.csv
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -525,8 +527,6 @@ $ sed '/^id/d' 10568-*.csv | csvcut -c 1,2 > map-to-cifor-archive.csv
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2018-07/index.html b/docs/2018-07/index.html index bd2b3dc4b..f03b78846 100644 --- a/docs/2018-07/index.html +++ b/docs/2018-07/index.html @@ -569,6 +569,8 @@ dspace=# select count(text_value) from metadatavalue where resource_type_id=2 an
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -577,8 +579,6 @@ dspace=# select count(text_value) from metadatavalue where resource_type_id=2 an
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2018-08/index.html b/docs/2018-08/index.html index 93248adec..4300c2230 100644 --- a/docs/2018-08/index.html +++ b/docs/2018-08/index.html @@ -442,6 +442,8 @@ $ dspace database migrate ignored
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -450,8 +452,6 @@ $ dspace database migrate ignored
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2018-09/index.html b/docs/2018-09/index.html index 9fbc885dd..0209a3198 100644 --- a/docs/2018-09/index.html +++ b/docs/2018-09/index.html @@ -748,6 +748,8 @@ UPDATE metadatavalue SET text_value='ja' WHERE resource_type_id=2 AND me
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -756,8 +758,6 @@ UPDATE metadatavalue SET text_value='ja' WHERE resource_type_id=2 AND me
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2018-10/index.html b/docs/2018-10/index.html index 78527c86f..2c5f9fa57 100644 --- a/docs/2018-10/index.html +++ b/docs/2018-10/index.html @@ -656,6 +656,8 @@ $ curl -X GET -H "Content-Type: application/json" -H "Accept: applic
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -664,8 +666,6 @@ $ curl -X GET -H "Content-Type: application/json" -H "Accept: applic
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2018-11/index.html b/docs/2018-11/index.html index d770a926e..11db760d6 100644 --- a/docs/2018-11/index.html +++ b/docs/2018-11/index.html @@ -553,6 +553,8 @@ $ dspace dsrun org.dspace.eperson.Groomer -a -b 11/27/2016 -d
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -561,8 +563,6 @@ $ dspace dsrun org.dspace.eperson.Groomer -a -b 11/27/2016 -d
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2018-12/index.html b/docs/2018-12/index.html index ac150b594..c1e9a6c50 100644 --- a/docs/2018-12/index.html +++ b/docs/2018-12/index.html @@ -594,6 +594,8 @@ UPDATE 1
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -602,8 +604,6 @@ UPDATE 1
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2019-01/index.html b/docs/2019-01/index.html index d111fc46b..8eb3068a4 100644 --- a/docs/2019-01/index.html +++ b/docs/2019-01/index.html @@ -1264,6 +1264,8 @@ identify: CorruptImageProfile `xmp' @ warning/profile.c/SetImageProfileInter
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -1272,8 +1274,6 @@ identify: CorruptImageProfile `xmp' @ warning/profile.c/SetImageProfileInter
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2019-02/index.html b/docs/2019-02/index.html index 0db9738f7..d4667d117 100644 --- a/docs/2019-02/index.html +++ b/docs/2019-02/index.html @@ -1344,6 +1344,8 @@ Please see the DSpace documentation for assistance.
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -1352,8 +1354,6 @@ Please see the DSpace documentation for assistance.
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2019-03/index.html b/docs/2019-03/index.html index df5149197..46f1e8262 100644 --- a/docs/2019-03/index.html +++ b/docs/2019-03/index.html @@ -1208,6 +1208,8 @@ sys 0m2.551s
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -1216,8 +1218,6 @@ sys 0m2.551s
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2019-04/index.html b/docs/2019-04/index.html index 332fcdd4a..29ae0b124 100644 --- a/docs/2019-04/index.html +++ b/docs/2019-04/index.html @@ -1299,6 +1299,8 @@ UPDATE 14
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -1307,8 +1309,6 @@ UPDATE 14
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2019-05/index.html b/docs/2019-05/index.html index 42562e3b6..b4642c20c 100644 --- a/docs/2019-05/index.html +++ b/docs/2019-05/index.html @@ -631,6 +631,8 @@ COPY 64871
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -639,8 +641,6 @@ COPY 64871
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2019-06/index.html b/docs/2019-06/index.html index 25aaa1076..01ac1fbbc 100644 --- a/docs/2019-06/index.html +++ b/docs/2019-06/index.html @@ -317,6 +317,8 @@ UPDATE 2
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -325,8 +327,6 @@ UPDATE 2
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2019-07/index.html b/docs/2019-07/index.html index a06e1a8d1..5aab6abdd 100644 --- a/docs/2019-07/index.html +++ b/docs/2019-07/index.html @@ -554,6 +554,8 @@ issn.validate('1020-3362')
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -562,8 +564,6 @@ issn.validate('1020-3362')
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2019-08/index.html b/docs/2019-08/index.html index a57246373..1336a3e8c 100644 --- a/docs/2019-08/index.html +++ b/docs/2019-08/index.html @@ -573,6 +573,8 @@ sys 2m27.496s
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -581,8 +583,6 @@ sys 2m27.496s
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2019-09/index.html b/docs/2019-09/index.html index 7c2a5b47f..a17f5b488 100644 --- a/docs/2019-09/index.html +++ b/docs/2019-09/index.html @@ -581,6 +581,8 @@ $ csv-metadata-quality -i /tmp/clarisa-institutions.csv -o /tmp/clarisa-institut
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -589,8 +591,6 @@ $ csv-metadata-quality -i /tmp/clarisa-institutions.csv -o /tmp/clarisa-institut
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2019-10/index.html b/docs/2019-10/index.html index 294edb35e..0134932dc 100644 --- a/docs/2019-10/index.html +++ b/docs/2019-10/index.html @@ -385,6 +385,8 @@ $ dspace import -a -c 10568/104057 -e fuu@cgiar.org -m 2019-10-15-Bioversity.map
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -393,8 +395,6 @@ $ dspace import -a -c 10568/104057 -e fuu@cgiar.org -m 2019-10-15-Bioversity.map
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2019-11/index.html b/docs/2019-11/index.html index 6f2ebf3cc..856f984dc 100644 --- a/docs/2019-11/index.html +++ b/docs/2019-11/index.html @@ -692,6 +692,8 @@ $ tidy -xml -utf8 -iq -m -w 0 dspace/config/controlled-vocabularies/cg-creator-i
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -700,8 +702,6 @@ $ tidy -xml -utf8 -iq -m -w 0 dspace/config/controlled-vocabularies/cg-creator-i
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2019-12/index.html b/docs/2019-12/index.html index ea1bee863..219ea4d74 100644 --- a/docs/2019-12/index.html +++ b/docs/2019-12/index.html @@ -404,6 +404,8 @@ UPDATE 1
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -412,8 +414,6 @@ UPDATE 1
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2020-01/index.html b/docs/2020-01/index.html index 4e5c73a8d..893a80622 100644 --- a/docs/2020-01/index.html +++ b/docs/2020-01/index.html @@ -604,6 +604,8 @@ COPY 2900
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -612,8 +614,6 @@ COPY 2900
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2020-02/index.html b/docs/2020-02/index.html index b1ed62724..45f0720e8 100644 --- a/docs/2020-02/index.html +++ b/docs/2020-02/index.html @@ -1275,6 +1275,8 @@ Moving: 21993 into core statistics-2019
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -1283,8 +1285,6 @@ Moving: 21993 into core statistics-2019
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2020-03/index.html b/docs/2020-03/index.html index 960574904..54c517ee4 100644 --- a/docs/2020-03/index.html +++ b/docs/2020-03/index.html @@ -484,6 +484,8 @@ $ tidy -xml -utf8 -iq -m -w 0 dspace/config/controlled-vocabularies/cg-creator-i
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -492,8 +494,6 @@ $ tidy -xml -utf8 -iq -m -w 0 dspace/config/controlled-vocabularies/cg-creator-i
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2020-04/index.html b/docs/2020-04/index.html index a15fcc30d..634556b1c 100644 --- a/docs/2020-04/index.html +++ b/docs/2020-04/index.html @@ -658,6 +658,8 @@ $ psql -c 'select * from pg_stat_activity' | wc -l
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -666,8 +668,6 @@ $ psql -c 'select * from pg_stat_activity' | wc -l
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2020-05/index.html b/docs/2020-05/index.html index d83f404f6..9222bc5e8 100644 --- a/docs/2020-05/index.html +++ b/docs/2020-05/index.html @@ -477,6 +477,8 @@ Caused by: java.lang.NullPointerException
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -485,8 +487,6 @@ Caused by: java.lang.NullPointerException
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2020-06/index.html b/docs/2020-06/index.html index eff2a4f38..c5c33ec79 100644 --- a/docs/2020-06/index.html +++ b/docs/2020-06/index.html @@ -811,6 +811,8 @@ $ csvcut -c 'id,cg.subject.ilri[],cg.subject.ilri[en_US],dc.subject[en_US]&#
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -819,8 +821,6 @@ $ csvcut -c 'id,cg.subject.ilri[],cg.subject.ilri[en_US],dc.subject[en_US]&#
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2020-07/index.html b/docs/2020-07/index.html index da39cbf10..bfca38b7c 100644 --- a/docs/2020-07/index.html +++ b/docs/2020-07/index.html @@ -1142,6 +1142,8 @@ Fixed 4 occurences of: Muloi, D.M.
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -1150,8 +1152,6 @@ Fixed 4 occurences of: Muloi, D.M.
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2020-08/index.html b/docs/2020-08/index.html index 1c68c611a..3502d4d91 100644 --- a/docs/2020-08/index.html +++ b/docs/2020-08/index.html @@ -798,6 +798,8 @@ $ grep -c added /tmp/2020-08-27-countrycodetagger.log
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -806,8 +808,6 @@ $ grep -c added /tmp/2020-08-27-countrycodetagger.log
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2020-09/index.html b/docs/2020-09/index.html index 0e865e8d2..19df865a3 100644 --- a/docs/2020-09/index.html +++ b/docs/2020-09/index.html @@ -717,6 +717,8 @@ solr_query_params = {
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -725,8 +727,6 @@ solr_query_params = {
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2020-10/index.html b/docs/2020-10/index.html index 92a0d12ac..253fa5cbb 100644 --- a/docs/2020-10/index.html +++ b/docs/2020-10/index.html @@ -1241,6 +1241,8 @@ $ ./delete-metadata-values.py -i 2020-10-31-delete-74-sponsors.csv -db dspace -u
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -1249,8 +1251,6 @@ $ ./delete-metadata-values.py -i 2020-10-31-delete-74-sponsors.csv -db dspace -u
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2020-11/index.html b/docs/2020-11/index.html index 985932203..fb89425cb 100644 --- a/docs/2020-11/index.html +++ b/docs/2020-11/index.html @@ -731,6 +731,8 @@ $ ./fix-metadata-values.py -i 2020-11-30-fix-hung-orcid.csv -db dspace63 -u dspa
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -739,8 +741,6 @@ $ ./fix-metadata-values.py -i 2020-11-30-fix-hung-orcid.csv -db dspace63 -u dspa
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2020-12/index.html b/docs/2020-12/index.html index 3d38e243d..4a305f22a 100644 --- a/docs/2020-12/index.html +++ b/docs/2020-12/index.html @@ -869,6 +869,8 @@ $ query-json '.items | length' /tmp/policy2.json
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -877,8 +879,6 @@ $ query-json '.items | length' /tmp/policy2.json
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2021-01/index.html b/docs/2021-01/index.html index 0173433b8..3aa80f181 100644 --- a/docs/2021-01/index.html +++ b/docs/2021-01/index.html @@ -688,6 +688,8 @@ java.lang.IllegalArgumentException: Invalid character found in the request targe
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -696,8 +698,6 @@ java.lang.IllegalArgumentException: Invalid character found in the request targe
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2021-02/index.html b/docs/2021-02/index.html index 84390ab62..67044dbfe 100644 --- a/docs/2021-02/index.html +++ b/docs/2021-02/index.html @@ -898,6 +898,8 @@ dspace=# UPDATE metadatavalue SET text_lang='en_US' WHERE dspace_object_
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -906,8 +908,6 @@ dspace=# UPDATE metadatavalue SET text_lang='en_US' WHERE dspace_object_
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2021-03/index.html b/docs/2021-03/index.html index 982e11b1d..ad824b4af 100644 --- a/docs/2021-03/index.html +++ b/docs/2021-03/index.html @@ -875,6 +875,8 @@ Also, we found some issues building and running OpenRXV currently due to ecosyst
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -883,8 +885,6 @@ Also, we found some issues building and running OpenRXV currently due to ecosyst
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2021-04/index.html b/docs/2021-04/index.html index 0675fa9b3..05bfdde8e 100644 --- a/docs/2021-04/index.html +++ b/docs/2021-04/index.html @@ -1042,6 +1042,8 @@ $ chrt -b 0 dspace dsrun com.atmire.statistics.util.update.atomic.AtomicStatisti
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -1050,8 +1052,6 @@ $ chrt -b 0 dspace dsrun com.atmire.statistics.util.update.atomic.AtomicStatisti
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2021-05/index.html b/docs/2021-05/index.html index 78d046071..3ab67ce42 100644 --- a/docs/2021-05/index.html +++ b/docs/2021-05/index.html @@ -685,6 +685,8 @@ May 26, 02:57 UTC
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -693,8 +695,6 @@ May 26, 02:57 UTC
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2021-06/index.html b/docs/2021-06/index.html index 17186efe0..4dc37a33a 100644 --- a/docs/2021-06/index.html +++ b/docs/2021-06/index.html @@ -693,6 +693,8 @@ I simply started it and AReS was running again:
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -701,8 +703,6 @@ I simply started it and AReS was running again:
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2021-07/index.html b/docs/2021-07/index.html index e01cf5fd9..2035cbea8 100644 --- a/docs/2021-07/index.html +++ b/docs/2021-07/index.html @@ -715,6 +715,8 @@ COPY 20994
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -723,8 +725,6 @@ COPY 20994
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2021-08/index.html b/docs/2021-08/index.html index b14351bab..e3e03952f 100644 --- a/docs/2021-08/index.html +++ b/docs/2021-08/index.html @@ -606,6 +606,8 @@ I decided to upgrade linode20 from Ubuntu 18.04 to 20.04
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -614,8 +616,6 @@ I decided to upgrade linode20 from Ubuntu 18.04 to 20.04
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2021-09/index.html b/docs/2021-09/index.html index 8bc8bc8d9..f9afc688d 100644 --- a/docs/2021-09/index.html +++ b/docs/2021-09/index.html @@ -588,6 +588,8 @@ The syntax Moayad showed me last month doesn’t seem to honor the search qu
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -596,8 +598,6 @@ The syntax Moayad showed me last month doesn’t seem to honor the search qu
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2021-10/index.html b/docs/2021-10/index.html index e77ce0c13..7080243ca 100644 --- a/docs/2021-10/index.html +++ b/docs/2021-10/index.html @@ -791,6 +791,8 @@ Try doing it in two imports. In first import, remove all authors. In second impo
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -799,8 +801,6 @@ Try doing it in two imports. In first import, remove all authors. In second impo
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2021-11/index.html b/docs/2021-11/index.html index a59b4deaa..b6661357a 100644 --- a/docs/2021-11/index.html +++ b/docs/2021-11/index.html @@ -494,6 +494,8 @@ $ zstd statistics-2019.json
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -502,8 +504,6 @@ $ zstd statistics-2019.json
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2021-12/index.html b/docs/2021-12/index.html index 51545fdac..7f43ad504 100644 --- a/docs/2021-12/index.html +++ b/docs/2021-12/index.html @@ -577,6 +577,8 @@ Total number of bot hits purged: 3679
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -585,8 +587,6 @@ Total number of bot hits purged: 3679
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2022-01/index.html b/docs/2022-01/index.html index dc8268174..1a9fb4193 100644 --- a/docs/2022-01/index.html +++ b/docs/2022-01/index.html @@ -380,6 +380,8 @@ Start a full harvest on AReS
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -388,8 +390,6 @@ Start a full harvest on AReS
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2022-02/index.html b/docs/2022-02/index.html index 71ea0319c..22e9ebbd4 100644 --- a/docs/2022-02/index.html +++ b/docs/2022-02/index.html @@ -724,6 +724,8 @@ isNotNull(value.match('699'))
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -732,8 +734,6 @@ isNotNull(value.match('699'))
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2022-03/index.html b/docs/2022-03/index.html index 3f94da53f..656ceb409 100644 --- a/docs/2022-03/index.html +++ b/docs/2022-03/index.html @@ -476,6 +476,8 @@ isNotNull(value.match('889'))
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -484,8 +486,6 @@ isNotNull(value.match('889'))
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2022-04/index.html b/docs/2022-04/index.html index 3f136e857..27fa1849c 100644 --- a/docs/2022-04/index.html +++ b/docs/2022-04/index.html @@ -509,6 +509,8 @@
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -517,8 +519,6 @@
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2022-05/index.html b/docs/2022-05/index.html index d0b9f1b18..94a4a00a8 100644 --- a/docs/2022-05/index.html +++ b/docs/2022-05/index.html @@ -445,6 +445,8 @@ I purged 93,974 hits from these IPs using my check-spider-ip-hits.sh script
    +
  1. July, 2022
  2. +
  3. June, 2022
  4. May, 2022
  5. @@ -453,8 +455,6 @@ I purged 93,974 hits from these IPs using my check-spider-ip-hits.sh script
  6. March, 2022
  7. -
  8. February, 2022
  9. -
diff --git a/docs/2022-06/index.html b/docs/2022-06/index.html index 751e58e93..ebdc6e2d2 100644 --- a/docs/2022-06/index.html +++ b/docs/2022-06/index.html @@ -26,7 +26,7 @@ There seem to be many more of these: - + @@ -58,9 +58,9 @@ There seem to be many more of these: "@type": "BlogPosting", "headline": "June, 2022", "url": "https://alanorth.github.io/cgspace-notes/2022-06/", - "wordCount": "1761", + "wordCount": "1786", "datePublished": "2022-06-06T09:01:36+03:00", - "dateModified": "2022-06-30T09:41:54+03:00", + "dateModified": "2022-06-30T16:48:03+03:00", "author": { "@type": "Person", "name": "Alan Orth" @@ -431,6 +431,11 @@ There seem to be many more of these:
  • There I started removing any term that was a variation of an existing AGROVOC term (like cowpea/cowpeas, policy/policies) or a compound concept
  • +
  • pnbecker on DSpace Slack mentioned that they made a JSPUI deduplication step that is open source: https://github.com/the-library-code/deduplication + +
  • @@ -453,6 +458,8 @@ There seem to be many more of these:
      +
    1. July, 2022
    2. +
    3. June, 2022
    4. May, 2022
    5. @@ -461,8 +468,6 @@ There seem to be many more of these:
    6. March, 2022
    7. -
    8. February, 2022
    9. -
    diff --git a/docs/2022-07/index.html b/docs/2022-07/index.html new file mode 100644 index 000000000..80e7262e7 --- /dev/null +++ b/docs/2022-07/index.html @@ -0,0 +1,222 @@ + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + July, 2022 | CGSpace Notes + + + + + + + + + + + + + + + + + + + + + +
    +
    + +
    +
    + + + + +
    +
    +

    CGSpace Notes

    +

    Documenting day-to-day work on the CGSpace repository.

    +
    +
    + + + + +
    +
    +
    + + + + +
    +
    +

    July, 2022

    + +
    +

    2022-07-02

    +
      +
    • I learned how to use the Levenshtein functions in PostgreSQL +
        +
      • The thing is that there is a limit of 255 characters for these functions in PostgreSQL so you need to truncate the strings before comparing
      • +
      • Also, the trgm functions I’ve used before are case insensitive, but Levenshtein is not, so you need to make sure to lower case both strings first
      • +
      +
    • +
    +
      +
    • A working query checking for duplicates in the recent AfricaRice items is:
    • +
    +
    localhost/dspace= ☘ SELECT text_value FROM metadatavalue WHERE  dspace_object_id IN (SELECT uuid FROM item) AND metadata_field_id=64 AND levenshtein_less_equal(LOWER('International Trade and Exotic Pests: The Risks for Biodiversity and African Economies'), LEFT(LOWER(text_value), 255), 3) <= 3;
    +                                       text_value                                       
    +────────────────────────────────────────────────────────────────────────────────────────
    + International trade and exotic pests: the risks for biodiversity and African economies
    +(1 row)
    +
    +Time: 399.751 ms
    +
    +

    2022-07-03

    +
      +
    • Start a harvest on AReS
    • +
    + + + + + + +
    + + + +
    + + + + +
    +
    + + + + + + + + + diff --git a/docs/404.html b/docs/404.html index 1568cdcef..2bf1e0165 100644 --- a/docs/404.html +++ b/docs/404.html @@ -95,6 +95,8 @@
      +
    1. July, 2022
    2. +
    3. June, 2022
    4. May, 2022
    5. @@ -103,8 +105,6 @@
    6. March, 2022
    7. -
    8. February, 2022
    9. -
    diff --git a/docs/categories/index.html b/docs/categories/index.html index ff4e3799c..bec37df3f 100644 --- a/docs/categories/index.html +++ b/docs/categories/index.html @@ -10,7 +10,7 @@ - + @@ -84,7 +84,7 @@

    Notes

    - +
    Read more → @@ -108,6 +108,8 @@
      +
    1. July, 2022
    2. +
    3. June, 2022
    4. May, 2022
    5. @@ -116,8 +118,6 @@
    6. March, 2022
    7. -
    8. February, 2022
    9. -
    diff --git a/docs/categories/index.xml b/docs/categories/index.xml index a4494aaca..564592e2c 100644 --- a/docs/categories/index.xml +++ b/docs/categories/index.xml @@ -6,11 +6,11 @@ Recent content in Categories on CGSpace Notes Hugo -- gohugo.io en-us - Mon, 06 Jun 2022 09:01:36 +0300 + Sat, 02 Jul 2022 14:07:36 +0300 Notes https://alanorth.github.io/cgspace-notes/categories/notes/ - Mon, 06 Jun 2022 09:01:36 +0300 + Sat, 02 Jul 2022 14:07:36 +0300 https://alanorth.github.io/cgspace-notes/categories/notes/ diff --git a/docs/categories/notes/index.html b/docs/categories/notes/index.html index 5429adf40..b5cb216a6 100644 --- a/docs/categories/notes/index.html +++ b/docs/categories/notes/index.html @@ -10,7 +10,7 @@ - + @@ -81,6 +81,32 @@ +
    +
    +

    July, 2022

    + +
    +

    2022-07-02

    +
      +
    • I learned how to use the Levenshtein functions in PostgreSQL +
        +
      • The thing is that there is a limit of 255 characters for these functions in PostgreSQL so you need to truncate the strings before comparing
      • +
      • Also, the trgm functions I’ve used before are case insensitive, but Levenshtein is not, so you need to make sure to lower case both strings first
      • +
      +
    • +
    + Read more → +
    + + + + + +

    June, 2022

    @@ -334,39 +360,6 @@ - -
    -
    -

    September, 2021

    - -
    -

    2021-09-02

    -
      -
    • Troubleshooting the missing Altmetric scores on AReS -
        -
      • Turns out that I didn’t actually fix them last month because the check for content.altmetric still exists, and I can’t access the DOIs using _h.source.DOI for some reason
      • -
      • I can access all other kinds of item metadata using the Elasticsearch label, but not DOI!!!
      • -
      • I will change DOI to tomato in the repository setup and start a re-harvest… I need to see if this is some kind of reserved word or something…
      • -
      • Even as tomato I can’t access that field as _h.source.tomato in Angular, but it does work as a filter source… sigh
      • -
      -
    • -
    • I’m having problems using the OpenRXV API -
        -
      • The syntax Moayad showed me last month doesn’t seem to honor the search query properly…
      • -
      -
    • -
    - Read more → -
    - - - - -