From baf29b7f8074170445bc758e8dc100fe61eb3b03 Mon Sep 17 00:00:00 2001 From: Alan Orth Date: Thu, 4 Feb 2021 17:28:20 +0200 Subject: [PATCH] Add notes for 2021-02-04 --- content/posts/2021-02.md | 157 ++++++++++++++++++ docs/2021-02/index.html | 208 +++++++++++++++++++++++- docs/categories/index.html | 2 +- docs/categories/notes/index.html | 9 +- docs/categories/notes/index.xml | 7 + docs/categories/notes/page/2/index.html | 2 +- docs/categories/notes/page/3/index.html | 2 +- docs/categories/notes/page/4/index.html | 2 +- docs/categories/notes/page/5/index.html | 2 +- docs/index.html | 9 +- docs/index.xml | 7 + docs/page/2/index.html | 2 +- docs/page/3/index.html | 2 +- docs/page/4/index.html | 2 +- docs/page/5/index.html | 2 +- docs/page/6/index.html | 2 +- docs/page/7/index.html | 2 +- docs/posts/index.html | 9 +- docs/posts/index.xml | 7 + docs/posts/page/2/index.html | 2 +- docs/posts/page/3/index.html | 2 +- docs/posts/page/4/index.html | 2 +- docs/posts/page/5/index.html | 2 +- docs/posts/page/6/index.html | 2 +- docs/posts/page/7/index.html | 2 +- docs/sitemap.xml | 10 +- 26 files changed, 428 insertions(+), 29 deletions(-) diff --git a/content/posts/2021-02.md b/content/posts/2021-02.md index 19b58a0d7..b02505879 100644 --- a/content/posts/2021-02.md +++ b/content/posts/2021-02.md @@ -7,6 +7,10 @@ categories: ["Notes"] ## 2021-02-01 +- Abenet said that CIP found more duplicate records in their export from AReS + - I re-opened [the issue](https://github.com/ilri/OpenRXV/issues/67) on OpenRXV where we had previously noticed this + - The shared link where the duplicates are is here: https://cgspace.cgiar.org/explorer/shared/heEOz3YBnXdK69bR2ra6 +- I had a call with CodeObia to discuss the work on OpenRXV - Check the results of the AReS harvesting from last night: ```console @@ -48,4 +52,157 @@ $ curl -XDELETE 'http://localhost:9200/openrxv-items-temp' $ curl -XDELETE 'http://localhost:9200/openrxv-items-2021-02-01' ``` +- Meeting with Peter and Abenet about CGSpace goals and progress +- Test submission to DSpace via REST API to see if Abenet can fix / reject it (submit workflow?) +- Get Peter a list of users who have submitted or approved on DSpace everrrrrrr, so he can remove some +- Ask MEL for a dump of their types to reconcile with ours and CG Core +- Need to tag ILRI collection with license!! For pre-2010 use "Other" unless a license is already there; 2010-2020 do the ilri content in batches (2010-2015: CC-BY-NC-SA; 2016-onwards: CC-BY); + - ONLY if ILRI / International Livestock Research Institute is the publisher, no journal articles, no book chapters... +- I tried to export the ILRI community from CGSpace but I got an error: + +```console +$ dspace metadata-export -i 10568/1 -f /tmp/2021-02-01-ILRI.csv +Loading @mire database changes for module MQM +Changes have been processed +Exporting community 'International Livestock Research Institute (ILRI)' (10568/1) + Exception: null +java.lang.NullPointerException + at com.google.common.base.Preconditions.checkNotNull(Preconditions.java:212) + at com.google.common.collect.Iterators.concat(Iterators.java:464) + at org.dspace.app.bulkedit.MetadataExport.addItemsToResult(MetadataExport.java:136) + at org.dspace.app.bulkedit.MetadataExport.buildFromCommunity(MetadataExport.java:125) + at org.dspace.app.bulkedit.MetadataExport.(MetadataExport.java:77) + at org.dspace.app.bulkedit.MetadataExport.main(MetadataExport.java:282) + at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) + at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62) + at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) + at java.lang.reflect.Method.invoke(Method.java:498) + at org.dspace.app.launcher.ScriptLauncher.runOneCommand(ScriptLauncher.java:229) + at org.dspace.app.launcher.ScriptLauncher.main(ScriptLauncher.java:81) +``` + +- I imported the production database to my local development environment and I get the same error... WTF is this? + - I was able to export another smaller community + - I filed [an issue](https://tracker.atmire.com/tickets-cgiar-ilri/view-ticket?id=919) with Atmire to see if it is likely something of theirs, or if I need to ask on the dspace-tech mailing list +- CodeObia sent a [pull request](https://github.com/ilri/OpenRXV/pull/71) with fixes for several issues we highlighted in OpenRXV + - I deployed the fixes on production, as they only affect minor parts of the frontend, and two of the four are working + - I sent feedback to CodeObia + +## 2021-02-02 + +- Communicate more with CodeObia about some fixes for OpenRXV +- Maria Garruccio sent me some new ORCID iDs for Bioversity authors, as well as a correction for Stefan Burkart's iD +- I saved the new ones to a text file, combined them with the others, extracted the ORCID iDs themselves, and updated the names using `resolve-orcids.py`: +- Then for the rest, I saved them to a text file, combined them with the others, extracted the ORCID iDs themselves, and updated the names using `resolve-orcids.py`: + +```console +$ cat ~/src/git/DSpace/dspace/config/controlled-vocabularies/cg-creator-id.xml /tmp/bioversity-orcid-ids.txt | grep -oE '[A-Z0-9]{4}-[A-Z0-9]{4}-[A-Z0-9]{4}-[A-Z0-9]{4}' | sort | uniq > /tmp/2021-02-02-combined-orcids.txt +$ ./ilri/resolve-orcids.py -i /tmp/2021-02-02-combined-orcids.txt -o /tmp/2021-02-02-combined-orcid-names.txt +``` + +- I sorted the names and added the XML formatting in vim, then ran it through tidy: + +```console +$ tidy -xml -utf8 -m -iq -w 0 dspace/config/controlled-vocabularies/cg-creator-id.xml +``` + +- Then I added all the changed names plus Stefan's incorrect ones to a CSV and processed them with `fix-metadata-values.py`: + +```console +$ cat 2021-02-02-fix-orcid-ids.csv +cg.creator.id,correct +Burkart Stefan: 0000-0001-5297-2184,Stefan Burkart: 0000-0001-5297-2184 +Burkart Stefan: 0000-0002-7558-9177,Stefan Burkart: 0000-0001-5297-2184 +Stefan Burkart: 0000-0001-5297-2184,Stefan Burkart: 0000-0001-5297-2184 +Stefan Burkart: 0000-0002-7558-9177,Stefan Burkart: 0000-0001-5297-2184 +Adina Chain Guadarrama: 0000-0002-6944-2064,Adina Chain-Guadarrama: 0000-0002-6944-2064 +Bedru: 0000-0002-7344-5743,Bedru B. Balana: 0000-0002-7344-5743 +Leigh Winowiecki: 0000-0001-5572-1284,Leigh Ann Winowiecki: 0000-0001-5572-1284 +Sander J. Zwart: 0000-0002-5091-1801,Sander Zwart: 0000-0002-5091-1801 +saul lozano-fuentes: 0000-0003-1517-6853,Saul Lozano: 0000-0003-1517-6853 +$ ./ilri/fix-metadata-values.py -i 2021-02-02-fix-orcid-ids.csv -db dspace63 -u dspace -p 'fuuu' -f cg.creator.id -t 'correct' -m 240 +``` + +- I also looked up which of these new authors might have existing items that are missing ORCID iDs +- I had to port my `add-orcid-identifiers-csv.py` to DSpace 6 UUIDs and I think it's working but I want to do a few more tests because it uses a sequence for the metadata_value_id + +## 2021-02-03 + +- Tag forty-three items from Bioversity's new authors with ORCID iDs using `add-orcid-identifiers-csv.py`: + +```console +$ cat /tmp/2021-02-02-add-orcid-ids.csv +dc.contributor.author,cg.creator.id +"Nchanji, E.",Eileen Bogweh Nchanji: 0000-0002-6859-0962 +"Nchanji, Eileen",Eileen Bogweh Nchanji: 0000-0002-6859-0962 +"Nchanji, Eileen Bogweh",Eileen Bogweh Nchanji: 0000-0002-6859-0962 +"Machida, Lewis",Lewis Machida: 0000-0002-0012-3997 +"Mockshell, Jonathan",Jonathan Mockshell: 0000-0003-1990-6657" +"Aubert, C.",Celine Aubert: 0000-0001-6284-4821 +"Aubert, Céline",Celine Aubert: 0000-0001-6284-4821 +"Devare, M.",Medha Devare: 0000-0003-0041-4812 +"Devare, Medha",Medha Devare: 0000-0003-0041-4812 +"Benites-Alfaro, O.E.",Omar E. Benites-Alfaro: 0000-0002-6852-9598 +"Benites-Alfaro, Omar Eduardo",Omar E. Benites-Alfaro: 0000-0002-6852-9598 +"Johnson, Vincent",VINCENT JOHNSON: 0000-0001-7874-178X +"Lesueur, Didier",didier lesueur: 0000-0002-6694-0869 +$ ./ilri/add-orcid-identifiers-csv.py -i /tmp/2021-02-02-add-orcid-ids.csv -db dspace -u dspace -p 'fuuu' -d +``` + +- I'm working on the CGSpace accession for Karl Rich's [Viet Nam Pig Model 2018](https://github.com/ilri/vietnam-pig-model-2018) and I noticed his ORCID iD is missing from CGSpace + - I added it and tagged 141 items of his with the iD +- I [uploaded a metadata-only accession](https://hdl.handle.net/10568/111126) for the impact of ILRI book by John McIntire and Delia Grace to CGSpace + - The source code itself is here: https://github.com/ilri/impact-book +- A little bit more work on CG Core v2 + +## 2021-02-04 + +- Re-sync CGSpace database and Solr to DSpace Test to start a public test of CG Core v2 + - Afterwards I updated Discovery and OAI: + +```console +$ time chrt -b 0 dspace index-discovery -b +$ dspace oai import -c +``` + +- Attend Accenture meeting for repository managers + - Not clear what the SMO wants to get out of us +- Enrico asked for some notes about our work on AReS in 2020 for CRP Livestock reporting + - Abenet and I came up with the following: + +> In 2020 we funded the third phase of development on the OpenRXV platform that powers AReS. This phase focused mainly on improving the search filtering, graphical visualizations, and reporting capabilities. It is now possible to create custom reports in Excel, Word, and PDF formats using a templating system. We also concentrated on making the vanilla OpenRXV platform easier to deploy and administer in hopes that other organizations would begin using it. Lastly, we identified and fixed a handful of bugs in the system. All development takes place publicly on GitHub: https://github.com/ilri/OpenRXV. + +> In the last quarter of 2020, ILRI conducted a briefing for nearly 100 scientists and communications staff on how to use ARes as a visualization tool for repository outputs and as a reporting tool (https://hdl.handle.net/10568/110527). Staff will begin using AReS to generate lists of their outputs to upload in the performance evaluation system to assist in their performance evaluation. The list of publications they will upload from AReS to Performax will indicate the open access status of each publication to help start discussion why some outputs are not open access given the open access policies of the CGIAR. + +- Call Moayad to discuss OpenRXV development + - We talked about the "reporting period" (date-based statistics) and some of the issues Abdullah is working on on GitHub + - I suggested that we offer the date-range statistics in a modal dialog with other sorting and grouping options during report generation +- Peter sent me the cleaned up series that I had originally sent him in 2020-10 + - I quickly applied all the deletions on CGSpace: + +```console +$ ./ilri/delete-metadata-values.py -i /tmp/2020-10-28-Series-PB.csv -db dspace -u dspace -p 'fuuu' -f dc.relation.ispartofseries -m 43 +``` + +- The corrected versions have a lot of encoding issues so I asked Peter to give me the correct ones so I can search/replace them: + - CIAT Publicaçao + - CIAT Publicación + - CIAT Série + - CIAT Séries + - Colección investigación y desarrollo + - CTA Guias práticos + - CTA Guias técnicas + - Curso de adiestramiento en producción y utilización de pastos tropicales + - Folheto Técnico + - ILRI Nota Informativa de Investigação + - Influencia de los actores sociales en América Central + - Institutionalization of quality assurance mechanism and dissemination of top quality commercial products to increase crop yields and improve food security of smallholder farmers in sub-Saharan Africa – COMPRO-II + - Manuel pour les Banques de Gènes;1 + - Sistematización de experiencias Proyecto ACORDAR + - Strüngmann Forum + - Unité de Recherche +- Help Peter debug an issue with one of Alan Duncan's new FEAST Data reports on CGSpace + - For some reason the default policy for the item was "COLLECTION_492_DEFAULT_READ" group, which had zero members + - I changed them all to Anonymous and the item was accessible + diff --git a/docs/2021-02/index.html b/docs/2021-02/index.html index c4b4fa108..f7e1ba54f 100644 --- a/docs/2021-02/index.html +++ b/docs/2021-02/index.html @@ -9,6 +9,13 @@ - + @@ -33,6 +40,13 @@ $ curl -s 'http://localhost:9200/openrxv-items-temp/_count?q=*&pretty

2021-02-01

$ curl -s 'http://localhost:9200/openrxv-items-temp/_count?q=*&pretty'
@@ -164,7 +185,186 @@ $ curl -s -X POST http://localhost:9200/openrxv-items-temp/_clone/openrxv-items
 
$ curl -XDELETE 'http://localhost:9200/openrxv-items-temp'       
 {"acknowledged":true}%                                                                                                               
 $ curl -XDELETE 'http://localhost:9200/openrxv-items-2021-02-01'
-
+
+
$ dspace metadata-export -i 10568/1 -f /tmp/2021-02-01-ILRI.csv
+Loading @mire database changes for module MQM
+Changes have been processed
+Exporting community 'International Livestock Research Institute (ILRI)' (10568/1)
+           Exception: null
+java.lang.NullPointerException
+        at com.google.common.base.Preconditions.checkNotNull(Preconditions.java:212)
+        at com.google.common.collect.Iterators.concat(Iterators.java:464)
+        at org.dspace.app.bulkedit.MetadataExport.addItemsToResult(MetadataExport.java:136)
+        at org.dspace.app.bulkedit.MetadataExport.buildFromCommunity(MetadataExport.java:125)
+        at org.dspace.app.bulkedit.MetadataExport.<init>(MetadataExport.java:77)
+        at org.dspace.app.bulkedit.MetadataExport.main(MetadataExport.java:282)
+        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
+        at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
+        at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
+        at java.lang.reflect.Method.invoke(Method.java:498)
+        at org.dspace.app.launcher.ScriptLauncher.runOneCommand(ScriptLauncher.java:229)
+        at org.dspace.app.launcher.ScriptLauncher.main(ScriptLauncher.java:81)
+
+

2021-02-02

+ +
$ cat ~/src/git/DSpace/dspace/config/controlled-vocabularies/cg-creator-id.xml /tmp/bioversity-orcid-ids.txt | grep -oE '[A-Z0-9]{4}-[A-Z0-9]{4}-[A-Z0-9]{4}-[A-Z0-9]{4}' | sort | uniq > /tmp/2021-02-02-combined-orcids.txt
+$ ./ilri/resolve-orcids.py -i /tmp/2021-02-02-combined-orcids.txt -o /tmp/2021-02-02-combined-orcid-names.txt
+
+
$ tidy -xml -utf8 -m -iq -w 0 dspace/config/controlled-vocabularies/cg-creator-id.xml
+
+
$ cat 2021-02-02-fix-orcid-ids.csv 
+cg.creator.id,correct
+Burkart Stefan: 0000-0001-5297-2184,Stefan Burkart: 0000-0001-5297-2184
+Burkart Stefan: 0000-0002-7558-9177,Stefan Burkart: 0000-0001-5297-2184
+Stefan  Burkart: 0000-0001-5297-2184,Stefan Burkart: 0000-0001-5297-2184
+Stefan Burkart: 0000-0002-7558-9177,Stefan Burkart: 0000-0001-5297-2184
+Adina Chain Guadarrama: 0000-0002-6944-2064,Adina Chain-Guadarrama: 0000-0002-6944-2064
+Bedru: 0000-0002-7344-5743,Bedru B. Balana: 0000-0002-7344-5743
+Leigh Winowiecki: 0000-0001-5572-1284,Leigh Ann Winowiecki: 0000-0001-5572-1284
+Sander J. Zwart: 0000-0002-5091-1801,Sander Zwart: 0000-0002-5091-1801
+saul lozano-fuentes: 0000-0003-1517-6853,Saul Lozano: 0000-0003-1517-6853
+$ ./ilri/fix-metadata-values.py -i 2021-02-02-fix-orcid-ids.csv -db dspace63 -u dspace -p 'fuuu' -f cg.creator.id -t 'correct' -m 240
+
+

2021-02-03

+ +
$ cat /tmp/2021-02-02-add-orcid-ids.csv
+dc.contributor.author,cg.creator.id
+"Nchanji, E.",Eileen Bogweh Nchanji: 0000-0002-6859-0962
+"Nchanji, Eileen",Eileen Bogweh Nchanji: 0000-0002-6859-0962
+"Nchanji, Eileen Bogweh",Eileen Bogweh Nchanji: 0000-0002-6859-0962
+"Machida, Lewis",Lewis Machida: 0000-0002-0012-3997
+"Mockshell, Jonathan",Jonathan Mockshell: 0000-0003-1990-6657"
+"Aubert, C.",Celine Aubert: 0000-0001-6284-4821
+"Aubert, Céline",Celine Aubert: 0000-0001-6284-4821
+"Devare, M.",Medha Devare: 0000-0003-0041-4812
+"Devare, Medha",Medha Devare: 0000-0003-0041-4812
+"Benites-Alfaro, O.E.",Omar E. Benites-Alfaro: 0000-0002-6852-9598
+"Benites-Alfaro, Omar Eduardo",Omar E. Benites-Alfaro: 0000-0002-6852-9598
+"Johnson, Vincent",VINCENT JOHNSON: 0000-0001-7874-178X
+"Lesueur, Didier",didier lesueur: 0000-0002-6694-0869
+$ ./ilri/add-orcid-identifiers-csv.py -i /tmp/2021-02-02-add-orcid-ids.csv -db dspace -u dspace -p 'fuuu' -d
+
+

2021-02-04

+ +
$ time chrt -b 0 dspace index-discovery -b
+$ dspace oai import -c
+
+
+

In 2020 we funded the third phase of development on the OpenRXV platform that powers AReS. This phase focused mainly on improving the search filtering, graphical visualizations, and reporting capabilities. It is now possible to create custom reports in Excel, Word, and PDF formats using a templating system. We also concentrated on making the vanilla OpenRXV platform easier to deploy and administer in hopes that other organizations would begin using it. Lastly, we identified and fixed a handful of bugs in the system. All development takes place publicly on GitHub: https://github.com/ilri/OpenRXV.

+
+
+

In the last quarter of 2020, ILRI conducted a briefing for nearly 100 scientists and communications staff on how to use ARes as a visualization tool for repository outputs and as a reporting tool (https://hdl.handle.net/10568/110527). Staff will begin using AReS to generate lists of their outputs to upload in the performance evaluation system to assist in their performance evaluation. The list of publications they will upload from AReS to Performax will indicate the open access status of each publication to help start discussion why some outputs are not open access given the open access policies of the CGIAR.

+
+ +
$ ./ilri/delete-metadata-values.py -i /tmp/2020-10-28-Series-PB.csv -db dspace -u dspace -p 'fuuu' -f dc.relation.ispartofseries -m 43
+
+ diff --git a/docs/categories/index.html b/docs/categories/index.html index 726592d11..d202b2348 100644 --- a/docs/categories/index.html +++ b/docs/categories/index.html @@ -10,7 +10,7 @@ - + diff --git a/docs/categories/notes/index.html b/docs/categories/notes/index.html index e784898f3..4623d614c 100644 --- a/docs/categories/notes/index.html +++ b/docs/categories/notes/index.html @@ -10,7 +10,7 @@ - + @@ -92,6 +92,13 @@

2021-02-01

$ curl -s 'http://localhost:9200/openrxv-items-temp/_count?q=*&pretty'
diff --git a/docs/categories/notes/index.xml b/docs/categories/notes/index.xml
index 53eae0fe8..fc20294a8 100644
--- a/docs/categories/notes/index.xml
+++ b/docs/categories/notes/index.xml
@@ -15,6 +15,13 @@
       https://alanorth.github.io/cgspace-notes/2021-02/
       <h2 id="2021-02-01">2021-02-01</h2>
 <ul>
+<li>Abenet said that CIP found more duplicate records in their export from AReS
+<ul>
+<li>I re-opened <a href="https://github.com/ilri/OpenRXV/issues/67">the issue</a> on OpenRXV where we had previously noticed this</li>
+<li>The shared link where the duplicates are is here: <a href="https://cgspace.cgiar.org/explorer/shared/heEOz3YBnXdK69bR2ra6">https://cgspace.cgiar.org/explorer/shared/heEOz3YBnXdK69bR2ra6</a></li>
+</ul>
+</li>
+<li>I had a call with CodeObia to discuss the work on OpenRXV</li>
 <li>Check the results of the AReS harvesting from last night:</li>
 </ul>
 <pre><code class="language-console" data-lang="console">$ curl -s 'http://localhost:9200/openrxv-items-temp/_count?q=*&amp;pretty'
diff --git a/docs/categories/notes/page/2/index.html b/docs/categories/notes/page/2/index.html
index 665e29c45..ac616f811 100644
--- a/docs/categories/notes/page/2/index.html
+++ b/docs/categories/notes/page/2/index.html
@@ -10,7 +10,7 @@
 
 
 
-
+
 
 
 
diff --git a/docs/categories/notes/page/3/index.html b/docs/categories/notes/page/3/index.html
index d78a4b116..4adf8efff 100644
--- a/docs/categories/notes/page/3/index.html
+++ b/docs/categories/notes/page/3/index.html
@@ -10,7 +10,7 @@
 
 
 
-
+
 
 
 
diff --git a/docs/categories/notes/page/4/index.html b/docs/categories/notes/page/4/index.html
index baa14d68c..345984e2f 100644
--- a/docs/categories/notes/page/4/index.html
+++ b/docs/categories/notes/page/4/index.html
@@ -10,7 +10,7 @@
 
 
 
-
+
 
 
 
diff --git a/docs/categories/notes/page/5/index.html b/docs/categories/notes/page/5/index.html
index 9f04b6f69..96595673b 100644
--- a/docs/categories/notes/page/5/index.html
+++ b/docs/categories/notes/page/5/index.html
@@ -10,7 +10,7 @@
 
 
 
-
+
 
 
 
diff --git a/docs/index.html b/docs/index.html
index 8cb4fd100..ae9e55eb3 100644
--- a/docs/index.html
+++ b/docs/index.html
@@ -10,7 +10,7 @@
 
 
 
-
+
 
 
 
@@ -107,6 +107,13 @@
   
   

2021-02-01

    +
  • Abenet said that CIP found more duplicate records in their export from AReS + +
  • +
  • I had a call with CodeObia to discuss the work on OpenRXV
  • Check the results of the AReS harvesting from last night:
$ curl -s 'http://localhost:9200/openrxv-items-temp/_count?q=*&pretty'
diff --git a/docs/index.xml b/docs/index.xml
index 4a3a737c7..aff3dfa84 100644
--- a/docs/index.xml
+++ b/docs/index.xml
@@ -15,6 +15,13 @@
       https://alanorth.github.io/cgspace-notes/2021-02/
       <h2 id="2021-02-01">2021-02-01</h2>
 <ul>
+<li>Abenet said that CIP found more duplicate records in their export from AReS
+<ul>
+<li>I re-opened <a href="https://github.com/ilri/OpenRXV/issues/67">the issue</a> on OpenRXV where we had previously noticed this</li>
+<li>The shared link where the duplicates are is here: <a href="https://cgspace.cgiar.org/explorer/shared/heEOz3YBnXdK69bR2ra6">https://cgspace.cgiar.org/explorer/shared/heEOz3YBnXdK69bR2ra6</a></li>
+</ul>
+</li>
+<li>I had a call with CodeObia to discuss the work on OpenRXV</li>
 <li>Check the results of the AReS harvesting from last night:</li>
 </ul>
 <pre><code class="language-console" data-lang="console">$ curl -s 'http://localhost:9200/openrxv-items-temp/_count?q=*&amp;pretty'
diff --git a/docs/page/2/index.html b/docs/page/2/index.html
index 40ac7fdd8..1ea69f8c0 100644
--- a/docs/page/2/index.html
+++ b/docs/page/2/index.html
@@ -10,7 +10,7 @@
 
 
 
-
+
 
 
 
diff --git a/docs/page/3/index.html b/docs/page/3/index.html
index b11fe1bc6..f598eb21b 100644
--- a/docs/page/3/index.html
+++ b/docs/page/3/index.html
@@ -10,7 +10,7 @@
 
 
 
-
+
 
 
 
diff --git a/docs/page/4/index.html b/docs/page/4/index.html
index 36c9ab550..b1e0c8e83 100644
--- a/docs/page/4/index.html
+++ b/docs/page/4/index.html
@@ -10,7 +10,7 @@
 
 
 
-
+
 
 
 
diff --git a/docs/page/5/index.html b/docs/page/5/index.html
index 0dcb7be52..e8c2617ca 100644
--- a/docs/page/5/index.html
+++ b/docs/page/5/index.html
@@ -10,7 +10,7 @@
 
 
 
-
+
 
 
 
diff --git a/docs/page/6/index.html b/docs/page/6/index.html
index 55e3167c9..c66f30ba3 100644
--- a/docs/page/6/index.html
+++ b/docs/page/6/index.html
@@ -10,7 +10,7 @@
 
 
 
-
+
 
 
 
diff --git a/docs/page/7/index.html b/docs/page/7/index.html
index 75e914d33..3507d7653 100644
--- a/docs/page/7/index.html
+++ b/docs/page/7/index.html
@@ -10,7 +10,7 @@
 
 
 
-
+
 
 
 
diff --git a/docs/posts/index.html b/docs/posts/index.html
index c0805e976..297aa8674 100644
--- a/docs/posts/index.html
+++ b/docs/posts/index.html
@@ -10,7 +10,7 @@
 
 
 
-
+
 
 
 
@@ -107,6 +107,13 @@
   
   

2021-02-01

    +
  • Abenet said that CIP found more duplicate records in their export from AReS + +
  • +
  • I had a call with CodeObia to discuss the work on OpenRXV
  • Check the results of the AReS harvesting from last night:
$ curl -s 'http://localhost:9200/openrxv-items-temp/_count?q=*&pretty'
diff --git a/docs/posts/index.xml b/docs/posts/index.xml
index 32fd6ccd6..85d579476 100644
--- a/docs/posts/index.xml
+++ b/docs/posts/index.xml
@@ -15,6 +15,13 @@
       https://alanorth.github.io/cgspace-notes/2021-02/
       <h2 id="2021-02-01">2021-02-01</h2>
 <ul>
+<li>Abenet said that CIP found more duplicate records in their export from AReS
+<ul>
+<li>I re-opened <a href="https://github.com/ilri/OpenRXV/issues/67">the issue</a> on OpenRXV where we had previously noticed this</li>
+<li>The shared link where the duplicates are is here: <a href="https://cgspace.cgiar.org/explorer/shared/heEOz3YBnXdK69bR2ra6">https://cgspace.cgiar.org/explorer/shared/heEOz3YBnXdK69bR2ra6</a></li>
+</ul>
+</li>
+<li>I had a call with CodeObia to discuss the work on OpenRXV</li>
 <li>Check the results of the AReS harvesting from last night:</li>
 </ul>
 <pre><code class="language-console" data-lang="console">$ curl -s 'http://localhost:9200/openrxv-items-temp/_count?q=*&amp;pretty'
diff --git a/docs/posts/page/2/index.html b/docs/posts/page/2/index.html
index 46c8332b5..587e220f7 100644
--- a/docs/posts/page/2/index.html
+++ b/docs/posts/page/2/index.html
@@ -10,7 +10,7 @@
 
 
 
-
+
 
 
 
diff --git a/docs/posts/page/3/index.html b/docs/posts/page/3/index.html
index 999750407..1044b6d9e 100644
--- a/docs/posts/page/3/index.html
+++ b/docs/posts/page/3/index.html
@@ -10,7 +10,7 @@
 
 
 
-
+
 
 
 
diff --git a/docs/posts/page/4/index.html b/docs/posts/page/4/index.html
index 25329710f..a5d7f7740 100644
--- a/docs/posts/page/4/index.html
+++ b/docs/posts/page/4/index.html
@@ -10,7 +10,7 @@
 
 
 
-
+
 
 
 
diff --git a/docs/posts/page/5/index.html b/docs/posts/page/5/index.html
index b5282d756..e49abeaea 100644
--- a/docs/posts/page/5/index.html
+++ b/docs/posts/page/5/index.html
@@ -10,7 +10,7 @@
 
 
 
-
+
 
 
 
diff --git a/docs/posts/page/6/index.html b/docs/posts/page/6/index.html
index 56c9f04b3..949b0577b 100644
--- a/docs/posts/page/6/index.html
+++ b/docs/posts/page/6/index.html
@@ -10,7 +10,7 @@
 
 
 
-
+
 
 
 
diff --git a/docs/posts/page/7/index.html b/docs/posts/page/7/index.html
index 3cd99f64d..1459f0252 100644
--- a/docs/posts/page/7/index.html
+++ b/docs/posts/page/7/index.html
@@ -10,7 +10,7 @@
 
 
 
-
+
 
 
 
diff --git a/docs/sitemap.xml b/docs/sitemap.xml
index 2a16a0618..c031d1d91 100644
--- a/docs/sitemap.xml
+++ b/docs/sitemap.xml
@@ -4,27 +4,27 @@
   
   
     https://alanorth.github.io/cgspace-notes/categories/
-    2021-02-01T10:13:54+02:00
+    2021-02-01T12:28:54+02:00
   
   
   
     https://alanorth.github.io/cgspace-notes/
-    2021-02-01T10:13:54+02:00
+    2021-02-01T12:28:54+02:00
   
   
   
     https://alanorth.github.io/cgspace-notes/2021-02/
-    2021-02-01T10:13:54+02:00
+    2021-02-01T12:28:54+02:00
   
   
   
     https://alanorth.github.io/cgspace-notes/categories/notes/
-    2021-02-01T10:13:54+02:00
+    2021-02-01T12:28:54+02:00
   
   
   
     https://alanorth.github.io/cgspace-notes/posts/
-    2021-02-01T10:13:54+02:00
+    2021-02-01T12:28:54+02:00