diff --git a/content/posts/2022-08.md b/content/posts/2022-08.md
index 7fd3a1dc3..cfc26b9dc 100644
--- a/content/posts/2022-08.md
+++ b/content/posts/2022-08.md
@@ -96,4 +96,26 @@ $ dspace import --add --eperson=aorth@mjanja.ch --source /tmp/SimpleArchiveForma
- Add CONSERVATION to ILRI subjects on CGSpace
- I see that AGROVOC has `conservation agriculture` and I suggested that we use that instead
+## 2022-08-17
+
+- Peter and Jose sent more feedback about the CRP Innovation records from MARLO
+ - We expanded the CRP names in the citation and removed the `cg.identifier.url` URLs because they are ugly and will stop working eventually
+ - The mappings of MARLO links will be done internally with the `cg.number` IDs like "IN-1119" and the Handle URIs
+
+## 2022-08-18
+
+- I talked to Jose about the CCAFS MARLO records
+ - He still hasn't finished re-processing the PDFs to update the internal MARLO links
+ - I started looking at the other records (MELIAs, OICRs, Policies) and found some minor issues in the MELIAs so I sent feedback to Jose
+ - On second thought, I opened the MELIAs file in OpenRefine and it looks OK, so this must have been a parsing issue in LibreOffice when I was checking the file (or perhaps I didn't use the correct quoting when importing)
+- Import the original MELIA v2 CSV file into OpenRefine to fix encoding before processing with csvcut/csvjoin
+ - Then extract the IDs and filenames from the original V2 file and join with the UTF-8 file:
+
+```console
+$ csvcut -c 'cg.number (series/report No.)',File ~/Downloads/MELIA-Metadata-v2-csv.csv > MELIA-v2-IDs-Files.csv
+$ csvjoin -c 'cg.number (series/report No.)' MELIAs\ metadata\ utf8\ 20220816_JM.csv MELIA-v2-IDs-Files.csv > MELIAs-UTF-8-with-files.csv
+```
+
+- Then I imported them into OpenRefine to start metadata cleaning and enrichment
+
diff --git a/docs/2022-08/index.html b/docs/2022-08/index.html
index 3bfc3b3ac..8399b64f5 100644
--- a/docs/2022-08/index.html
+++ b/docs/2022-08/index.html
@@ -14,7 +14,7 @@ Our request to add CC-BY-3.0-IGO to SPDX was approved a few weeks ago
-
+
@@ -34,9 +34,9 @@ Our request to add CC-BY-3.0-IGO to SPDX was approved a few weeks ago
"@type": "BlogPosting",
"headline": "August, 2022",
"url": "https://alanorth.github.io/cgspace-notes/2022-08/",
- "wordCount": "871",
+ "wordCount": "1080",
"datePublished": "2022-08-01T10:22:36+03:00",
- "dateModified": "2022-08-13T21:51:49-07:00",
+ "dateModified": "2022-08-15T18:46:57-07:00",
"author": {
"@type": "Person",
"name": "Alan Orth"
@@ -221,6 +221,35 @@ Our request to add CC-BY-3.0-IGO to SPDX was approved a few weeks ago
+
2022-08-17
+
+- Peter and Jose sent more feedback about the CRP Innovation records from MARLO
+
+- We expanded the CRP names in the citation and removed the
cg.identifier.url
URLs because they are ugly and will stop working eventually
+- The mappings of MARLO links will be done internally with the
cg.number
IDs like “IN-1119” and the Handle URIs
+
+
+
+2022-08-18
+
+- I talked to Jose about the CCAFS MARLO records
+
+- He still hasn’t finished re-processing the PDFs to update the internal MARLO links
+- I started looking at the other records (MELIAs, OICRs, Policies) and found some minor issues in the MELIAs so I sent feedback to Jose
+- On second thought, I opened the MELIAs file in OpenRefine and it looks OK, so this must have been a parsing issue in LibreOffice when I was checking the file (or perhaps I didn’t use the correct quoting when importing)
+
+
+- Import the original MELIA v2 CSV file into OpenRefine to fix encoding before processing with csvcut/csvjoin
+
+- Then extract the IDs and filenames from the original V2 file and join with the UTF-8 file:
+
+
+
+$ csvcut -c 'cg.number (series/report No.)',File ~/Downloads/MELIA-Metadata-v2-csv.csv > MELIA-v2-IDs-Files.csv
+$ csvjoin -c 'cg.number (series/report No.)' MELIAs\ metadata\ utf8\ 20220816_JM.csv MELIA-v2-IDs-Files.csv > MELIAs-UTF-8-with-files.csv
+
+- Then I imported them into OpenRefine to start metadata cleaning and enrichment
+
diff --git a/docs/categories/index.html b/docs/categories/index.html
index d22e9bf1a..6d069ec56 100644
--- a/docs/categories/index.html
+++ b/docs/categories/index.html
@@ -10,7 +10,7 @@
-
+
diff --git a/docs/categories/notes/index.html b/docs/categories/notes/index.html
index c68614b38..ec759e39d 100644
--- a/docs/categories/notes/index.html
+++ b/docs/categories/notes/index.html
@@ -10,7 +10,7 @@
-
+
diff --git a/docs/categories/notes/page/2/index.html b/docs/categories/notes/page/2/index.html
index 37dac6463..96f541f21 100644
--- a/docs/categories/notes/page/2/index.html
+++ b/docs/categories/notes/page/2/index.html
@@ -10,7 +10,7 @@
-
+
diff --git a/docs/categories/notes/page/3/index.html b/docs/categories/notes/page/3/index.html
index 810ef6a7f..0c018706b 100644
--- a/docs/categories/notes/page/3/index.html
+++ b/docs/categories/notes/page/3/index.html
@@ -10,7 +10,7 @@
-
+
diff --git a/docs/categories/notes/page/4/index.html b/docs/categories/notes/page/4/index.html
index 17b689bd4..cff4facf4 100644
--- a/docs/categories/notes/page/4/index.html
+++ b/docs/categories/notes/page/4/index.html
@@ -10,7 +10,7 @@
-
+
diff --git a/docs/categories/notes/page/5/index.html b/docs/categories/notes/page/5/index.html
index 4ce9fba18..24a95783c 100644
--- a/docs/categories/notes/page/5/index.html
+++ b/docs/categories/notes/page/5/index.html
@@ -10,7 +10,7 @@
-
+
diff --git a/docs/categories/notes/page/6/index.html b/docs/categories/notes/page/6/index.html
index 266d3c530..751ef39c5 100644
--- a/docs/categories/notes/page/6/index.html
+++ b/docs/categories/notes/page/6/index.html
@@ -10,7 +10,7 @@
-
+
diff --git a/docs/categories/notes/page/7/index.html b/docs/categories/notes/page/7/index.html
index 20d244a30..1f053759f 100644
--- a/docs/categories/notes/page/7/index.html
+++ b/docs/categories/notes/page/7/index.html
@@ -10,7 +10,7 @@
-
+
diff --git a/docs/index.html b/docs/index.html
index 4fd3ddec1..022ae2002 100644
--- a/docs/index.html
+++ b/docs/index.html
@@ -10,7 +10,7 @@
-
+
diff --git a/docs/page/2/index.html b/docs/page/2/index.html
index 74890e308..2a72c7905 100644
--- a/docs/page/2/index.html
+++ b/docs/page/2/index.html
@@ -10,7 +10,7 @@
-
+
diff --git a/docs/page/3/index.html b/docs/page/3/index.html
index 6c3f8788c..1d1bfc6c8 100644
--- a/docs/page/3/index.html
+++ b/docs/page/3/index.html
@@ -10,7 +10,7 @@
-
+
diff --git a/docs/page/4/index.html b/docs/page/4/index.html
index 95f3bd04f..bb921c81a 100644
--- a/docs/page/4/index.html
+++ b/docs/page/4/index.html
@@ -10,7 +10,7 @@
-
+
diff --git a/docs/page/5/index.html b/docs/page/5/index.html
index ba2a2b720..9265507ad 100644
--- a/docs/page/5/index.html
+++ b/docs/page/5/index.html
@@ -10,7 +10,7 @@
-
+
diff --git a/docs/page/6/index.html b/docs/page/6/index.html
index 555edfe14..5faf718d9 100644
--- a/docs/page/6/index.html
+++ b/docs/page/6/index.html
@@ -10,7 +10,7 @@
-
+
diff --git a/docs/page/7/index.html b/docs/page/7/index.html
index af99ab30e..1b74f5022 100644
--- a/docs/page/7/index.html
+++ b/docs/page/7/index.html
@@ -10,7 +10,7 @@
-
+
diff --git a/docs/page/8/index.html b/docs/page/8/index.html
index b4ffed3da..cc04936d7 100644
--- a/docs/page/8/index.html
+++ b/docs/page/8/index.html
@@ -10,7 +10,7 @@
-
+
diff --git a/docs/page/9/index.html b/docs/page/9/index.html
index 26deb7a23..8094229d2 100644
--- a/docs/page/9/index.html
+++ b/docs/page/9/index.html
@@ -10,7 +10,7 @@
-
+
diff --git a/docs/posts/index.html b/docs/posts/index.html
index 2dafd4415..fdb0e5615 100644
--- a/docs/posts/index.html
+++ b/docs/posts/index.html
@@ -10,7 +10,7 @@
-
+
diff --git a/docs/posts/page/2/index.html b/docs/posts/page/2/index.html
index d44af3a7d..1c65751c8 100644
--- a/docs/posts/page/2/index.html
+++ b/docs/posts/page/2/index.html
@@ -10,7 +10,7 @@
-
+
diff --git a/docs/posts/page/3/index.html b/docs/posts/page/3/index.html
index e3a3bc3c7..d9ccf12b7 100644
--- a/docs/posts/page/3/index.html
+++ b/docs/posts/page/3/index.html
@@ -10,7 +10,7 @@
-
+
diff --git a/docs/posts/page/4/index.html b/docs/posts/page/4/index.html
index f12d4b2c8..282e59e87 100644
--- a/docs/posts/page/4/index.html
+++ b/docs/posts/page/4/index.html
@@ -10,7 +10,7 @@
-
+
diff --git a/docs/posts/page/5/index.html b/docs/posts/page/5/index.html
index 3d0995384..a2a7a0571 100644
--- a/docs/posts/page/5/index.html
+++ b/docs/posts/page/5/index.html
@@ -10,7 +10,7 @@
-
+
diff --git a/docs/posts/page/6/index.html b/docs/posts/page/6/index.html
index 72d05d6aa..2637a2f56 100644
--- a/docs/posts/page/6/index.html
+++ b/docs/posts/page/6/index.html
@@ -10,7 +10,7 @@
-
+
diff --git a/docs/posts/page/7/index.html b/docs/posts/page/7/index.html
index 1f74c01b0..15e53a428 100644
--- a/docs/posts/page/7/index.html
+++ b/docs/posts/page/7/index.html
@@ -10,7 +10,7 @@
-
+
diff --git a/docs/posts/page/8/index.html b/docs/posts/page/8/index.html
index beea82838..ce8daef00 100644
--- a/docs/posts/page/8/index.html
+++ b/docs/posts/page/8/index.html
@@ -10,7 +10,7 @@
-
+
diff --git a/docs/posts/page/9/index.html b/docs/posts/page/9/index.html
index bc30cdb33..e6d2054dd 100644
--- a/docs/posts/page/9/index.html
+++ b/docs/posts/page/9/index.html
@@ -10,7 +10,7 @@
-
+
diff --git a/docs/sitemap.xml b/docs/sitemap.xml
index 2c2b201d7..96ebcd575 100644
--- a/docs/sitemap.xml
+++ b/docs/sitemap.xml
@@ -3,19 +3,19 @@
xmlns:xhtml="http://www.w3.org/1999/xhtml">
https://alanorth.github.io/cgspace-notes/2022-08/
- 2022-08-13T21:51:49-07:00
+ 2022-08-15T18:46:57-07:00
https://alanorth.github.io/cgspace-notes/categories/
- 2022-08-13T21:51:49-07:00
+ 2022-08-15T18:46:57-07:00
https://alanorth.github.io/cgspace-notes/
- 2022-08-13T21:51:49-07:00
+ 2022-08-15T18:46:57-07:00
https://alanorth.github.io/cgspace-notes/categories/notes/
- 2022-08-13T21:51:49-07:00
+ 2022-08-15T18:46:57-07:00
https://alanorth.github.io/cgspace-notes/posts/
- 2022-08-13T21:51:49-07:00
+ 2022-08-15T18:46:57-07:00
https://alanorth.github.io/cgspace-notes/2022-07/
2022-07-31T15:49:35+03:00