diff --git a/content/posts/2020-06.md b/content/posts/2020-06.md
index baae002c3..3b329224b 100644
--- a/content/posts/2020-06.md
+++ b/content/posts/2020-06.md
@@ -352,7 +352,7 @@ Total number of bot hits purged: 29025
## 2020-06-14
-- Abenet asked for a list of authors from CIP's community so that Gabriel can make some corrections
+- Abenet asked for a list of authors from CIP's community so that Gabriela can make some corrections
- I generated a list of collections in CIPs two communities using the REST API:
```
diff --git a/content/posts/2020-07.md b/content/posts/2020-07.md
index 4ef2517f6..df2923cb0 100644
--- a/content/posts/2020-07.md
+++ b/content/posts/2020-07.md
@@ -303,4 +303,38 @@ $ ./fix-metadata-values.py -i 2020-07-07-fix-sponsors.csv -db dspace -u dspace -
![Altmetric and Dimensions.ai badge](/cgspace-notes/2020/07/dimensions-badge2.png)
+## 2020-07-08
+
+- Generate a CSV of all the AGROVOC subjects that didn't match from the top 6500 I exported earlier this week:
+
+```
+$ csvgrep -c 'number of matches' -r "^0$" 2020-07-05-cgspace-subjects.csv | csvcut -c 1 > 2020-07-05-cgspace-invalid-subjects.csv
+```
+
+- Yesterday Gabriela from CIP emailed to say that she was removing the accents from her authors' names because of "funny character" issues with reports generated from CGSpace
+ - I told her that it's probably her Windows / Excel that is messing up the data, and she figured out how to open them correctly!
+ - Now she says she doesn't want to remove the accents after all and she sent me a new list of corrections
+ - I used csvgrep and found a few where she is still removing accents:
+
+```
+$ csvgrep -c 2 -r "^.+$" ~/Downloads/cip-authors-GH-20200706.csv | csvgrep -c 1 -r "^.*[À-ú].*$" | csvgrep -c 2 -r "^.*[À-ú].*$" -i | csvcut -c 1,2
+dc.contributor.author,correction
+"López, G.","Lopez, G."
+"Gómez, R.","Gomez, R."
+"García, M.","Garcia, M."
+"Mejía, A.","Mejia, A."
+"Quiróz, Roberto A.","Quiroz, R."
+```
+
+- csvgrep from the csvkit suite is *so cool*:
+ - Select lines with column two (the correction) having a value
+ - Select lines with column one (the original author name) having an accent / diacritic
+ - Select lines with column two (the correction) NOT having an accent (ie, she's not removing an accent)
+ - Select columns one and two
+
+- Peter said he liked the work I didn on the badges yesterday so I put some finishing touches on it to detect more DOI URI styles and pushed it to the `5_x-prod` branch
+ - I will port it to DSpace 6 soon
+
+![Altmetric and Dimensions badges](/cgspace-notes/2020/07/altmetrics-dimensions-badges.png)
+
diff --git a/docs/2020-06/index.html b/docs/2020-06/index.html
index cffdb4b5f..12f4ae01e 100644
--- a/docs/2020-06/index.html
+++ b/docs/2020-06/index.html
@@ -488,7 +488,7 @@ Total number of bot hits purged: 29025
2020-06-14
-- Abenet asked for a list of authors from CIP’s community so that Gabriel can make some corrections
+
- Abenet asked for a list of authors from CIP’s community so that Gabriela can make some corrections
- I generated a list of collections in CIPs two communities using the REST API:
diff --git a/docs/2020-07/index.html b/docs/2020-07/index.html
index f1549fd65..6e6f92591 100644
--- a/docs/2020-07/index.html
+++ b/docs/2020-07/index.html
@@ -20,7 +20,7 @@ Since I was restarting Tomcat anyways I decided to redeploy the latest changes f
-
+
@@ -45,9 +45,9 @@ Since I was restarting Tomcat anyways I decided to redeploy the latest changes f
"@type": "BlogPosting",
"headline": "July, 2020",
"url": "https://alanorth.github.io/cgspace-notes/2020-07/",
- "wordCount": "1858",
+ "wordCount": "2116",
"datePublished": "2020-07-01T10:53:54+03:00",
- "dateModified": "2020-07-07T12:53:16+03:00",
+ "dateModified": "2020-07-07T16:14:49+03:00",
"author": {
"@type": "Person",
"name": "Alan Orth"
@@ -425,6 +425,45 @@ $ ./fix-metadata-values.py -i 2020-07-07-fix-sponsors.csv -db dspace -u dspace -
+2020-07-08
+
+- Generate a CSV of all the AGROVOC subjects that didn’t match from the top 6500 I exported earlier this week:
+
+$ csvgrep -c 'number of matches' -r "^0$" 2020-07-05-cgspace-subjects.csv | csvcut -c 1 > 2020-07-05-cgspace-invalid-subjects.csv
+
+- Yesterday Gabriela from CIP emailed to say that she was removing the accents from her authors’ names because of “funny character” issues with reports generated from CGSpace
+
+- I told her that it’s probably her Windows / Excel that is messing up the data, and she figured out how to open them correctly!
+- Now she says she doesn’t want to remove the accents after all and she sent me a new list of corrections
+- I used csvgrep and found a few where she is still removing accents:
+
+
+
+$ csvgrep -c 2 -r "^.+$" ~/Downloads/cip-authors-GH-20200706.csv | csvgrep -c 1 -r "^.*[À-ú].*$" | csvgrep -c 2 -r "^.*[À-ú].*$" -i | csvcut -c 1,2
+dc.contributor.author,correction
+"López, G.","Lopez, G."
+"Gómez, R.","Gomez, R."
+"García, M.","Garcia, M."
+"Mejía, A.","Mejia, A."
+"Quiróz, Roberto A.","Quiroz, R."
+
+-
+
csvgrep from the csvkit suite is so cool:
+
+- Select lines with column two (the correction) having a value
+- Select lines with column one (the original author name) having an accent / diacritic
+- Select lines with column two (the correction) NOT having an accent (ie, she’s not removing an accent)
+- Select columns one and two
+
+
+-
+
Peter said he liked the work I didn on the badges yesterday so I put some finishing touches on it to detect more DOI URI styles and pushed it to the 5_x-prod
branch
+
+- I will port it to DSpace 6 soon
+
+
+
+
diff --git a/docs/2020/07/altmetrics-dimensions-badges.png b/docs/2020/07/altmetrics-dimensions-badges.png
new file mode 100644
index 000000000..0420689c3
Binary files /dev/null and b/docs/2020/07/altmetrics-dimensions-badges.png differ
diff --git a/docs/categories/index.html b/docs/categories/index.html
index 386318579..25148e024 100644
--- a/docs/categories/index.html
+++ b/docs/categories/index.html
@@ -9,7 +9,7 @@
-
+
diff --git a/docs/categories/notes/index.html b/docs/categories/notes/index.html
index 2aaea2891..cd0dc0fae 100644
--- a/docs/categories/notes/index.html
+++ b/docs/categories/notes/index.html
@@ -9,7 +9,7 @@
-
+
diff --git a/docs/categories/notes/page/2/index.html b/docs/categories/notes/page/2/index.html
index b87113ec3..f1a696de3 100644
--- a/docs/categories/notes/page/2/index.html
+++ b/docs/categories/notes/page/2/index.html
@@ -9,7 +9,7 @@
-
+
diff --git a/docs/categories/notes/page/3/index.html b/docs/categories/notes/page/3/index.html
index f930b7231..32b087c13 100644
--- a/docs/categories/notes/page/3/index.html
+++ b/docs/categories/notes/page/3/index.html
@@ -9,7 +9,7 @@
-
+
diff --git a/docs/categories/notes/page/4/index.html b/docs/categories/notes/page/4/index.html
index 6d0d75831..dbbb5bd06 100644
--- a/docs/categories/notes/page/4/index.html
+++ b/docs/categories/notes/page/4/index.html
@@ -9,7 +9,7 @@
-
+
diff --git a/docs/index.html b/docs/index.html
index 1be880b3c..8457b6fdf 100644
--- a/docs/index.html
+++ b/docs/index.html
@@ -9,7 +9,7 @@
-
+
diff --git a/docs/page/2/index.html b/docs/page/2/index.html
index 4b9f72436..1a4a41a70 100644
--- a/docs/page/2/index.html
+++ b/docs/page/2/index.html
@@ -9,7 +9,7 @@
-
+
diff --git a/docs/page/3/index.html b/docs/page/3/index.html
index 29fa48b1b..4c0b36e8e 100644
--- a/docs/page/3/index.html
+++ b/docs/page/3/index.html
@@ -9,7 +9,7 @@
-
+
diff --git a/docs/page/4/index.html b/docs/page/4/index.html
index fbe723dac..47a1967a8 100644
--- a/docs/page/4/index.html
+++ b/docs/page/4/index.html
@@ -9,7 +9,7 @@
-
+
diff --git a/docs/page/5/index.html b/docs/page/5/index.html
index ffb739e13..e3731472c 100644
--- a/docs/page/5/index.html
+++ b/docs/page/5/index.html
@@ -9,7 +9,7 @@
-
+
diff --git a/docs/page/6/index.html b/docs/page/6/index.html
index d504a6c18..281969542 100644
--- a/docs/page/6/index.html
+++ b/docs/page/6/index.html
@@ -9,7 +9,7 @@
-
+
diff --git a/docs/posts/index.html b/docs/posts/index.html
index f1caef133..4a5524498 100644
--- a/docs/posts/index.html
+++ b/docs/posts/index.html
@@ -9,7 +9,7 @@
-
+
diff --git a/docs/posts/page/2/index.html b/docs/posts/page/2/index.html
index 3f7fa9530..a576b519c 100644
--- a/docs/posts/page/2/index.html
+++ b/docs/posts/page/2/index.html
@@ -9,7 +9,7 @@
-
+
diff --git a/docs/posts/page/3/index.html b/docs/posts/page/3/index.html
index 1cc69b61f..a5eee51a5 100644
--- a/docs/posts/page/3/index.html
+++ b/docs/posts/page/3/index.html
@@ -9,7 +9,7 @@
-
+
diff --git a/docs/posts/page/4/index.html b/docs/posts/page/4/index.html
index 1ae73206f..46216190d 100644
--- a/docs/posts/page/4/index.html
+++ b/docs/posts/page/4/index.html
@@ -9,7 +9,7 @@
-
+
diff --git a/docs/posts/page/5/index.html b/docs/posts/page/5/index.html
index 75d6e3c44..96fb70447 100644
--- a/docs/posts/page/5/index.html
+++ b/docs/posts/page/5/index.html
@@ -9,7 +9,7 @@
-
+
diff --git a/docs/posts/page/6/index.html b/docs/posts/page/6/index.html
index 2b36948d7..6858b6f70 100644
--- a/docs/posts/page/6/index.html
+++ b/docs/posts/page/6/index.html
@@ -9,7 +9,7 @@
-
+
diff --git a/docs/sitemap.xml b/docs/sitemap.xml
index e38ec843c..d5596621b 100644
--- a/docs/sitemap.xml
+++ b/docs/sitemap.xml
@@ -4,27 +4,27 @@
https://alanorth.github.io/cgspace-notes/categories/
- 2020-07-07T12:53:16+03:00
+ 2020-07-07T16:14:49+03:00
https://alanorth.github.io/cgspace-notes/
- 2020-07-07T12:53:16+03:00
+ 2020-07-07T16:14:49+03:00
https://alanorth.github.io/cgspace-notes/2020-07/
- 2020-07-07T12:53:16+03:00
+ 2020-07-07T16:14:49+03:00
https://alanorth.github.io/cgspace-notes/categories/notes/
- 2020-07-07T12:53:16+03:00
+ 2020-07-07T16:14:49+03:00
https://alanorth.github.io/cgspace-notes/posts/
- 2020-07-07T12:53:16+03:00
+ 2020-07-07T16:14:49+03:00
diff --git a/static/2020/07/altmetrics-dimensions-badges.png b/static/2020/07/altmetrics-dimensions-badges.png
new file mode 100644
index 000000000..0420689c3
Binary files /dev/null and b/static/2020/07/altmetrics-dimensions-badges.png differ