diff --git a/content/posts/2020-06.md b/content/posts/2020-06.md index baae002c3..3b329224b 100644 --- a/content/posts/2020-06.md +++ b/content/posts/2020-06.md @@ -352,7 +352,7 @@ Total number of bot hits purged: 29025 ## 2020-06-14 -- Abenet asked for a list of authors from CIP's community so that Gabriel can make some corrections +- Abenet asked for a list of authors from CIP's community so that Gabriela can make some corrections - I generated a list of collections in CIPs two communities using the REST API: ``` diff --git a/content/posts/2020-07.md b/content/posts/2020-07.md index 4ef2517f6..df2923cb0 100644 --- a/content/posts/2020-07.md +++ b/content/posts/2020-07.md @@ -303,4 +303,38 @@ $ ./fix-metadata-values.py -i 2020-07-07-fix-sponsors.csv -db dspace -u dspace - ![Altmetric and Dimensions.ai badge](/cgspace-notes/2020/07/dimensions-badge2.png) +## 2020-07-08 + +- Generate a CSV of all the AGROVOC subjects that didn't match from the top 6500 I exported earlier this week: + +``` +$ csvgrep -c 'number of matches' -r "^0$" 2020-07-05-cgspace-subjects.csv | csvcut -c 1 > 2020-07-05-cgspace-invalid-subjects.csv +``` + +- Yesterday Gabriela from CIP emailed to say that she was removing the accents from her authors' names because of "funny character" issues with reports generated from CGSpace + - I told her that it's probably her Windows / Excel that is messing up the data, and she figured out how to open them correctly! + - Now she says she doesn't want to remove the accents after all and she sent me a new list of corrections + - I used csvgrep and found a few where she is still removing accents: + +``` +$ csvgrep -c 2 -r "^.+$" ~/Downloads/cip-authors-GH-20200706.csv | csvgrep -c 1 -r "^.*[À-ú].*$" | csvgrep -c 2 -r "^.*[À-ú].*$" -i | csvcut -c 1,2 +dc.contributor.author,correction +"López, G.","Lopez, G." +"Gómez, R.","Gomez, R." +"García, M.","Garcia, M." +"Mejía, A.","Mejia, A." +"Quiróz, Roberto A.","Quiroz, R." +``` + +- csvgrep from the csvkit suite is *so cool*: + - Select lines with column two (the correction) having a value + - Select lines with column one (the original author name) having an accent / diacritic + - Select lines with column two (the correction) NOT having an accent (ie, she's not removing an accent) + - Select columns one and two + +- Peter said he liked the work I didn on the badges yesterday so I put some finishing touches on it to detect more DOI URI styles and pushed it to the `5_x-prod` branch + - I will port it to DSpace 6 soon + +![Altmetric and Dimensions badges](/cgspace-notes/2020/07/altmetrics-dimensions-badges.png) + diff --git a/docs/2020-06/index.html b/docs/2020-06/index.html index cffdb4b5f..12f4ae01e 100644 --- a/docs/2020-06/index.html +++ b/docs/2020-06/index.html @@ -488,7 +488,7 @@ Total number of bot hits purged: 29025

2020-06-14

Altmetric and Dimensions.ai badge

+

2020-07-08

+ +
$ csvgrep -c 'number of matches' -r "^0$" 2020-07-05-cgspace-subjects.csv | csvcut -c 1 > 2020-07-05-cgspace-invalid-subjects.csv
+
+
$ csvgrep -c 2 -r "^.+$" ~/Downloads/cip-authors-GH-20200706.csv | csvgrep -c 1 -r "^.*[À-ú].*$" | csvgrep -c 2 -r "^.*[À-ú].*$" -i | csvcut -c 1,2
+dc.contributor.author,correction
+"López, G.","Lopez, G."
+"Gómez, R.","Gomez, R."
+"García, M.","Garcia, M."
+"Mejía, A.","Mejia, A."
+"Quiróz, Roberto A.","Quiroz, R."
+
+

Altmetric and Dimensions badges

diff --git a/docs/2020/07/altmetrics-dimensions-badges.png b/docs/2020/07/altmetrics-dimensions-badges.png new file mode 100644 index 000000000..0420689c3 Binary files /dev/null and b/docs/2020/07/altmetrics-dimensions-badges.png differ diff --git a/docs/categories/index.html b/docs/categories/index.html index 386318579..25148e024 100644 --- a/docs/categories/index.html +++ b/docs/categories/index.html @@ -9,7 +9,7 @@ - + diff --git a/docs/categories/notes/index.html b/docs/categories/notes/index.html index 2aaea2891..cd0dc0fae 100644 --- a/docs/categories/notes/index.html +++ b/docs/categories/notes/index.html @@ -9,7 +9,7 @@ - + diff --git a/docs/categories/notes/page/2/index.html b/docs/categories/notes/page/2/index.html index b87113ec3..f1a696de3 100644 --- a/docs/categories/notes/page/2/index.html +++ b/docs/categories/notes/page/2/index.html @@ -9,7 +9,7 @@ - + diff --git a/docs/categories/notes/page/3/index.html b/docs/categories/notes/page/3/index.html index f930b7231..32b087c13 100644 --- a/docs/categories/notes/page/3/index.html +++ b/docs/categories/notes/page/3/index.html @@ -9,7 +9,7 @@ - + diff --git a/docs/categories/notes/page/4/index.html b/docs/categories/notes/page/4/index.html index 6d0d75831..dbbb5bd06 100644 --- a/docs/categories/notes/page/4/index.html +++ b/docs/categories/notes/page/4/index.html @@ -9,7 +9,7 @@ - + diff --git a/docs/index.html b/docs/index.html index 1be880b3c..8457b6fdf 100644 --- a/docs/index.html +++ b/docs/index.html @@ -9,7 +9,7 @@ - + diff --git a/docs/page/2/index.html b/docs/page/2/index.html index 4b9f72436..1a4a41a70 100644 --- a/docs/page/2/index.html +++ b/docs/page/2/index.html @@ -9,7 +9,7 @@ - + diff --git a/docs/page/3/index.html b/docs/page/3/index.html index 29fa48b1b..4c0b36e8e 100644 --- a/docs/page/3/index.html +++ b/docs/page/3/index.html @@ -9,7 +9,7 @@ - + diff --git a/docs/page/4/index.html b/docs/page/4/index.html index fbe723dac..47a1967a8 100644 --- a/docs/page/4/index.html +++ b/docs/page/4/index.html @@ -9,7 +9,7 @@ - + diff --git a/docs/page/5/index.html b/docs/page/5/index.html index ffb739e13..e3731472c 100644 --- a/docs/page/5/index.html +++ b/docs/page/5/index.html @@ -9,7 +9,7 @@ - + diff --git a/docs/page/6/index.html b/docs/page/6/index.html index d504a6c18..281969542 100644 --- a/docs/page/6/index.html +++ b/docs/page/6/index.html @@ -9,7 +9,7 @@ - + diff --git a/docs/posts/index.html b/docs/posts/index.html index f1caef133..4a5524498 100644 --- a/docs/posts/index.html +++ b/docs/posts/index.html @@ -9,7 +9,7 @@ - + diff --git a/docs/posts/page/2/index.html b/docs/posts/page/2/index.html index 3f7fa9530..a576b519c 100644 --- a/docs/posts/page/2/index.html +++ b/docs/posts/page/2/index.html @@ -9,7 +9,7 @@ - + diff --git a/docs/posts/page/3/index.html b/docs/posts/page/3/index.html index 1cc69b61f..a5eee51a5 100644 --- a/docs/posts/page/3/index.html +++ b/docs/posts/page/3/index.html @@ -9,7 +9,7 @@ - + diff --git a/docs/posts/page/4/index.html b/docs/posts/page/4/index.html index 1ae73206f..46216190d 100644 --- a/docs/posts/page/4/index.html +++ b/docs/posts/page/4/index.html @@ -9,7 +9,7 @@ - + diff --git a/docs/posts/page/5/index.html b/docs/posts/page/5/index.html index 75d6e3c44..96fb70447 100644 --- a/docs/posts/page/5/index.html +++ b/docs/posts/page/5/index.html @@ -9,7 +9,7 @@ - + diff --git a/docs/posts/page/6/index.html b/docs/posts/page/6/index.html index 2b36948d7..6858b6f70 100644 --- a/docs/posts/page/6/index.html +++ b/docs/posts/page/6/index.html @@ -9,7 +9,7 @@ - + diff --git a/docs/sitemap.xml b/docs/sitemap.xml index e38ec843c..d5596621b 100644 --- a/docs/sitemap.xml +++ b/docs/sitemap.xml @@ -4,27 +4,27 @@ https://alanorth.github.io/cgspace-notes/categories/ - 2020-07-07T12:53:16+03:00 + 2020-07-07T16:14:49+03:00 https://alanorth.github.io/cgspace-notes/ - 2020-07-07T12:53:16+03:00 + 2020-07-07T16:14:49+03:00 https://alanorth.github.io/cgspace-notes/2020-07/ - 2020-07-07T12:53:16+03:00 + 2020-07-07T16:14:49+03:00 https://alanorth.github.io/cgspace-notes/categories/notes/ - 2020-07-07T12:53:16+03:00 + 2020-07-07T16:14:49+03:00 https://alanorth.github.io/cgspace-notes/posts/ - 2020-07-07T12:53:16+03:00 + 2020-07-07T16:14:49+03:00 diff --git a/static/2020/07/altmetrics-dimensions-badges.png b/static/2020/07/altmetrics-dimensions-badges.png new file mode 100644 index 000000000..0420689c3 Binary files /dev/null and b/static/2020/07/altmetrics-dimensions-badges.png differ