mirror of
https://github.com/alanorth/cgspace-notes.git
synced 2024-11-24 07:30:18 +01:00
Add notes for 2020-07-08
This commit is contained in:
parent
5291baa539
commit
8d42c71a44
@ -352,7 +352,7 @@ Total number of bot hits purged: 29025
|
||||
|
||||
## 2020-06-14
|
||||
|
||||
- Abenet asked for a list of authors from CIP's community so that Gabriel can make some corrections
|
||||
- Abenet asked for a list of authors from CIP's community so that Gabriela can make some corrections
|
||||
- I generated a list of collections in CIPs two communities using the REST API:
|
||||
|
||||
```
|
||||
|
@ -303,4 +303,38 @@ $ ./fix-metadata-values.py -i 2020-07-07-fix-sponsors.csv -db dspace -u dspace -
|
||||
|
||||
![Altmetric and Dimensions.ai badge](/cgspace-notes/2020/07/dimensions-badge2.png)
|
||||
|
||||
## 2020-07-08
|
||||
|
||||
- Generate a CSV of all the AGROVOC subjects that didn't match from the top 6500 I exported earlier this week:
|
||||
|
||||
```
|
||||
$ csvgrep -c 'number of matches' -r "^0$" 2020-07-05-cgspace-subjects.csv | csvcut -c 1 > 2020-07-05-cgspace-invalid-subjects.csv
|
||||
```
|
||||
|
||||
- Yesterday Gabriela from CIP emailed to say that she was removing the accents from her authors' names because of "funny character" issues with reports generated from CGSpace
|
||||
- I told her that it's probably her Windows / Excel that is messing up the data, and she figured out how to open them correctly!
|
||||
- Now she says she doesn't want to remove the accents after all and she sent me a new list of corrections
|
||||
- I used csvgrep and found a few where she is still removing accents:
|
||||
|
||||
```
|
||||
$ csvgrep -c 2 -r "^.+$" ~/Downloads/cip-authors-GH-20200706.csv | csvgrep -c 1 -r "^.*[À-ú].*$" | csvgrep -c 2 -r "^.*[À-ú].*$" -i | csvcut -c 1,2
|
||||
dc.contributor.author,correction
|
||||
"López, G.","Lopez, G."
|
||||
"Gómez, R.","Gomez, R."
|
||||
"García, M.","Garcia, M."
|
||||
"Mejía, A.","Mejia, A."
|
||||
"Quiróz, Roberto A.","Quiroz, R."
|
||||
```
|
||||
|
||||
- csvgrep from the csvkit suite is *so cool*:
|
||||
- Select lines with column two (the correction) having a value
|
||||
- Select lines with column one (the original author name) having an accent / diacritic
|
||||
- Select lines with column two (the correction) NOT having an accent (ie, she's not removing an accent)
|
||||
- Select columns one and two
|
||||
|
||||
- Peter said he liked the work I didn on the badges yesterday so I put some finishing touches on it to detect more DOI URI styles and pushed it to the `5_x-prod` branch
|
||||
- I will port it to DSpace 6 soon
|
||||
|
||||
![Altmetric and Dimensions badges](/cgspace-notes/2020/07/altmetrics-dimensions-badges.png)
|
||||
|
||||
<!-- vim: set sw=2 ts=2: -->
|
||||
|
@ -488,7 +488,7 @@ Total number of bot hits purged: 29025
|
||||
</ul>
|
||||
<h2 id="2020-06-14">2020-06-14</h2>
|
||||
<ul>
|
||||
<li>Abenet asked for a list of authors from CIP’s community so that Gabriel can make some corrections
|
||||
<li>Abenet asked for a list of authors from CIP’s community so that Gabriela can make some corrections
|
||||
<ul>
|
||||
<li>I generated a list of collections in CIPs two communities using the REST API:</li>
|
||||
</ul>
|
||||
|
@ -20,7 +20,7 @@ Since I was restarting Tomcat anyways I decided to redeploy the latest changes f
|
||||
<meta property="og:type" content="article" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2020-07/" />
|
||||
<meta property="article:published_time" content="2020-07-01T10:53:54+03:00" />
|
||||
<meta property="article:modified_time" content="2020-07-07T12:53:16+03:00" />
|
||||
<meta property="article:modified_time" content="2020-07-07T16:14:49+03:00" />
|
||||
|
||||
<meta name="twitter:card" content="summary"/>
|
||||
<meta name="twitter:title" content="July, 2020"/>
|
||||
@ -45,9 +45,9 @@ Since I was restarting Tomcat anyways I decided to redeploy the latest changes f
|
||||
"@type": "BlogPosting",
|
||||
"headline": "July, 2020",
|
||||
"url": "https://alanorth.github.io/cgspace-notes/2020-07/",
|
||||
"wordCount": "1858",
|
||||
"wordCount": "2116",
|
||||
"datePublished": "2020-07-01T10:53:54+03:00",
|
||||
"dateModified": "2020-07-07T12:53:16+03:00",
|
||||
"dateModified": "2020-07-07T16:14:49+03:00",
|
||||
"author": {
|
||||
"@type": "Person",
|
||||
"name": "Alan Orth"
|
||||
@ -425,6 +425,45 @@ $ ./fix-metadata-values.py -i 2020-07-07-fix-sponsors.csv -db dspace -u dspace -
|
||||
</li>
|
||||
</ul>
|
||||
<p><img src="/cgspace-notes/2020/07/dimensions-badge2.png" alt="Altmetric and Dimensions.ai badge"></p>
|
||||
<h2 id="2020-07-08">2020-07-08</h2>
|
||||
<ul>
|
||||
<li>Generate a CSV of all the AGROVOC subjects that didn’t match from the top 6500 I exported earlier this week:</li>
|
||||
</ul>
|
||||
<pre><code>$ csvgrep -c 'number of matches' -r "^0$" 2020-07-05-cgspace-subjects.csv | csvcut -c 1 > 2020-07-05-cgspace-invalid-subjects.csv
|
||||
</code></pre><ul>
|
||||
<li>Yesterday Gabriela from CIP emailed to say that she was removing the accents from her authors’ names because of “funny character” issues with reports generated from CGSpace
|
||||
<ul>
|
||||
<li>I told her that it’s probably her Windows / Excel that is messing up the data, and she figured out how to open them correctly!</li>
|
||||
<li>Now she says she doesn’t want to remove the accents after all and she sent me a new list of corrections</li>
|
||||
<li>I used csvgrep and found a few where she is still removing accents:</li>
|
||||
</ul>
|
||||
</li>
|
||||
</ul>
|
||||
<pre><code>$ csvgrep -c 2 -r "^.+$" ~/Downloads/cip-authors-GH-20200706.csv | csvgrep -c 1 -r "^.*[À-ú].*$" | csvgrep -c 2 -r "^.*[À-ú].*$" -i | csvcut -c 1,2
|
||||
dc.contributor.author,correction
|
||||
"López, G.","Lopez, G."
|
||||
"Gómez, R.","Gomez, R."
|
||||
"García, M.","Garcia, M."
|
||||
"Mejía, A.","Mejia, A."
|
||||
"Quiróz, Roberto A.","Quiroz, R."
|
||||
</code></pre><ul>
|
||||
<li>
|
||||
<p>csvgrep from the csvkit suite is <em>so cool</em>:</p>
|
||||
<ul>
|
||||
<li>Select lines with column two (the correction) having a value</li>
|
||||
<li>Select lines with column one (the original author name) having an accent / diacritic</li>
|
||||
<li>Select lines with column two (the correction) NOT having an accent (ie, she’s not removing an accent)</li>
|
||||
<li>Select columns one and two</li>
|
||||
</ul>
|
||||
</li>
|
||||
<li>
|
||||
<p>Peter said he liked the work I didn on the badges yesterday so I put some finishing touches on it to detect more DOI URI styles and pushed it to the <code>5_x-prod</code> branch</p>
|
||||
<ul>
|
||||
<li>I will port it to DSpace 6 soon</li>
|
||||
</ul>
|
||||
</li>
|
||||
</ul>
|
||||
<p><img src="/cgspace-notes/2020/07/altmetrics-dimensions-badges.png" alt="Altmetric and Dimensions badges"></p>
|
||||
<!-- raw HTML omitted -->
|
||||
|
||||
|
||||
|
BIN
docs/2020/07/altmetrics-dimensions-badges.png
Normal file
BIN
docs/2020/07/altmetrics-dimensions-badges.png
Normal file
Binary file not shown.
After Width: | Height: | Size: 114 KiB |
@ -9,7 +9,7 @@
|
||||
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/" />
|
||||
<meta property="og:updated_time" content="2020-07-07T12:53:16+03:00" />
|
||||
<meta property="og:updated_time" content="2020-07-07T16:14:49+03:00" />
|
||||
|
||||
<meta name="twitter:card" content="summary"/>
|
||||
<meta name="twitter:title" content="Categories"/>
|
||||
|
@ -9,7 +9,7 @@
|
||||
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" />
|
||||
<meta property="og:updated_time" content="2020-07-07T12:53:16+03:00" />
|
||||
<meta property="og:updated_time" content="2020-07-07T16:14:49+03:00" />
|
||||
|
||||
<meta name="twitter:card" content="summary"/>
|
||||
<meta name="twitter:title" content="Notes"/>
|
||||
|
@ -9,7 +9,7 @@
|
||||
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" />
|
||||
<meta property="og:updated_time" content="2020-07-07T12:53:16+03:00" />
|
||||
<meta property="og:updated_time" content="2020-07-07T16:14:49+03:00" />
|
||||
|
||||
<meta name="twitter:card" content="summary"/>
|
||||
<meta name="twitter:title" content="Notes"/>
|
||||
|
@ -9,7 +9,7 @@
|
||||
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" />
|
||||
<meta property="og:updated_time" content="2020-07-07T12:53:16+03:00" />
|
||||
<meta property="og:updated_time" content="2020-07-07T16:14:49+03:00" />
|
||||
|
||||
<meta name="twitter:card" content="summary"/>
|
||||
<meta name="twitter:title" content="Notes"/>
|
||||
|
@ -9,7 +9,7 @@
|
||||
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" />
|
||||
<meta property="og:updated_time" content="2020-07-07T12:53:16+03:00" />
|
||||
<meta property="og:updated_time" content="2020-07-07T16:14:49+03:00" />
|
||||
|
||||
<meta name="twitter:card" content="summary"/>
|
||||
<meta name="twitter:title" content="Notes"/>
|
||||
|
@ -9,7 +9,7 @@
|
||||
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
|
||||
<meta property="og:updated_time" content="2020-07-07T12:53:16+03:00" />
|
||||
<meta property="og:updated_time" content="2020-07-07T16:14:49+03:00" />
|
||||
|
||||
<meta name="twitter:card" content="summary"/>
|
||||
<meta name="twitter:title" content="CGSpace Notes"/>
|
||||
|
@ -9,7 +9,7 @@
|
||||
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
|
||||
<meta property="og:updated_time" content="2020-07-07T12:53:16+03:00" />
|
||||
<meta property="og:updated_time" content="2020-07-07T16:14:49+03:00" />
|
||||
|
||||
<meta name="twitter:card" content="summary"/>
|
||||
<meta name="twitter:title" content="CGSpace Notes"/>
|
||||
|
@ -9,7 +9,7 @@
|
||||
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
|
||||
<meta property="og:updated_time" content="2020-07-07T12:53:16+03:00" />
|
||||
<meta property="og:updated_time" content="2020-07-07T16:14:49+03:00" />
|
||||
|
||||
<meta name="twitter:card" content="summary"/>
|
||||
<meta name="twitter:title" content="CGSpace Notes"/>
|
||||
|
@ -9,7 +9,7 @@
|
||||
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
|
||||
<meta property="og:updated_time" content="2020-07-07T12:53:16+03:00" />
|
||||
<meta property="og:updated_time" content="2020-07-07T16:14:49+03:00" />
|
||||
|
||||
<meta name="twitter:card" content="summary"/>
|
||||
<meta name="twitter:title" content="CGSpace Notes"/>
|
||||
|
@ -9,7 +9,7 @@
|
||||
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
|
||||
<meta property="og:updated_time" content="2020-07-07T12:53:16+03:00" />
|
||||
<meta property="og:updated_time" content="2020-07-07T16:14:49+03:00" />
|
||||
|
||||
<meta name="twitter:card" content="summary"/>
|
||||
<meta name="twitter:title" content="CGSpace Notes"/>
|
||||
|
@ -9,7 +9,7 @@
|
||||
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
|
||||
<meta property="og:updated_time" content="2020-07-07T12:53:16+03:00" />
|
||||
<meta property="og:updated_time" content="2020-07-07T16:14:49+03:00" />
|
||||
|
||||
<meta name="twitter:card" content="summary"/>
|
||||
<meta name="twitter:title" content="CGSpace Notes"/>
|
||||
|
@ -9,7 +9,7 @@
|
||||
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" />
|
||||
<meta property="og:updated_time" content="2020-07-07T12:53:16+03:00" />
|
||||
<meta property="og:updated_time" content="2020-07-07T16:14:49+03:00" />
|
||||
|
||||
<meta name="twitter:card" content="summary"/>
|
||||
<meta name="twitter:title" content="Posts"/>
|
||||
|
@ -9,7 +9,7 @@
|
||||
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" />
|
||||
<meta property="og:updated_time" content="2020-07-07T12:53:16+03:00" />
|
||||
<meta property="og:updated_time" content="2020-07-07T16:14:49+03:00" />
|
||||
|
||||
<meta name="twitter:card" content="summary"/>
|
||||
<meta name="twitter:title" content="Posts"/>
|
||||
|
@ -9,7 +9,7 @@
|
||||
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" />
|
||||
<meta property="og:updated_time" content="2020-07-07T12:53:16+03:00" />
|
||||
<meta property="og:updated_time" content="2020-07-07T16:14:49+03:00" />
|
||||
|
||||
<meta name="twitter:card" content="summary"/>
|
||||
<meta name="twitter:title" content="Posts"/>
|
||||
|
@ -9,7 +9,7 @@
|
||||
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" />
|
||||
<meta property="og:updated_time" content="2020-07-07T12:53:16+03:00" />
|
||||
<meta property="og:updated_time" content="2020-07-07T16:14:49+03:00" />
|
||||
|
||||
<meta name="twitter:card" content="summary"/>
|
||||
<meta name="twitter:title" content="Posts"/>
|
||||
|
@ -9,7 +9,7 @@
|
||||
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" />
|
||||
<meta property="og:updated_time" content="2020-07-07T12:53:16+03:00" />
|
||||
<meta property="og:updated_time" content="2020-07-07T16:14:49+03:00" />
|
||||
|
||||
<meta name="twitter:card" content="summary"/>
|
||||
<meta name="twitter:title" content="Posts"/>
|
||||
|
@ -9,7 +9,7 @@
|
||||
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" />
|
||||
<meta property="og:updated_time" content="2020-07-07T12:53:16+03:00" />
|
||||
<meta property="og:updated_time" content="2020-07-07T16:14:49+03:00" />
|
||||
|
||||
<meta name="twitter:card" content="summary"/>
|
||||
<meta name="twitter:title" content="Posts"/>
|
||||
|
@ -4,27 +4,27 @@
|
||||
|
||||
<url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/categories/</loc>
|
||||
<lastmod>2020-07-07T12:53:16+03:00</lastmod>
|
||||
<lastmod>2020-07-07T16:14:49+03:00</lastmod>
|
||||
</url>
|
||||
|
||||
<url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/</loc>
|
||||
<lastmod>2020-07-07T12:53:16+03:00</lastmod>
|
||||
<lastmod>2020-07-07T16:14:49+03:00</lastmod>
|
||||
</url>
|
||||
|
||||
<url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/2020-07/</loc>
|
||||
<lastmod>2020-07-07T12:53:16+03:00</lastmod>
|
||||
<lastmod>2020-07-07T16:14:49+03:00</lastmod>
|
||||
</url>
|
||||
|
||||
<url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/categories/notes/</loc>
|
||||
<lastmod>2020-07-07T12:53:16+03:00</lastmod>
|
||||
<lastmod>2020-07-07T16:14:49+03:00</lastmod>
|
||||
</url>
|
||||
|
||||
<url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/posts/</loc>
|
||||
<lastmod>2020-07-07T12:53:16+03:00</lastmod>
|
||||
<lastmod>2020-07-07T16:14:49+03:00</lastmod>
|
||||
</url>
|
||||
|
||||
<url>
|
||||
|
BIN
static/2020/07/altmetrics-dimensions-badges.png
Normal file
BIN
static/2020/07/altmetrics-dimensions-badges.png
Normal file
Binary file not shown.
After Width: | Height: | Size: 114 KiB |
Loading…
Reference in New Issue
Block a user