mirror of
https://github.com/alanorth/cgspace-notes.git
synced 2024-11-26 08:28:18 +01:00
Update notes for 2019-10-13
This commit is contained in:
parent
be1e2a283c
commit
0171ace573
@ -136,5 +136,44 @@ $ ./fix-metadata-values.py -i /tmp/affiliations.csv -db dspace -u dspace -p 'fuu
|
|||||||
|
|
||||||
- More cleanup work on the authors in the Bioversity migration
|
- More cleanup work on the authors in the Bioversity migration
|
||||||
- Now I sent the final feedback to Francesca, Carol, and Abenet
|
- Now I sent the final feedback to Francesca, Carol, and Abenet
|
||||||
|
- Peter is still seeing some authors listed with "|" in the "Top Authors" statistics for some collections
|
||||||
|
- I looked in some of the items that are listed and the author field does not contain those invalid separators
|
||||||
|
- I decided to try doing a full Discovery re-indexing on CGSpace (linode18):
|
||||||
|
|
||||||
|
```
|
||||||
|
$ time schedtool -B -e ionice -c2 -n7 nice -n19 dspace index-discovery -b
|
||||||
|
|
||||||
|
real 82m35.993s
|
||||||
|
```
|
||||||
|
|
||||||
|
- After the re-indexing the top authors still list the following:
|
||||||
|
|
||||||
|
```
|
||||||
|
Jagwe, J.|Ouma, E.A.|Brandes-van Dorresteijn, D.|Kawuma, Brian|Smith, J.
|
||||||
|
```
|
||||||
|
|
||||||
|
- I looked in the database to find authors that had "|" in them:
|
||||||
|
|
||||||
|
```
|
||||||
|
dspace=# SELECT text_value, resource_id FROM metadatavalue WHERE resource_type_id=2 AND metadata_field_id=3 AND text_value LIKE '%|%';
|
||||||
|
text_value | resource_id
|
||||||
|
----------------------------------+-------------
|
||||||
|
Anandajayasekeram, P.|Puskur, R. | 157
|
||||||
|
Morales, J.|Renner, I. | 22779
|
||||||
|
Zahid, A.|Haque, M.A. | 25492
|
||||||
|
(3 rows)
|
||||||
|
```
|
||||||
|
|
||||||
|
- Then I found their handles and corrected them, for example:
|
||||||
|
|
||||||
|
```
|
||||||
|
dspacetest=# select handle from item, handle where handle.resource_id = item.item_id AND item.item_id = '157' and handle.resource_type_id=2;
|
||||||
|
handle
|
||||||
|
-----------
|
||||||
|
10568/129
|
||||||
|
(1 row)
|
||||||
|
```
|
||||||
|
|
||||||
|
- So I'm still not sure where these weird authors in the "Top Author" stats are coming from
|
||||||
|
|
||||||
<!-- vim: set sw=2 ts=2: -->
|
<!-- vim: set sw=2 ts=2: -->
|
||||||
|
@ -11,7 +11,7 @@
|
|||||||
<meta property="og:type" content="article" />
|
<meta property="og:type" content="article" />
|
||||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2019-10/" />
|
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2019-10/" />
|
||||||
<meta property="article:published_time" content="2019-10-01T13:20:51+03:00" />
|
<meta property="article:published_time" content="2019-10-01T13:20:51+03:00" />
|
||||||
<meta property="article:modified_time" content="2019-10-12T23:28:50+03:00" />
|
<meta property="article:modified_time" content="2019-10-13T11:59:11+03:00" />
|
||||||
|
|
||||||
<meta name="twitter:card" content="summary"/>
|
<meta name="twitter:card" content="summary"/>
|
||||||
<meta name="twitter:title" content="October, 2019"/>
|
<meta name="twitter:title" content="October, 2019"/>
|
||||||
@ -27,9 +27,9 @@
|
|||||||
"@type": "BlogPosting",
|
"@type": "BlogPosting",
|
||||||
"headline": "October, 2019",
|
"headline": "October, 2019",
|
||||||
"url": "https:\/\/alanorth.github.io\/cgspace-notes\/2019-10\/",
|
"url": "https:\/\/alanorth.github.io\/cgspace-notes\/2019-10\/",
|
||||||
"wordCount": "1073",
|
"wordCount": "1250",
|
||||||
"datePublished": "2019-10-01T13:20:51+03:00",
|
"datePublished": "2019-10-01T13:20:51+03:00",
|
||||||
"dateModified": "2019-10-12T23:28:50+03:00",
|
"dateModified": "2019-10-13T11:59:11+03:00",
|
||||||
"author": {
|
"author": {
|
||||||
"@type": "Person",
|
"@type": "Person",
|
||||||
"name": "Alan Orth"
|
"name": "Alan Orth"
|
||||||
@ -286,6 +286,46 @@ International Maize and Wheat Improvement Centre,International Maize and Wheat I
|
|||||||
<ul>
|
<ul>
|
||||||
<li>Now I sent the final feedback to Francesca, Carol, and Abenet</li>
|
<li>Now I sent the final feedback to Francesca, Carol, and Abenet</li>
|
||||||
</ul></li>
|
</ul></li>
|
||||||
|
|
||||||
|
<li><p>Peter is still seeing some authors listed with “|” in the “Top Authors” statistics for some collections</p>
|
||||||
|
|
||||||
|
<ul>
|
||||||
|
<li>I looked in some of the items that are listed and the author field does not contain those invalid separators</li>
|
||||||
|
|
||||||
|
<li><p>I decided to try doing a full Discovery re-indexing on CGSpace (linode18):</p>
|
||||||
|
|
||||||
|
<pre><code>$ time schedtool -B -e ionice -c2 -n7 nice -n19 dspace index-discovery -b
|
||||||
|
|
||||||
|
real 82m35.993s
|
||||||
|
</code></pre></li>
|
||||||
|
</ul></li>
|
||||||
|
|
||||||
|
<li><p>After the re-indexing the top authors still list the following:</p>
|
||||||
|
|
||||||
|
<pre><code>Jagwe, J.|Ouma, E.A.|Brandes-van Dorresteijn, D.|Kawuma, Brian|Smith, J.
|
||||||
|
</code></pre></li>
|
||||||
|
|
||||||
|
<li><p>I looked in the database to find authors that had “|” in them:</p>
|
||||||
|
|
||||||
|
<pre><code>dspace=# SELECT text_value, resource_id FROM metadatavalue WHERE resource_type_id=2 AND metadata_field_id=3 AND text_value LIKE '%|%';
|
||||||
|
text_value | resource_id
|
||||||
|
----------------------------------+-------------
|
||||||
|
Anandajayasekeram, P.|Puskur, R. | 157
|
||||||
|
Morales, J.|Renner, I. | 22779
|
||||||
|
Zahid, A.|Haque, M.A. | 25492
|
||||||
|
(3 rows)
|
||||||
|
</code></pre></li>
|
||||||
|
|
||||||
|
<li><p>Then I found their handles and corrected them, for example:</p>
|
||||||
|
|
||||||
|
<pre><code>dspacetest=# select handle from item, handle where handle.resource_id = item.item_id AND item.item_id = '157' and handle.resource_type_id=2;
|
||||||
|
handle
|
||||||
|
-----------
|
||||||
|
10568/129
|
||||||
|
(1 row)
|
||||||
|
</code></pre></li>
|
||||||
|
|
||||||
|
<li><p>So I’m still not sure where these weird authors in the “Top Author” stats are coming from</p></li>
|
||||||
</ul>
|
</ul>
|
||||||
|
|
||||||
<!-- vim: set sw=2 ts=2: -->
|
<!-- vim: set sw=2 ts=2: -->
|
||||||
|
@ -4,27 +4,27 @@
|
|||||||
|
|
||||||
<url>
|
<url>
|
||||||
<loc>https://alanorth.github.io/cgspace-notes/</loc>
|
<loc>https://alanorth.github.io/cgspace-notes/</loc>
|
||||||
<lastmod>2019-10-12T23:28:50+03:00</lastmod>
|
<lastmod>2019-10-13T11:59:11+03:00</lastmod>
|
||||||
</url>
|
</url>
|
||||||
|
|
||||||
<url>
|
<url>
|
||||||
<loc>https://alanorth.github.io/cgspace-notes/tags/notes/</loc>
|
<loc>https://alanorth.github.io/cgspace-notes/tags/notes/</loc>
|
||||||
<lastmod>2019-10-12T23:28:50+03:00</lastmod>
|
<lastmod>2019-10-13T11:59:11+03:00</lastmod>
|
||||||
</url>
|
</url>
|
||||||
|
|
||||||
<url>
|
<url>
|
||||||
<loc>https://alanorth.github.io/cgspace-notes/2019-10/</loc>
|
<loc>https://alanorth.github.io/cgspace-notes/2019-10/</loc>
|
||||||
<lastmod>2019-10-12T23:28:50+03:00</lastmod>
|
<lastmod>2019-10-13T11:59:11+03:00</lastmod>
|
||||||
</url>
|
</url>
|
||||||
|
|
||||||
<url>
|
<url>
|
||||||
<loc>https://alanorth.github.io/cgspace-notes/posts/</loc>
|
<loc>https://alanorth.github.io/cgspace-notes/posts/</loc>
|
||||||
<lastmod>2019-10-12T23:28:50+03:00</lastmod>
|
<lastmod>2019-10-13T11:59:11+03:00</lastmod>
|
||||||
</url>
|
</url>
|
||||||
|
|
||||||
<url>
|
<url>
|
||||||
<loc>https://alanorth.github.io/cgspace-notes/tags/</loc>
|
<loc>https://alanorth.github.io/cgspace-notes/tags/</loc>
|
||||||
<lastmod>2019-10-12T23:28:50+03:00</lastmod>
|
<lastmod>2019-10-13T11:59:11+03:00</lastmod>
|
||||||
</url>
|
</url>
|
||||||
|
|
||||||
<url>
|
<url>
|
||||||
|
Loading…
Reference in New Issue
Block a user