mirror of
https://github.com/alanorth/cgspace-notes.git
synced 2024-11-26 00:18:21 +01:00
Update notes for 2020-01-28
This commit is contained in:
parent
914646453a
commit
13f4d47ed8
@ -301,4 +301,27 @@ org.dspace.discovery.SearchServiceException: org.apache.solr.search.SyntaxError:
|
|||||||
- I made a [pull request](https://github.com/ilri/DSpace/pull/443) and merged it to the `5_x-prod` branch and will deploy on CGSpace later tonight
|
- I made a [pull request](https://github.com/ilri/DSpace/pull/443) and merged it to the `5_x-prod` branch and will deploy on CGSpace later tonight
|
||||||
- I am curious if anyone on the dspace-tech mailing list has run into this, so I will try to send a message about this there when I get a chance
|
- I am curious if anyone on the dspace-tech mailing list has run into this, so I will try to send a message about this there when I get a chance
|
||||||
|
|
||||||
|
## 2020-01-28
|
||||||
|
|
||||||
|
- Generate a list of CIP subjects for Abenet:
|
||||||
|
|
||||||
|
```
|
||||||
|
dspace=# \COPY (SELECT DISTINCT text_value as "cg.subject.cip", count(*) FROM metadatavalue WHERE resource_type_id = 2 AND metadata_field_id = 127 GROUP BY text_value ORDER BY count DESC) to /tmp/2020-01-28-cip-subjects.csv WITH CSV HEADER;
|
||||||
|
COPY 77
|
||||||
|
```
|
||||||
|
|
||||||
|
- Start looking over the IITA records from earlier this month ([IITA_201907_Jan13](https://dspacetest.cgiar.org/handle/10568/106567))
|
||||||
|
- Delete one duplicate, map one item from ILRI community
|
||||||
|
- The following items are duplicates or something (there is not enough metadata to tell for sure):
|
||||||
|
- https://dspacetest.cgiar.org/handle/10568/106682
|
||||||
|
- https://dspacetest.cgiar.org/handle/10568/106653
|
||||||
|
- https://dspacetest.cgiar.org/handle/10568/106694
|
||||||
|
- This item doesn't exist in the journal, and Weed Science volume 55 was published in 2007, not 2003:
|
||||||
|
- https://dspacetest.cgiar.org/handle/10568/106665
|
||||||
|
- All items using `cg.journal.title` instead of `dc.source`
|
||||||
|
- Several items were missing ISSN despite having a journal title
|
||||||
|
- Many items were missing DOIs, abstracts, etc
|
||||||
|
- I did some metadata enrichment by searching for the items and copying relevant data from journal pages
|
||||||
|
- I asked Bosede to try to do the same for the rest of the journal articles
|
||||||
|
|
||||||
<!-- vim: set sw=2 ts=2: -->
|
<!-- vim: set sw=2 ts=2: -->
|
||||||
|
@ -63,7 +63,7 @@ I tweeted the CGSpace repository link
|
|||||||
"@type": "BlogPosting",
|
"@type": "BlogPosting",
|
||||||
"headline": "January, 2020",
|
"headline": "January, 2020",
|
||||||
"url": "https:\/\/alanorth.github.io\/cgspace-notes\/2020-01\/",
|
"url": "https:\/\/alanorth.github.io\/cgspace-notes\/2020-01\/",
|
||||||
"wordCount": "2754",
|
"wordCount": "2910",
|
||||||
"datePublished": "2020-01-06T10:48:30+02:00",
|
"datePublished": "2020-01-06T10:48:30+02:00",
|
||||||
"dateModified": "2020-01-27T16:20:44+02:00",
|
"dateModified": "2020-01-27T16:20:44+02:00",
|
||||||
"author": {
|
"author": {
|
||||||
@ -446,6 +446,36 @@ org.dspace.discovery.SearchServiceException: org.apache.solr.search.SyntaxError:
|
|||||||
</ul>
|
</ul>
|
||||||
</li>
|
</li>
|
||||||
</ul>
|
</ul>
|
||||||
|
<h2 id="2020-01-28">2020-01-28</h2>
|
||||||
|
<ul>
|
||||||
|
<li>Generate a list of CIP subjects for Abenet:</li>
|
||||||
|
</ul>
|
||||||
|
<pre><code>dspace=# \COPY (SELECT DISTINCT text_value as "cg.subject.cip", count(*) FROM metadatavalue WHERE resource_type_id = 2 AND metadata_field_id = 127 GROUP BY text_value ORDER BY count DESC) to /tmp/2020-01-28-cip-subjects.csv WITH CSV HEADER;
|
||||||
|
COPY 77
|
||||||
|
</code></pre><ul>
|
||||||
|
<li>Start looking over the IITA records from earlier this month (<a href="https://dspacetest.cgiar.org/handle/10568/106567">IITA_201907_Jan13</a>)
|
||||||
|
<ul>
|
||||||
|
<li>Delete one duplicate, map one item from ILRI community</li>
|
||||||
|
<li>The following items are duplicates or something (there is not enough metadata to tell for sure):
|
||||||
|
<ul>
|
||||||
|
<li><a href="https://dspacetest.cgiar.org/handle/10568/106682">https://dspacetest.cgiar.org/handle/10568/106682</a></li>
|
||||||
|
<li><a href="https://dspacetest.cgiar.org/handle/10568/106653">https://dspacetest.cgiar.org/handle/10568/106653</a></li>
|
||||||
|
<li><a href="https://dspacetest.cgiar.org/handle/10568/106694">https://dspacetest.cgiar.org/handle/10568/106694</a></li>
|
||||||
|
</ul>
|
||||||
|
</li>
|
||||||
|
<li>This item doesn’t exist in the journal, and Weed Science volume 55 was published in 2007, not 2003:
|
||||||
|
<ul>
|
||||||
|
<li><a href="https://dspacetest.cgiar.org/handle/10568/106665">https://dspacetest.cgiar.org/handle/10568/106665</a></li>
|
||||||
|
</ul>
|
||||||
|
</li>
|
||||||
|
<li>All items using <code>cg.journal.title</code> instead of <code>dc.source</code></li>
|
||||||
|
<li>Several items were missing ISSN despite having a journal title</li>
|
||||||
|
<li>Many items were missing DOIs, abstracts, etc</li>
|
||||||
|
<li>I did some metadata enrichment by searching for the items and copying relevant data from journal pages</li>
|
||||||
|
<li>I asked Bosede to try to do the same for the rest of the journal articles</li>
|
||||||
|
</ul>
|
||||||
|
</li>
|
||||||
|
</ul>
|
||||||
<!-- raw HTML omitted -->
|
<!-- raw HTML omitted -->
|
||||||
|
|
||||||
|
|
||||||
|
Loading…
Reference in New Issue
Block a user