mirror of
https://github.com/alanorth/cgspace-notes.git
synced 2025-01-27 05:49:12 +01:00
Add notes for 2021-12
This commit is contained in:
@ -50,7 +50,7 @@ Total number of bot hits purged: 3679
|
||||
"@type": "BlogPosting",
|
||||
"headline": "December, 2021",
|
||||
"url": "https://alanorth.github.io/cgspace-notes/2021-12/",
|
||||
"wordCount": "404",
|
||||
"wordCount": "597",
|
||||
"datePublished": "2021-12-01T16:07:07+02:00",
|
||||
"dateModified": "2021-12-01T16:07:07+02:00",
|
||||
"author": {
|
||||
@ -191,10 +191,38 @@ Purging 455 hits from WhatsApp in statistics
|
||||
<ul>
|
||||
<li>I see GARDIAN is now using a “GARDIAN” user agent finally
|
||||
<ul>
|
||||
<li>I will add them to our local bot override for Solr</li>
|
||||
<li>I will add them to our local spider agent override in DSpace so that the hits don’t get counted in Solr</li>
|
||||
</ul>
|
||||
</li>
|
||||
</ul>
|
||||
<h2 id="2021-12-05">2021-12-05</h2>
|
||||
<ul>
|
||||
<li>Proof fifty records Abenet sent me from Africa Rice Center (“AfricaRice 1st batch Import”)
|
||||
<ul>
|
||||
<li>Fixed forty-six incorrect collections</li>
|
||||
<li>Cleaned up and normalize affiliations</li>
|
||||
<li>Cleaned up dates (extra <code>*</code> character in all?)</li>
|
||||
<li>Cleaned up citation format</li>
|
||||
<li>Fixed some encoding issues in abstracts</li>
|
||||
<li>Removed empty columns</li>
|
||||
<li>Removed one duplicate: Enhancing Rice Productivity and Soil Nitrogen Using Dual-Purpose Cowpea-NERICA® Rice Sequence in Degraded Savanna</li>
|
||||
<li>Added volume and issue metadata by extracting it from the citations</li>
|
||||
<li>All PDFs hosted on davidpublishing.com are dead…</li>
|
||||
<li>All DOIs linking to African Journal of Agricultural Research are dead…</li>
|
||||
<li>Fixed a handful of items marked as “Open Access” that are actually closed</li>
|
||||
<li>Added many missing ISSNs</li>
|
||||
<li>Added many missing countries/regions</li>
|
||||
<li>Fixed invalid AGROVOC terms and added some more based on article subjects</li>
|
||||
</ul>
|
||||
</li>
|
||||
<li>I also made some minor changes to the <a href="https://github.com/ilri/csv-metadata-quality">CSV Metadata Quality Checker</a>
|
||||
<ul>
|
||||
<li>Added the ability to check if the item’s title exists in the citation</li>
|
||||
<li>Updated to only run the mojibake check if we’re not running in unsafe mode (so we don’t print the same warning during both the check and fix steps)</li>
|
||||
</ul>
|
||||
</li>
|
||||
<li>I ran the re-harvesting on AReS</li>
|
||||
</ul>
|
||||
<!-- raw HTML omitted -->
|
||||
|
||||
|
||||
|
Reference in New Issue
Block a user