mirror of
https://github.com/alanorth/cgspace-notes.git
synced 2025-01-27 05:49:12 +01:00
Add notes for 2017-05-13
This commit is contained in:
@ -13,7 +13,7 @@
|
||||
|
||||
|
||||
<meta property="article:published_time" content="2017-05-01T16:21:52+02:00"/>
|
||||
<meta property="article:modified_time" content="2017-05-10T11:20:27+03:00"/>
|
||||
<meta property="article:modified_time" content="2017-05-10T23:44:44+03:00"/>
|
||||
|
||||
|
||||
|
||||
@ -45,9 +45,9 @@
|
||||
"@type": "BlogPosting",
|
||||
"headline": "May, 2017",
|
||||
"url": "https://alanorth.github.io/cgspace-notes/2017-05/",
|
||||
"wordCount": "1037",
|
||||
"wordCount": "1122",
|
||||
"datePublished": "2017-05-01T16:21:52+02:00",
|
||||
"dateModified": "2017-05-10T11:20:27+03:00",
|
||||
"dateModified": "2017-05-10T23:44:44+03:00",
|
||||
"author": {
|
||||
"@type": "Person",
|
||||
"name": "Alan Orth"
|
||||
@ -263,6 +263,14 @@ $ for item in /home/aorth/10947-1/ITEM@10947-*; do [dspace]/bin/dspace packager
|
||||
<pre><code>dspace=# delete from metadatavalue where resource_type_id=2 and text_value='';
|
||||
</code></pre>
|
||||
|
||||
<h2 id="2017-05-13">2017-05-13</h2>
|
||||
|
||||
<ul>
|
||||
<li>After quite a bit of troubleshooting with importing cleaned up data as CSV, it seems that there are actually <a href="https://en.wikipedia.org/wiki/Null_character">NUL</a> characters in the <code>dc.description.abstract</code> field (at least) on the lines where CSV importing was failing</li>
|
||||
<li>I tried to find a way to remove the characters in vim or Open Refine, but decided it was quicker to just remove the column temporarily and import it</li>
|
||||
<li>The import was successful and detected 2022 changes, which should likely be the rest that were failing to import before</li>
|
||||
</ul>
|
||||
|
||||
|
||||
|
||||
|
||||
|
Reference in New Issue
Block a user