mirror of
https://github.com/alanorth/cgspace-notes.git
synced 2024-11-22 22:55:04 +01:00
Add notes for 2017-05-15
This commit is contained in:
parent
516e5ecd1d
commit
ad891890c7
@ -132,3 +132,8 @@ dspace=# delete from metadatavalue where resource_type_id=2 and text_value='';
|
|||||||
- After quite a bit of troubleshooting with importing cleaned up data as CSV, it seems that there are actually [NUL](https://en.wikipedia.org/wiki/Null_character) characters in the `dc.description.abstract` field (at least) on the lines where CSV importing was failing
|
- After quite a bit of troubleshooting with importing cleaned up data as CSV, it seems that there are actually [NUL](https://en.wikipedia.org/wiki/Null_character) characters in the `dc.description.abstract` field (at least) on the lines where CSV importing was failing
|
||||||
- I tried to find a way to remove the characters in vim or Open Refine, but decided it was quicker to just remove the column temporarily and import it
|
- I tried to find a way to remove the characters in vim or Open Refine, but decided it was quicker to just remove the column temporarily and import it
|
||||||
- The import was successful and detected 2022 changes, which should likely be the rest that were failing to import before
|
- The import was successful and detected 2022 changes, which should likely be the rest that were failing to import before
|
||||||
|
|
||||||
|
## 2017-05-15
|
||||||
|
|
||||||
|
- To delete the blank lines that cause isses during import we need to use a regex in vim `g/^$/d`
|
||||||
|
- After that I started looking in the `dc.subject` field to try to pull countries and regions out, but there are too many values in there
|
||||||
|
@ -13,7 +13,7 @@
|
|||||||
|
|
||||||
|
|
||||||
<meta property="article:published_time" content="2017-05-01T16:21:52+02:00"/>
|
<meta property="article:published_time" content="2017-05-01T16:21:52+02:00"/>
|
||||||
<meta property="article:modified_time" content="2017-05-10T23:44:44+03:00"/>
|
<meta property="article:modified_time" content="2017-05-13T13:48:40+03:00"/>
|
||||||
|
|
||||||
|
|
||||||
|
|
||||||
@ -45,9 +45,9 @@
|
|||||||
"@type": "BlogPosting",
|
"@type": "BlogPosting",
|
||||||
"headline": "May, 2017",
|
"headline": "May, 2017",
|
||||||
"url": "https://alanorth.github.io/cgspace-notes/2017-05/",
|
"url": "https://alanorth.github.io/cgspace-notes/2017-05/",
|
||||||
"wordCount": "1122",
|
"wordCount": "1167",
|
||||||
"datePublished": "2017-05-01T16:21:52+02:00",
|
"datePublished": "2017-05-01T16:21:52+02:00",
|
||||||
"dateModified": "2017-05-10T23:44:44+03:00",
|
"dateModified": "2017-05-13T13:48:40+03:00",
|
||||||
"author": {
|
"author": {
|
||||||
"@type": "Person",
|
"@type": "Person",
|
||||||
"name": "Alan Orth"
|
"name": "Alan Orth"
|
||||||
@ -271,6 +271,13 @@ $ for item in /home/aorth/10947-1/ITEM@10947-*; do [dspace]/bin/dspace packager
|
|||||||
<li>The import was successful and detected 2022 changes, which should likely be the rest that were failing to import before</li>
|
<li>The import was successful and detected 2022 changes, which should likely be the rest that were failing to import before</li>
|
||||||
</ul>
|
</ul>
|
||||||
|
|
||||||
|
<h2 id="2017-05-15">2017-05-15</h2>
|
||||||
|
|
||||||
|
<ul>
|
||||||
|
<li>To delete the blank lines that cause isses during import we need to use a regex in vim <code>g/^$/d</code></li>
|
||||||
|
<li>After that I started looking in the <code>dc.subject</code> field to try to pull countries and regions out, but there are too many values in there</li>
|
||||||
|
</ul>
|
||||||
|
|
||||||
|
|
||||||
|
|
||||||
|
|
||||||
|
@ -4,7 +4,7 @@
|
|||||||
|
|
||||||
<url>
|
<url>
|
||||||
<loc>https://alanorth.github.io/cgspace-notes/2017-05/</loc>
|
<loc>https://alanorth.github.io/cgspace-notes/2017-05/</loc>
|
||||||
<lastmod>2017-05-10T23:44:44+03:00</lastmod>
|
<lastmod>2017-05-13T13:48:40+03:00</lastmod>
|
||||||
</url>
|
</url>
|
||||||
|
|
||||||
<url>
|
<url>
|
||||||
@ -99,7 +99,7 @@
|
|||||||
|
|
||||||
<url>
|
<url>
|
||||||
<loc>https://alanorth.github.io/cgspace-notes/</loc>
|
<loc>https://alanorth.github.io/cgspace-notes/</loc>
|
||||||
<lastmod>2017-05-10T23:44:44+03:00</lastmod>
|
<lastmod>2017-05-13T13:48:40+03:00</lastmod>
|
||||||
<priority>0</priority>
|
<priority>0</priority>
|
||||||
</url>
|
</url>
|
||||||
|
|
||||||
@ -110,19 +110,19 @@
|
|||||||
|
|
||||||
<url>
|
<url>
|
||||||
<loc>https://alanorth.github.io/cgspace-notes/tags/notes/</loc>
|
<loc>https://alanorth.github.io/cgspace-notes/tags/notes/</loc>
|
||||||
<lastmod>2017-05-10T23:44:44+03:00</lastmod>
|
<lastmod>2017-05-13T13:48:40+03:00</lastmod>
|
||||||
<priority>0</priority>
|
<priority>0</priority>
|
||||||
</url>
|
</url>
|
||||||
|
|
||||||
<url>
|
<url>
|
||||||
<loc>https://alanorth.github.io/cgspace-notes/post/</loc>
|
<loc>https://alanorth.github.io/cgspace-notes/post/</loc>
|
||||||
<lastmod>2017-05-10T23:44:44+03:00</lastmod>
|
<lastmod>2017-05-13T13:48:40+03:00</lastmod>
|
||||||
<priority>0</priority>
|
<priority>0</priority>
|
||||||
</url>
|
</url>
|
||||||
|
|
||||||
<url>
|
<url>
|
||||||
<loc>https://alanorth.github.io/cgspace-notes/tags/</loc>
|
<loc>https://alanorth.github.io/cgspace-notes/tags/</loc>
|
||||||
<lastmod>2017-05-10T23:44:44+03:00</lastmod>
|
<lastmod>2017-05-13T13:48:40+03:00</lastmod>
|
||||||
<priority>0</priority>
|
<priority>0</priority>
|
||||||
</url>
|
</url>
|
||||||
|
|
||||||
|
Loading…
Reference in New Issue
Block a user