diff --git a/public/2016-02/index.html b/public/2016-02/index.html
index 077d4b18b..b04801b97 100644
--- a/public/2016-02/index.html
+++ b/public/2016-02/index.html
@@ -308,6 +308,33 @@ CIAT_COLOMBIA_000169_Técnicas_para_el_aislamiento_y_cultivo_de_protoplastos_de_
- Merge pull requests for submission form theming (#178) and missing center subjects in XMLUI item views (#176)
- They will be deployed on CGSpace the next time I re-deploy
+
+
+2016-02-16
+
+
+- Turns out OpenRefine has an unescape function!
+
+
+value.unescape("url")
+
+
+
+- This turns the URLs into human-readable versions that we can use as proper filenames
+- Run web server and system updates on DSpace Test and reboot
+- To merge
dc.identifier.url
and dc.identifier.url[]
, rename the second column so it doesn’t have the brackets, like dc.identifier.url2
+- Then you create a facet for blank values on each column, show the rows that have values for one and not the other, then transform each independently to have the contents of the other, with “||” in between
+- Work on Python script for parsing and downloading PDF records from
dc.identifier.url
+- To turn
dc.identifier.url
into filenames, create a new column based o
+- To get filenames from
dc.identifier.url
, create a new column based on this transform: forEach(value.split('||'), v, v.split('/')[-1]).join('||')
+- This also works for records that have multiple URLs (separated by “||”)
+
+
+2016-02-17
+
+
+- Re-deploy CGSpace, run all system updates, and reboot
+- More work on CIAT data, cleaning and doing a last metadata-only import into DSpace Test
diff --git a/public/index.xml b/public/index.xml
index 4b6964298..c41ac3077 100644
--- a/public/index.xml
+++ b/public/index.xml
@@ -247,6 +247,33 @@ CIAT_COLOMBIA_000169_Técnicas_para_el_aislamiento_y_cultivo_de_protoplastos_de_
<li>Merge pull requests for submission form theming (<a href="https://github.com/ilri/DSpace/pull/178">#178</a>) and missing center subjects in XMLUI item views (<a href="https://github.com/ilri/DSpace/pull/176">#176</a>)</li>
<li>They will be deployed on CGSpace the next time I re-deploy</li>
</ul>
+
+<h2 id="2016-02-16:124a59adbaa8ef13e1518d003fc03981">2016-02-16</h2>
+
+<ul>
+<li>Turns out OpenRefine has an unescape function!</li>
+</ul>
+
+<pre><code>value.unescape("url")
+</code></pre>
+
+<ul>
+<li>This turns the URLs into human-readable versions that we can use as proper filenames</li>
+<li>Run web server and system updates on DSpace Test and reboot</li>
+<li>To merge <code>dc.identifier.url</code> and <code>dc.identifier.url[]</code>, rename the second column so it doesn’t have the brackets, like <code>dc.identifier.url2</code></li>
+<li>Then you create a facet for blank values on each column, show the rows that have values for one and not the other, then transform each independently to have the contents of the other, with “||” in between</li>
+<li>Work on Python script for parsing and downloading PDF records from <code>dc.identifier.url</code></li>
+<li>To turn <code>dc.identifier.url</code> into filenames, create a new column based o</li>
+<li>To get filenames from <code>dc.identifier.url</code>, create a new column based on this transform: <code>forEach(value.split('||'), v, v.split('/')[-1]).join('||')</code></li>
+<li>This also works for records that have multiple URLs (separated by “||”)</li>
+</ul>
+
+<h2 id="2016-02-17:124a59adbaa8ef13e1518d003fc03981">2016-02-17</h2>
+
+<ul>
+<li>Re-deploy CGSpace, run all system updates, and reboot</li>
+<li>More work on CIAT data, cleaning and doing a last metadata-only import into DSpace Test</li>
+</ul>
diff --git a/public/tags/notes/index.xml b/public/tags/notes/index.xml
index cf1cc6abe..1b5cde904 100644
--- a/public/tags/notes/index.xml
+++ b/public/tags/notes/index.xml
@@ -247,6 +247,33 @@ CIAT_COLOMBIA_000169_Técnicas_para_el_aislamiento_y_cultivo_de_protoplastos_de_
<li>Merge pull requests for submission form theming (<a href="https://github.com/ilri/DSpace/pull/178">#178</a>) and missing center subjects in XMLUI item views (<a href="https://github.com/ilri/DSpace/pull/176">#176</a>)</li>
<li>They will be deployed on CGSpace the next time I re-deploy</li>
</ul>
+
+<h2 id="2016-02-16:124a59adbaa8ef13e1518d003fc03981">2016-02-16</h2>
+
+<ul>
+<li>Turns out OpenRefine has an unescape function!</li>
+</ul>
+
+<pre><code>value.unescape("url")
+</code></pre>
+
+<ul>
+<li>This turns the URLs into human-readable versions that we can use as proper filenames</li>
+<li>Run web server and system updates on DSpace Test and reboot</li>
+<li>To merge <code>dc.identifier.url</code> and <code>dc.identifier.url[]</code>, rename the second column so it doesn’t have the brackets, like <code>dc.identifier.url2</code></li>
+<li>Then you create a facet for blank values on each column, show the rows that have values for one and not the other, then transform each independently to have the contents of the other, with “||” in between</li>
+<li>Work on Python script for parsing and downloading PDF records from <code>dc.identifier.url</code></li>
+<li>To turn <code>dc.identifier.url</code> into filenames, create a new column based o</li>
+<li>To get filenames from <code>dc.identifier.url</code>, create a new column based on this transform: <code>forEach(value.split('||'), v, v.split('/')[-1]).join('||')</code></li>
+<li>This also works for records that have multiple URLs (separated by “||”)</li>
+</ul>
+
+<h2 id="2016-02-17:124a59adbaa8ef13e1518d003fc03981">2016-02-17</h2>
+
+<ul>
+<li>Re-deploy CGSpace, run all system updates, and reboot</li>
+<li>More work on CIAT data, cleaning and doing a last metadata-only import into DSpace Test</li>
+</ul>