Add notes for 2016-10-30

This commit is contained in:
Alan Orth 2016-10-30 14:38:01 +02:00
parent 092ed3fda5
commit 54bd802b0d
Signed by: alanorth
GPG Key ID: 0FB860CC9C45B1B9
5 changed files with 186 additions and 0 deletions

View File

@ -209,3 +209,37 @@ UPDATE 0
![DSpace Test with Font Awesome icons](2016/10/dspacetest-fontawesome-icons.png)
- Run the same replacements on CGSpace
## 2016-10-30
- Fix some messed up authors on CGSpace:
```
dspace=# update metadatavalue set authority='799da1d8-22f3-43f5-8233-3d2ef5ebf8a8', confidence=600 where metadata_field_id=3 and resource_type_id=2 and text_value like 'Charleston, B.%';
UPDATE 10
dspace=# update metadatavalue set authority='e936f5c5-343d-4c46-aa91-7a1fff6277ed', confidence=600 where metadata_field_id=3 and resource_type_id=2 and text_value like 'Knight-Jones%';
UPDATE 36
```
- I updated the authority index but nothing seemed to change, so I'll wait and do it again after I update Discovery below
- Skype chat with Tsega about the [IFPRI contentdm bridge](https://github.com/ilri/ckm-cgspace-contentdm-bridge)
- We tested harvesting OAI in an example collection to see how it works
- Talk to Carlos Quiros about CG Core metadata in CGSpace
- Get a list of countries from CGSpace so I can do some batch corrections:
```
dspace=# \copy (select distinct text_value, count(*) from metadatavalue where metadata_field_id=228 group by text_value order by count desc) to /tmp/countries.csv with csv;
```
- Fix a bunch of countries in Open Refine and run the corrections on CGSpace:
```
$ ./fix-metadata-values.py -i countries-fix-18.csv -f dc.coverage.country -t 'correct' -m 228 -d dspace -u dspace -p fuuu
$ ./delete-metadata-values.py -i countries-delete-2.csv -f dc.coverage.country -m 228 -d dspace -u dspace -p fuuu
```
- Run a shit ton of author fixes from Peter Ballantyne that we've been cleaning up for two months:
```
$ ./fix-metadata-values.py -i /tmp/authors-fix-pb2.csv -f dc.contributor.author -t correct -m 3 -u dspace -d dspace -p fuuu
```

View File

@ -349,6 +349,44 @@ UPDATE 0
<li>Run the same replacements on CGSpace</li>
</ul>
<h2 id="2016-10-30">2016-10-30</h2>
<ul>
<li>Fix some messed up authors on CGSpace:</li>
</ul>
<pre><code>dspace=# update metadatavalue set authority='799da1d8-22f3-43f5-8233-3d2ef5ebf8a8', confidence=600 where metadata_field_id=3 and resource_type_id=2 and text_value like 'Charleston, B.%';
UPDATE 10
dspace=# update metadatavalue set authority='e936f5c5-343d-4c46-aa91-7a1fff6277ed', confidence=600 where metadata_field_id=3 and resource_type_id=2 and text_value like 'Knight-Jones%';
UPDATE 36
</code></pre>
<ul>
<li>I updated the authority index but nothing seemed to change, so I&rsquo;ll wait and do it again after I update Discovery below</li>
<li>Skype chat with Tsega about the <a href="https://github.com/ilri/ckm-cgspace-contentdm-bridge">IFPRI contentdm bridge</a></li>
<li>We tested harvesting OAI in an example collection to see how it works</li>
<li>Talk to Carlos Quiros about CG Core metadata in CGSpace</li>
<li>Get a list of countries from CGSpace so I can do some batch corrections:</li>
</ul>
<pre><code>dspace=# \copy (select distinct text_value, count(*) from metadatavalue where metadata_field_id=228 group by text_value order by count desc) to /tmp/countries.csv with csv;
</code></pre>
<ul>
<li>Fix a bunch of countries in Open Refine and run the corrections on CGSpace:</li>
</ul>
<pre><code>$ ./fix-metadata-values.py -i countries-fix-18.csv -f dc.coverage.country -t 'correct' -m 228 -d dspace -u dspace -p fuuu
$ ./delete-metadata-values.py -i countries-delete-2.csv -f dc.coverage.country -m 228 -d dspace -u dspace -p fuuu
</code></pre>
<ul>
<li>Run a shit ton of author fixes from Peter Ballantyne that we&rsquo;ve been cleaning up for two months:</li>
</ul>
<pre><code>$ ./fix-metadata-values.py -i /tmp/authors-fix-pb2.csv -f dc.contributor.author -t correct -m 3 -u dspace -d dspace -p fuuu
</code></pre>

View File

@ -275,6 +275,44 @@ UPDATE 0
&lt;ul&gt;
&lt;li&gt;Run the same replacements on CGSpace&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id=&#34;2016-10-30&#34;&gt;2016-10-30&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Fix some messed up authors on CGSpace:&lt;/li&gt;
&lt;/ul&gt;
&lt;pre&gt;&lt;code&gt;dspace=# update metadatavalue set authority=&#39;799da1d8-22f3-43f5-8233-3d2ef5ebf8a8&#39;, confidence=600 where metadata_field_id=3 and resource_type_id=2 and text_value like &#39;Charleston, B.%&#39;;
UPDATE 10
dspace=# update metadatavalue set authority=&#39;e936f5c5-343d-4c46-aa91-7a1fff6277ed&#39;, confidence=600 where metadata_field_id=3 and resource_type_id=2 and text_value like &#39;Knight-Jones%&#39;;
UPDATE 36
&lt;/code&gt;&lt;/pre&gt;
&lt;ul&gt;
&lt;li&gt;I updated the authority index but nothing seemed to change, so I&amp;rsquo;ll wait and do it again after I update Discovery below&lt;/li&gt;
&lt;li&gt;Skype chat with Tsega about the &lt;a href=&#34;https://github.com/ilri/ckm-cgspace-contentdm-bridge&#34;&gt;IFPRI contentdm bridge&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;We tested harvesting OAI in an example collection to see how it works&lt;/li&gt;
&lt;li&gt;Talk to Carlos Quiros about CG Core metadata in CGSpace&lt;/li&gt;
&lt;li&gt;Get a list of countries from CGSpace so I can do some batch corrections:&lt;/li&gt;
&lt;/ul&gt;
&lt;pre&gt;&lt;code&gt;dspace=# \copy (select distinct text_value, count(*) from metadatavalue where metadata_field_id=228 group by text_value order by count desc) to /tmp/countries.csv with csv;
&lt;/code&gt;&lt;/pre&gt;
&lt;ul&gt;
&lt;li&gt;Fix a bunch of countries in Open Refine and run the corrections on CGSpace:&lt;/li&gt;
&lt;/ul&gt;
&lt;pre&gt;&lt;code&gt;$ ./fix-metadata-values.py -i countries-fix-18.csv -f dc.coverage.country -t &#39;correct&#39; -m 228 -d dspace -u dspace -p fuuu
$ ./delete-metadata-values.py -i countries-delete-2.csv -f dc.coverage.country -m 228 -d dspace -u dspace -p fuuu
&lt;/code&gt;&lt;/pre&gt;
&lt;ul&gt;
&lt;li&gt;Run a shit ton of author fixes from Peter Ballantyne that we&amp;rsquo;ve been cleaning up for two months:&lt;/li&gt;
&lt;/ul&gt;
&lt;pre&gt;&lt;code&gt;$ ./fix-metadata-values.py -i /tmp/authors-fix-pb2.csv -f dc.contributor.author -t correct -m 3 -u dspace -d dspace -p fuuu
&lt;/code&gt;&lt;/pre&gt;
</description>
</item>

View File

@ -275,6 +275,44 @@ UPDATE 0
&lt;ul&gt;
&lt;li&gt;Run the same replacements on CGSpace&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id=&#34;2016-10-30&#34;&gt;2016-10-30&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Fix some messed up authors on CGSpace:&lt;/li&gt;
&lt;/ul&gt;
&lt;pre&gt;&lt;code&gt;dspace=# update metadatavalue set authority=&#39;799da1d8-22f3-43f5-8233-3d2ef5ebf8a8&#39;, confidence=600 where metadata_field_id=3 and resource_type_id=2 and text_value like &#39;Charleston, B.%&#39;;
UPDATE 10
dspace=# update metadatavalue set authority=&#39;e936f5c5-343d-4c46-aa91-7a1fff6277ed&#39;, confidence=600 where metadata_field_id=3 and resource_type_id=2 and text_value like &#39;Knight-Jones%&#39;;
UPDATE 36
&lt;/code&gt;&lt;/pre&gt;
&lt;ul&gt;
&lt;li&gt;I updated the authority index but nothing seemed to change, so I&amp;rsquo;ll wait and do it again after I update Discovery below&lt;/li&gt;
&lt;li&gt;Skype chat with Tsega about the &lt;a href=&#34;https://github.com/ilri/ckm-cgspace-contentdm-bridge&#34;&gt;IFPRI contentdm bridge&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;We tested harvesting OAI in an example collection to see how it works&lt;/li&gt;
&lt;li&gt;Talk to Carlos Quiros about CG Core metadata in CGSpace&lt;/li&gt;
&lt;li&gt;Get a list of countries from CGSpace so I can do some batch corrections:&lt;/li&gt;
&lt;/ul&gt;
&lt;pre&gt;&lt;code&gt;dspace=# \copy (select distinct text_value, count(*) from metadatavalue where metadata_field_id=228 group by text_value order by count desc) to /tmp/countries.csv with csv;
&lt;/code&gt;&lt;/pre&gt;
&lt;ul&gt;
&lt;li&gt;Fix a bunch of countries in Open Refine and run the corrections on CGSpace:&lt;/li&gt;
&lt;/ul&gt;
&lt;pre&gt;&lt;code&gt;$ ./fix-metadata-values.py -i countries-fix-18.csv -f dc.coverage.country -t &#39;correct&#39; -m 228 -d dspace -u dspace -p fuuu
$ ./delete-metadata-values.py -i countries-delete-2.csv -f dc.coverage.country -m 228 -d dspace -u dspace -p fuuu
&lt;/code&gt;&lt;/pre&gt;
&lt;ul&gt;
&lt;li&gt;Run a shit ton of author fixes from Peter Ballantyne that we&amp;rsquo;ve been cleaning up for two months:&lt;/li&gt;
&lt;/ul&gt;
&lt;pre&gt;&lt;code&gt;$ ./fix-metadata-values.py -i /tmp/authors-fix-pb2.csv -f dc.contributor.author -t correct -m 3 -u dspace -d dspace -p fuuu
&lt;/code&gt;&lt;/pre&gt;
</description>
</item>

View File

@ -274,6 +274,44 @@ UPDATE 0
&lt;ul&gt;
&lt;li&gt;Run the same replacements on CGSpace&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id=&#34;2016-10-30&#34;&gt;2016-10-30&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Fix some messed up authors on CGSpace:&lt;/li&gt;
&lt;/ul&gt;
&lt;pre&gt;&lt;code&gt;dspace=# update metadatavalue set authority=&#39;799da1d8-22f3-43f5-8233-3d2ef5ebf8a8&#39;, confidence=600 where metadata_field_id=3 and resource_type_id=2 and text_value like &#39;Charleston, B.%&#39;;
UPDATE 10
dspace=# update metadatavalue set authority=&#39;e936f5c5-343d-4c46-aa91-7a1fff6277ed&#39;, confidence=600 where metadata_field_id=3 and resource_type_id=2 and text_value like &#39;Knight-Jones%&#39;;
UPDATE 36
&lt;/code&gt;&lt;/pre&gt;
&lt;ul&gt;
&lt;li&gt;I updated the authority index but nothing seemed to change, so I&amp;rsquo;ll wait and do it again after I update Discovery below&lt;/li&gt;
&lt;li&gt;Skype chat with Tsega about the &lt;a href=&#34;https://github.com/ilri/ckm-cgspace-contentdm-bridge&#34;&gt;IFPRI contentdm bridge&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;We tested harvesting OAI in an example collection to see how it works&lt;/li&gt;
&lt;li&gt;Talk to Carlos Quiros about CG Core metadata in CGSpace&lt;/li&gt;
&lt;li&gt;Get a list of countries from CGSpace so I can do some batch corrections:&lt;/li&gt;
&lt;/ul&gt;
&lt;pre&gt;&lt;code&gt;dspace=# \copy (select distinct text_value, count(*) from metadatavalue where metadata_field_id=228 group by text_value order by count desc) to /tmp/countries.csv with csv;
&lt;/code&gt;&lt;/pre&gt;
&lt;ul&gt;
&lt;li&gt;Fix a bunch of countries in Open Refine and run the corrections on CGSpace:&lt;/li&gt;
&lt;/ul&gt;
&lt;pre&gt;&lt;code&gt;$ ./fix-metadata-values.py -i countries-fix-18.csv -f dc.coverage.country -t &#39;correct&#39; -m 228 -d dspace -u dspace -p fuuu
$ ./delete-metadata-values.py -i countries-delete-2.csv -f dc.coverage.country -m 228 -d dspace -u dspace -p fuuu
&lt;/code&gt;&lt;/pre&gt;
&lt;ul&gt;
&lt;li&gt;Run a shit ton of author fixes from Peter Ballantyne that we&amp;rsquo;ve been cleaning up for two months:&lt;/li&gt;
&lt;/ul&gt;
&lt;pre&gt;&lt;code&gt;$ ./fix-metadata-values.py -i /tmp/authors-fix-pb2.csv -f dc.contributor.author -t correct -m 3 -u dspace -d dspace -p fuuu
&lt;/code&gt;&lt;/pre&gt;
</description>
</item>