Add notes for 2016-10-04

This commit is contained in:
Alan Orth 2016-10-04 11:34:57 +03:00
parent 4a1eb3ce16
commit d36443d3e8
Signed by: alanorth
GPG Key ID: 0FB860CC9C45B1B9
5 changed files with 104 additions and 0 deletions

View File

@ -24,3 +24,19 @@ tags = ["Notes"]
![Bootstrap issue with in-page anchors](2016/10/bootstrap-issue.png)
- Looks like we'll just have to add the text to the About page (without a link) or add a separate page
## 2016-10-04
- Start testing cleanups of authors that Peter sent last week
- Out of 40,000+ rows, Peter had indicated corrections for ~3,200 of them—too many to look through carefully, so I did some basic quality checking:
- Trim leading/trailing whitespace
- Find invalid characters
- Cluster values to merge obvious authors
- That left us with 3,180 valid corrections and 3 deletions:
```
$ ./fix-metadata-values.py -i authors-fix-3180.csv -f dc.contributor.author -t correct -m 3 -d dspacetest -u dspacetest -p fuuu
$ ./delete-metadata-values.py -i authors-delete-3.csv -f dc.contributor.author -m 3 -d dspacetest -u dspacetest -p fuuu
```
- Remove old about page ([#284](https://github.com/ilri/DSpace/pull/284))

View File

@ -112,6 +112,28 @@
<li>Looks like we&rsquo;ll just have to add the text to the About page (without a link) or add a separate page</li>
</ul>
<h2 id="2016-10-04">2016-10-04</h2>
<ul>
<li>Start testing cleanups of authors that Peter sent last week</li>
<li>Out of 40,000+ rows, Peter had indicated corrections for ~3,200 of them—too many to look through carefully, so I did some basic quality checking:
<ul>
<li>Trim leading/trailing whitespace</li>
<li>Find invalid characters</li>
<li>Cluster values to merge obvious authors</li>
</ul></li>
<li>That left us with 3,180 valid corrections and 3 deletions:</li>
</ul>
<pre><code>$ ./fix-metadata-values.py -i authors-fix-3180.csv -f dc.contributor.author -t correct -m 3 -d dspacetest -u dspacetest -p fuuu
$ ./delete-metadata-values.py -i authors-delete-3.csv -f dc.contributor.author -m 3 -d dspacetest -u dspacetest -p fuuu
</code></pre>
<ul>
<li>Remove old about page (<a href="https://github.com/ilri/DSpace/pull/284">#284</a>)</li>
</ul>
</article>

View File

@ -44,6 +44,28 @@
&lt;ul&gt;
&lt;li&gt;Looks like we&amp;rsquo;ll just have to add the text to the About page (without a link) or add a separate page&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id=&#34;2016-10-04&#34;&gt;2016-10-04&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Start testing cleanups of authors that Peter sent last week&lt;/li&gt;
&lt;li&gt;Out of 40,000+ rows, Peter had indicated corrections for ~3,200 of them—too many to look through carefully, so I did some basic quality checking:
&lt;ul&gt;
&lt;li&gt;Trim leading/trailing whitespace&lt;/li&gt;
&lt;li&gt;Find invalid characters&lt;/li&gt;
&lt;li&gt;Cluster values to merge obvious authors&lt;/li&gt;
&lt;/ul&gt;&lt;/li&gt;
&lt;li&gt;That left us with 3,180 valid corrections and 3 deletions:&lt;/li&gt;
&lt;/ul&gt;
&lt;pre&gt;&lt;code&gt;$ ./fix-metadata-values.py -i authors-fix-3180.csv -f dc.contributor.author -t correct -m 3 -d dspacetest -u dspacetest -p fuuu
$ ./delete-metadata-values.py -i authors-delete-3.csv -f dc.contributor.author -m 3 -d dspacetest -u dspacetest -p fuuu
&lt;/code&gt;&lt;/pre&gt;
&lt;ul&gt;
&lt;li&gt;Remove old about page (&lt;a href=&#34;https://github.com/ilri/DSpace/pull/284&#34;&gt;#284&lt;/a&gt;)&lt;/li&gt;
&lt;/ul&gt;
</description>
</item>

View File

@ -44,6 +44,28 @@
&lt;ul&gt;
&lt;li&gt;Looks like we&amp;rsquo;ll just have to add the text to the About page (without a link) or add a separate page&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id=&#34;2016-10-04&#34;&gt;2016-10-04&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Start testing cleanups of authors that Peter sent last week&lt;/li&gt;
&lt;li&gt;Out of 40,000+ rows, Peter had indicated corrections for ~3,200 of them—too many to look through carefully, so I did some basic quality checking:
&lt;ul&gt;
&lt;li&gt;Trim leading/trailing whitespace&lt;/li&gt;
&lt;li&gt;Find invalid characters&lt;/li&gt;
&lt;li&gt;Cluster values to merge obvious authors&lt;/li&gt;
&lt;/ul&gt;&lt;/li&gt;
&lt;li&gt;That left us with 3,180 valid corrections and 3 deletions:&lt;/li&gt;
&lt;/ul&gt;
&lt;pre&gt;&lt;code&gt;$ ./fix-metadata-values.py -i authors-fix-3180.csv -f dc.contributor.author -t correct -m 3 -d dspacetest -u dspacetest -p fuuu
$ ./delete-metadata-values.py -i authors-delete-3.csv -f dc.contributor.author -m 3 -d dspacetest -u dspacetest -p fuuu
&lt;/code&gt;&lt;/pre&gt;
&lt;ul&gt;
&lt;li&gt;Remove old about page (&lt;a href=&#34;https://github.com/ilri/DSpace/pull/284&#34;&gt;#284&lt;/a&gt;)&lt;/li&gt;
&lt;/ul&gt;
</description>
</item>

View File

@ -43,6 +43,28 @@
&lt;ul&gt;
&lt;li&gt;Looks like we&amp;rsquo;ll just have to add the text to the About page (without a link) or add a separate page&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id=&#34;2016-10-04&#34;&gt;2016-10-04&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Start testing cleanups of authors that Peter sent last week&lt;/li&gt;
&lt;li&gt;Out of 40,000+ rows, Peter had indicated corrections for ~3,200 of them—too many to look through carefully, so I did some basic quality checking:
&lt;ul&gt;
&lt;li&gt;Trim leading/trailing whitespace&lt;/li&gt;
&lt;li&gt;Find invalid characters&lt;/li&gt;
&lt;li&gt;Cluster values to merge obvious authors&lt;/li&gt;
&lt;/ul&gt;&lt;/li&gt;
&lt;li&gt;That left us with 3,180 valid corrections and 3 deletions:&lt;/li&gt;
&lt;/ul&gt;
&lt;pre&gt;&lt;code&gt;$ ./fix-metadata-values.py -i authors-fix-3180.csv -f dc.contributor.author -t correct -m 3 -d dspacetest -u dspacetest -p fuuu
$ ./delete-metadata-values.py -i authors-delete-3.csv -f dc.contributor.author -m 3 -d dspacetest -u dspacetest -p fuuu
&lt;/code&gt;&lt;/pre&gt;
&lt;ul&gt;
&lt;li&gt;Remove old about page (&lt;a href=&#34;https://github.com/ilri/DSpace/pull/284&#34;&gt;#284&lt;/a&gt;)&lt;/li&gt;
&lt;/ul&gt;
</description>
</item>