Add notes for 2016-05-19

This commit is contained in:
Alan Orth 2016-05-19 11:22:29 +03:00
parent 7363c63a9c
commit e9e1dbc3d1
Signed by: alanorth
GPG Key ID: 0FB860CC9C45B1B9
4 changed files with 74 additions and 0 deletions

View File

@ -143,3 +143,17 @@ if(cells['thumbnails'].value.contains('hqdefault'), cells['thumbnails'].value.sp
- Because ~400 records had the same filename on Flickr (hqdefault.jpg) but different UUIDs in the URL
- So for the `hqdefault.jpg` ones I just take the UUID (-2) and use it as the filename
- Before importing with SAFBuilder I tested adding "__bundle:THUMBNAIL" to the `filename` column and it works fine
## 2016-05-19
- More quality control on `filename` field of CCAFS records to make processing in shell and SAFBuilder more reliable:
```
value.replace('_','').replace('-','')
```
- We need to hold off on moving `dc.Species` to `cg.species` because it is only used for plants, and might be better to move it to something like `cg.species.plant`
- And `dc.identifier.fund` is MOSTLY used for CPWF project identifier but has some other sponsorship things
- We should move PN*, SG*, CBA, IA, and PHASE* values to `cg.identifier.cpwfproject`
- The rest, like BMGF and USAID etc, might have to go to either `dc.description.sponsorship` or `cg.identifier.fund` (not sure yet)
- There are also some mistakes in CPWF's things, like "PN 47"

View File

@ -252,6 +252,26 @@ fi
<li>Because ~400 records had the same filename on Flickr (hqdefault.jpg) but different UUIDs in the URL</li>
<li>So for the <code>hqdefault.jpg</code> ones I just take the UUID (-2) and use it as the filename</li>
<li>Before importing with SAFBuilder I tested adding &ldquo;__bundle:THUMBNAIL&rdquo; to the <code>filename</code> column and it works fine</li>
</ul>
<h2 id="2016-05-19:b7bf1a0f8f2415a40e1e11e343b04c0d">2016-05-19</h2>
<ul>
<li>More quality control on <code>filename</code> field of CCAFS records to make processing in shell and SAFBuilder more reliable:</li>
</ul>
<pre><code>value.replace('_','').replace('-','')
</code></pre>
<ul>
<li>We need to hold off on moving <code>dc.Species</code> to <code>cg.species</code> because it is only used for plants, and might be better to move it to something like <code>cg.species.plant</code></li>
<li>And <code>dc.identifier.fund</code> is MOSTLY used for CPWF project identifier but has some other sponsorship things
<ul>
<li>We should move PN<em>, SG</em>, CBA, IA, and PHASE* values to <code>cg.identifier.cpwfproject</code></li>
<li>The rest, like BMGF and USAID etc, might have to go to either <code>dc.description.sponsorship</code> or <code>cg.identifier.fund</code> (not sure yet)</li>
<li>There are also some mistakes in CPWF&rsquo;s things, like &ldquo;PN 47&rdquo;</li>
</ul></li>
</ul>
</section>

View File

@ -191,6 +191,26 @@ fi
&lt;li&gt;So for the &lt;code&gt;hqdefault.jpg&lt;/code&gt; ones I just take the UUID (-2) and use it as the filename&lt;/li&gt;
&lt;li&gt;Before importing with SAFBuilder I tested adding &amp;ldquo;__bundle:THUMBNAIL&amp;rdquo; to the &lt;code&gt;filename&lt;/code&gt; column and it works fine&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id=&#34;2016-05-19:b7bf1a0f8f2415a40e1e11e343b04c0d&#34;&gt;2016-05-19&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;More quality control on &lt;code&gt;filename&lt;/code&gt; field of CCAFS records to make processing in shell and SAFBuilder more reliable:&lt;/li&gt;
&lt;/ul&gt;
&lt;pre&gt;&lt;code&gt;value.replace(&#39;_&#39;,&#39;&#39;).replace(&#39;-&#39;,&#39;&#39;)
&lt;/code&gt;&lt;/pre&gt;
&lt;ul&gt;
&lt;li&gt;We need to hold off on moving &lt;code&gt;dc.Species&lt;/code&gt; to &lt;code&gt;cg.species&lt;/code&gt; because it is only used for plants, and might be better to move it to something like &lt;code&gt;cg.species.plant&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;And &lt;code&gt;dc.identifier.fund&lt;/code&gt; is MOSTLY used for CPWF project identifier but has some other sponsorship things
&lt;ul&gt;
&lt;li&gt;We should move PN&lt;em&gt;, SG&lt;/em&gt;, CBA, IA, and PHASE* values to &lt;code&gt;cg.identifier.cpwfproject&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;The rest, like BMGF and USAID etc, might have to go to either &lt;code&gt;dc.description.sponsorship&lt;/code&gt; or &lt;code&gt;cg.identifier.fund&lt;/code&gt; (not sure yet)&lt;/li&gt;
&lt;li&gt;There are also some mistakes in CPWF&amp;rsquo;s things, like &amp;ldquo;PN 47&amp;rdquo;&lt;/li&gt;
&lt;/ul&gt;&lt;/li&gt;
&lt;/ul&gt;
</description>
</item>

View File

@ -191,6 +191,26 @@ fi
&lt;li&gt;So for the &lt;code&gt;hqdefault.jpg&lt;/code&gt; ones I just take the UUID (-2) and use it as the filename&lt;/li&gt;
&lt;li&gt;Before importing with SAFBuilder I tested adding &amp;ldquo;__bundle:THUMBNAIL&amp;rdquo; to the &lt;code&gt;filename&lt;/code&gt; column and it works fine&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id=&#34;2016-05-19:b7bf1a0f8f2415a40e1e11e343b04c0d&#34;&gt;2016-05-19&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;More quality control on &lt;code&gt;filename&lt;/code&gt; field of CCAFS records to make processing in shell and SAFBuilder more reliable:&lt;/li&gt;
&lt;/ul&gt;
&lt;pre&gt;&lt;code&gt;value.replace(&#39;_&#39;,&#39;&#39;).replace(&#39;-&#39;,&#39;&#39;)
&lt;/code&gt;&lt;/pre&gt;
&lt;ul&gt;
&lt;li&gt;We need to hold off on moving &lt;code&gt;dc.Species&lt;/code&gt; to &lt;code&gt;cg.species&lt;/code&gt; because it is only used for plants, and might be better to move it to something like &lt;code&gt;cg.species.plant&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;And &lt;code&gt;dc.identifier.fund&lt;/code&gt; is MOSTLY used for CPWF project identifier but has some other sponsorship things
&lt;ul&gt;
&lt;li&gt;We should move PN&lt;em&gt;, SG&lt;/em&gt;, CBA, IA, and PHASE* values to &lt;code&gt;cg.identifier.cpwfproject&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;The rest, like BMGF and USAID etc, might have to go to either &lt;code&gt;dc.description.sponsorship&lt;/code&gt; or &lt;code&gt;cg.identifier.fund&lt;/code&gt; (not sure yet)&lt;/li&gt;
&lt;li&gt;There are also some mistakes in CPWF&amp;rsquo;s things, like &amp;ldquo;PN 47&amp;rdquo;&lt;/li&gt;
&lt;/ul&gt;&lt;/li&gt;
&lt;/ul&gt;
</description>
</item>