mirror of
https://github.com/alanorth/cgspace-notes.git
synced 2025-01-27 05:49:12 +01:00
Add notes for 2016-02-15
Signed-off-by: Alan Orth <alan.orth@gmail.com>
This commit is contained in:
@ -283,6 +283,21 @@ Processing 64195.pdf
|
||||
<li>A few items link to PDFs on IFPRI’s e-Library or Research Gate</li>
|
||||
<li>A few items have no item</li>
|
||||
<li>Also, I’m not sure if we import these items, will be remove the <code>dc.identifier.url</code> field from the records?</li>
|
||||
</ul>
|
||||
|
||||
<h2 id="2016-02-12-1:124a59adbaa8ef13e1518d003fc03981">2016-02-12</h2>
|
||||
|
||||
<ul>
|
||||
<li>Looking at CIAT’s records again, there are some files linking to PDFs on Slide Share, Embrapa, UEA UK, and Condesan, so I’m not sure if we can use those</li>
|
||||
<li>265 items have dirty, URL-encoded filenames:</li>
|
||||
</ul>
|
||||
|
||||
<pre><code>$ ls | grep -c -E "%"
|
||||
265
|
||||
</code></pre>
|
||||
|
||||
<ul>
|
||||
<li>I suggest that we import ~850 or so of the clean ones first, then do the rest after I can find a clean/reliable way to decode the filenames</li>
|
||||
</ul>
|
||||
|
||||
</section>
|
||||
|
Reference in New Issue
Block a user