Add notes for 2016-02-15

Signed-off-by: Alan Orth <alan.orth@gmail.com>
This commit is contained in:
2016-02-15 11:36:31 +02:00
parent 450965091c
commit 6a4cb0aca6
4 changed files with 57 additions and 0 deletions

View File

@ -185,3 +185,15 @@ Processing 64195.pdf
- A few items link to PDFs on IFPRI's e-Library or Research Gate
- A few items have no item
- Also, I'm not sure if we import these items, will be remove the `dc.identifier.url` field from the records?
## 2016-02-12
- Looking at CIAT's records again, there are some files linking to PDFs on Slide Share, Embrapa, UEA UK, and Condesan, so I'm not sure if we can use those
- 265 items have dirty, URL-encoded filenames:
```
$ ls | grep -c -E "%"
265
```
- I suggest that we import ~850 or so of the clean ones first, then do the rest after I can find a clean/reliable way to decode the filenames