Add notes for 2019-08-06

This commit is contained in:
2019-08-06 20:07:44 +03:00
parent 833aa58bac
commit 133c1d6d3f
3 changed files with 31 additions and 8 deletions

View File

@ -34,6 +34,7 @@ tags: ["Notes"]
- The following PDFs are used by several items incorrectly:
- `Report_of_a_Working_Group_on_Allium_7.pdf`
- `Report_of_a_Working_Group_on_Allium_Fourth_meeting_1696.pdf`
- I checked the SHA1 hashes of each PDF and found that some appear more than once...
- The following items use the same PDF with a different name, but seem to be duplicates (pick one?):
- https://www.bioversityinternational.org/index.php?id=244&tx_news_pi1[news]=433
- https://www.bioversityinternational.org/index.php?id=244&tx_news_pi1[news]=10189
@ -61,4 +62,12 @@ or(
- I tried to extract the filenames and construct a URL to download the PDFs with my `generate-thumbnails.py` script, but there seem to be several paths for PDFs so I can't guess it properly
- I will have to wait for Francesco to respond about the PDFs, or perhaps proceed with a metadata-only upload so we can do other checks on DSpace Test
## 2019-08-06
- Francesca responded to address my feedback yesterday
- I made some changes to the CSV based on her feedback (remove two duplicates, change one PDF file name, change two titles)
- Then I found some items that have PDFs in multiple languages that only list one language in `dc.language.iso` so I changed them
- Strangley, one item was referring to a 7zip file...
- After removing the two duplicates there are now 1427 records
<!-- vim: set sw=2 ts=2: -->