diff --git a/content/posts/2022-02.md b/content/posts/2022-02.md index e2753eb5d..050779287 100644 --- a/content/posts/2022-02.md +++ b/content/posts/2022-02.md @@ -116,4 +116,44 @@ $ ./ilri/ror-lookup.py -i /tmp/2022-02-02-donors.txt -r 2021-09-23-ror-data.json - According to the logs her account is failing on LDAP authentication - I checked CGSpace's LDAP credentials using ldapsearch and was able to connect so it's gotta be something with her account +## 2022-02-03 + +- I synchronized DSpace Test with a fresh snapshot of CGSpace +- I noticed a bunch of thumbnails missing for items submitted in the last week on CGSpace so I ran the `dspace filter-media` script manually and eventually it crashed: + +```console +$ JAVA_OPTS="-Xmx1024m -Dfile.encoding=UTF-8" dspace filter-media +... +SKIPPED: bitstream 48612de7-eec5-4990-8f1b-589a87219a39 (item: 10568/67391) because 'ilri_establishiment.pdf.txt' already exists +Generated Thumbnail ilri_establishiment.pdf matches pattern and is replacable. +SKIPPED: bitstream 48612de7-eec5-4990-8f1b-589a87219a39 (item: 10568/67391) because 'ilri_establishiment.pdf.jpg' already exists +File: Agreement_on_the_Estab_of_ILRI.doc.txt +Exception: org.apache.poi.util.LittleEndian.getUnsignedByte([BI)I +java.lang.NoSuchMethodError: org.apache.poi.util.LittleEndian.getUnsignedByte([BI)I + at org.textmining.extraction.word.model.FormattedDiskPage.(FormattedDiskPage.java:66) + at org.textmining.extraction.word.model.CHPFormattedDiskPage.(CHPFormattedDiskPage.java:62) + at org.textmining.extraction.word.model.CHPBinTable.(CHPBinTable.java:70) + at org.textmining.extraction.word.Word97TextExtractor.getText(Word97TextExtractor.java:122) + at org.textmining.extraction.word.Word97TextExtractor.getText(Word97TextExtractor.java:63) + at org.dspace.app.mediafilter.WordFilter.getDestinationStream(WordFilter.java:83) + at com.atmire.dspace.app.mediafilter.AtmireMediaFilter.processBitstream(AtmireMediaFilter.java:103) + at com.atmire.dspace.app.mediafilter.AtmireMediaFilterServiceImpl.filterBitstream(AtmireMediaFilterServiceImpl.java:61) + at org.dspace.app.mediafilter.MediaFilterServiceImpl.filterItem(MediaFilterServiceImpl.java:181) + at org.dspace.app.mediafilter.MediaFilterServiceImpl.applyFiltersItem(MediaFilterServiceImpl.java:159) + at org.dspace.app.mediafilter.MediaFilterServiceImpl.applyFiltersAllItems(MediaFilterServiceImpl.java:111) + at org.dspace.app.mediafilter.MediaFilterCLITool.main(MediaFilterCLITool.java:212) + at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) + at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62) + at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) + at java.lang.reflect.Method.invoke(Method.java:498) + at org.dspace.app.launcher.ScriptLauncher.runOneCommand(ScriptLauncher.java:229) + at org.dspace.app.launcher.ScriptLauncher.main(ScriptLauncher.java:81) +``` + +- I should look up that issue and report a bug somewhere perhaps, but for now I just forced the JPG thumbnails with: + +```console +$ JAVA_OPTS="-Xmx1024m -Dfile.encoding=UTF-8" dspace filter-media -p "ImageMagick PDF Thumbnail" -v >& /tmp/filter-media.log +``` + diff --git a/docs/sitemap.xml b/docs/sitemap.xml index 133069e28..a30b3a824 100644 --- a/docs/sitemap.xml +++ b/docs/sitemap.xml @@ -3,19 +3,19 @@ xmlns:xhtml="http://www.w3.org/1999/xhtml"> https://alanorth.github.io/cgspace-notes/categories/ - 2022-02-02T09:11:43+03:00 + 2022-02-02T23:51:22+03:00 https://alanorth.github.io/cgspace-notes/ - 2022-02-02T09:11:43+03:00 + 2022-02-02T23:51:22+03:00 https://alanorth.github.io/cgspace-notes/2022-02/ - 2022-02-02T09:11:43+03:00 + 2022-02-02T23:51:22+03:00 https://alanorth.github.io/cgspace-notes/categories/notes/ - 2022-02-02T09:11:43+03:00 + 2022-02-02T23:51:22+03:00 https://alanorth.github.io/cgspace-notes/posts/ - 2022-02-02T09:11:43+03:00 + 2022-02-02T23:51:22+03:00 https://alanorth.github.io/cgspace-notes/2022-01/ 2022-01-31T09:00:59+03:00