diff --git a/content/posts/2020-08.md b/content/posts/2020-08.md
index adb375104..97170138f 100644
--- a/content/posts/2020-08.md
+++ b/content/posts/2020-08.md
@@ -201,6 +201,12 @@ on_id=[A-Z0-9]{32}' | sort | uniq | wc -l
- I developed a small Java class called `FixJpgJpgThumbnails` to remove ".jpg.jpg" thumbnails from the `THUMBNAIL` bundle and replace them with their originals from the `ORIGINAL` bundle
- The code is based on [RemovePNGThumbnailsForPDFs.java](https://github.com/UoW-IRRs/DSpace-Scripts/blob/master/src/main/java/nz/ac/waikato/its/irr/scripts/RemovePNGThumbnailsForPDFs.java) by Andrea Schweer
- I incorporated it into my dspace-curation-tasks repository, then renamed it to [cgspace-java-helpers](https://github.com/ilri/cgspace-java-helpers)
- - In testing I found that I can replace ~3,500 thumbnails on CGSpace!
+ - In testing I found that I can replace ~4,000 thumbnails on CGSpace!
+
+## 2020-08-07
+
+- I improved the `RemovePNGThumbnailsForPDFs.java` a bit more to exclude infographics and original bitstreams larger than 100KiB
+ - I ran it on CGSpace and it cleaned up 3,769 thumbnails!
+ - Afterwards I ran `dspace cleanup -v` to remove the deleted thumbnails
diff --git a/docs/2020-08/index.html b/docs/2020-08/index.html
index ea340c211..fd0540c52 100644
--- a/docs/2020-08/index.html
+++ b/docs/2020-08/index.html
@@ -19,7 +19,7 @@ It is class based so I can easily add support for other vocabularies, and the te
-
+
@@ -43,9 +43,9 @@ It is class based so I can easily add support for other vocabularies, and the te
"@type": "BlogPosting",
"headline": "August, 2020",
"url": "https://alanorth.github.io/cgspace-notes/2020-08/",
- "wordCount": "1382",
+ "wordCount": "1421",
"datePublished": "2020-08-02T15:35:54+03:00",
- "dateModified": "2020-08-06T10:56:13+03:00",
+ "dateModified": "2020-08-06T16:24:01+03:00",
"author": {
"@type": "Person",
"name": "Alan Orth"
@@ -357,7 +357,16 @@ on_id=[A-Z0-9]{32}' | sort | uniq | wc -l
- The code is based on RemovePNGThumbnailsForPDFs.java by Andrea Schweer
- I incorporated it into my dspace-curation-tasks repository, then renamed it to cgspace-java-helpers
-- In testing I found that I can replace ~3,500 thumbnails on CGSpace!
+- In testing I found that I can replace ~4,000 thumbnails on CGSpace!
+
+
+
+2020-08-07
+
+- I improved the
RemovePNGThumbnailsForPDFs.java
a bit more to exclude infographics and original bitstreams larger than 100KiB
+
+- I ran it on CGSpace and it cleaned up 3,769 thumbnails!
+- Afterwards I ran
dspace cleanup -v
to remove the deleted thumbnails
diff --git a/docs/categories/index.html b/docs/categories/index.html
index f0188b048..a898e832c 100644
--- a/docs/categories/index.html
+++ b/docs/categories/index.html
@@ -9,7 +9,7 @@
-
+
diff --git a/docs/categories/notes/index.html b/docs/categories/notes/index.html
index 48ad611a8..e785deea7 100644
--- a/docs/categories/notes/index.html
+++ b/docs/categories/notes/index.html
@@ -9,7 +9,7 @@
-
+
diff --git a/docs/categories/notes/page/2/index.html b/docs/categories/notes/page/2/index.html
index 26aacb3ee..a46729cad 100644
--- a/docs/categories/notes/page/2/index.html
+++ b/docs/categories/notes/page/2/index.html
@@ -9,7 +9,7 @@
-
+
diff --git a/docs/categories/notes/page/3/index.html b/docs/categories/notes/page/3/index.html
index 91e4514c2..340c2dff9 100644
--- a/docs/categories/notes/page/3/index.html
+++ b/docs/categories/notes/page/3/index.html
@@ -9,7 +9,7 @@
-
+
diff --git a/docs/categories/notes/page/4/index.html b/docs/categories/notes/page/4/index.html
index 9a0a1363d..3ca6a693f 100644
--- a/docs/categories/notes/page/4/index.html
+++ b/docs/categories/notes/page/4/index.html
@@ -9,7 +9,7 @@
-
+
diff --git a/docs/index.html b/docs/index.html
index 9715258c8..f9e8b347a 100644
--- a/docs/index.html
+++ b/docs/index.html
@@ -9,7 +9,7 @@
-
+
diff --git a/docs/page/2/index.html b/docs/page/2/index.html
index 6beb051f1..ceea736ea 100644
--- a/docs/page/2/index.html
+++ b/docs/page/2/index.html
@@ -9,7 +9,7 @@
-
+
diff --git a/docs/page/3/index.html b/docs/page/3/index.html
index efc34ac1d..537dca946 100644
--- a/docs/page/3/index.html
+++ b/docs/page/3/index.html
@@ -9,7 +9,7 @@
-
+
diff --git a/docs/page/4/index.html b/docs/page/4/index.html
index be323f486..bd175d7d8 100644
--- a/docs/page/4/index.html
+++ b/docs/page/4/index.html
@@ -9,7 +9,7 @@
-
+
diff --git a/docs/page/5/index.html b/docs/page/5/index.html
index 7c6ab3a94..cf405ac4d 100644
--- a/docs/page/5/index.html
+++ b/docs/page/5/index.html
@@ -9,7 +9,7 @@
-
+
diff --git a/docs/page/6/index.html b/docs/page/6/index.html
index 6cb63e81d..ede77718d 100644
--- a/docs/page/6/index.html
+++ b/docs/page/6/index.html
@@ -9,7 +9,7 @@
-
+
diff --git a/docs/posts/index.html b/docs/posts/index.html
index e78ea4207..a701be457 100644
--- a/docs/posts/index.html
+++ b/docs/posts/index.html
@@ -9,7 +9,7 @@
-
+
diff --git a/docs/posts/page/2/index.html b/docs/posts/page/2/index.html
index 0acde5589..11563debd 100644
--- a/docs/posts/page/2/index.html
+++ b/docs/posts/page/2/index.html
@@ -9,7 +9,7 @@
-
+
diff --git a/docs/posts/page/3/index.html b/docs/posts/page/3/index.html
index 1fc117436..a17f1147b 100644
--- a/docs/posts/page/3/index.html
+++ b/docs/posts/page/3/index.html
@@ -9,7 +9,7 @@
-
+
diff --git a/docs/posts/page/4/index.html b/docs/posts/page/4/index.html
index 098957cb1..c11669e9c 100644
--- a/docs/posts/page/4/index.html
+++ b/docs/posts/page/4/index.html
@@ -9,7 +9,7 @@
-
+
diff --git a/docs/posts/page/5/index.html b/docs/posts/page/5/index.html
index e358282ce..230c2b9c6 100644
--- a/docs/posts/page/5/index.html
+++ b/docs/posts/page/5/index.html
@@ -9,7 +9,7 @@
-
+
diff --git a/docs/posts/page/6/index.html b/docs/posts/page/6/index.html
index b3926ca7b..c63d300b7 100644
--- a/docs/posts/page/6/index.html
+++ b/docs/posts/page/6/index.html
@@ -9,7 +9,7 @@
-
+
diff --git a/docs/sitemap.xml b/docs/sitemap.xml
index ef9779b41..02dec6ed5 100644
--- a/docs/sitemap.xml
+++ b/docs/sitemap.xml
@@ -4,27 +4,27 @@
https://alanorth.github.io/cgspace-notes/2020-08/
- 2020-08-06T10:56:13+03:00
+ 2020-08-06T16:24:01+03:00
https://alanorth.github.io/cgspace-notes/categories/
- 2020-08-06T10:56:13+03:00
+ 2020-08-06T16:24:01+03:00
https://alanorth.github.io/cgspace-notes/
- 2020-08-06T10:56:13+03:00
+ 2020-08-06T16:24:01+03:00
https://alanorth.github.io/cgspace-notes/categories/notes/
- 2020-08-06T10:56:13+03:00
+ 2020-08-06T16:24:01+03:00
https://alanorth.github.io/cgspace-notes/posts/
- 2020-08-06T10:56:13+03:00
+ 2020-08-06T16:24:01+03:00