mirror of
https://github.com/alanorth/cgspace-notes.git
synced 2024-11-22 22:55:04 +01:00
Add notes for 2020-01-23
This commit is contained in:
parent
7c401b10bf
commit
832b60c906
@ -219,4 +219,30 @@ $ wc -l hung-nguyen-a*handles.txt
|
|||||||
- Comparing the lists of items, I see that nine of the ten missing items were added less than twenty-four hours ago, and the other was added last week, so they apparently just haven't been indexed yet
|
- Comparing the lists of items, I see that nine of the ten missing items were added less than twenty-four hours ago, and the other was added last week, so they apparently just haven't been indexed yet
|
||||||
- I am curious to check tomorrow to see if they are there
|
- I am curious to check tomorrow to see if they are there
|
||||||
|
|
||||||
|
## 2020-01-23
|
||||||
|
|
||||||
|
- I checked AReS and I see that there are now 55 items for author "Hung Nguyen-Viet"
|
||||||
|
- Linode sent an alert that the outbound traffic rate of CGSpace (linode18) was high for several hours this morning around 5AM UTC+1
|
||||||
|
- I checked the nginx logs this morning for the few hours before and after that using goaccess:
|
||||||
|
|
||||||
|
```
|
||||||
|
# cat /var/log/nginx/*.log /var/log/nginx/*.log.1 | grep -E "23/Jan/2020:0[12345678]" | goaccess --log-format=COMBINED -
|
||||||
|
```
|
||||||
|
|
||||||
|
- The top two hosts according to the amount of data transferred are:
|
||||||
|
- 2a01:7e00::f03c:91ff:fe9a:3a37
|
||||||
|
- 2a01:7e00::f03c:91ff:fe18:7396
|
||||||
|
- Both are on Linode, and appear to be the new and old ilri.org servers
|
||||||
|
- I will ask the web team
|
||||||
|
- Judging from the [ILRI publications site](https://www.ilri.org/publications/trade-offs-related-agricultural-use-antimicrobials-and-synergies-emanating-efforts) it seems they are downloading the PDFs so they can generate higher-quality thumbnails:
|
||||||
|
- They are apparently using this Drupal module to generate the thumbnails: `sites/all/modules/contrib/pdf_to_imagefield`
|
||||||
|
- I see some excellent suggestions in this [ImageMagick thread from 2012](https://www.imagemagick.org/discourse-server/viewtopic.php?t=21589) that lead me to some nice thumbnails (default PDF density is 72, so supersample to 4X and then resize back to 25%) as well as [this blog post](https://duncanlock.net/blog/2013/11/18/how-to-create-thumbnails-for-pdfs-with-imagemagick-on-linux/):
|
||||||
|
|
||||||
|
```
|
||||||
|
$ convert -density 288 -filter lagrange -thumbnail 25% -background white -alpha remove -sampling-factor 1:1 -colorspace sRGB 10568-97925.pdf\[0\] 10568-97925.jpg
|
||||||
|
```
|
||||||
|
|
||||||
|
- Here I'm also explicitly setting the background to white and removing any alpha layers, but I could probably also just keep using `-flatten` like DSpace already does
|
||||||
|
- I wonder if I could hack this into DSpace code to get better thumbnails...
|
||||||
|
|
||||||
<!-- vim: set sw=2 ts=2: -->
|
<!-- vim: set sw=2 ts=2: -->
|
||||||
|
@ -29,7 +29,7 @@ I tweeted the CGSpace repository link
|
|||||||
<meta property="og:type" content="article" />
|
<meta property="og:type" content="article" />
|
||||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2020-01/" />
|
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2020-01/" />
|
||||||
<meta property="article:published_time" content="2020-01-06T10:48:30+02:00" />
|
<meta property="article:published_time" content="2020-01-06T10:48:30+02:00" />
|
||||||
<meta property="article:modified_time" content="2020-01-22T10:35:46+02:00" />
|
<meta property="article:modified_time" content="2020-01-22T14:16:08+02:00" />
|
||||||
|
|
||||||
<meta name="twitter:card" content="summary"/>
|
<meta name="twitter:card" content="summary"/>
|
||||||
<meta name="twitter:title" content="January, 2020"/>
|
<meta name="twitter:title" content="January, 2020"/>
|
||||||
@ -63,9 +63,9 @@ I tweeted the CGSpace repository link
|
|||||||
"@type": "BlogPosting",
|
"@type": "BlogPosting",
|
||||||
"headline": "January, 2020",
|
"headline": "January, 2020",
|
||||||
"url": "https:\/\/alanorth.github.io\/cgspace-notes\/2020-01\/",
|
"url": "https:\/\/alanorth.github.io\/cgspace-notes\/2020-01\/",
|
||||||
"wordCount": "1674",
|
"wordCount": "1905",
|
||||||
"datePublished": "2020-01-06T10:48:30+02:00",
|
"datePublished": "2020-01-06T10:48:30+02:00",
|
||||||
"dateModified": "2020-01-22T10:35:46+02:00",
|
"dateModified": "2020-01-22T14:16:08+02:00",
|
||||||
"author": {
|
"author": {
|
||||||
"@type": "Person",
|
"@type": "Person",
|
||||||
"name": "Alan Orth"
|
"name": "Alan Orth"
|
||||||
@ -357,6 +357,34 @@ $ wc -l hung-nguyen-a*handles.txt
|
|||||||
</ul>
|
</ul>
|
||||||
</li>
|
</li>
|
||||||
</ul>
|
</ul>
|
||||||
|
<h2 id="2020-01-23">2020-01-23</h2>
|
||||||
|
<ul>
|
||||||
|
<li>I checked AReS and I see that there are now 55 items for author “Hung Nguyen-Viet”</li>
|
||||||
|
<li>Linode sent an alert that the outbound traffic rate of CGSpace (linode18) was high for several hours this morning around 5AM UTC+1
|
||||||
|
<ul>
|
||||||
|
<li>I checked the nginx logs this morning for the few hours before and after that using goaccess:</li>
|
||||||
|
</ul>
|
||||||
|
</li>
|
||||||
|
</ul>
|
||||||
|
<pre><code># cat /var/log/nginx/*.log /var/log/nginx/*.log.1 | grep -E "23/Jan/2020:0[12345678]" | goaccess --log-format=COMBINED -
|
||||||
|
</code></pre><ul>
|
||||||
|
<li>The top two hosts according to the amount of data transferred are:
|
||||||
|
<ul>
|
||||||
|
<li>2a01:7e00::f03c:91ff:fe9a:3a37</li>
|
||||||
|
<li>2a01:7e00::f03c:91ff:fe18:7396</li>
|
||||||
|
</ul>
|
||||||
|
</li>
|
||||||
|
<li>Both are on Linode, and appear to be the new and old ilri.org servers</li>
|
||||||
|
<li>I will ask the web team</li>
|
||||||
|
<li>Judging from the <a href="https://www.ilri.org/publications/trade-offs-related-agricultural-use-antimicrobials-and-synergies-emanating-efforts">ILRI publications site</a> it seems they are downloading the PDFs so they can generate higher-quality thumbnails:</li>
|
||||||
|
<li>They are apparently using this Drupal module to generate the thumbnails: <code>sites/all/modules/contrib/pdf_to_imagefield</code></li>
|
||||||
|
<li>I see some excellent suggestions in this <a href="https://www.imagemagick.org/discourse-server/viewtopic.php?t=21589">ImageMagick thread from 2012</a> that lead me to some nice thumbnails (default PDF density is 72, so supersample to 4X and then resize back to 25%) as well as <a href="https://duncanlock.net/blog/2013/11/18/how-to-create-thumbnails-for-pdfs-with-imagemagick-on-linux/">this blog post</a>:</li>
|
||||||
|
</ul>
|
||||||
|
<pre><code>$ convert -density 288 -filter lagrange -thumbnail 25% -background white -alpha remove -sampling-factor 1:1 -colorspace sRGB 10568-97925.pdf\[0\] 10568-97925.jpg
|
||||||
|
</code></pre><ul>
|
||||||
|
<li>Here I'm also explicitly setting the background to white and removing any alpha layers, but I could probably also just keep using <code>-flatten</code> like DSpace already does</li>
|
||||||
|
<li>I wonder if I could hack this into DSpace code to get better thumbnails…</li>
|
||||||
|
</ul>
|
||||||
<!-- raw HTML omitted -->
|
<!-- raw HTML omitted -->
|
||||||
|
|
||||||
|
|
||||||
|
@ -4,27 +4,27 @@
|
|||||||
|
|
||||||
<url>
|
<url>
|
||||||
<loc>https://alanorth.github.io/cgspace-notes/categories/</loc>
|
<loc>https://alanorth.github.io/cgspace-notes/categories/</loc>
|
||||||
<lastmod>2020-01-22T10:35:46+02:00</lastmod>
|
<lastmod>2020-01-22T14:16:08+02:00</lastmod>
|
||||||
</url>
|
</url>
|
||||||
|
|
||||||
<url>
|
<url>
|
||||||
<loc>https://alanorth.github.io/cgspace-notes/</loc>
|
<loc>https://alanorth.github.io/cgspace-notes/</loc>
|
||||||
<lastmod>2020-01-22T10:35:46+02:00</lastmod>
|
<lastmod>2020-01-22T14:16:08+02:00</lastmod>
|
||||||
</url>
|
</url>
|
||||||
|
|
||||||
<url>
|
<url>
|
||||||
<loc>https://alanorth.github.io/cgspace-notes/2020-01/</loc>
|
<loc>https://alanorth.github.io/cgspace-notes/2020-01/</loc>
|
||||||
<lastmod>2020-01-22T10:35:46+02:00</lastmod>
|
<lastmod>2020-01-22T14:16:08+02:00</lastmod>
|
||||||
</url>
|
</url>
|
||||||
|
|
||||||
<url>
|
<url>
|
||||||
<loc>https://alanorth.github.io/cgspace-notes/categories/notes/</loc>
|
<loc>https://alanorth.github.io/cgspace-notes/categories/notes/</loc>
|
||||||
<lastmod>2020-01-22T10:35:46+02:00</lastmod>
|
<lastmod>2020-01-22T14:16:08+02:00</lastmod>
|
||||||
</url>
|
</url>
|
||||||
|
|
||||||
<url>
|
<url>
|
||||||
<loc>https://alanorth.github.io/cgspace-notes/posts/</loc>
|
<loc>https://alanorth.github.io/cgspace-notes/posts/</loc>
|
||||||
<lastmod>2020-01-22T10:35:46+02:00</lastmod>
|
<lastmod>2020-01-22T14:16:08+02:00</lastmod>
|
||||||
</url>
|
</url>
|
||||||
|
|
||||||
<url>
|
<url>
|
||||||
|
Loading…
Reference in New Issue
Block a user