<li>For <ahref="https://docs.openalex.org/api-entities/works/work-object#abstract_inverted_index">copyright reasons they don’t include plain abstracts</a>, but the <ahref="https://github.com/J535D165/pyalex">pyalex</a> library can convert them on the fly</li>
</ul>
</li>
</ul>
<ul>
<li>I filtered for journal articles that were Creative Commons and missing abstracts:</li>
<li>After inspecting and cleaning a few dozen up in OpenRefine (removing “Keywords:” and copyright, and HTML entities, etc) I managed to get about 440</li>
<li>Since I increase Solr’s heap from 2 to 3G a few weeks ago it seems like Solr is always using 100% CPU
<ul>
<li>I don’t understand this because it was running well before, and I only increased it in anticipation of running the dspace-statistics-api-js, though never got around to it</li>
<li>I just realized that this may be related to the JMX monitoring, as I’ve seen gaps in the Grafana dashboards and remember that it took surprisingly long to scrape the metrics</li>
<li>Maybe I need to change the scrape interval</li>
</ul>
</li>
</ul>
<h2id="2024-10-08">2024-10-08</h2>
<ul>
<li>I checked the VictoriaMetrics vmagent dashboard and saw that there were thousands of errors scraping the <code>jvm_solr</code> target from Solr
<ul>
<li>So it seems like I do need to change the scrape interval</li>
<li>I will increase it from 15s (global) to 20s for that job</li>
<li>Reading some documentation I found <ahref="https://www.robustperception.io/keep-it-simple-scrape_interval-id/">this reference from Brian Brazil that discusses this very problem</a></li>
<li>He recommends keeping a single scrape interval for all targets, but also checking the slow exporter (<code>jmx_exporter</code> in this case) and seeing if we can limit the data we scrape</li>
<li>To keep things simple for now I will increase the global scrape interval to 20s</li>
<li>Long term I should limit the metrics…</li>
<li>Oh wow, I found out that <ahref="https://solr.apache.org/guide/8_11/monitoring-solr-with-prometheus-and-grafana.html">Solr ships with a Prometheus exporter!</a> and even includes a Grafana dashboard</li>
</ul>
</li>
<li>I’m trying to run the Solr prometheus-exporter as a one-off systemd unit to test it:</li>
</ul>
<divclass="highlight"><pretabindex="0"style="color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;"><codeclass="language-console"data-lang="console"><spanstyle="display:flex;"><span># cd /opt/solr-8.11.3/contrib/prometheus-exporter