Add notes for 2023-12-08

This commit is contained in:
2023-12-09 09:55:16 +03:00
parent 2ecafafc17
commit bc6412de09
140 changed files with 191 additions and 177 deletions

View File

@ -11,14 +11,14 @@
<meta property="og:type" content="article" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2023-12/" />
<meta property="article:published_time" content="2023-12-01T08:48:36+03:00" />
<meta property="article:modified_time" content="2023-12-06T09:55:57+03:00" />
<meta property="article:modified_time" content="2023-12-08T16:32:48+03:00" />
<meta name="twitter:card" content="summary"/>
<meta name="twitter:title" content="December, 2023"/>
<meta name="twitter:description" content="2023-12-01 There is still high load on CGSpace and I don&rsquo;t know why I don&rsquo;t see a high number of sessions compared to previous days in the last few weeks $ for file in dspace.log.2023-11-[23]*; do echo &#34;$file&#34;; grep -a -oE &#39;session_id=[A-Z0-9]{32}&#39; &#34;$file&#34; | sort | uniq | wc -l; done dspace.log.2023-11-20 22865 dspace.log.2023-11-21 20296 dspace.log.2023-11-22 19688 dspace.log.2023-11-23 17906 dspace.log.2023-11-24 18453 dspace.log.2023-11-25 17513 dspace.log.2023-11-26 19037 dspace.log.2023-11-27 21103 dspace.log.2023-11-28 23023 dspace.log.2023-11-29 23545 dspace."/>
<meta name="generator" content="Hugo 0.121.0">
<meta name="generator" content="Hugo 0.121.1">
@ -28,9 +28,9 @@
"@type": "BlogPosting",
"headline": "December, 2023",
"url": "https://alanorth.github.io/cgspace-notes/2023-12/",
"wordCount": "643",
"wordCount": "695",
"datePublished": "2023-12-01T08:48:36+03:00",
"dateModified": "2023-12-06T09:55:57+03:00",
"dateModified": "2023-12-08T16:32:48+03:00",
"author": {
"@type": "Person",
"name": "Alan Orth"
@ -220,6 +220,7 @@
<li>I generated ~3,200 statistics based on her records of the download statistics of <a href="https://hdl.handle.net/10568/131997">that item</a> and imported them on CGSpace</li>
</ul>
</li>
<li>Did some work on the DSpace 7 submission form</li>
<li>Peter asked for lists of affiliations, investors, and publishers to do some cleanups
<ul>
<li>I generated a list from a CSV export instead of doing it based on a SQL dump&hellip;</li>
@ -232,6 +233,11 @@
</span></span><span style="display:flex;"><span> | awk &#39;BEGIN { FS = &#34;^[[:space:]]+[[:digit:]]+[[:space:]]+&#34; } {print $2}&#39;\
</span></span><span style="display:flex;"><span> | sed -e &#39;1i cg.contributor.affiliation&#39; -e &#39;s/^\(.*\)$/&#34;\1&#34;/&#39; \
</span></span><span style="display:flex;"><span> &gt; /tmp/2023-12-08-initiatives-affiliations.csv
</span></span></code></pre></div><ul>
<li>Export a list of authors as well:</li>
</ul>
<div class="highlight"><pre tabindex="0" style="color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;"><code class="language-console" data-lang="console"><span style="display:flex;"><span>localhost/dspace7= ☘ \COPY (SELECT DISTINCT text_value AS &#34;dc.contributor.author&#34;, count(*) FROM metadatavalue WHERE dspace_object_id in (SELECT dspace_object_id FROM item) AND metadata_field_id = 3 GROUP BY &#34;dc.contributor.author&#34; ORDER BY count DESC) to /tmp/2023-12-08-authors.csv WITH CSV HEADER;
</span></span><span style="display:flex;"><span>COPY 102435
</span></span></code></pre></div><!-- raw HTML omitted -->