Notes for 2023-12-08

This commit is contained in:
2023-12-08 16:32:48 +03:00
parent 804a505ae2
commit 2ecafafc17
2 changed files with 39 additions and 2 deletions

View File

@ -109,4 +109,21 @@ $ for network in $(csvcut -c network /tmp/ips.csv | sed 1d | sort -u); do grepci
- I will remove those for now so that Altmetric doesn't have any unexpected issues harvesting
## 2023-12-08
- Finalized the script to generate Solr statistics for Alliance research Mirjam
- The script is `ilri/generate_solr_statistics.py`
- I generated ~3,200 statistics based on her records of the download statistics of [that item](https://hdl.handle.net/10568/131997) and imported them on CGSpace
- Peter asked for lists of affiliations, investors, and publishers to do some cleanups
- I generated a list from a CSV export instead of doing it based on a SQL dump...
```console
$ csvcut -c 'cg.contributor.affiliation[en_US]' /tmp/initiatives.csv \
| sed -e 1d -e 's/^"//' -e 's/"$//' -e 's/||/\n/g' -e '/^$/d' \
| sort | uniq -c | sort -hr \
| awk 'BEGIN { FS = "^[[:space:]]+[[:digit:]]+[[:space:]]+" } {print $2}'\
| sed -e '1i cg.contributor.affiliation' -e 's/^\(.*\)$/"\1"/' \
> /tmp/2023-12-08-initiatives-affiliations.csv
```
<!-- vim: set sw=2 ts=2: -->