📚

2023-03-05 09:59:48 -07:00 · 2022-10-29 10:24:57 -06:00 · 2022-08-07 13:05:43 -06:00 · 2022-03-31 09:24:47 -06:00 · 2022-01-07 12:51:53 -07:00 · 2021-08-14 10:43:44 -06:00
8 changed files with 37123 additions and 247 deletions
--- a/2
+++ b/2
@ -0,0 +1,2 @@
 default: books.csv genfeed.py
 	./genfeed.py < books.csv > feed.xml
--- a/README.md
+++ b/README.md
@ -4,9 +4,17 @@ This is a catalog of all the books in my calibre library!
 If you see a title here you would like to borrow, let me know! I'd be happy to share :)
-## How To
+## How To (For me)
-If you're going to explore this dataset, I recommend using the awesome csvkit.
+- go to calibre and "convert books" -> "create a catalog...."
 - save it to the dir
 - `j all`
 ## How To (For you)
 If you want to browse the collection, I would look at `books.rec`.
 If you're going to really explore this dataset, I recommend using the awesome csvkit.
 => <https://csvkit.readthedocs.io/en/latest/index.html>
@ -17,4 +25,20 @@ It will allow you to do stuff like:
 - look at some stats: `csvcut -c languages,size,formats books.csv | csvstat`
 - find the largest pdfs in the collection: `csvcut -c title_sort,formats,size books.csv | csvgrep -c formats -m pdf | csvsort -c size -r | head`
 - `csvjson books.csv | jq | whatever`
 - show the most recently added books: `csvcut -c 13,1,3 books.csv | csvsort -c timestamp -r | head -n 20`
 - You can also perform actual SQL queries on it, and convert the data between csv and sqlite database: <https://csvkit.readthedocs.io/en/latest/tutorial/3_power_tools.html>
 ## RSS feed
 An RSS feed has been kindly provided by [the Rsszard of Syndication](https://tilde.town/~lucidiot)
 and is available at https://git.tilde.town/dozens/books/raw/branch/main/feed.xml
 Generating the feed requires you to have Python 3.7 or later installed, as well
 as the [xmltodict](https://pypi.org/project/xmltodict) package:
 `pip3 install xmltodict`.
 To generate the feed, run `./geenfeed.py <books.csv >lefeed.xml`.
 ## TODO
 - type definitions for Book
--- a/books.csv
+++ b/books.csv
--- a/books.rec
+++ b/books.rec
--- a/feed.xml
+++ b/feed.xml
--- a/genfeed.py
+++ b/genfeed.py
@ -0,0 +1,88 @@
 #!/usr/bin/env python3
 # -*- coding: utf-8 -*-
 from datetime import datetime, timezone
 from typing import Mapping, MutableMapping
 import csv
 import sys
 import xmltodict
 RSS_DATE_FORMAT = '%a, %d %b %Y %T %z'
 ISO_DATE_FORMAT = '%Y-%m-%dT%H:%M:%S%z'
 def parse_book(book: MutableMapping[str, str]) -> Mapping:
    item = {
        "title": book["title_sort"],
        "pubDate": datetime.strptime(book.pop("timestamp"), ISO_DATE_FORMAT)
                           .strftime(RSS_DATE_FORMAT),
        "guid": {
            "@isPermaLink": "false",
            "#text": book.pop("uuid"),
        },
        "description": book.pop("comments"),
        # The CSV's first character is a non-breaking space for some reason,
        # which breaks the author column
        "author": book.get("author_sort") or book["\ufeffauthor_sort"],
    }
    # Prepend metadata to the item description
    item["description"] = "<dl>{}</dl>{}".format(
        "".join(
            "<dt>{}</dt><dd>{}</dd>".format(
                key.replace('_sort', '').replace('_', ' ').replace('\ufeff', '').capitalize(),
                value,
            )
            for key, value in book.items()
            # Ignore empty columns
            if value
        ),
        item['description']
    )
    if book.get("tags"):
        item["category"] = [
            {
                "@domain": "https://git.tilde.town/dozens/books",
                "#text": tag
            }
            for tag in book["tags"].split(", ")
        ]
    return item
 def main():
    sys.stdout.write(xmltodict.unparse({
        "rss": {
            "@version": "2.0",
            "@xmlns:atom": "http://www.w3.org/2005/Atom",
            "@xmlns:sy": "http://purl.org/rss/1.0/modules/syndication/",
            "channel": {
                "title": "dozens books",
                "description": "the cool calibre library of dozens",
                "link": "https://git.tilde.town/dozens/books",
                "atom:link": {
                    "@rel": "self",
                    "@type": "application/rss+xml",
                    "@href": "https://git.tilde.town/dozens/books/raw/branch/main/feed.xml",
                },
                "language": "en-US",
                "pubDate": datetime.now(timezone.utc)
                                   .strftime(RSS_DATE_FORMAT),
                "docs": "https://www.rssboard.org/rss-specification",
                "webMaster": "dozens@tilde.town (~dozens)",
                "generator": "Python " + ".".join(map(str, sys.version_info[:3])),
                # Update on the first of every month, at midnight UTC
                "sy:updatePeriod": "monthly",
                "sy:updateFrequency": "1",
                "sy:updateBase": "1971-01-01T00:00+00:00",
                # One month, roughly, for clients that do not support mod_syndication
                "ttl": 60 * 24 * 30,
                "item": list(map(parse_book, csv.DictReader(sys.stdin))),
            }
        }
    }, pretty=True, short_empty_elements=True))
 if __name__ == '__main__':
    main()
--- a/14
+++ b/14
@ -0,0 +1,14 @@
 # show all commands
 default:
  just --list
 # generate rss
 rss:
  ./genfeed.py < books.csv > feed.xml
 # make rec
 rec:
  csvformat books.csv | csv2rec > books.rec
 # do the damn thing
 all: rec rss
--- a/requirements.txt
+++ b/requirements.txt
@ -0,0 +1 @@
 xmltodict>=0.12
Author	SHA1	Message	Date
dozens	c3e42563c6	📚	2023-03-05 09:59:48 -07:00
dozens	8a59c47485	📚	2022-10-29 10:24:57 -06:00
dozens	12fa2770d5	📚	2022-08-07 13:05:43 -06:00
dozens	2dc0d374cd	📚	2022-03-31 09:24:47 -06:00
Christopher P. Brown	23d6d84d43	📚 - export catalog - add books.rec - switch from makefile to justfile	2022-01-07 12:51:53 -07:00
Christopher P. Brown	44bf8fabbd	📚	2021-08-14 10:43:44 -06:00
Christopher P. Brown	70fb11b3a1	📚	2021-07-30 17:47:53 -06:00
Christopher P. Brown	a10152ff67	📝	2021-05-09 11:16:24 -06:00
Christopher P. Brown	9238b1816d	📚	2021-05-09 11:09:52 -06:00
Christopher P. Brown	a04db2f312	add makefile	2021-04-17 11:02:34 -06:00
Christopher P. Brown	dbceefe37c	add xml feed	2021-04-17 11:01:19 -06:00
dozens	53d8718ee9	Merge pull request 'Add an RSS feed' (#1 ) from lucidiot/books:rss into main Reviewed-on: http://git.tilde.town/dozens/books/pulls/1	2021-04-17 16:58:06 +00:00
Lucidiot	2dcb679567	Set monthly updates	2021-04-17 18:53:32 +02:00
Lucidiot	b00b383411	Add an RSS feed	2021-04-17 18:48:15 +02:00
		`@ -0,0 +1,2 @@`
							`default: books.csv genfeed.py`
							`./genfeed.py < books.csv > feed.xml`