pilcrow - Run-It-Yourself web chat, maybe

	Commit message (Collapse)	Author	Age
*	Add a `--umask` option to determine what permissions new files/databases get.	Owen Jacobson	2025-07-18
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The new `--umask` option takes one of three values: * `--umask masked`, the default, takes the inherited umask and forces o+rwx on. * `--umask inherit` takes the inherited umask as-is. * `--umask OCTAL` sets the umask to exactly `OCTAL` and is broadly equivalent to `umask OCTAL && pilcrow --umask inherit`. This fell out of a conversation with @wlonk, who is working on notifications. Since notifications may require [VAPID] keys, the server will need a way to store those keys. That would generally be "in the pilcrow database," which lead me to the observation that Pilcrow creates that database as world-readable by default. "World-readable" and "encryption/signing keys" are not things that belong in the same sentence. [VAPID]: https://datatracker.ietf.org/doc/html/rfc8292 The most "obvious" solution would be to set the permissions used for the sqlite database when it's created. That's harder than it sounds: sqlite has no built-in facility for doing this. The closest thing that exists today is the [`modeof`] query parameter, which copies the permissions (and ownership) from some other file. We also can't reliably set the permissions ourselves, as sqlite may - depending on build options and configuration - [create multiple files][wal]. [`modeof`]: https://www.sqlite.org/uri.html [wal]: https://www.sqlite.org/wal.html Using `umask` is a whole-process solution to this. As Pilcrow doesn't attempt to create other files, there's little issue with doing it this way, but this is a design risk for future work if it creates files that are _intended_ to be readable by more than just the Pilcrow daemon user.
*	Unify `setup_required` middlewares.	Owen Jacobson	2025-06-17
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The two middlewares were identical but for the specific `IntoResponse` impl used to generate the response when setup has not been completed. However, unifying them while still using `from_fn_with_state` lead to this horrorshow: .route_layer(middleware::from_fn_with_state( app.clone(), \|state, req, next\| { setup::middeware::setup_required(UNAVAILABLE, state, req, next) } )) It's a lot to read, and it surfaces the entire signature of a state-driven middleware `fn` into the call site solely to close over one argument (`UNAVAILABLE`). Rather than doing that, I've converted this middleware into a full blown Tower middleware, following <https://docs.rs/axum/latest/axum/middleware/index.html#towerservice-and-pinboxdyn-future>. I considered taking this further and implementing a custom future to remove the allocation for `Box::pin`, but honestly, that allocation isn't hurting anyone and this code already got long enough in the translation. The new API looks like: .route_layer(setup::Required::or_unavailable(app.clone())) Or like: .route_layer(setup::Required::with_fallback(app.clone(), RESPONSE)) One thing I would have liked to have avoided is the additional `app.clone()` argument, but there isn't a way to extract the _state_ from a request inside of an Axum middleware. It has to be passed in externally - that's what `from_fn_with_state` is doing under the hood, as well. Using `State` as an extractor doesn't work; the `State` extractor is special in a _bunch_ of ways, and this is one of them. Other extractors would work. Realistically, I'd probably want to explore interfaces like .route_layer(setup::Required(app).or_unavailable()) or .route_layer(app.setup().required().or_unavailable())
*	Build the Sveltekit UI into Cargo's OUT_DIR.	Owen Jacobson	2025-05-28
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This has a couple of material consequences: * It will be (much) easier to reorganize the source tree, as the path to the output is no longer relative to where the config files are when building the final binary. If we do decide to move `ui` into its own child crate, we won't have to make a bunch of (very similar) changes to the Svelte build process at that time. * There is less chance of a stale build contaminating a new one, since changes to the crate change the project hash in `OUT_DIR`. For example, while working on this change, `OUT_DIR` was at various points: * `target/debug/build/pilcrow-7cfeef3536ddd3e7/out` * `target/debug/build/pilcrow-09d4ddbc12bef36b/out` * `target/release/build/pilcrow-070d373bd5f850a1` This may use more space on disk, but it's all reclaimable with `cargo clean` and Rust is _far_ more profligate with disk space than Svelte will ever be. * It's more consistent with Cargo's expectations around generated source files, and thus potentially easier to onboard Rust developers into.
*	We no longer need async-trait.	Owen Jacobson	2025-02-20
\| \| \| \|	It was here to support axum 0.7.x.
*	Upgrade to latest thiserror	Owen Jacobson	2025-02-19
\|
*	Upgrade Axum to 0.8.1.	Owen Jacobson	2025-02-19
\|
*	Upgrade sqlx to latest.	Owen Jacobson	2025-02-18
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We can't quite update rusqlite to latest as well, as it uses a slightly newer libsqlite3-sys crate. I made sure this pair of versions is valid: % cargo tree --invert libsqlite3-sys libsqlite3-sys v0.30.1 ├── rusqlite v0.32.1 │ └── pilcrow v0.1.0 (/Users/owen/Projects/grimoire.ca/pilcrow) └── sqlx-sqlite v0.8.3 └── sqlx v0.8.3 └── pilcrow v0.1.0 (/Users/owen/Projects/grimoire.ca/pilcrow) libsqlite3-sys v0.30.1 └── sqlx-sqlite v0.8.3 └── sqlx-macros-core v0.8.3 └── sqlx-macros v0.8.3 (proc-macro) └── sqlx v0.8.3 (*) As both sqlx and rusqlite resolve to use the same version of libsqlite3-sys, we're fine.
*	Mass-upgrade Rust dependencies	Owen Jacobson	2025-02-18
\|
*	Rename the project to `pilcrow`.	Owen Jacobson	2024-11-08
\|
*	Upgrade dependencies.	Owen Jacobson	2024-11-02
\| \| \| \|	(Svelte 5 upgrade not included.)
*	Restrict login names.	Owen Jacobson	2024-10-29
\| \| \| \| \| \| \| \|	There's no good reason to use an empty string as your login name, or to use one so long as to annoy others. Names beginning or ending with whitespace, or containing runs of whitespace, are also a technical problem, so they're also prohibited. This change does not implement [UTS #39], as I haven't yet fully understood how to do so. [UTS #39]: https://www.unicode.org/reports/tr39/
*	Tests for purged channels and messages.	Owen Jacobson	2024-10-25
\| \| \| \|	This required a re-think of the `.immediately()` combinator, to generalize it to cases where a message is _not_ expected. That (more or less immediately) suggested some mixed combinators, particularly for stream futures (futures of `Option<T>`).
*	Set `charset` params on returned content types.	Owen Jacobson	2024-10-22
\| \| \| \|	This is a somewhat indirect change; it removes `mime_guess` in favour of some very, uh, "bespoke" mime detection logic that hardcodes mime types for the small repertoire of file extensions actually present in the UI. `mime_guess` doesn't provide a way to set params as it exports its own `Mime` struct, which doesn't provide `with_params()`.
*	Canonicalize login and channel names.	Owen Jacobson	2024-10-22
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Canonicalization does two things: * It prevents duplicate names that differ only by case or only by normalization/encoding sequence; and * It makes certain name-based comparisons "case-insensitive" (generalizing via Unicode's case-folding rules). This change is complicated, as it means that every name now needs to be stored in two forms. Unfortunately, this is _very likely_ a breaking schema change. The migrations in this commit perform a best-effort attempt to canonicalize existing channel or login names, but it's likely any existing channels or logins with non-ASCII characters will not be canonicalize correctly. Since clients look at all channel names and all login names on boot, and since the code in this commit verifies canonicalization when reading from the database, this will effectively make the server un-usuable until any incorrectly-canonicalized values are either manually canonicalized, or removed It might be possible to do better with [the `icu` sqlite3 extension][icu], but (a) I'm not convinced of that and (b) this commit is already huge; adding database extension support would make it far larger. [icu]: https://sqlite.org/src/dir/ext/icu For some references on why it's worth storing usernames this way, see <https://www.b-list.org/weblog/2018/nov/26/case/> and the refernced talk, as well as <https://www.b-list.org/weblog/2018/feb/11/usernames/>. Bennett's treatment of this issue is, to my eye, much more readable than the referenced Unicode technical reports, and I'm inclined to trust his opinion given that he maintains a widely-used, internet-facing user registration library for Django.
*	Unicode normalization on input.	Owen Jacobson	2024-10-21
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This normalizes the following values: * login names * passwords * channel names * message bodies, because why not The goal here is to have a canonical representation of these values, so that, for example, the service does not inadvertently host two channels whose names are semantically identical but differ in the specifics of how diacritics are encoded, or two users whose names are identical. Normalization is done on input from the wire, using Serde hooks, and when reading from the database. The `crate::nfc::String` type implements these normalizations (as well as normalizing whenever converted from a `std::string::String` generally). This change does not cover: * Trying to cope with passwords that were created as non-normalized strings, which are now non-verifiable as all the paths to verify passwords normalize the input. * Trying to ensure that non-normalized data in the database compares reasonably to normalized data. Fortunately, we don't _do_ very many string comparisons (I think only login names), so this isn't a huge deal at this stage. Login names will probably have to Get Fixed later on, when we figure out how to handle case folding for login name verification.
*	Dependency upgrades (Rust)	Owen Jacobson	2024-10-18
\|
*	Merge branch 'wip/ui'	Owen Jacobson	2024-10-05
\|\
\| *	Render the UI at /.	Owen Jacobson	2024-10-05
\| \|
* \|	Dependency update.	Owen Jacobson	2024-10-05
\| \|
* \|	Replace `unsafe` impl of backups with `rusqlite`.	Owen Jacobson	2024-10-05
\| \| \| \| \| \| \| \|	The unsafe code still exists, but I have more faith in the rusqlite authors than in myself to ensure that the code is correct.
* \|	Use sqlx's API, not SQL groveling, to find unwanted migrations.	Owen Jacobson	2024-10-05
\| \|
* \|	Make a backup of the `.hi` database before applying migrations.	Owen Jacobson	2024-10-05
\|/ \| \| \|	This was motivated by Kit and I both independently discovering that sqlite3 will happily partially apply migrations, leaving the DB in a broken state.
*	Represent channels and messages using a split "History" and "Snapshot" model.	Owen Jacobson	2024-10-03
\| \| \| \| \| \|	This separates the code that figures out what happened to an entity from the code that represents it to a user, and makes it easier to compute a snapshot at a point in time (for things like bootstrap). It also makes the internal logic a bit easier to follow, since it's easier to tell whether you're working with a point in time or with the whole recorded history. This hefty.
*	Package upgrades	Owen Jacobson	2024-09-25
\|
*	Write tests.	Owen Jacobson	2024-09-20
\|
*	Remove the HTML client, and expose a JSON API.	Owen Jacobson	2024-09-20
\| \| \| \| \| \| \| \| \| \| \| \| \|	This API structure fell out of a conversation with Kit. Described loosely: kit: ok kit: Here's what I'm picturing in a client kit: list channels, make-new-channel, zero to one active channels, post-to-active. kit: login/sign-up, logout owen: you will likely also want "am I logged in" here kit: sure, whoami
*	Blanket dependency upgrades, yolo edition	Owen Jacobson	2024-09-18
\|
*	App methods now return errors that allow not-found cases to be distinguished.	Owen Jacobson	2024-09-18
\|
*	Consolidate channel events into a single stream endpoint.	Owen Jacobson	2024-09-15
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	While reviewing [MDN], I noticed this note: > SSE suffers from a limitation to the maximum number of open connections, which can be specially painful when opening various tabs as the limit is per browser and set to a very low number (6). […] This limit is per browser + domain, so that means that you can open 6 SSE connections across all of the tabs to www.example1.com and another 6 SSE connections to www.example2.com. I tested it in Safari; this is true, and once six streams are open, _no_ more requests can be made - in any tab, even a fresh one. Since the design _was_ that each channel had its own events endpoint, this is an obvious operations risk. Any client that tries to read multiple channels' streams will hit this limit quickly. This change consolidates all channel events into a single endpoint: `/events`. This takes a list of channel IDs (as query parameters, one `channel=` param per channel), and streams back events from all listed channels. The previous `/:channel/events` endpoint has been removed. Clients can selectively request events for the channels they're interested in. [MDN]: https://developer.mozilla.org/en-US/docs/Web/API/EventSource
*	Placeholder UX, probably	Owen Jacobson	2024-09-14
\|
*	Support Last-Event-Id as a method of resuming channel events after a disconnect	Owen Jacobson	2024-09-13
\|
*	Tolerate panics in channel::app where they can only be triggered by ↵	Owen Jacobson	2024-09-13
\| \| \| \|	implementation errors.
*	Transmit messages via `/:chan/send` and `/:chan/events`.	Owen Jacobson	2024-09-13
\|
*	Add logout support.	Owen Jacobson	2024-09-03
\|
*	Allow login creation and authentication.	Owen Jacobson	2024-09-03
\| \| \| \| \| \| \| \| \| \|	This is a beefy change, as it adds a TON of smaller pieces needed to make this all function: * A database migration. * A ton of new crates for things like password validation, timekeeping, and HTML generation. * A first cut at a module structure for routes, templates, repositories. * A family of ID types, for identifying various kinds of domain thing. * AppError, which _doesn't_ implement Error but can be sent to clients.
*	Store state in sqlite. Default to .hi in the cwd.	Owen Jacobson	2024-08-30
\|
*	Make it an HTTP server	Owen Jacobson	2024-08-30
\|
*	Empty bin crate	Owen Jacobson	2024-08-30