akkoma

Author	SHA1	Message	Date
Oneric	cc5c1bb10c	signing_key: cleanup code In particular this avoids an unecessary roundtrip over user_id when searching a key via its primary key_id	2025-02-14 22:10:25 +01:00
Oneric	70fe99d196	Prevent key-actor mapping poisoning and key take overs Previously there were mainly two attack vectors: - for raw keys the owner <-> key mapping wasn't verified at all - keys were retrieved with refetching allowed and only the top-level ID was sanitised while usually keys are but a subobject This reintroduces public key checks in the user actor, previously removed in `9728e2f8f7` but now adapted to account for the new mapping mechanism.	2025-02-14 22:10:25 +01:00
Oneric	366065c0f6	fetcher: split out core object fetch validation To allow reuse for adapted key validation logic	2025-02-14 22:10:25 +01:00
Oneric	d68a5f6c56	Protected against counterfeit local docs being posted Only possible if actor keys leaked first thus log with alert level	2025-02-14 22:10:25 +01:00
Oneric	4231345f4e	cosmetic/emoji/pack: fix spelling There might be further debate about "emoji" vs "emojis" for the plural but a grep shows the latter is already widely used in our codebase.	2025-02-14 22:10:25 +01:00
Oneric	96fe080e6e	Convert all raw :zip usage to SafeZip Notably at least two instances were not properly guarded from path traversal attack before and are only now fixed by using SafeZip: - frontend installation did never check for malicious paths. But given a malicious froontend could already, e.g. steal all user tokens even without this, in the real world admins should only use frontends from trusted sources and the practical implications are minimal - the emoji pack update/upload API taking a ZIP file did not protect against path traversal. While atm only admins can use these emoji endpoints, emoji packs are typically considered "harmless" and used without prior verification from various sources. Thus this appears more concerning.	2025-02-14 22:10:25 +01:00
Oneric	7151ef4718	Add SafeZip module This will replace all the slightly different safety workarounds at different ZIP handling sites and ensure safety is actually consistently enforced everywhere while also making code cleaner and easiert to follow.	2025-02-14 22:10:25 +01:00
Oneric	c8e0f7848b	Migrate back to upstream Plug.Static Commit `a924e117fd` bumped the plug package to 1.16.0 which includes our upstream patch. Resolves: https://akkoma.dev/AkkomaGang/akkoma/issues/734	2025-02-14 22:10:25 +01:00
Oneric	1c2eb4d799	cosmetic/object: drop is_ prefix from is_tombstone_object? The question mark suffix already implies it being an indicator function	2025-02-14 22:10:25 +01:00
Oneric	7998a00346	cosmetic/rich_media/parser: fix typo	2025-02-14 22:10:25 +01:00
Oneric	f0a99b4595	article_note_validator: fix handling of Mastodon-style replies collections The first collection page is (sometimes?) inlined which caused crashes when attempting to log the fetch failure. But there’s no need to fetch and we can treat it like the other inline collection	2025-02-14 18:49:51 +01:00
Oneric	1b09b9fc22	static_fe: fix HTML quotation for upload alt text Reported by riley on IRC	2025-02-14 18:49:51 +01:00
Oneric	46148c0825	Don't return garbage on failed collection fetches And for now treat partial fetches as a success, since for all current users partial collection data is better than no data at all. If an error occurred while fetching a page, this previously returned a bogus {:ok, {:error, _}} success, causing the error to be attached to the object as an reply list subsequently leading to the whole post getting rejected during validation. Also the pinned collection caller did not actually handle the preexisting error case resulting in process crashes.	2025-02-14 18:49:51 +01:00
Oneric	4701aa2a38	receiver_worker: log processes crashes Oban cataches crashes to handle job failure and retry, thus it never bubbles up all the way and nothing is logged by default. For better debugging, catch and log any crashes.	2025-02-14 18:46:19 +01:00
Oneric	2ddff7e386	transmogrifier: gracefully ignore Delete of unknown objects It's quite common to receive spurious Deletes, so we neither want to waste resources on retrying nor spam "invalid AP" logs	2025-01-07 20:27:28 +01:00
Oneric	cd8e6a4235	transmogrifier: gracefully ignore duplicated object deletes The object lookup is later repeated in the validator, but due to caching shouldn't incur any noticeable performance impact. It’s actually preferable to check here, since it avoids the otherwise occuring user lookup and overhead from starting and aborting a transaction	2025-01-07 20:27:28 +01:00
Oneric	ac2327c8fc	transmogrfier: be more selective about Delete retry If something else renders the Delete invalid, there’s no point in retrying anyway	2025-01-07 20:27:28 +01:00
Oneric	92bf93a4f7	transmogrifier: avoid crashes on non-validation Delte errors Happens e.g. for duplicated Deletes. The remaining tombstone object no longer has an actor, leading to an error response during side-effect handling.	2025-01-07 20:27:28 +01:00
Oneric	7ad5f8d3c0	object_validators: only query relevant table for object Most of them actually only accept either activities or a non-activity object later; querying both is then a waste of resources and may create false positives.	2025-01-07 20:27:28 +01:00
Oneric	b0387dee14	Gracefully ignore Undo activities referring to unknown objects	2025-01-07 20:27:28 +01:00
Oneric	caa4fbe326	user: avoid database work on superfluous pin The only thing this does is changing the updated_at field of the user. Afaict this came to be because prior to pins federating this was split into two functions, one of which created a changeset, the other applying a given changeset. When this was merged the bits were just copied into place.	2025-01-07 20:27:28 +01:00
Oneric	09736431e0	Don't spam logs about deleted users User.get_or_fetch_by_(apid\|nickname) are the only external users of fetch_and_prepare_user_from_ap_id, thus there’s no point in duplicating logging, expecially not at error level. Currently (duplicated) _not_found errors for users make up the bulk of my logs and are created almost every second. Deleted users are a common occurence and not worth logging outside of debug	2025-01-07 20:27:28 +01:00
Oneric	bcf3e101f6	rich_media: lower log level of update	2025-01-07 20:27:28 +01:00
Oneric	05bbdbf388	nodeinfo: lower log level of regular actions to debug	2025-01-07 20:27:28 +01:00
Oneric	2c75600532	federation/incoming: improve link_resolve retry decision To facilitate this ObjectValidator.fetch_actor_and_object is adapted to return an informative error. Otherwise we’d be unable to make an informed decision on retrying or not later. There’s no point in retrying to fetch MRF-blocked stuff or private posts for example.	2025-01-07 20:27:28 +01:00
Oneric	0cd4040db6	Error out earlier on missing mandatory reference This is the only user of fetch_actor_and_object which previously just always preteneded to be successful. For all the activity types handled here, we absolutely need the referenced object to be able to process it (other than Announce whether or not processing those activity types for unknown remote objects is desirable in the first place is up for debate) All other users of the similar fetch_actor already properly check success. Note, this currently lumps all reolv failure reasons together, so even e.g. boosts of MRF rejected posts will still exhaust all retries. The following commit improves on this.	2025-01-07 20:27:28 +01:00
Oneric	0ba5c3649d	federator: don't nest {:error, _} tuples It makes decisions based on error sources harder since all possible nesting levels need to be checked for. As shown by the return values handled in the receiver worker something else still nests those, but this is a first start.	2025-01-07 20:27:28 +01:00
Oneric	8e5defe6ca	stats: estimate remote user count This value is currently only used by Prometheus metrics but (after optimisng the peer query inthe preceeding commit) the most costly part of instance stats.	2025-01-07 20:27:28 +01:00
Oneric	138b1aea2f	stats: use cheaper peers query This query is one of the top cost offenders during an instances lifetime. For small instances it was shown to take up 30-50% percent of the total database query time, while for bigger isntaces it still held a spot in the top 3 — alost as or even more expensive overall than timeline queries! The good news is, there’s a cheaper way using the instance table: no need to process each entry, no need to filter NULLs and no need to dedupe. EXPLAIN estimates the cost of the old query as 13272.39 and the cost of the new query as 395.74 for me; i.e. a 33-fold reduction. Results can slightly differ. E.g. we might have an old user predating the instance tables existence and no interaction with since or no instance table entry due to failure to query nodeinfo. Conversely, we might have an instance entry but all known users got deleted since. However, this seems unproblematic in practice and well worth the perf improvment. Given the previous query didn’t exclude unreachable instances neither does the new query.	2025-01-07 20:27:28 +01:00
Oneric	8b5183cb74	stats: fix stat spec	2025-01-07 20:27:28 +01:00
Oneric	cbb0d4b0a8	receiver_worker: log unecpected errors This can't handle process crash errors but i hope those get a stacktrace logged by default	2025-01-07 20:27:28 +01:00
Oneric	be2c857845	receiver_worker: don't reattempt invalid documents Ideally we’d like to split this up more and count most invalid documents as an error, but silently drop e.g. Deletes for unknown objects. However, this is hard to extract from the changeset and jobs canceled with :discard don’t count as exceptions and I’m not aware of a idiomatic way to cancel further retries while retaining the exception status. Thus at least keep a log, but since superfluous "Delete"s seem kinda frequent, don't log at error, only info level.	2025-01-07 20:27:28 +01:00
Oneric	9f4d3a936f	cosmetic/receiver_worker: reformat error cases The next commit adds a multi-statement case and then mix format will enforce this anyway	2025-01-07 20:27:28 +01:00
Oneric	f9724b5879	Don’t reattempt insertion of already known objects Might happen if we receive e.g. a Like before the Note arrives in our inbox and we thus already queried the Note ourselves.	2025-01-07 20:27:27 +01:00
Oneric	280652651c	rich_media: don't reattempt parsing on rejected URLs	2025-01-07 20:27:27 +01:00
Oneric	92544e8f99	Don't enqueue a plethora of unnecessary NodeInfoFetcher jobs There were two issues leading to needles effort: Most importnatly, the use of AP IDs as "source_url" meant multiple simultaneous jobs got scheduled for the same instance even with the default unique settings. Also jobs were scheduled uncontionally for each processed AP object meaning we incured oberhead from managing Oban jobs even if we knew it wasn't necessary. By comparison the single query to check if an update is needed should be cheaper overall.	2025-01-07 20:27:27 +01:00
Oneric	d283ac52c3	Don't create noop SearchIndexingWorker jobs for passive index	2025-01-07 20:27:27 +01:00
Oneric	ed4019e7a3	workers: make custom filtering ahead of enqueue possible	2025-01-07 20:27:27 +01:00
Oneric	25d24cc5f6	validators/add_remove: don't crash on failure to resolve reference It allows for informed error handling and retry/discard job decisions lateron which a future commit will add.	2025-01-07 20:27:27 +01:00
Oneric	ead44c6671	federator: don't fetch the user for no reason The return value is never used here; later stages which actually need it fetch the user themselves and it doesn't matter wheter we wait for the fech here or later (if needed at all). Even more, this early fetch always fails if the user was already deleted or never known to begin with, but we get something referencing it; e.g. the very Delete action carrying out the user deletion. This prevents processing of the Delete, but before that it will be reattempted several times, each time attempring to fetch the non-existing profile, wasting resources.	2025-01-07 20:27:27 +01:00
Oneric	4859f38624	add_remove_validator: limit refetch rate to 1 per 5s This matches the maximum_age used when processing Move activities	2025-01-07 20:27:27 +01:00
Oneric	0f4a7a185f	Drop ap_enabled indicator from atom feeds	2025-01-07 20:27:27 +01:00
Haelwenn (lanodan) Monnier	c17681ae1e	Purge obsolete ap_enabled indicator It was used to migrate OStatus connections to ActivityPub if possible, but support for OStatus was long since dropped, all new actors always AP and if anything wasn't migrated before, their instance is already marked as unreachable anyway. The associated logic was also buggy in several ways and deleted users got set to ap_enabled=false also causing some issues. This patch is a pretty direct port of the original Pleroma MR; follow-up commits will further fix and clean up remaining issues. Changes made (other than trivial merge conflict resolutions): - converted CHANGELOG format - adapted migration id for Akkoma’s timeline - removed ap_enabled from additional tests Ported-from: https://git.pleroma.social/pleroma/pleroma/-/merge_requests/3880	2025-01-07 20:27:26 +01:00
Floatingghost	1ffbaa2924	don't allow a nil inbox to obliterate federation	2025-01-06 11:43:41 +00:00
floatingghost	39cef8b8d2	Merge pull request 'Set customize_hostname_check for Swoosh.Adapters.SMTP' (#861 ) from norm/akkoma:smtp-defaults-fix into develop Reviewed-on: https://akkoma.dev/AkkomaGang/akkoma/pulls/861	2025-01-05 15:43:16 +00:00
floatingghost	3ba743d635	Merge pull request 'Update hashtag prune to account for followed hashtags' (#844 ) from norm/akkoma:hashtag-prune into develop Reviewed-on: https://akkoma.dev/AkkomaGang/akkoma/pulls/844	2025-01-05 15:41:23 +00:00
floatingghost	8de373fa24	Merge pull request 'Fix various attachment cleanup issues' (#789 ) from Oneric/akkoma:attachcleanup-overeager into develop Reviewed-on: https://akkoma.dev/AkkomaGang/akkoma/pulls/789	2025-01-05 15:39:48 +00:00
floatingghost	7c095a6b70	Merge pull request 'do not fetch if :limit_to_local_content is :all or :unauthenticated' (#582 ) from beerriot/akkoma:develop-no-fetch-with-local-limit into develop Reviewed-on: https://akkoma.dev/AkkomaGang/akkoma/pulls/582	2025-01-05 15:39:13 +00:00
Oneric	e8bf4422ff	Delay attachment deletion Otherwise attachments have a high chance to disappear with akkoma-fe’s “delete & redraft” feature when cleanup is enabled in the backend. Since we don't know whether a deletion was intended to be part of a redraft process or even if whether the redraft was abandoned we still have to delete attachments eventually. A thirty minute delay should provide sufficient time for redrafting. Fixes: https://akkoma.dev/AkkomaGang/akkoma/issues/775	2025-01-03 20:49:11 +01:00
Oneric	bcfbfbcff5	Don't try to cleanup remote attachments The cleanup attachment worker was run for every deleted post, even if it’s a remote post whose attachments we don't even store. This was especially bad due to attachment cleanup involving a particularly heavy query wasting a bunch of database perf for nil. This was uncovered by comparing statistics from https://akkoma.dev/AkkomaGang/akkoma/issues/784 and https://akkoma.dev/AkkomaGang/akkoma/issues/765#issuecomment-12256	2025-01-03 20:48:46 +01:00

1 2 3 4 5 ...

9481 commits