akkoma

Author	SHA1	Message	Date
Oneric	2eadc4b513	mrf/normalize_markup: also scrub contentMap Only scrubbing "content" leads to differences between "content" and "contentMap" eventhough the latter should ideally match the former exactly for the primary language’s entry. While ideally, for locally generated posts there should be no difference between applying the scrubber or not, as it turns out automatically generated attachment links didn't match the form expected by our default scrubber. Currently Akkoma never uses nor exposes the value of contentMap entries, thus this oversight was harmless wrt to safety and at most pertubed the language detection for our posts perfomed by remote servers. Fixes: https://akkoma.dev/AkkomaGang/akkoma/issues/928	2025-05-16 21:30:26 +02:00
Oneric	c749df62a1	cosmetic: replace deprecated Tuple.append Everytime a tuple is append a new one is allocated and all data copied over. Keeping it a list until all entries are converted avoids this.	2025-05-15 23:40:57 +02:00
Oneric	88a6a9d964	cosmetic: replace deprecated comment syntax in eex The replacement <!-- --> is available since elixir 1.14.0 which matches our minimal version.	2025-05-15 23:07:43 +02:00
Oneric	40fef8e632	api/masto/instance: use WebFinger domain for URI Despite its name this property is not supposed to be a full URI, but just a bare domain witout protocol. Furthermore, it’s supposed to be the WebFinger domain used in userhandles and NOT the domain used for API and ActivityPub objects (which every caller will already know anyway). Not following this caused issues for Pachli and Tusky. Reported-by: nikclayton	2025-05-15 21:10:17 +02:00
Oneric	295e4a4da3	api/masto/instance: add short_description field Added in Mastodon 2.9.2 (June 2019) this is plain-text-only and supposed to be shorter compared to the older description field. Some clients were reported to require this field to properly function. Reported-by: https://akkoma.dev/paulyd	2025-05-15 20:41:55 +02:00
Oneric	f576807f1b	worker/receiver: don't retry unsupported actions Observed for e.g. user delete Undos and Bite activities	2025-05-09 22:29:49 +02:00
Oneric	487473cd75	Merge pull request 'web/metadata: provide alternate link for ActivityPub' (#905 ) from Oneric/akkoma:metadata_aplink into develop Reviewed-on: https://akkoma.dev/AkkomaGang/akkoma/pulls/905	2025-05-09 20:20:04 +00:00
Oneric	c3d163d34d	Merge pull request 'mediaproxy: proxy network-path references' (#903 ) from Oneric/akkoma:mediaproxy_networkpathref into develop Reviewed-on: https://akkoma.dev/AkkomaGang/akkoma/pulls/903	2025-05-09 20:13:55 +00:00
Oneric	8cdfbf872d	Merge pull request 'federation/out: tweak publish retry backoff' (#884 ) from Oneric/akkoma:publish_backoff into develop Reviewed-on: https://akkoma.dev/AkkomaGang/akkoma/pulls/884	2025-05-09 20:12:56 +00:00
Oneric	13940a558a	Merge pull request 'Expose stats about finally failed AP deliveries in prometheus' (#882 ) from Oneric/akkoma:telemetry-failed-deliveries into develop Reviewed-on: https://akkoma.dev/AkkomaGang/akkoma/pulls/882	2025-05-09 20:12:01 +00:00
Oneric	aac5493dd5	Merge pull request 'Don’t pretend internal actors have follow(er\|ing) collections' (#856 ) from Oneric/akkoma:fetch-actor-follow-collections into develop Reviewed-on: https://akkoma.dev/AkkomaGang/akkoma/pulls/856	2025-05-09 20:10:41 +00:00
Oneric	d6f5f4db18	Merge pull request 'receiver_worker: prevent duplicate jobs' (#886 ) from Oneric/akkoma:receive_dedupe into develop Reviewed-on: https://akkoma.dev/AkkomaGang/akkoma/pulls/886	2025-05-09 19:13:14 +00:00
Oneric	a80444041c	federation: always prefer the shared inbox In theory a pedantic reading of the spec indeed suggests DMs must only be delivered to personal inboxes. However, in practice the normative force of real-world implementations disagrees. Mastodon, Iceshrimp.NET and GtS (the latter notably has a config option to never use sharedInboxes) all unconditionally prefer sharedInbox for everything without ill effect. This saves on duplicate deliveries on the sending and processing on the receiving end. (Typically the receiving side ends up rejecting all but the first copy as duplicates) Furthermore current determine_inbox logic also actually needs up forcing personal inboxes for follower-only posts, unless they additionally explicitly address at least one specific actor. This is even much wasteful and directly contradicts the explicit intent of the spec. There’s one part where the use of sharedInbox falls apart, namely spec-compliant bcc and bto addressing. AP spec requires bcc/bto fields to be stripped before delivery and then implicitly reconstructed by the receiver based on the addressed personal inbox. In practice however, this addressing mode is almost unused. Neither of the three implementations brought up above supports it and while *oma does use bcc for list addressing, it does not use it in a spec-compliant way and even copies same-host recipients into cc before delivery. Messages with bcc addressing are handled in another function clause, always force personal inboxes for every recipient and not affected by this commit. In theory it would be beneficial to use sharedInbox there too for all but bcc recipients. But in practice list addressing has been broken for quite some time already and is not actually exposed in any frontend, as discussed in https://akkoma.dev/AkkomaGang/akkoma/issues/812. Therefore any changes here have virtually no effect anyway and all code concerning it may just be outright removed.	2025-05-06 17:38:24 +02:00
Oneric	0d38385d6f	publisher: don't mangle between string and atom Oban jobs only can have string args and there’s no reason to insist on atoms here. Plus this used unchecked string_to_atom	2025-05-06 17:38:18 +02:00
floatingghost	f0653efe13	Merge pull request 'Fix Pleroma’s unlisted posts' (#885 ) from Oneric/akkoma:pleroma_unlisted into develop Reviewed-on: https://akkoma.dev/AkkomaGang/akkoma/pulls/885	2025-05-02 22:26:59 +00:00
Oneric	ed03eaade9	web/metadata: provide alternate link for ActivityPub This allows discovering a page represents an ActivityPub object and also where to find the underlying representation. Other servers already implement this and some tools came to rely or profit from it. The alternate link is provided both with the "application/activity+json" format as used by Mastodon and the standard-compliant media type. Just like the feed provider, ActivityPub links are always enabled unless access to local posts is restricted and not configurable. The commit is based on earlier work by Charlotte 🦝 Deleńkec but with fixes and some tweaks. Co-authored-by: Charlotte 🦝 Deleńkec <lotte@chir.rs>	2025-04-27 00:35:02 +02:00
Oneric	bbf974adc8	mediaproxy: proxy network-path references Discovered-by: snow Fixes: https://akkoma.dev/AkkomaGang/akkoma/issues/900	2025-04-18 01:50:27 +02:00
Oneric	1f6f5edf85	telemetry: expose stats about failed deliveries And also log about it which we so far didn't do	2025-04-15 19:41:12 +02:00
Oneric	2fee79e1f5	Use apropriate cancellation type for oban jobs :discard marks jobs as "discarded", i.e. jobs which permanently failed due to e.g. exhausting all retries or explicitly being discared due to a fatal error. :cancel marks jobs as "cancelled" which does not imply failure. While neither method counts as a job "exception" in the set of telemetries we currently export via Prometheus, the different state is visible in the (not-exported) metadata of oban job telemetry. We can use handlers of those events to build bespoke statistics. Ideally we'd like to distinguish in the receiver worker between "invalid" and "already present or delete of unknown" documents, but this is cumbersome to get get right with a list of free-form, human-readable descriptions oof the violated constraints. For now, just count both as an fatal error. # but that is cumbersome to get right with a list of string error descriptions	2025-04-15 19:40:26 +02:00
Oneric	984e5a121a	api/statuses: allow expires_in to override user-level status_ttl_default Fixes: https://akkoma.dev/AkkomaGang/akkoma/issues/898	2025-04-08 23:43:59 +02:00
ilja space	c9a36e4340	Support htmlMfm term key for FEP-c16b compliance We now add the htmlMfm key when relevant, store this in the database, and we see it when fetching using e.g. curl -L -H 'Accept: application/activity+json' "$ap_id" The `@context` of the Activity Pub message now also contains `htmlMfm: https://w3id.org/fep/c16b#htmlMfm`. When an incomming post has `htmlMfm: true`, we will not re-parse the content. FEDERATION.md is adapted to show the `htmlMfm` term is used.	2025-04-06 19:56:56 +02:00
Oneric	caf6b4606f	Fix representaton of internal actors CUrrently internal actors are supposed to be identified in the database by either a NULL nickname or a nickname prefixed by "internal.". For old installations this is true, but only if they were created over five years ago before `70410dfafd`. Newer installations will use "relay" as the nickname of the realy actor causing ii to be treated as a regular user. In particular this means all installations in the last five years never made use of the reduced endpoint case, thus it is dropped. Simplify this distinction by properly marking internal actors asa an Application type in the database. This was already implemented before by ilja in https://akkoma.dev/AkkomaGang/akkoma/pulls/457 but accidentally reverted during a translation update in `eba3cce77b`. This commit effectively restores this patch together with further changes. Also service actors unconditionally expose follow* collections atm, eventhough the internal fetch actor doesn't actually implement them. Since they are optional per spec and with Mastodon omitting them too for its instance actor proving the practical viability, we should just omit them. The relay actor however should continue to expose such collections and they are properly implemented here. Here too we now just use the values or their absence in the database. We do not have any other internal.* actors besides fetch atm. Fixes: https://akkoma.dev/AkkomaGang/akkoma/issues/855 Co-authored-by: ilja space <git@ilja.space>	2025-03-26 17:14:28 +01:00
Oneric	b58b6af3ba	cosmetic: adapt software name in internal actor descriptions	2025-03-26 05:03:18 +01:00
Oneric	195042bdc9	receiver_worker: prevent duplicate jobs E.g. \*oma federates (most) follower-only posts multiple times to each personal inbox. This commonly leads to race conditions with jobs of several copies running at the same time and getting past the initial "already known" check but then later all but one will crash with an exception from the unique db index. Since the only special thing we do with copies anyway is to discard them, just don't create such duplicate jobs in the first place. For the same reason and since failed jobs don't count towards duplicates, this should have virtually no effect on federation.	2025-03-18 03:46:33 +01:00
Oneric	7ffbe2ad26	upload/filter/exiftool/strip: hide warnings from log	2025-03-18 01:01:47 +01:00
Oneric	0abe01be2e	federation/in: always copy object addressing into its Create activity Since we later only consider the Create activity for access permission checks, but the semantically more sensible set of fields are the object’s. Changing the check itself to use the object may have unintended consequences on already existing legacy posts as the old code which processed it when it arrived may have never considered effects on the objects addressing fields.	2025-03-17 23:08:27 +01:00
Oneric	cdf576b951	federation/in: fix activity addressing of Pleroma unlisted While the object itself has the expected adressing for an "unlisted" post, we always use the Create activity’s adressing fields for permission checks. To avoid unintended effects on legacy objects we will continue to use the activity for access perm checks, but fix its addressing fields based on its object data. Ref: https://git.pleroma.social/pleroma/pleroma/-/issues/3323	2025-03-17 23:06:16 +01:00
Oneric	4011d20dbe	federation/out: tweak publish retry backoff With the current strategy the individual and cumulative backoff looks like this (the + part denotes max extra random delay): attempt backoff_single cumulative 1 16+30 16+30 2 47+60 63+90 3 243+90 ≈ 4min 321+180 4 1024+120 ≈17min 1360+300 ≈23+5min 5 3125+150 ≈20min 4500+450 ≈75+8min 6 7776+180 ≈ 2.1h 12291+630 ≈3.4h 7 16807+210 ≈ 4.6h 29113+840 ≈8h 8 32768+240 ≈ 9.1h 61896+1080 ≈17h 9 59049+270 ≈16.4h 120960+1350 ≈33h 10 100000+300 ≈27.7h 220975+1650 ≈61h We default to 5 retries meaning the least backoff runs with attempt=4. Therefore outgoing activiities might already be permanently dropped by a downtime of only 23 minutes which doesn't seem too implausible to occur. Furthermore it seems excessive to retry this quickly this often at the beginning. At the same time, we’d like to have at least one quick'ish retry to deal with transient issues and maintain reasonable federation responsiveness. If an admin wants to tolerate one -day downtime of remotes, retries need to be almost doubled. The new backoff strategy implemented in this commit instead switches to an exponetial after a few initial attempts: attempt backoff_single cumulative 1 16+30 16+30 2 143+60 159+90 3 2202+90 ≈37min 2361+180 ≈40min 4 8160+120 ≈ 2.3h 10521+300 ≈ 3h 5 77393+150 ≈21.5h 87914+450 ≈24h Initial retries are still fast, but the same amount of retries now allows a remote downtime of at least 40 minutes. Customising the retry count to 5 allows for whole-day downtimes.	2025-03-17 19:37:54 +01:00
floatingghost	0a9cf8fa8b	Merge pull request 'Test lowest and highest language versions, elixir 1.18 support' (#875 ) from ci-testing-all-versions into develop Reviewed-on: https://akkoma.dev/AkkomaGang/akkoma/pulls/875	2025-03-11 20:47:54 +00:00
Oneric	066d5b48ed	Fix Content-Type sanitisation for emoji and local uploads This was accidentally broken in `c8e0f7848b` due to a one-letter mistake in the plug option name and an absence of tests. Therefore it was once again possible to serve e.g. Javascript or CSS payloads via uploads and emoji. However due to other protections it was still NOT possible for anyone to serve any payload with an ActivityPub Content-Type. With the CSP policy hardening from previous JS payload exloits predating the Content-Type sanitisation, there is currently no known way of abusing this weakened Content-Type sanitisation, but should be fixed regardless. This commit fixes the option name and adds tests to ensure such a regression doesn't occur again in the future. Reported-by: Lain Soykaf <lain@lain.com>	2025-03-10 19:45:26 +01:00
Floatingghost	f176294d6d	elixir 1.18 formatting	2025-03-02 11:54:00 +00:00
Floatingghost	522a168af6	force signatures for pinned posts	2025-03-01 17:27:45 +00:00
Floatingghost	d62808e4b6	move /outbox to signed pipeline	2025-03-01 16:28:12 +00:00
Floatingghost	a47b02cb69	Merge remote-tracking branch 'oneric-sec/sec-2024-12' into develop	2025-03-01 12:13:17 +00:00
Floatingghost	a49f04bb4e	Merge branch 'develop' into oban_web	2025-02-23 16:16:48 +00:00
Floatingghost	da7998e89e	put oban route under a known prefix	2025-02-23 16:16:17 +00:00
Oneric	8243fc0ef4	federation: strip internal fields from incoming updates and history When note editing support was added, it was omitted to strip internal fields from edited notes and their history. This was uncovered due to Mastodon inlining the like count as a "likes" collection conflicting with our internal "likes" list causing validation failures. In a spot check with likes/like_count it was not possible to inject those internal fields into the local db via Update, but this was not extensively tested for all fields and avenues. Similarly address normalisation did not normalise addressing in the object history, although this was never at risk of being exploitable. The revision history of the Pleroma MR adding edit support reveals recusrive stripping was intentionally avoided, since it will end up removing e.g. emoji from outgoing activities. This appears to still be true. However, all current internal fields ("pleroma_interal" appears to be unused) contain data already publicised otherwise anyway. In the interest of fixing a federation bug (and at worst potential data injection) quickly outgoing stripping is left non-recursive for now. Of course the ultimate fix here is to not mix remote and internal data into the same map in the first place, but unfortunately having a single map of all truth is a core assumption of *oma's AP doc processing. Changing this is a masive undertaking and not suitable for providing a short-term fix.	2025-02-21 19:37:27 +01:00
Oneric	11ad4711eb	signing_key: don't retrieve superfluous fields when loading ap_id	2025-02-21 19:37:27 +01:00
Oneric	d8e40173bf	http_signatures: tweak order of route aliases We expect most requests to be made for the actual canonical ID, so check this one first (starting without query headers matching the predominant albeit spec-breaking version). Also avoid unnecessary rerewrites of the digest header on each route alias by just setting it once before iterating through aliases.	2025-02-21 19:37:27 +01:00
Oneric	9cc5fe9a5f	signature: refetch key upon verification failure This matches behaviour prioir to the SigningKey migration and the expected semantics of the http_signatures lib. Additionally add a min interval paramter, to avoid refetch floods on bugs causing incompatible signatures (like e.g. currently with Bridgy)	2025-02-21 19:37:27 +01:00
floatingghost	355263858c	Merge pull request 'Expose Port IO stats via Prometheus' (#869 ) from Oneric/akkoma:io-telemetry into develop Reviewed-on: https://akkoma.dev/AkkomaGang/akkoma/pulls/869	2025-02-21 15:28:09 +00:00
Oneric	a7b4e4bfd9	signature: distinguish error sources and log fetch issues	2025-02-14 22:10:25 +01:00
Oneric	51642a90c5	signature: drop unecessary round trip over user We already got the key.	2025-02-14 22:10:25 +01:00
Oneric	898b98e5dd	db: drop legacy key fields in users table	2025-02-14 22:10:25 +01:00
Oneric	ea2de1f28a	signing_key: ensure only one key per user exists Fixes: AkkomaGang/akkoma issue 858	2025-02-14 22:10:25 +01:00
Oneric	3460f41776	Fix user updates User updates broke with the migration to separate signing keys since user data carries signing keys but we didn't allow the association data to be updated.	2025-02-14 22:10:25 +01:00
Oneric	cc5c1bb10c	signing_key: cleanup code In particular this avoids an unecessary roundtrip over user_id when searching a key via its primary key_id	2025-02-14 22:10:25 +01:00
Oneric	70fe99d196	Prevent key-actor mapping poisoning and key take overs Previously there were mainly two attack vectors: - for raw keys the owner <-> key mapping wasn't verified at all - keys were retrieved with refetching allowed and only the top-level ID was sanitised while usually keys are but a subobject This reintroduces public key checks in the user actor, previously removed in `9728e2f8f7` but now adapted to account for the new mapping mechanism.	2025-02-14 22:10:25 +01:00
Oneric	366065c0f6	fetcher: split out core object fetch validation To allow reuse for adapted key validation logic	2025-02-14 22:10:25 +01:00
Oneric	d68a5f6c56	Protected against counterfeit local docs being posted Only possible if actor keys leaked first thus log with alert level	2025-02-14 22:10:25 +01:00
Oneric	4231345f4e	cosmetic/emoji/pack: fix spelling There might be further debate about "emoji" vs "emojis" for the plural but a grep shows the latter is already widely used in our codebase.	2025-02-14 22:10:25 +01:00
Oneric	96fe080e6e	Convert all raw :zip usage to SafeZip Notably at least two instances were not properly guarded from path traversal attack before and are only now fixed by using SafeZip: - frontend installation did never check for malicious paths. But given a malicious froontend could already, e.g. steal all user tokens even without this, in the real world admins should only use frontends from trusted sources and the practical implications are minimal - the emoji pack update/upload API taking a ZIP file did not protect against path traversal. While atm only admins can use these emoji endpoints, emoji packs are typically considered "harmless" and used without prior verification from various sources. Thus this appears more concerning.	2025-02-14 22:10:25 +01:00
Oneric	7151ef4718	Add SafeZip module This will replace all the slightly different safety workarounds at different ZIP handling sites and ensure safety is actually consistently enforced everywhere while also making code cleaner and easiert to follow.	2025-02-14 22:10:25 +01:00
Oneric	c8e0f7848b	Migrate back to upstream Plug.Static Commit `a924e117fd` bumped the plug package to 1.16.0 which includes our upstream patch. Resolves: https://akkoma.dev/AkkomaGang/akkoma/issues/734	2025-02-14 22:10:25 +01:00
Oneric	1c2eb4d799	cosmetic/object: drop is_ prefix from is_tombstone_object? The question mark suffix already implies it being an indicator function	2025-02-14 22:10:25 +01:00
Oneric	7998a00346	cosmetic/rich_media/parser: fix typo	2025-02-14 22:10:25 +01:00
Oneric	f0a99b4595	article_note_validator: fix handling of Mastodon-style replies collections The first collection page is (sometimes?) inlined which caused crashes when attempting to log the fetch failure. But there’s no need to fetch and we can treat it like the other inline collection	2025-02-14 18:49:51 +01:00
Oneric	1b09b9fc22	static_fe: fix HTML quotation for upload alt text Reported by riley on IRC	2025-02-14 18:49:51 +01:00
Oneric	46148c0825	Don't return garbage on failed collection fetches And for now treat partial fetches as a success, since for all current users partial collection data is better than no data at all. If an error occurred while fetching a page, this previously returned a bogus {:ok, {:error, _}} success, causing the error to be attached to the object as an reply list subsequently leading to the whole post getting rejected during validation. Also the pinned collection caller did not actually handle the preexisting error case resulting in process crashes.	2025-02-14 18:49:51 +01:00
Oneric	4701aa2a38	receiver_worker: log processes crashes Oban cataches crashes to handle job failure and retry, thus it never bubbles up all the way and nothing is logged by default. For better debugging, catch and log any crashes.	2025-02-14 18:46:19 +01:00
Oneric	fb3de8045a	Expose Port IO stats via Prometheus	2025-01-27 20:00:30 +01:00
Floatingghost	b2b63ad89f	add oban web	2025-01-17 09:38:09 +00:00
Oneric	2ddff7e386	transmogrifier: gracefully ignore Delete of unknown objects It's quite common to receive spurious Deletes, so we neither want to waste resources on retrying nor spam "invalid AP" logs	2025-01-07 20:27:28 +01:00
Oneric	cd8e6a4235	transmogrifier: gracefully ignore duplicated object deletes The object lookup is later repeated in the validator, but due to caching shouldn't incur any noticeable performance impact. It’s actually preferable to check here, since it avoids the otherwise occuring user lookup and overhead from starting and aborting a transaction	2025-01-07 20:27:28 +01:00
Oneric	ac2327c8fc	transmogrfier: be more selective about Delete retry If something else renders the Delete invalid, there’s no point in retrying anyway	2025-01-07 20:27:28 +01:00
Oneric	92bf93a4f7	transmogrifier: avoid crashes on non-validation Delte errors Happens e.g. for duplicated Deletes. The remaining tombstone object no longer has an actor, leading to an error response during side-effect handling.	2025-01-07 20:27:28 +01:00
Oneric	7ad5f8d3c0	object_validators: only query relevant table for object Most of them actually only accept either activities or a non-activity object later; querying both is then a waste of resources and may create false positives.	2025-01-07 20:27:28 +01:00
Oneric	b0387dee14	Gracefully ignore Undo activities referring to unknown objects	2025-01-07 20:27:28 +01:00
Oneric	caa4fbe326	user: avoid database work on superfluous pin The only thing this does is changing the updated_at field of the user. Afaict this came to be because prior to pins federating this was split into two functions, one of which created a changeset, the other applying a given changeset. When this was merged the bits were just copied into place.	2025-01-07 20:27:28 +01:00
Oneric	09736431e0	Don't spam logs about deleted users User.get_or_fetch_by_(apid\|nickname) are the only external users of fetch_and_prepare_user_from_ap_id, thus there’s no point in duplicating logging, expecially not at error level. Currently (duplicated) _not_found errors for users make up the bulk of my logs and are created almost every second. Deleted users are a common occurence and not worth logging outside of debug	2025-01-07 20:27:28 +01:00
Oneric	bcf3e101f6	rich_media: lower log level of update	2025-01-07 20:27:28 +01:00
Oneric	05bbdbf388	nodeinfo: lower log level of regular actions to debug	2025-01-07 20:27:28 +01:00
Oneric	2c75600532	federation/incoming: improve link_resolve retry decision To facilitate this ObjectValidator.fetch_actor_and_object is adapted to return an informative error. Otherwise we’d be unable to make an informed decision on retrying or not later. There’s no point in retrying to fetch MRF-blocked stuff or private posts for example.	2025-01-07 20:27:28 +01:00
Oneric	0cd4040db6	Error out earlier on missing mandatory reference This is the only user of fetch_actor_and_object which previously just always preteneded to be successful. For all the activity types handled here, we absolutely need the referenced object to be able to process it (other than Announce whether or not processing those activity types for unknown remote objects is desirable in the first place is up for debate) All other users of the similar fetch_actor already properly check success. Note, this currently lumps all reolv failure reasons together, so even e.g. boosts of MRF rejected posts will still exhaust all retries. The following commit improves on this.	2025-01-07 20:27:28 +01:00
Oneric	0ba5c3649d	federator: don't nest {:error, _} tuples It makes decisions based on error sources harder since all possible nesting levels need to be checked for. As shown by the return values handled in the receiver worker something else still nests those, but this is a first start.	2025-01-07 20:27:28 +01:00
Oneric	8e5defe6ca	stats: estimate remote user count This value is currently only used by Prometheus metrics but (after optimisng the peer query inthe preceeding commit) the most costly part of instance stats.	2025-01-07 20:27:28 +01:00
Oneric	138b1aea2f	stats: use cheaper peers query This query is one of the top cost offenders during an instances lifetime. For small instances it was shown to take up 30-50% percent of the total database query time, while for bigger isntaces it still held a spot in the top 3 — alost as or even more expensive overall than timeline queries! The good news is, there’s a cheaper way using the instance table: no need to process each entry, no need to filter NULLs and no need to dedupe. EXPLAIN estimates the cost of the old query as 13272.39 and the cost of the new query as 395.74 for me; i.e. a 33-fold reduction. Results can slightly differ. E.g. we might have an old user predating the instance tables existence and no interaction with since or no instance table entry due to failure to query nodeinfo. Conversely, we might have an instance entry but all known users got deleted since. However, this seems unproblematic in practice and well worth the perf improvment. Given the previous query didn’t exclude unreachable instances neither does the new query.	2025-01-07 20:27:28 +01:00
Oneric	8b5183cb74	stats: fix stat spec	2025-01-07 20:27:28 +01:00
Oneric	cbb0d4b0a8	receiver_worker: log unecpected errors This can't handle process crash errors but i hope those get a stacktrace logged by default	2025-01-07 20:27:28 +01:00
Oneric	be2c857845	receiver_worker: don't reattempt invalid documents Ideally we’d like to split this up more and count most invalid documents as an error, but silently drop e.g. Deletes for unknown objects. However, this is hard to extract from the changeset and jobs canceled with :discard don’t count as exceptions and I’m not aware of a idiomatic way to cancel further retries while retaining the exception status. Thus at least keep a log, but since superfluous "Delete"s seem kinda frequent, don't log at error, only info level.	2025-01-07 20:27:28 +01:00
Oneric	9f4d3a936f	cosmetic/receiver_worker: reformat error cases The next commit adds a multi-statement case and then mix format will enforce this anyway	2025-01-07 20:27:28 +01:00
Oneric	f9724b5879	Don’t reattempt insertion of already known objects Might happen if we receive e.g. a Like before the Note arrives in our inbox and we thus already queried the Note ourselves.	2025-01-07 20:27:27 +01:00
Oneric	280652651c	rich_media: don't reattempt parsing on rejected URLs	2025-01-07 20:27:27 +01:00
Oneric	92544e8f99	Don't enqueue a plethora of unnecessary NodeInfoFetcher jobs There were two issues leading to needles effort: Most importnatly, the use of AP IDs as "source_url" meant multiple simultaneous jobs got scheduled for the same instance even with the default unique settings. Also jobs were scheduled uncontionally for each processed AP object meaning we incured oberhead from managing Oban jobs even if we knew it wasn't necessary. By comparison the single query to check if an update is needed should be cheaper overall.	2025-01-07 20:27:27 +01:00
Oneric	d283ac52c3	Don't create noop SearchIndexingWorker jobs for passive index	2025-01-07 20:27:27 +01:00
Oneric	ed4019e7a3	workers: make custom filtering ahead of enqueue possible	2025-01-07 20:27:27 +01:00
Oneric	25d24cc5f6	validators/add_remove: don't crash on failure to resolve reference It allows for informed error handling and retry/discard job decisions lateron which a future commit will add.	2025-01-07 20:27:27 +01:00
Oneric	ead44c6671	federator: don't fetch the user for no reason The return value is never used here; later stages which actually need it fetch the user themselves and it doesn't matter wheter we wait for the fech here or later (if needed at all). Even more, this early fetch always fails if the user was already deleted or never known to begin with, but we get something referencing it; e.g. the very Delete action carrying out the user deletion. This prevents processing of the Delete, but before that it will be reattempted several times, each time attempring to fetch the non-existing profile, wasting resources.	2025-01-07 20:27:27 +01:00
Oneric	4859f38624	add_remove_validator: limit refetch rate to 1 per 5s This matches the maximum_age used when processing Move activities	2025-01-07 20:27:27 +01:00
Oneric	0f4a7a185f	Drop ap_enabled indicator from atom feeds	2025-01-07 20:27:27 +01:00
Haelwenn (lanodan) Monnier	c17681ae1e	Purge obsolete ap_enabled indicator It was used to migrate OStatus connections to ActivityPub if possible, but support for OStatus was long since dropped, all new actors always AP and if anything wasn't migrated before, their instance is already marked as unreachable anyway. The associated logic was also buggy in several ways and deleted users got set to ap_enabled=false also causing some issues. This patch is a pretty direct port of the original Pleroma MR; follow-up commits will further fix and clean up remaining issues. Changes made (other than trivial merge conflict resolutions): - converted CHANGELOG format - adapted migration id for Akkoma’s timeline - removed ap_enabled from additional tests Ported-from: https://git.pleroma.social/pleroma/pleroma/-/merge_requests/3880	2025-01-07 20:27:26 +01:00
Floatingghost	1ffbaa2924	don't allow a nil inbox to obliterate federation	2025-01-06 11:43:41 +00:00
floatingghost	39cef8b8d2	Merge pull request 'Set customize_hostname_check for Swoosh.Adapters.SMTP' (#861 ) from norm/akkoma:smtp-defaults-fix into develop Reviewed-on: https://akkoma.dev/AkkomaGang/akkoma/pulls/861	2025-01-05 15:43:16 +00:00
floatingghost	8de373fa24	Merge pull request 'Fix various attachment cleanup issues' (#789 ) from Oneric/akkoma:attachcleanup-overeager into develop Reviewed-on: https://akkoma.dev/AkkomaGang/akkoma/pulls/789	2025-01-05 15:39:48 +00:00
floatingghost	7c095a6b70	Merge pull request 'do not fetch if :limit_to_local_content is :all or :unauthenticated' (#582 ) from beerriot/akkoma:develop-no-fetch-with-local-limit into develop Reviewed-on: https://akkoma.dev/AkkomaGang/akkoma/pulls/582	2025-01-05 15:39:13 +00:00
Oneric	e8bf4422ff	Delay attachment deletion Otherwise attachments have a high chance to disappear with akkoma-fe’s “delete & redraft” feature when cleanup is enabled in the backend. Since we don't know whether a deletion was intended to be part of a redraft process or even if whether the redraft was abandoned we still have to delete attachments eventually. A thirty minute delay should provide sufficient time for redrafting. Fixes: https://akkoma.dev/AkkomaGang/akkoma/issues/775	2025-01-03 20:49:11 +01:00
Oneric	bcfbfbcff5	Don't try to cleanup remote attachments The cleanup attachment worker was run for every deleted post, even if it’s a remote post whose attachments we don't even store. This was especially bad due to attachment cleanup involving a particularly heavy query wasting a bunch of database perf for nil. This was uncovered by comparing statistics from https://akkoma.dev/AkkomaGang/akkoma/issues/784 and https://akkoma.dev/AkkomaGang/akkoma/issues/765#issuecomment-12256	2025-01-03 20:48:46 +01:00
floatingghost	e3c8c4f24f	Merge pull request 'mrf/object_age: fix handling of non-public objects' (#851 ) from Oneric/akkoma:mrf-fix-oage into develop Reviewed-on: https://akkoma.dev/AkkomaGang/akkoma/pulls/851	2025-01-03 15:26:11 +00:00
floatingghost	67cdc38296	Merge pull request 'Only proxy HTTP and HTTP urls via Media Proxy' (#860 ) from nopjmp/akkoma:media-proxy-only-http into develop Reviewed-on: https://akkoma.dev/AkkomaGang/akkoma/pulls/860	2025-01-03 15:25:14 +00:00
floatingghost	89d209f486	Merge pull request 'Fix NodeInfo content-type' (#853 ) from Oneric/akkoma:nodeinfo-contenttype into develop Reviewed-on: https://akkoma.dev/AkkomaGang/akkoma/pulls/853	2025-01-03 15:24:25 +00:00

1 2 3 4 5 ...

9126 commits