Only update the ChordCounter.count field when saving #347

orf · 2022-10-20T13:41:37Z

With the current implementation the following SQL query will be continually executed on every task:

UPDATE "django_celery_results_chordcounter" 
SET "group_id" = 'f81ac317-eb61-42a6-a750-7cc1feb78b8c', 
"sub_tasks" = '[[["6a2f07d8-6b6f-4286-9d71-dbead46e9b1c", null], null], ..', 
"count" = 122 
WHERE "django_celery_results_chordcounter"."id" = 118

If you are using a large number of sub-tasks, ranging in the thousands or tens of thousands, then continually sending the sub_tasks in every update can be very expensive.

If we use update_fields then we can skip this.

orf · 2022-10-20T14:17:17Z

FYI we deployed this change and saw an immediate drop in our write throughput,

network traffic:

and write IOPs.

auvipy · 2022-10-20T15:43:11Z

can you confirm this won't create any regression?

orf · 2022-10-20T22:46:57Z

Is it ever possible to confirm that it won’t cause any regression?

All I can say is that the current code doesn’t need to update anything other than the count column, and that I’ve validated this on complex real world workflows and it works as expected.

One of our larger chords results in a ~7mb JSON string being updated every time “save” is called, which may be multiple times per second. Our internal monitoring showed Postgres triggered about ~4,700 “internal” row updates and deletes to complete this, which causes significant stress on the various IO subsystems within PG.

Throughout the execution of all of these queries, the JSON string remained the same.

auvipy · 2022-10-21T05:07:55Z

yup legit, make sense. thanks a lot

Only update the ChordCounter.count field when saving

Verified

This commit was signed with the committer’s verified signature.

bentonam Aaron

GPG key ID: 9C6A6EDFE1203396

Verified
Learn about vigilant mode

2776cd5

auvipy approved these changes Oct 20, 2022

View reviewed changes

auvipy merged commit 945c009 into celery:master Oct 21, 2022

orf deleted the patch-1 branch October 21, 2022 17:18

orf restored the patch-1 branch October 21, 2022 17:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Only update the ChordCounter.count field when saving #347

Only update the ChordCounter.count field when saving #347

orf commented Oct 20, 2022

orf commented Oct 20, 2022

auvipy commented Oct 20, 2022

orf commented Oct 20, 2022 •

edited

Loading

auvipy commented Oct 21, 2022

Only update the ChordCounter.count field when saving #347

Only update the ChordCounter.count field when saving #347

Conversation

orf commented Oct 20, 2022

orf commented Oct 20, 2022

auvipy commented Oct 20, 2022

orf commented Oct 20, 2022 • edited Loading

auvipy commented Oct 21, 2022

orf commented Oct 20, 2022 •

edited

Loading