[IMPROVED] NumPending calculations and subject index memory in filestore #4960

derekcollison · 2024-01-16T14:11:07Z

Swapped out psim as a go hashmap for our stree implementation.

Stree is an adaptive radix tree implementation used for storing and retrieving literal subjects. It also allows quick matching to wildcard subjects, which is it's major design goal along with using less memory in high subject cardinality situations.

This will be used in the filestore implementation to replace the PSIM hash map which was fast at insert and lookup but suffered when trying to filter based on wildcard subjects.

This is used specifically in calculations on NumPending with a wildcard, and given we push folks to use larger muxed streams with down filtered consumers and/or mirrors this was becoming a performance issue.

Signed-off-by: Derek Collison derek@nats.io

Stree is an adaptive radix tree implementation used for storing and retrieving literal subjects. It also allows quick matching to wildcard subjects, which is it's major design goal along with using less memory in high subject cardinality situations. This will be used in the filestore implementation to replace the PSIM hash map which was fast at insert and lookup but suffered when trying to filter based on wildcard subjects. This is used specifically in calculations on NumPending with a wildcard, and given we push folks to use larger muxed streams with down filtered consumers and/or mirrors this was becoming a performance issue. Signed-off-by: Derek Collison <derek@nats.io>

Signed-off-by: Derek Collison <derek@nats.io>

neilalexander

Overall looks good! Few minor things I noticed.

server/stree/node16.go

server/stree/node4.go

Signed-off-by: Derek Collison <derek@nats.io>

neilalexander

LGTM

Jarema

Still doing deeper dive into some parts of the codebase and benching it, but code itself looks good!

LGTM!

server/stree/dump.go

Signed-off-by: Derek Collison <derek@nats.io>

Signed-off-by: Neil Twigg <neil@nats.io>

The `_pre` and `cparts` copies are more often than not unnecessary and result in potentially gigabytes of allocations which can slow down functions like `NumPending`. Passing `pre` and `nparts` down recursively without copies is usually safe, even where there are appends in those, because there is no concurrency and those appends will not modify the slice length back at the callsite. Instead we'll only reallocate either when `append` does so naturally (we've ran out of capacity) or when we know specifically that we're modifying something in-place (like in `matchParts`). This improves performance of things like `NumPending` with high subject cardinality. Signed-off-by: Neil Twigg <neil@nats.io>

When using `make(x, y, z)`, there is a heap escape due to the non-constant size/capacity. Try to stay on the stack instead, reducing GC pressure. Signed-off-by: Neil Twigg <neil@nats.io>

In #4974 I removed preallocated buffers for `pre` as the copies were unnecessary on each single recursion, however it turns out that having a preallocation up front removes quite a few unnecessary allocations from subject construction, relieving further pressure on the GC. Signed-off-by: Neil Twigg <neil@nats.io>

In #4974 I removed preallocated buffers for `pre` as the copies were unnecessary on each single recursion, however it turns out that having a preallocation up front removes quite a few unnecessary allocations from subject construction as the underlying memory gets reused throughout the iteration or match process, relieving further pressure on the GC. Signed-off-by: Neil Twigg <neil@nats.io>

Signed-off-by: Neil Twigg <neil@nats.io>

neilalexander

Otherwise LGTM!

server/stree/bench_test.go

…ing memory for the GC. The solution was to have a node return its children as a []node. Since node256 is sparse the upper layers need to check for nil, but this improved the performance. Signed-off-by: Derek Collison <derek@nats.io>

wallyqs

LGTM

This updates the memory store to use the new subject tree from #4960 for per-subject tracking. Signed-off-by: Neil Twigg <neil@nats.io>

…ore (#4960) Swapped out psim as a go hashmap for our stree implementation. Stree is an adaptive radix tree implementation used for storing and retrieving literal subjects. It also allows quick matching to wildcard subjects, which is it's major design goal along with using less memory in high subject cardinality situations. This will be used in the filestore implementation to replace the PSIM hash map which was fast at insert and lookup but suffered when trying to filter based on wildcard subjects. This is used specifically in calculations on NumPending with a wildcard, and given we push folks to use larger muxed streams with down filtered consumers and/or mirrors this was becoming a performance issue. Signed-off-by: Derek Collison <derek@nats.io> --------- Signed-off-by: Derek Collison <derek@nats.io> Signed-off-by: Neil Twigg <neil@nats.io> Co-authored-by: Neil Twigg <neil@nats.io>

derekcollison added 2 commits January 16, 2024 05:33

Swapped out psim as a hashmap for our stree impl.

a5290b3

Signed-off-by: Derek Collison <derek@nats.io>

derekcollison requested a review from a team as a code owner January 16, 2024 14:11

neilalexander reviewed Jan 17, 2024

View reviewed changes

Updates based on PR feedback

334206f

Signed-off-by: Derek Collison <derek@nats.io>

neilalexander approved these changes Jan 17, 2024

View reviewed changes

Jarema approved these changes Jan 18, 2024

View reviewed changes

server/stree/dump.go Outdated Show resolved Hide resolved

derekcollison and others added 8 commits January 18, 2024 02:59

Remove comment that is no longer applicable

c0c41be

Signed-off-by: Derek Collison <derek@nats.io>

stree: Reduce allocations in iter and match

8a0cbc6

Signed-off-by: Neil Twigg <neil@nats.io>

stree: Reduce heap escapes in iter

f40e99b

When using `make(x, y, z)`, there is a heap escape due to the non-constant size/capacity. Try to stay on the stack instead, reducing GC pressure. Signed-off-by: Neil Twigg <neil@nats.io>

stree: Reduce heap escapes in iter (#4977)

6de76a4

When using `make(x, y, z)`, there is a heap escape due to the non-constant size/capacity. Try to stay on the stack instead, reducing GC pressure. Signed-off-by: Neil Twigg <neil@nats.io>

Add BenchmarkSubjectTreeMatch

30baeba

Signed-off-by: Neil Twigg <neil@nats.io>

neilalexander approved these changes Jan 20, 2024

View reviewed changes

server/stree/bench_test.go Outdated Show resolved Hide resolved

derekcollison force-pushed the stree branch from 0770a03 to 90a8897 Compare January 20, 2024 18:26

derekcollison merged commit d9235ab into main Jan 20, 2024
4 checks passed

derekcollison deleted the stree branch January 20, 2024 18:59

wallyqs reviewed Jan 20, 2024

View reviewed changes

neilalexander mentioned this pull request Jan 22, 2024

Use stree for per-subject tracking in memory store #4983

Merged

derekcollison added a commit that referenced this pull request Jan 22, 2024

Use stree for per-subject tracking in memory store (#4983)

fd1e1e6

This updates the memory store to use the new subject tree from #4960 for per-subject tracking. Signed-off-by: Neil Twigg <neil@nats.io>

david-wakeo mentioned this pull request Feb 13, 2024

Replaying messages on a topic stream with a lot of message is very slow if they are at the end of the stream #5072

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[IMPROVED] NumPending calculations and subject index memory in filestore #4960

[IMPROVED] NumPending calculations and subject index memory in filestore #4960

derekcollison commented Jan 16, 2024

neilalexander left a comment

neilalexander left a comment

Jarema left a comment

neilalexander left a comment

wallyqs left a comment

[IMPROVED] NumPending calculations and subject index memory in filestore #4960

[IMPROVED] NumPending calculations and subject index memory in filestore #4960

Conversation

derekcollison commented Jan 16, 2024

neilalexander left a comment

Choose a reason for hiding this comment

neilalexander left a comment

Choose a reason for hiding this comment

Jarema left a comment

Choose a reason for hiding this comment

neilalexander left a comment

Choose a reason for hiding this comment

wallyqs left a comment

Choose a reason for hiding this comment