feat: contentTypeParser cache, prefer lru to fifo #5340

gurgunday · 2024-03-01T08:34:19Z

Why Fifo?

If there is a cache match, we should make that entry fresh again as chances of it being used again is now higher

mcollina · 2024-03-01T09:36:28Z

Because you changed it: #5331

kibertoad · 2024-03-01T09:58:57Z

@gurgunday lru is only useful when you expect eviction due to exceeding the cache size, otherwise you are just paying overhead on every get for no reason.
Do we expect evictions here? From my understanding we get a fairly limited amount of entries

gurgunday · 2024-03-01T09:58:58Z

No, I indeed changed it from Fifo to FifoMap, but here I'm proposing changing FifoMap to LruMap

For this part of the code, an Lru could be better since a match would bump that contentType in cache so that it stays longer before getting evicted

kibertoad · 2024-03-01T10:00:50Z

@gurgunday lru doesn't affect ttl, it only switches eviction order. do we expect evictions?

gurgunday · 2024-03-01T10:06:41Z

@gurgunday lru doesn't affect ttl, it only switches eviction order. do we expect evictions?

Yes, when I said it would stay longer, I meant the evictions

@gurgunday lru is only useful when you expect eviction due to exceeding the cache size, otherwise you are just paying overhead on every get for no reason. Do we expect evictions here? From my understanding we get a fairly limited amount of entries

The cache can fill up fast for RegExp matches, but for string matches it actually doesn't do much, so maybe the cache should only be for RegExp, will take a look if I can come up with something better

kibertoad · 2024-03-01T10:09:37Z

@gurgunday we can have two separate caches for these cases, lru for regex and fifo for strings accordingly, that would have optimal runtime characteristics

gurgunday · 2024-03-01T11:09:44Z

Actually, after a second look, there could be evictions because of string matches too:

fastify/lib/contentTypeParser.js

Lines 106 to 113 in f776447

    
           for (var i = 0; i !== this.parserList.length; ++i) { 
        
             const parserListItem = this.parserList[i] 
        
             if (contentType.indexOf(parserListItem) === 0) { 
        
               const parser = this.customParsers.get(parserListItem) 
        
               this.cache.set(contentType, parser) 
        
               return parser 
        
             } 
        
           }

Consider the following:

A client sends a POST request with the Content-Type header application/json --;, this will be matched to application/json and cached, another client can then send application/json *** or application/json; charset=utf-8, these will all be cached

Combined with the RegExp matches, I feel like bumping fits better, no?

Is bumping that expensive?

kibertoad · 2024-03-01T11:12:45Z

Not super expensive, but state mutation is state mutation.
How many different permutations do you expect there? Wonder if we can bump cache size to 1000 entries (which is still not much memory) and expect not to get any evictions.

gurgunday · 2024-03-01T11:36:09Z

How many different permutations do you expect there?

No idea 😁

This is a client-dependent area, in theory I could spam a server with different variations of application/json to cause evictions, but in a normal scenario, inputs should be pretty consistent

I'll think about what the best way to handle this would be

kibertoad · 2024-03-01T11:48:01Z

We don't need to consider malicious agent scenario here, eviction event is not expensive enough to qualify as a ddos attack surface.

gurgunday · 2024-03-02T13:50:55Z

Should I bench?

In any case, this is when the server receives stuff, which is much rarer than sending stuff anyway — so neither LRU or FIFO will cause any meaningful bottleneck

Looking at the nature of this cache, I think an LRU makes more sense — why not push back the eviction order of a Content-Type that was matched recently? An alternative is to increase the FIFO size, as @kibertoad said, but my problem isn't necessarily with the size

What does @fastify/core think?

gurgunday · 2024-03-09T14:30:41Z

I read a few things, looked at some practical benchmarks, and concluded that a FIFO is simply better unless there is huge interest in bumping recent hits, like in rate-limit

kibertoad · 2024-03-09T14:46:29Z

@gurgunday Do you think we may still benefit from increasing the cache size?

gurgunday · 2024-03-09T15:40:50Z

100 sounds reasonable to me... but maybe it is too little – some real-world usage data would be pretty helpful here

prefer lru to fifo

00757c3

gurgunday requested a review from kibertoad March 1, 2024 08:34

gurgunday marked this pull request as draft March 1, 2024 10:06

gurgunday closed this Mar 9, 2024

gurgunday deleted the use-map branch March 9, 2024 15:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: contentTypeParser cache, prefer lru to fifo #5340

feat: contentTypeParser cache, prefer lru to fifo #5340

gurgunday commented Mar 1, 2024

mcollina commented Mar 1, 2024

kibertoad commented Mar 1, 2024

gurgunday commented Mar 1, 2024 •

edited

kibertoad commented Mar 1, 2024

gurgunday commented Mar 1, 2024

kibertoad commented Mar 1, 2024

gurgunday commented Mar 1, 2024

kibertoad commented Mar 1, 2024

gurgunday commented Mar 1, 2024

kibertoad commented Mar 1, 2024

gurgunday commented Mar 2, 2024

gurgunday commented Mar 9, 2024

kibertoad commented Mar 9, 2024

gurgunday commented Mar 9, 2024

feat: contentTypeParser cache, prefer lru to fifo #5340

feat: contentTypeParser cache, prefer lru to fifo #5340

Conversation

gurgunday commented Mar 1, 2024

mcollina commented Mar 1, 2024

kibertoad commented Mar 1, 2024

gurgunday commented Mar 1, 2024 • edited

kibertoad commented Mar 1, 2024

gurgunday commented Mar 1, 2024

kibertoad commented Mar 1, 2024

gurgunday commented Mar 1, 2024

kibertoad commented Mar 1, 2024

gurgunday commented Mar 1, 2024

kibertoad commented Mar 1, 2024

gurgunday commented Mar 2, 2024

gurgunday commented Mar 9, 2024

kibertoad commented Mar 9, 2024

gurgunday commented Mar 9, 2024

gurgunday commented Mar 1, 2024 •

edited