community[patch]: Graceful handling of redis errors in RedisCache and AsyncRedisCache #17171

snsten · 2024-02-07T11:56:04Z

Description:
The existing RedisCache implementation lacks proper handling for redis client failures, such as ConnectionRefusedError, leading to subsequent failures in pipeline components like LLM calls. This pull request aims to improve error handling for redis client issues, ensuring a more robust and graceful handling of such errors.
Issue: Fixes RedisCache does't handle errors from redis. #16866
Dependencies: No new dependency
Twitter handle: N/A

vercel · 2024-02-07T11:56:07Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

1 Ignored Deployment

Name	Status	Preview	Comments	Updated (UTC)
langchain	⬜️ Ignored (Inspect)	Visit Preview		Feb 21, 2024 3:39pm

eyurtsev · 2024-02-09T16:48:24Z

libs/community/langchain_community/cache.py

+                        generations.append(Generation(text=text))
+        # Gotta catch'em all
+        except Exception as e:
+            logger.warning(f"Redis lookup failed: {e}")


Isn't this an error level issue rather than a warning?

Yes, will change the log level to error

eyurtsev · 2024-02-09T16:48:46Z

libs/community/langchain_community/cache.py

-                    # In a previous life we stored the raw text directly
-                    # in the table, so assume it's in that format.
-                    generations.append(Generation(text=text))
+        try:


There are two other reasonable behaviors.

Retry once prior to giving up

Circuit breaker pattern (out of scope)

What are your thoughts on these?

Adding retry with some timeout make sense.

About circuit breaker for long term failure scenario:

We keep logging errors while redis is down until circuit is open.

When circuit is open do the error logs continue for redis failure in this duration?
Or

Circuit is open we log error that the circuit is open so redis is not being used?

Ideally it would be better to continue showing error logs since the api costs are incurred without cache, the service user should be aware of the redis failure even during the open circuit state from continued failure.

I can start with retry and close this and start a new pr for circuit breaker considering all the scenarios or do it here. Let me know what do you suggest.

I think users would appreciate being able to configure the retry behavior since a retry involves some additional latency and isn't guaranteed to pay off.

Would you be able to suggest a parameterization for the initializer that you think would make sense to configure the retry behavior?

For a circuit breaker, I think it probably make sense to log periodically, but not every request. We should probably figure out a separate way to tackle this. I'd appreciate if we could design a circuit breaker as a higher level primitive that accepts a cache as an instance and dresses it up as a circuit breaker

cc @cbornet / @dzmitry-kankalovich tagging you in case this is of interest for you

I think even as-is current change set is great, because cache layer IO should not break LLM calls in my opinion.

Taking it further, I do agree that more configuration around it would be nice, like the fact whether it should error out or not (default "not"?); and if retry to be added, then its definitely need to have configuration for that as well (which I'd probably set to no retry by default? you rightfully mentioned with cache retries you need to be sure this is what you want, as it can just defeat the purpose of a cache in the first place).

And yes, a general circuit breaker for cache interface would be great, but this is probably a substantial changeset, which IDK if @snsten would go for.

Agree on all points!

@snsten do you want to proceed with the PR as is and just update the logger.warning -> logger.error? This will probably spam someone's logs at some point, but at least their service will stay up.

And if you want to tackle more configuration separately that will be great (or a circuit breaker).

Ok, will change the error log only in this pr for the two classes mentioned

dzmitry-kankalovich · 2024-02-13T15:17:39Z

I'd also add it to AsyncRedisCache class (should pop up in your branch once you rebase onto latest master)

…r redis client failure

snsten · 2024-02-15T10:42:26Z

Added the error logs for RedisCache and AsyncRedisCache classes.

dzmitry-kankalovich · 2024-02-16T10:10:07Z

LGTM 👍

eyurtsev · 2024-02-21T15:38:08Z

libs/community/langchain_community/cache.py

-                    # In a previous life we stored the raw text directly
-                    # in the table, so assume it's in that format.
-                    generations.append(Generation(text=text))
+        try:


… AsyncRedisCache (langchain-ai#17171) - **Description:** The existing `RedisCache` implementation lacks proper handling for redis client failures, such as `ConnectionRefusedError`, leading to subsequent failures in pipeline components like LLM calls. This pull request aims to improve error handling for redis client issues, ensuring a more robust and graceful handling of such errors. - **Issue:** Fixes langchain-ai#16866 - **Dependencies:** No new dependency - **Twitter handle:** N/A Co-authored-by: snsten <> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>

dosubot bot added size:M This PR changes 30-99 lines, ignoring generated files. Ɑ: memory Related to memory module 🤖:bug Related to a bug, vulnerability, unexpected error with an existing feature 🔌: redis Primarily related to Redis integrations labels Feb 7, 2024

baskaryan assigned eyurtsev Feb 8, 2024

eyurtsev reviewed Feb 9, 2024

View reviewed changes

dosubot bot removed the size:M This PR changes 30-99 lines, ignoring generated files. label Feb 15, 2024

efriis added partner template labels Feb 15, 2024

dosubot bot added the size:XXL This PR changes 1000+ lines, ignoring generated files. label Feb 15, 2024

efriis self-assigned this Feb 15, 2024

snsten closed this Feb 15, 2024

snsten force-pushed the redis-cache-error-handling branch from f7ea281 to 53b8c86 Compare February 15, 2024 10:29

dosubot bot added size:XS This PR changes 0-9 lines, ignoring generated files. and removed size:XXL This PR changes 1000+ lines, ignoring generated files. labels Feb 15, 2024

Added error checking for async redis and raised log level to error fo…

b716178

…r redis client failure

snsten reopened this Feb 15, 2024

dosubot bot added size:M This PR changes 30-99 lines, ignoring generated files. and removed size:XS This PR changes 0-9 lines, ignoring generated files. labels Feb 15, 2024

eyurtsev changed the title ~~community: Graceful handling of redis errors~~ community[patch]: Graceful handling of redis errors in RedisCache and AsyncRedisCache Feb 21, 2024

eyurtsev approved these changes Feb 21, 2024

View reviewed changes

dosubot bot added the lgtm PR looks good. Use to confirm that a PR is ready for merging. label Feb 21, 2024

Merge branch 'master' into redis-cache-error-handling

3b789c2

eyurtsev removed Ɑ: memory Related to memory module 🤖:bug Related to a bug, vulnerability, unexpected error with an existing feature partner labels Feb 21, 2024

eyurtsev removed the template label Feb 21, 2024

eyurtsev merged commit 8381f85 into langchain-ai:master Feb 21, 2024
58 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

community[patch]: Graceful handling of redis errors in RedisCache and AsyncRedisCache #17171

community[patch]: Graceful handling of redis errors in RedisCache and AsyncRedisCache #17171

snsten commented Feb 7, 2024

vercel bot commented Feb 7, 2024 •

edited

eyurtsev Feb 9, 2024

snsten Feb 12, 2024

eyurtsev Feb 9, 2024

snsten Feb 12, 2024

eyurtsev Feb 13, 2024

dzmitry-kankalovich Feb 13, 2024

eyurtsev Feb 15, 2024

snsten Feb 15, 2024

eyurtsev Feb 21, 2024

dzmitry-kankalovich commented Feb 13, 2024

snsten commented Feb 15, 2024

dzmitry-kankalovich commented Feb 16, 2024

eyurtsev Feb 21, 2024

community[patch]: Graceful handling of redis errors in RedisCache and AsyncRedisCache #17171

community[patch]: Graceful handling of redis errors in RedisCache and AsyncRedisCache #17171

Conversation

snsten commented Feb 7, 2024

vercel bot commented Feb 7, 2024 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dzmitry-kankalovich commented Feb 13, 2024

snsten commented Feb 15, 2024

dzmitry-kankalovich commented Feb 16, 2024

Choose a reason for hiding this comment

vercel bot commented Feb 7, 2024 •

edited