Fix context propagation in Executor#execute() for non-capturing lambdas #9179

mateuszrzeszutek · 2023-08-10T10:48:22Z

This is an interesting issue that only occurs when using a non-capturing lambda (or any other singleton/cached Runnable instance), and only when Executor#execute() is used (since all other ExecutorService methods typically wrap the passed task in a new FutureTask and delegate to execute() anyway).

Because a singleton Runnable instance was passed, the whole machinery in ExecutorAdviceHelper and TaskAdviceHelper that handles a stateful virtual field completely broke when trying to correctly update the PropagatedContext, and sometimes overwrote the propagated context with a null.

While this is a surgical fix that handles precisely the issue posted by the user, I believe we should rethink our executors instrumentation as a whole: I think instead of the complex and buggy state management we have right now we should switch to decoration whenever it's possible. While this will add a little bit of extra memory overhead, I think it's much safer that way.

laurit · 2023-08-10T15:44:27Z

While this is a surgical fix that handles precisely the issue posted by the user, I believe we should rethink our executors instrumentation as a whole: I think instead of the complex and buggy state management we have right now we should switch to decoration whenever it's possible. While this will add a little bit of extra memory overhead, I think it's much safer that way.

I think there was something that failed with the decorator approach. Some slick or other scala tests. If I remember correctly the issue was that it is possible to supply a custom work queue to ThreadPoolExecutor and they had used something that blew up on the wrapper types. I also believe that we should give using wrappers another go, perhaps we can detect when there is a custom queue and just not instrument these executors by default.

...src/main/java/io/opentelemetry/javaagent/bootstrap/executors/ContextPropagatingRunnable.java

…etry/javaagent/bootstrap/executors/ContextPropagatingRunnable.java Co-authored-by: Trask Stalnaker <trask.stalnaker@gmail.com>

xiangtianyu · 2023-08-11T02:15:43Z

...src/main/java/io/opentelemetry/javaagent/bootstrap/executors/ContextPropagatingRunnable.java

+    // We wrap only lambdas' anonymous classes and if given object has not already been wrapped.
+    // Anonymous classes have '/' in class name which is not allowed in 'normal' classes.
+    // note: it is always safe to decorate lambdas since downstream code cannot be expecting a specific runnable implementation anyways
+    return task.getClass().getName().contains("/") && !(task instanceof ContextPropagatingRunnable);


This can handle the "lambda" situation, but how about the singleton task?

This PR does not fix that. We're planning to redesign the executors instrumentation and fix that issue across the board; but that might take a while and will not be included in the next release (which is in ~ 5 days).

…as (open-telemetry#9179) Co-authored-by: Trask Stalnaker <trask.stalnaker@gmail.com>

Fix context propagation in Executor#execute() for non-capturing lambdas

b87e3a3

mateuszrzeszutek requested a review from a team as a code owner August 10, 2023 10:48

laurit approved these changes Aug 10, 2023

View reviewed changes

trask approved these changes Aug 10, 2023

View reviewed changes

...src/main/java/io/opentelemetry/javaagent/bootstrap/executors/ContextPropagatingRunnable.java Show resolved Hide resolved

Update instrumentation/executors/bootstrap/src/main/java/io/opentelem…

09baafe

…etry/javaagent/bootstrap/executors/ContextPropagatingRunnable.java Co-authored-by: Trask Stalnaker <trask.stalnaker@gmail.com>

tylerbenson approved these changes Aug 10, 2023

View reviewed changes

xiangtianyu reviewed Aug 11, 2023

View reviewed changes

spotless

6d61f6e

trask merged commit 32c5d4c into open-telemetry:main Aug 11, 2023
45 checks passed

breedx-splk pushed a commit to breedx-splk/opentelemetry-java-instrumentation that referenced this pull request Aug 15, 2023

Fix context propagation in Executor#execute() for non-capturing lambd…

e46ff18

…as (open-telemetry#9179) Co-authored-by: Trask Stalnaker <trask.stalnaker@gmail.com>

mateuszrzeszutek deleted the fix-non-capturing-lambda-execute branch August 28, 2023 12:52

mateuszrzeszutek mentioned this pull request Aug 28, 2023

Rewrite parts of the executors instrumentation to wrap Runnables #9324

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix context propagation in Executor#execute() for non-capturing lambdas #9179

Fix context propagation in Executor#execute() for non-capturing lambdas #9179

mateuszrzeszutek commented Aug 10, 2023

laurit commented Aug 10, 2023

xiangtianyu Aug 11, 2023

mateuszrzeszutek Aug 11, 2023

Fix context propagation in Executor#execute() for non-capturing lambdas #9179

Fix context propagation in Executor#execute() for non-capturing lambdas #9179

Conversation

mateuszrzeszutek commented Aug 10, 2023

laurit commented Aug 10, 2023

xiangtianyu Aug 11, 2023

Choose a reason for hiding this comment

mateuszrzeszutek Aug 11, 2023

Choose a reason for hiding this comment