Reduce excessive CPU usage when serializing breadcrumbs to disk #4181

romtsn · 2025-02-19T10:10:18Z

📜 Description

Introduce QueueFile (vendored from https://github.com/square/tape) which allows us to write breadcrumbs atomically one-by-one using RandomAccessFile under the hood
- I also modified it so it supports maxElements and works as a ring buffer based on the number of elements in the queue
- It already works as a ring buffer but based on the file size. It automatically expands the file size until some certain limit; after that limit is reached it will wrap around and start writing to the beginning of the file
Add a new resetCache method that cleans up the disk cache on every init (but after the ANR processing is done) to ensure clean state and that we don't enrich with outdated scope values
Do not check for file existence before deleting/writing to files because it was redundant and unnecessary I/O

Before

After

Also posting a perfetto trace query where we have ~50 addBreadcrumb calls to confirm that time spent in addBreadcrumb does not increase with the number of breadcrumbs added:

💡 Motivation and Context

Closes #3168

💚 How did you test it?

manually + automated

📝 Checklist

I added tests to verify the changes.
No new PII added or SDK only sends newly added PII if sendDefaultPII is enabled.
I updated the docs if needed.
I updated the wizard if needed.
Review from the native team if needed.
No breaking change or entry added to the changelog.
No breaking change for hybrid SDKs or communicated to hybrid SDKs.

🔮 Next steps

Another potential improvement could be to batch breadcrumbs in-memory first, before writing to disk, and then flush every 1-2 seconds as batches to reduce I/O even further. The trade-off would be that we can lose some important breadcrumbs if an error occurs right within that timeframe. For now I would still like to keep it as-is, but if we receive further reports about excessive I/O we can implement that improvement too. I'm leaving the code snippet here so we can come back to it later and just copy-paste it:

Code snippet for batched writes

  public PersistingScopeObserver(final @NotNull SentryOptions options) {
    this.options = options;
    this.inMemoryCrumbs = createBreadcrumbsList(options.getMaxBreadcrumbs());
  }

  private @Nullable Future<?> breadcrumbsSerializerTask;
  private volatile @NotNull Queue<Breadcrumb> inMemoryCrumbs;

  private static @NotNull Queue<Breadcrumb> createBreadcrumbsList(final int maxBreadcrumb) {
    return maxBreadcrumb > 0
      ? SynchronizedQueue.synchronizedQueue(new CircularFifoQueue<>(maxBreadcrumb))
      : SynchronizedQueue.synchronizedQueue(new DisabledQueue<>());
  }

  @Override
  public void addBreadcrumb(@NotNull Breadcrumb crumb) {
    inMemoryCrumbs.offer(crumb);
    if (breadcrumbsSerializerTask == null || breadcrumbsSerializerTask.isCancelled() || breadcrumbsSerializerTask.isDone()) {
      breadcrumbsSerializerTask = serializeToDiskScheduled(() -> {
        try {
          Breadcrumb breadcrumb = inMemoryCrumbs.poll();
          while (breadcrumb != null) {
            diskCrumbs.getValue().add(breadcrumb);
            breadcrumb = inMemoryCrumbs.poll();
          }
        } catch (IOException e) {
          options.getLogger().log(ERROR, "Failed to add breadcrumb to file queue", e);
        }
      });
    }
  }

  @Override
  public void setBreadcrumbs(@NotNull Collection<Breadcrumb> breadcrumbs) {
    if (breadcrumbs.isEmpty()) {
      Date now = DateUtils.getCurrentDateTime();
      serializeToDisk(() -> {
        try {
          Iterator<Breadcrumb> iterator = inMemoryCrumbs.iterator();
          while (iterator.hasNext()) {
            if (iterator.next().getTimestamp().before(now)) {
              iterator.remove();
            }
          }
          diskCrumbs.getValue().clear();
        } catch (IOException e) {
          options.getLogger().log(ERROR, "Failed to clear breadcrumbs from file queue", e);
        }
      });
    }
  }

  @Nullable
  private Future<?> serializeToDiskScheduled(final @NotNull Runnable task) {
    if (!options.isEnableScopePersistence()) {
      return null;
    }
    if (Thread.currentThread().getName().contains("SentryExecutor")) {
      // we're already on the sentry executor thread, so we can just execute it directly
      try {
        task.run();
      } catch (Throwable e) {
        options.getLogger().log(ERROR, "Serialization task failed", e);
      }
      return null;
    }

    try {
      return options
          .getExecutorService()
          .schedule(
              () -> {
                try {
                  task.run();
                } catch (Throwable e) {
                  options.getLogger().log(ERROR, "Serialization task failed", e);
                }
              }, 1000);
    } catch (Throwable e) {
      options.getLogger().log(ERROR, "Serialization task could not be scheduled", e);
    }
    return null;
  }

as a result we just had 2 I/O writes:

github-actions · 2025-02-19T10:20:12Z

Performance metrics 🚀

	Plain	With Sentry	Diff
Startup time	399.11 ms	422.08 ms	22.98 ms
Size	1.58 MiB	2.21 MiB	646.66 KiB

Previous results on branch: rz/fix/persisting-scope-observer

Startup times

Revision	Plain	With Sentry	Diff
`473dcc6`	406.02 ms	455.16 ms	49.14 ms
`d877760`	672.94 ms	788.46 ms	115.52 ms
`ca480b4`	389.82 ms	456.52 ms	66.70 ms
`247dd70`	396.72 ms	419.14 ms	22.42 ms

App size

Revision	Plain	With Sentry	Diff
`473dcc6`	1.58 MiB	2.21 MiB	645.19 KiB
`d877760`	1.58 MiB	2.21 MiB	646.64 KiB
`ca480b4`	1.58 MiB	2.21 MiB	645.20 KiB
`247dd70`	1.58 MiB	2.21 MiB	646.61 KiB

sentry/src/main/java/io/sentry/cache/PersistingScopeObserver.java

sentry/src/main/java/io/sentry/cache/tape/FileObjectQueue.java

stefanosiano

Just a question regarding setBreadcrumbs(), but it's good

markushi

Looks good to me! Do I assume right that if someone upgrades the SDK, and the devices has pending ANRs, they won't be enriched, as the scope data is only available in the old format?

markushi · 2025-03-14T10:00:59Z

sentry-android-core/src/main/java/io/sentry/android/core/AnrV2EventProcessor.java

    if (breadcrumbs == null) {
      return;
    }
    if (event.getBreadcrumbs() == null) {
-      event.setBreadcrumbs(new ArrayList<>(breadcrumbs));
+      event.setBreadcrumbs(breadcrumbs);


Well spotted, nice!

sentry/src/main/java/io/sentry/SentryOptions.java

romtsn · 2025-03-14T10:28:29Z

Looks good to me! Do I assume right that if someone upgrades the SDK, and the devices has pending ANRs, they won't be enriched, as the scope data is only available in the old format?

Yes that's right (only breadcrumbs would be missing though)! But I guess it's fine to accept that trade-off (and it's probably not going to be a very common scenario to have an ANR in-between app updates). Left a comment also:

              try {
                queueFile = new QueueFile.Builder(file).size(options.getMaxBreadcrumbs()).build();
              } catch (IOException e) {
                // if file is corrupted we simply delete it and try to create it again. We accept
                // the trade
                // off of losing breadcrumbs for ANRs that happened right before the app has
                // received an
                // update where the new format was introduced
                file.delete();

                queueFile = new QueueFile.Builder(file).size(options.getMaxBreadcrumbs()).build();
              }

* WIP * WIP * Remove redundant line * Add Tests * api dump * Formatting * REset scope cache on new init * Clean up * Comment * Changelog * Workaround square/tape#173 * Add a comment to setBreadcrumbs * Address PR review * Update CHANGELOG.md

… (#4260) * WIP * WIP * Remove redundant line * Add Tests * api dump * Formatting * REset scope cache on new init * Clean up * Comment * Changelog * Workaround square/tape#173 * Add a comment to setBreadcrumbs * Address PR review * Update CHANGELOG.md

romtsn and others added 11 commits November 6, 2024 10:23

WIP

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified
Learn about vigilant mode

603fde0

Merge branch 'main' into rz/fix/persisting-scope-observer

397797f

Merge branch 'main' into rz/fix/persisting-scope-observer

4328751

WIP

6e832c1

Remove redundant line

3aeeb9e

Add Tests

904cd82

api dump

b1d5b62

Formatting

38c54e4

REset scope cache on new init

7e9f370

Clean up

72a2fe3

Comment

Loading
Loading status checks…

f6ec9f8

romtsn requested review from adinauer, stefanosiano, markushi and lcian as code owners February 19, 2025 10:10

romtsn changed the title ~~Rz/fix/persisting scope observer~~ Reduce excessive CPU usage when serializing breadcrumbs to disk Feb 19, 2025

romtsn added 2 commits February 19, 2025 12:02

Merge branch 'main' into rz/fix/persisting-scope-observer

05432b2

Changelog

Loading
Loading status checks…

d976459

stefanosiano reviewed Feb 27, 2025

View reviewed changes

sentry/src/main/java/io/sentry/cache/PersistingScopeObserver.java Show resolved Hide resolved

stefanosiano reviewed Feb 27, 2025

View reviewed changes

sentry/src/main/java/io/sentry/cache/tape/FileObjectQueue.java Show resolved Hide resolved

stefanosiano approved these changes Feb 27, 2025

View reviewed changes

romtsn added 3 commits March 13, 2025 18:23

Workaround square/tape#173

2df48a5

Merge branch 'main' into rz/fix/persisting-scope-observer

Loading
Loading status checks…

36b0487

Add a comment to setBreadcrumbs

Loading
Loading status checks…

61c8149

markushi approved these changes Mar 14, 2025

View reviewed changes

Address PR review

4ccf64b

romtsn added 2 commits March 14, 2025 11:29

Merge branch 'main' into rz/fix/persisting-scope-observer

Loading
Loading status checks…

66e7d73

Merge branch 'main' into rz/fix/persisting-scope-observer

Loading
Loading status checks…

902bcd6

Update CHANGELOG.md

Loading
Loading status checks…

59bd334

romtsn enabled auto-merge (squash) March 14, 2025 11:16

romtsn merged commit b61429a into main Mar 14, 2025
34 checks passed

romtsn deleted the rz/fix/persisting-scope-observer branch March 14, 2025 11:31

romtsn mentioned this pull request Mar 14, 2025

chore(cherry-pick): Reduce excessive CPU usage when serializing breadcrumbs to disk (#4181) #4260

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce excessive CPU usage when serializing breadcrumbs to disk #4181

Reduce excessive CPU usage when serializing breadcrumbs to disk #4181

romtsn commented Feb 19, 2025 •

edited

Loading

github-actions bot commented Feb 19, 2025 •

edited

Loading

Previous results on branch: rz/fix/persisting-scope-observer

Startup times

App size

stefanosiano left a comment

markushi left a comment

markushi Mar 14, 2025

romtsn commented Mar 14, 2025

Reduce excessive CPU usage when serializing breadcrumbs to disk #4181

Reduce excessive CPU usage when serializing breadcrumbs to disk #4181

Conversation

romtsn commented Feb 19, 2025 • edited Loading

📜 Description

Before

After

💡 Motivation and Context

💚 How did you test it?

📝 Checklist

🔮 Next steps

github-actions bot commented Feb 19, 2025 • edited Loading

Performance metrics 🚀

Previous results on branch: rz/fix/persisting-scope-observer

Startup times

App size

stefanosiano left a comment

Choose a reason for hiding this comment

markushi left a comment

Choose a reason for hiding this comment

markushi Mar 14, 2025

Choose a reason for hiding this comment

romtsn commented Mar 14, 2025

romtsn commented Feb 19, 2025 •

edited

Loading

github-actions bot commented Feb 19, 2025 •

edited

Loading