Upgrade compare-metadata function #778

KaraMelih · 2023-11-27T15:12:47Z

What is the problem / what does the code in this PR do

The previous version of the st.compare_metadata() was accepting one runid+target pair and one already existing metadata JSON file as was pointed here: #770 by @WenzDaniel
This upgrade allows for two sets of //either// runid+target pairs or existing metadata in the form of a path to the JSON file or already loaded dictionary, to do the comparison. In case the data is not loaded anywhere (e.g. peak positions) instead of doing a full metadata comparison, it just compares the lineages.
It also allows to return of extracted metadata and lineage of both requested data for convenience, if return_results=True is passed.

Can you briefly describe how it works?

You get your favorite context, and call the attribute;

context.compare_metadata(  ("053877", "peak_basics"), ("053877", "peak_positions"))
context.compare_metadata(  ("053875", "peak_basics"), ("053877", "peak_basics"))
context.compare_metadata(  ("053877", "peak_basics"),  "./my_path_to/JSONfile.json")
first_metadata = context.get_metadata(run_id, "events")
context.compare_metadata(  ("053877", "peak_basics"), first_metadata)
context.compare_metadata(  ("053877", "records"), ("053899", "records") )

Can you give a minimal working example (or illustrate with a figure)?

import cutax
st = cutax.contexts.xenonnt_online(_rucio_local_path='/project/lgrandi/rucio', include_rucio_local=True, include_rucio_remote=True)
st.compare_metadata(  ("053875", "peak_basics"), ("053877", "peak_basics") )

Please include the following if applicable:

Update the docstring(s)
Update the documentation
Tests to check the (new) code is working as desired.
Does it solve one of the open issues on github? Solves Extend compare_metadata #770

Please make sure that all automated tests have passed before asking for a review (you can save the PR as a draft otherwise).

# Conflicts: # extra_requirements/requirements-tests.txt # strax/context.py # strax/utils.py # tests/test_context.py

for more information, see https://pre-commit.ci

coveralls · 2023-11-27T15:36:01Z

coverage: 91.35% (-0.2%) from 91.586%
when pulling 816f26d on KaraMelih:master
into 70f69c8 on AxFoundation:master.

KaraMelih · 2023-11-27T16:25:02Z

@WenzDaniel could you help me understand this failing test?
https://www.codefactor.io/repository/github/axfoundation/strax/pull/778
it refers to two definitions that do not come from this PR but already existing

WenzDaniel · 2023-12-14T08:56:33Z

Hej @KaraMelih thank you very much for the update I think your PR goes in a very different direction than I had in my mind. The issue we had is, that if you compare the metadata of the currently used context with some old data stored somewhere that you can only perform this comparison if the metadata of your currently used context is stored somewhere. However, I think most of the time the use case is the other way around: I have a context and I cannot load any data anymore and I would like to understand why my context is different than the one used for the stored data. So only adding a simple try/except would have been already sufficient. Sorry for all the extra work. I will look into your PR now.

WenzDaniel

Thanks @KaraMelih works like a charm. Maybe you can add the doc-string a little and then we can merge.

WenzDaniel · 2023-12-14T09:36:05Z

strax/context.py

@@ -1844,30 +1844,89 @@ def get_meta(self, run_id, target) -> dict:

    get_metadata = get_meta

-    def compare_metadata(self, run_id, target, old_metadata):
+    def compare_metadata(self, data1, data2, return_results=False):
        """Compare the metadata between two strax data.


Can you add to the doc string which explains the pintout of the comparison. I was a little confused which direction things are compared is red the first and green the second file?

WenzDaniel · 2023-12-14T09:38:45Z

strax/context.py

-        :param target: data type to get
-        :param old_metadata: path to metadata to compare, or a dictionary, or a tuple with another
-            run_id, target to compare against the metadata of the first id-target pair
+        :param data1, data2: either a list (tuple) of runid + target pair, or path to metadata to


Can you add into your doc string the examples you provided here in github. Then I think it becomes more clea, like:

""" doc-string so far Examples: example code 1 example code 2 """

WenzDaniel · 2023-12-14T09:39:50Z

strax/context.py

+                    metadata["lineage"] if _is_stored else self.key_for(run_id, target).lineage
+                )
+                _lineage_hash = str(self.key_for(run_id, target))
+                print(_lineage_hash)


Remove print statement?

WenzDaniel · 2023-12-14T09:41:25Z

strax/context.py

+                _is_stored = self.is_stored(run_id, target)
+                metadata = self.get_metadata(run_id, target) if _is_stored else None


Here you could have used a

try: self.get_metadata(...) except strax,DataNotStored: ....

(or what ever raise we are returning in case it is not stored.) Just as information for the future

WenzDaniel · 2023-12-14T11:32:37Z

Btw you can ignore the failing code factor.

strax/context.py

for more information, see https://pre-commit.ci

KaraMelih and others added 15 commits November 25, 2022 13:02

add a metadata comparison method

ed3eb07

Merge branch 'master' into master

c641c0d

Merge branch 'master' into master

e70bcb9

Merge branch 'master' into master

2005b92

Add test_compare_metadata

Verified

This commit was signed with the committer’s verified signature.

rajatvig Rajat Vig

GPG key ID: F98790FBEFB7E775

Verified
Learn about vigilant mode

b2b76ea

Minor change

f46af0f

Add deepdiff to requirements-tests.txt

82e8828

update metadata comparison

aa995c0

Merge remote-tracking branch 'origin/master'

64783c4

# Conflicts: # extra_requirements/requirements-tests.txt # strax/context.py # strax/utils.py # tests/test_context.py

update metadata comparison

7e62038

update metadata comp test

da5a6a8

[pre-commit.ci] auto fixes from pre-commit.com hooks

5ad3efe

for more information, see https://pre-commit.ci

fix a typo

25d2ff7

Merge remote-tracking branch 'origin/master'

c2a9267

fix external function call

b955df9

codefactor improvements

c65585a

WenzDaniel previously approved these changes Dec 14, 2023

View reviewed changes

Merge branch 'master' into master

61aeaf4

update docstring

b2522b1

KaraMelih dismissed WenzDaniel’s stale review via b2522b1 December 19, 2023 08:14

Merge branch 'master' into master

1d37198

WenzDaniel reviewed Dec 19, 2023

View reviewed changes

strax/context.py Outdated Show resolved Hide resolved

WenzDaniel and others added 2 commits December 19, 2023 10:06

Update strax/context.py

957164e

[pre-commit.ci] auto fixes from pre-commit.com hooks

Loading
Loading status checks…

816f26d

for more information, see https://pre-commit.ci

WenzDaniel approved these changes Dec 19, 2023

View reviewed changes

WenzDaniel merged commit f7e0dd3 into AxFoundation:master Dec 19, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Upgrade compare-metadata function #778

Upgrade compare-metadata function #778

KaraMelih commented Nov 27, 2023

coveralls commented Nov 27, 2023 •

edited

Loading

KaraMelih commented Nov 27, 2023

WenzDaniel commented Dec 14, 2023

WenzDaniel left a comment

WenzDaniel Dec 14, 2023

WenzDaniel Dec 14, 2023

WenzDaniel Dec 14, 2023

WenzDaniel Dec 14, 2023

WenzDaniel commented Dec 14, 2023

		_is_stored = self.is_stored(run_id, target)
		metadata = self.get_metadata(run_id, target) if _is_stored else None

Upgrade compare-metadata function #778

Upgrade compare-metadata function #778

Conversation

KaraMelih commented Nov 27, 2023

coveralls commented Nov 27, 2023 • edited Loading

KaraMelih commented Nov 27, 2023

WenzDaniel commented Dec 14, 2023

WenzDaniel left a comment

Choose a reason for hiding this comment

WenzDaniel Dec 14, 2023

Choose a reason for hiding this comment

WenzDaniel Dec 14, 2023

Choose a reason for hiding this comment

WenzDaniel Dec 14, 2023

Choose a reason for hiding this comment

WenzDaniel Dec 14, 2023

Choose a reason for hiding this comment

WenzDaniel commented Dec 14, 2023

coveralls commented Nov 27, 2023 •

edited

Loading