Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[WIP] Compare Policies #184

Draft
wants to merge 7 commits into
base: main
Choose a base branch
from

Conversation

aliberts
Copy link
Collaborator

@aliberts aliberts commented May 15, 2024

What this does

Adds compare_policies.py script to compare performances between different versions of a policy using statistical tests and help assess performance gains/loss based on the info_eval.json files produced during eval.

How it was tested

TODO

How to checkout & try? (for the reviewer)

TODO


This change is Reviewable

@aliberts aliberts self-assigned this May 15, 2024
@aliberts aliberts changed the title WIP add score tests [WIP] Compare Policies May 15, 2024
@aliberts aliberts added ✨ Enhancement New feature or request 🧠 Policies Something policies-related ✅ Tests Adds or modifies testing labels May 15, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
✨ Enhancement New feature or request 🧠 Policies Something policies-related ✅ Tests Adds or modifies testing
Projects
Status: Todo next
Development

Successfully merging this pull request may close these issues.

None yet

1 participant