Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

docs: Add environment setup and notebook execution guides for Presidio + Spark in Fabric #1529

Merged
merged 11 commits into from
Mar 6, 2025

Conversation

ShakutaiGit
Copy link
Collaborator

@ShakutaiGit ShakutaiGit commented Feb 23, 2025

Description

This PR adds concise documentation for setting up a custom Fabric environment and executing the presidio_and_spark.ipynb notebook for PII detection and anonymization. It covers:

  • Creating a custom environment with Presidio and SpaCy dependencies.
  • Installing/using both small (en_core_web_md) and large (en_core_web_lg) models.
  • Configuring parameters, detecting/anonymizing PII, and writing results to Delta.

Checklist

Sorry, something went wrong.

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
@ShakutaiGit ShakutaiGit changed the title docs: add environment setup and notebook execution samples for Fabric docs: Add environment setup and notebook execution guides for Presidio + Spark in Fabric Feb 25, 2025
@ShakutaiGit ShakutaiGit marked this pull request as ready for review February 25, 2025 12:40
@ShakutaiGit ShakutaiGit self-assigned this Feb 25, 2025
Copy link
Contributor

@omri374 omri374 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Great work!!! This is very helpful for fabric users.

@ShakutaiGit ShakutaiGit requested a review from omri374 March 4, 2025 14:39
@ShakutaiGit ShakutaiGit requested a review from SharonHart March 5, 2025 15:40
SharonHart
SharonHart previously approved these changes Mar 5, 2025
@ShakutaiGit ShakutaiGit removed the request for review from navalev March 6, 2025 08:05
@omri374 omri374 merged commit 8b288fa into main Mar 6, 2025
10 checks passed
@omri374 omri374 deleted the fabric_sample branch March 6, 2025 11:04
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants