Show HN: MaskWise: Redact, mask, and anonymize data in training files for LLMs https://ift.tt/NU4zJPK
Show HN: MaskWise: Redact, mask, and anonymize data in training files for LLMs If you’re working with LLM training data (like I often am), you’ll know how tricky it can be to scrub out PII without breaking the dataset. I have been using MS Presidio for some time and decided to build a UI on top of it. This is a tool that scans and recognizes sensitive bits in text (eg names, emails, addresses etc), processes images to mask whats sensitive and handles structured data. Everything is written in ts + nodejs, with great help from Claude Code :) It's still early so feedback & contributions are more than welcome. https://ift.tt/BjtQcs1 August 27, 2025 at 01:26AM
No comments