{ "cells": [ { "cell_type": "markdown", "id": "ffc7c8ae05ec26fa", "metadata": {}, "source": [ "# PII Scrubbing\n", "\n", "In this recipe, we go over how to detect Personal Identifiable Information, or PII and redact it from your source. Whether your source is from a database, a document, or spreadsheet, it is important prevent PII from leaving your system. We will be using Ollama for data privacy.\n", "\n", "
Mirascope Concepts Used
\n", "Background
\n", "\n", "Prior to Natural Language Processing (NLP) and Named Entity Recognition (NER) techniques, scrubbing or redacting sensitive information was a time-consuming manual task. LLMs have improved on this by being able to understand context surrounding sensitive information.\n", "
\n", "Additional Real-World Applications
\n", "