Tork

Feature

PII Redaction

Automatically detect and redact 50+ types of personally identifiable information before it reaches your AI models. Protect user privacy and ensure compliance with GDPR, CCPA, HIPAA, and more.

See It In Action

Before (Unsafe)

"Hi, I'm John Smith. You can reach me at john.smith@company.com or call me at (555) 123-4567. My SSN is 123-45-6789 and my credit card is 4532-1234-5678-9012."

After (Safe)

"Hi, I'm [NAME]. You can reach me at [EMAIL] or call me at [PHONE]. My SSN is [SSN] and my credit card is [CREDIT_CARD]."

50+ PII Types Detected

Contact Info

  • Email addresses
  • Phone numbers (global)
  • Physical addresses
  • IP addresses

Identity

  • Full names
  • Social Security Numbers
  • Passport numbers
  • Driver's license

Financial

  • Credit card numbers
  • Bank accounts
  • Tax IDs
  • Financial account numbers

Healthcare

  • Medical record numbers
  • Health plan IDs
  • Prescription info
  • Patient identifiers

Digital

  • API keys
  • Passwords
  • OAuth tokens
  • Private keys

Regional

  • AU TFN/ABN
  • UK NI numbers
  • EU VAT numbers
  • Canadian SIN

Simple Integration

Python

from tork import PIIDetector

detector = PIIDetector()

# Scan text for PII
text = "Contact john@example.com or call 555-123-4567"
results = detector.scan(text)

print(results)
# [
#   {"type": "email", "value": "john@example.com", "start": 8, "end": 24},
#   {"type": "phone", "value": "555-123-4567", "start": 33, "end": 45}
# ]

# Redact PII
safe_text = detector.redact(text)
print(safe_text)
# "Contact [EMAIL] or call [PHONE]"

# Configure which PII types to detect
detector = PIIDetector(
    enabled_types=["email", "phone", "ssn", "credit_card"],
    custom_patterns={
        "employee_id": r"EMP-\d{6}"
    }
)

Flexible Redaction Options

Token Replacement

john@email.com → [EMAIL]

Replace with type indicator

Masked

john@email.com → j***@e***.com

Partial masking for context

Hashed

john@email.com → a3f2c1...

Consistent hash for correlation

Custom

john@email.com → <REDACTED>

Your own replacement text

Compliance Ready

GDPR
CCPA
HIPAA
SOC 2
PCI DSS
FERPA

Tork's PII detection helps you meet data minimization requirements across major privacy regulations. Combined with our audit logging, you have the documentation needed for compliance audits.

Protect User Privacy Today

Start detecting and redacting PII in minutes.

Start Free Trial View on GitHub