Inspect the request
The gateway scans prompts, messages, and extracted file text before an LLM request is created.
Globesword is a locally deployable filtering gateway that detects configured PII, financial identifiers, credentials, national IDs, and customer-specific sensitive data before an LLM request is sent.
Sensitive values are replaced with request-scoped tokens. Approved values can be restored after inference through a controlled mapping layer.
Request inspection
Globesword Masking Gateway
Original request
Contact maya@example.com and transfer funds to IBAN DE89 3704 0044 0532 0130 00.
Payload sent to the LLM
Contact [EMAIL_1] and transfer funds to IBAN [IBAN_1].
Exact span
MOD-97 valid
Restoration is limited to authorized tokens associated with the current request context.
Baseline functional validation
These results demonstrate functional correctness for the tested examples. They are not presented as universal production accuracy.
13
Baseline scenarios
End-to-end functional tests
33
Expected entities
All detected with exact spans
0
Observed leaks
Within the baseline suite
0.154 ms
Deterministic p95
For tested short-text inputs
Client PoVs use larger customer-specific datasets containing positive, negative, malformed, overlap, multilingual, and document-specific test cases.
How it works
Globesword is designed to sit in the request path before data is sent to an external or internally hosted LLM.
The gateway scans prompts, messages, and extracted file text before an LLM request is created.
Deterministic patterns, checksum validators, contextual detection, and tenant rules identify configured data classes.
Detected values are replaced with request-scoped typed tokens before the content reaches the target model.
Authorized tokens can be restored after inference through the separate controlled mapping layer.
Detection coverage
The platform includes reusable detectors, but each deployment activates only what is relevant to the customer.
A German manufacturer, a US financial institution, and an Indian healthcare provider should not run the same national identifier policy.
Detect common personal identifiers before prompt content leaves the application boundary.
Use structural and checksum-aware detection for common financial data.
Enable only the country-specific identifiers relevant to each deployment.
Identify provider-specific credentials and explicitly labelled secrets.
Protect technical identifiers that may expose private systems or environments.
Add organization-specific patterns without retraining the detection model.
Technical approach
Structured identifiers, contextual entities, and ambiguous data classes require different detection and validation strategies.
Only the detectors relevant to the customer, geography, document type, and workflow need to be enabled.
Where supported, regex candidates are validated using algorithms such as Luhn, MOD-97, Verhoeff, and identifier-specific checks.
Confidence thresholds, custom identifiers, token prefixes, severity, and active policies can be configured per deployment.
The detection layer is designed to run locally, in a private cloud, or inside an enterprise-controlled container environment.
The system tracks character offsets so only the identified value is replaced while surrounding text remains intact.
PoV results are reported using precision, recall, F1, exact-span accuracy, leakage, restoration accuracy, and latency.
Where it fits
The gateway can be integrated into applications that send user input, documents, retrieved context, logs, or tool output to an LLM.
Mask sensitive content extracted from PDFs, DOCX files, spreadsheets, emails, tickets, and knowledge-base records.
Detect credentials, private keys, tokens, network identifiers, and labelled secrets before code or logs reach an LLM.
Apply department-specific policies for HR, finance, customer support, legal, healthcare, and operational workflows.
Place a masking layer before retrieval context, agent messages, tool calls, or external model requests.
Proof of Value
Every PoV is scoped around the customer’s countries, departments, document types, identifiers, model workflows, and acceptable risk.
Scope
Selected workflows and data classes
Policy
Customer-specific detector profile
Evaluation
Positive and negative test datasets
Outcome
Measured results and limitations
The product is evaluated against defined policies and datasets. Results are reported with their scope and known limitations.
Start with one workflow, one policy profile, and a measurable customer-specific dataset.