Privacy-Preserving KYC: Verify Without Exposing
The premise of Know Your Customer (KYC) regulation is that financial institutions must verify the identity of their customers to prevent money laundering, terrorist financing, and other financial crimes. The implementation of this premise has produced a global system where billions of people's passports, driver's licenses, utility bills, and bank statements are copied, transmitted, stored, and eventually breached across thousands of institutions, each maintaining their own database of identity documents with their own security posture and their own breach history.
The privacy problem with KYC is not a policy failure. It is an architectural one. The system was designed around a model where verification requires custody of the data being verified. That model made sense when verification was manual and the data was physical. It does not make sense when verification can be performed cryptographically and the data can be represented as mathematical proofs.
The Custody Problem
When a financial institution performs KYC, it creates a copy of the customer's identity documents. That copy becomes the institution's responsibility to protect. The institution must encrypt the data, control access to it, retain it for the regulatory retention period (typically 5-7 years after the customer relationship ends), and dispose of it securely when the retention period expires.
Every copy creates a new attack surface. Every institution with a copy is a potential breach point. Every breach exposes not just the breached institution but every customer whose identity documents were stored there. The cumulative effect of this architecture is that the average person's identity documents exist in dozens of databases operated by dozens of organizations with varying security capabilities. The probability that at least one of those databases is breached during the customer's lifetime approaches certainty.
The cost of this architecture is staggering. Identity fraud losses in the United States alone exceeded $23 billion in 2023. A substantial portion of this fraud is enabled by data breaches of identity documents stored for KYC compliance. The institutions storing the data bear the direct costs of breaches (remediation, regulatory fines, litigation). The customers bear the indirect costs (credit monitoring, account freezes, identity restoration). Society bears the systemic costs (reduced trust in financial institutions, increased fraud prevention expenditure throughout the economy).
Data Minimization as a Design Principle
The GDPR introduced data minimization as a legal requirement: personal data must be adequate, relevant, and limited to what is necessary for the purposes for which it is processed. The EU AI Act reinforces this principle for AI systems that process personal data. Yet the traditional KYC model is maximally non-minimal: it requires the institution to receive and store complete identity documents when all it needs is a binary answer to the question "has this person been verified?"
Zero-knowledge KYC implements data minimization at the cryptographic level. The institution never receives the identity documents. It receives a mathematical proof that the documents exist, are valid, and satisfy the requirements. The proof reveals nothing about the documents beyond the specific claims it attests to. This is not data minimization through policy ("we promise not to look at the data we do not need"). It is data minimization through mathematics ("we structurally cannot access data that is not included in the proof").
The STARK Proof Architecture
A STARK (Scalable Transparent Argument of Knowledge) proof for KYC operates as follows. The prover (the identity verifier) takes as input the customer's identity documents and the verification checks (document validity, sanctions screening, PEP screening, jurisdiction checks). The prover executes the verification computation and produces a proof that the computation was performed correctly and that the results satisfy the required conditions. The proof is a compact artifact (typically 200-800 bytes for KYC use cases) that can be verified in constant time by any verifier.
The key property of a STARK proof is zero-knowledge: the proof reveals nothing about the inputs (the identity documents) beyond the truth of the claimed outputs (the verification results). A verifier who receives the proof learns that the customer passed identity verification, that the verification used valid documents, that the customer is not on any sanctions list, and that the customer's jurisdiction is permitted. The verifier does not learn the customer's name, date of birth, document number, nationality, or any other personal information.
STARKs provide two additional properties critical for KYC. Transparency means there is no trusted setup: the proof system does not require a ceremony or trusted third party during initialization, which eliminates a potential point of compromise. Post-quantum security means the proof remains valid even after quantum computers can break elliptic curve cryptography, which is important for KYC proofs that may be verified years or decades after generation.
Attestation Epochs
Identity verification is not static. A person who passes sanctions screening today might appear on a sanctions list tomorrow. A document that is valid today might expire next month. A jurisdiction that is permitted today might become restricted due to a regulatory change. KYC proofs must account for this temporal dimension.
Attestation epochs solve this problem. An attestation epoch is a defined time period, configurable by the relying institution, during which a proof's claims are considered current. Common epoch lengths are 24 hours for sanctions screening, 30 days for general identity verification, and 12 months for document validity.
At the beginning of each epoch, the trusted verifier re-executes the relevant screening checks for all active proofs. Proofs that pass the updated checks are renewed with a new epoch timestamp. Proofs where the underlying data has changed (the customer appears on a newly updated sanctions list, the document has expired) are not renewed, and the customer is notified that a re-verification is required.
The epoch mechanism ensures that proofs remain current without requiring the customer to re-submit documents for routine updates. The customer's initial identity documents are verified once; ongoing screening is automated by the trusted verifier. Relying institutions verify that the proof is within the current epoch, which takes milliseconds and requires no PII exchange.
Selective Disclosure
Some use cases require more information than a simple pass/fail verification. A bank opening a high-value account might need to know that the customer is over 18 but does not need to know their exact date of birth. An exchange operating in a specific jurisdiction might need to confirm that the customer is not a resident of a sanctioned country but does not need to know their exact address. An investment platform might need to verify accredited investor status but does not need to see the customer's tax returns.
Selective disclosure allows the ZK-KYC proof to include specific claims without revealing the underlying data. The proof can attest: "the customer is over 18" without revealing their date of birth; "the customer is not a resident of any OFAC-sanctioned jurisdiction" without revealing their address or nationality; "the customer has a net worth exceeding $1 million" without revealing their financial statements; "the customer's identity document expires after a specific date" without revealing the document number or type.
Each selective disclosure claim is independently verifiable. The relying institution can choose which claims it needs and verify only those claims. Claims that are not needed are not included in the verification request, and the proof does not reveal them.
Cross-Border Interoperability
KYC requirements vary by jurisdiction. A customer verified under EU's fourth Anti-Money Laundering Directive (AMLD4) satisfies different requirements than a customer verified under the U.S. Bank Secrecy Act (BSA). A ZK-KYC proof must accommodate these differences without requiring separate proofs for each jurisdiction.
The approach is claim-based rather than jurisdiction-based. The proof attests to specific factual claims about the verification that was performed: the identity document type and issuing country, the databases checked, the screening lists used, and the verification date. The relying institution in each jurisdiction evaluates whether those claims satisfy their local requirements. A proof generated under EU verification standards might be accepted by a U.S. institution if the claims in the proof satisfy BSA requirements, even though the verification was not performed under BSA procedures.
This claim-based model supports progressive adoption. Institutions in jurisdictions with explicit digital KYC frameworks can accept ZK-KYC proofs immediately. Institutions in jurisdictions without such frameworks can evaluate the claims in the proof against their local requirements and make acceptance decisions on a case-by-case basis while regulatory clarity evolves.
The Trusted Verifier Model
The security and regulatory viability of ZK-KYC depends on the trusted verifier. The verifier is the entity that performs the initial identity verification, generates the STARK proof, and manages attestation epochs. The verifier holds the customer's identity documents (or has verified them and retained cryptographic commitments to them). The verifier's integrity is the foundation of the system.
Trusted verifier requirements include: regulatory licensing in the jurisdictions where they operate; regular independent audits of verification procedures; hardware security module (HSM) protection for proof generation keys; data retention policies that minimize the duration and scope of PII storage; incident response procedures for compromised proofs; and transparency reporting on verification volumes, rejection rates, and epoch renewal statistics.
The market for trusted verifiers will likely develop similarly to the certificate authority (CA) market for TLS certificates: a relatively small number of audited, regulated entities that issue credentials accepted across a large number of relying institutions. This concentration is a feature, not a bug. It reduces the total number of entities that custody PII, concentrates security investment in specialized infrastructure, and creates accountability through market reputation and regulatory oversight.
Integration with Existing Infrastructure
ZK-KYC does not require relying institutions to rebuild their compliance infrastructure. The proof verification step integrates into existing onboarding workflows as an additional verification method. Institutions can accept ZK-KYC proofs alongside traditional document-based KYC during a transition period, gradually shifting volume as regulatory comfort increases.
The technical integration surface is minimal: an API call that submits the proof and receives a verification result. The verification result includes the claims that were verified, the epoch timestamp, the trusted verifier identity, and a signature over the entire result. The institution's existing compliance system receives this result and maps it to its internal verification requirements, the same way it maps the results of traditional KYC checks.
For the full technical specification of ZK-KYC proofs, see the ZK-KYC product page. For STARK proof architecture, see the ZK Proofs overview. For related compliance topics, see KYC Verification Guide and AML Screening Best Practices.
Verify Without Exposing
Zero-knowledge KYC eliminates PII custody liability while satisfying compliance requirements.
ZK-KYC Product Schedule Demo