Question 1

What is the plaintext gap in a RAG system?

Accepted Answer

The plaintext gap is the point where encrypted storage and encrypted transport end and plaintext computation begins. Even a RAG system with strong access controls, encrypted vectors at rest, and TLS in transit will decrypt data before the AI provider processes it. The inference server sees every query, every retrieved document, and every generated response in plaintext. Access controls prevent unauthorised users from reaching the data but do not prevent the provider infrastructure from seeing it during processing.

Question 2

How does VectaX implement encrypted inference for RAG?

Accepted Answer

VectaX combines three layers: FHE (Fully Homomorphic Encryption) for computing similarity on ciphertext without decrypting it, similarity-preserving vector encryption for storage so the vector database never holds plaintext embeddings, and RBAC so that only vectors matching the user secret key policy can be decrypted even after retrieval. The client encrypts the query vector before it leaves the application, the vector database runs similarity search on ciphertext, and the application decrypts only results that match the user role group and department policy.

Question 3

What is encrypted vector memory for AI agents?

Accepted Answer

Encrypted vector memory stores an AI agent operating memory in encrypted form. This includes conversation history, system prompts, cached retrieved document embeddings, and intermediate reasoning stored across turns. Without this, the memory provider holds plaintext records of everything sensitive the agent has processed. With VectaX encrypted vector memory, every memory entry is encrypted at creation time before it reaches any storage system. The agent can still retrieve and reason over its memory because FHE enables similarity search on ciphertext.

Question 4

How do I connect VectaX to Claude Desktop using MCP?

Accepted Answer

Clone the VectaX MCP server from github.com/mirrorsecai/mirror-vectax-mcp-server and run the setup script for your platform (setup_claude_config.sh on macOS and Linux, setup_claude_config.bat on Windows). The script installs the mirror-sdk dependency, configures VectaX with your MIRROR_API_KEY, and registers the MCP server with Claude Desktop. After setup Claude Desktop can query your encrypted vector store through MCP with all RBAC policies enforced at query time.

Question 5

What compliance claims does encrypted inference enable?

Accepted Answer

GDPR Article 32 requires appropriate technical measures to protect personal data during processing. Encrypted inference satisfies this because the provider infrastructure never processes plaintext data. HIPAA requires encryption of protected health information at rest and in transit. Encrypted inference extends this to cover processing, meaning the AI provider never sees PHI in plaintext at any stage. PCI DSS Requirements 3 and 4 require encryption of cardholder data at rest and in transit. Encrypted inference satisfies this for AI workloads processing payment data. These are cryptographic guarantees not contractual assurances.

Question 6

What is the difference between encrypted vector storage and encrypted inference?

Accepted Answer

Encrypted vector storage (covered in A3 and A4) protects embeddings while they sit in the database and while they travel over the network. Encrypted inference goes further: it keeps the data encrypted during the computation itself. This means the similarity search, the retrieval, and the AI model processing all happen on ciphertext. A system with only encrypted storage decrypts before processing. A system with encrypted inference never decrypts on provider infrastructure at any stage.

Question 7

Which vector databases does VectaX work with?

Accepted Answer

VectaX works with Qdrant, Pinecone, ChromaDB, MongoDB Atlas Vector Search, and pgvector. The encrypted ciphertext is stored as the vector payload in whichever database you choose. The VectaX SDK handles encryption and decryption on the client side so the vector database itself never needs to understand the encryption scheme.

Question 8

Does encrypted inference slow down RAG queries?

Accepted Answer

There is a latency cost. FHE operations on ciphertext are more expensive than plaintext similarity search. VectaX provides noise control parameters through the SDK that let teams tune the trade-off between accuracy, latency, and security. For most enterprise RAG workloads the latency is acceptable and the compliance benefit justifies it. For latency-critical applications the recommendation is to profile with the VectaX playground before deploying to production.

Question 9

Where can I learn about the cryptographic foundations of FHE?

Accepted Answer

This module covers encrypted inference at the practitioner level: how to configure and use it. The cryptographic foundations of FHE including PHE vs SHE vs FHE, bootstrapping, noise management, CKKS vs BFV vs BGV schemes, and differential privacy are covered in Track 3D Privacy-Preserving AI.

Question 10

Can I use encrypted inference with an existing RAG system?

Accepted Answer

Yes. VectaX is designed as a drop-in addition to existing RAG stacks. You add the mirror-sdk dependency, replace plaintext embed-and-store calls with encrypted equivalents using VectorData or RBACVectorData, and replace plaintext query calls with encrypted query calls. The vector database API calls remain the same. The main change is that vectors stored and queried are now ciphertext rather than plaintext floats.

Question 11

What happens to vectors outside the user policy scope?

Accepted Answer

They come back from the vector database as ciphertext that the user key cannot decrypt. The MirrorError exception is raised on the decrypt call. The application catches this and excludes the result. From the user perspective these vectors simply do not appear in results. From the attacker perspective who might have bypassed application-layer access controls, the vectors are undecryptable ciphertext with no practical path to recovery.

Question 12

How does encrypted inference relate to the access controls covered in A4?

Accepted Answer

A4 covers RBAC and ABAC as application-layer and database-layer controls that determine who can query which vectors. Encrypted inference adds a cryptographic enforcement layer beneath those controls. If the application-layer controls fail due to a bug or injection attack, the RBAC policy is still enforced at decryption time because the user key simply cannot decrypt vectors outside their scope. Encrypted inference does not replace access controls. It provides a safety net that holds even when other controls fail.

Encrypted Inference
& Vector Memory

The plaintext gap that access controls cannot close

How encrypted inference works in a RAG pipeline

Encrypting the query vector before it leaves the client

Running encrypted similarity search

Decrypting only within policy scope

Encrypted vector memory for RAG agents

MCP integration with Claude Desktop

What you can claim for compliance

Go deeper on the cryptography

Encrypted inference for your RAG stack

Encrypted Inference& Vector Memory