First deployment ready · EU-only data plane · v1.0
Private AI · for Polish bailiff offices

The AI you can keep
on your shelf.

Lexindex indexes your case archive, incoming scans, and the Polish legal corpus on an edge server controlled by your office. Standard inference runs in the EU. Enterprise keeps inference on-prem when policy requires it.

RODO / GDPR by designEU-only infrastructureOffice-local indexNo training on your data
KM/2024/0123 · skarga · termin
Cited answer

The system shows the source document, page, and cited passage behind the answer.

PDF · p. 12SQL · case statusOCR · incoming scan
§ 03 / Architecture

Two zones.
One narrow channel.

Your archive stays on the edge. Generative work crosses one narrow, EU-only channel. Enterprise keeps inference on-prem when policy requires zero-hop.

Local · stays on-prem Transit · EU GPU, ephemeral ~12s loop · scrub or pause below
YOUR OFFICE · ON-PREM EU GPU · STATELESS Files & matters SQL · PDF · DOC · scans Polish law corpus statutes · case law Local index case-scoped retrieval audit log users · audit events User query browser · desktop ↑ retrieval context · encrypted · payload not stored ↓ inference result · ephemeral OCR scanned pages → text LLM inference EU · stateless No client data persisted memory wiped · no training
01
Ingest

Office documents and Polish law are indexed locally. Updates are incremental after that.

02
Query

Search the local index. Most answers never leave the office.

03
Inference

Only the minimum tokens transit when generative help is needed.

04
Return

The result returns to your office; nothing is retained centrally.

05
Audit

Every hop is logged on your edge server. Exportable.

01 / 05
100%
EU residency
0 bytes
retained on EU GPU
~80 ms
Median local query
1 wk
Install to go-live
§ 04 / Transit channel

What is sent.
What stays.
What is never sent.

Stays in your office · always
  • Full case files (PDF, DOC, scans)
  • SQL records · debtor / creditor data
  • The local index & embeddings
  • Encryption keys for the edge store
  • User identities & the audit log
Transits · per-query, ephemeral
  • Minimum prompt tokens (the question + cited snippets)
  • Page images for OCR, only when explicitly invoked
  • The inference result, on the way back

Held only for the duration of a single request. TLS 1.3. No payload retained on the GPU.

Never sent · ever
  • Whole case files in bulk
  • Debtor / creditor PII as a dataset
  • Anything used for model training
§ 05 / Product

Chat with
your case archive.

It feels like ChatGPT for your office, but it answers from your files and cites where it found the answer.

You

Have we handled a similar case before?

Lexindex

Yes. I found three similar matters and the exact pages that show what happened next.

You

Summarize this case for tomorrow.

Lexindex

Here is the short brief: parties, amounts, deadlines, key documents, and open questions.

Ask about a case, document, or deadline...

Ask naturally.

Write like you would to a careful assistant: what changed, what should I read first, show me similar cases.

Get cited answers.

Every answer points back to the document, page, or record it used.

Stay in control.

Staff review, edit, accept, or reject generated text before it leaves the system.

§ 06 / The unspoken question

"Why not just
use ChatGPT?"

A general assistant

Your prompt becomes someone else's tool training corpus.

It can route confidential prompts into provider-controlled infrastructure, retention rules, and model-improvement policies that were not designed around a Polish bailiff office. It does not know your archive, your sygn. akt conventions, or the register of Polish enforcement work. Useful for general questions; the wrong tool for case files.

Lexindex

Your archive is not uploaded to someone else's cloud.

The EU GPU sees only the minimum payload for a single answer or OCR request, retains no payload, and is forbidden — contractually and architecturally — from training on what passes through it. The only model that ever "learns" your archive is the one inside your walls.

§ 07 / Deployment

What gets installed.
Who does what.
How long it takes.

About one week, end-to-end, for the first office deployment. Your IT team does about two hours of work; we do the rest through an outbound-only operations path. Enterprise is quoted separately when inference must stay on-prem.

Step 01

Discovery · 30 min

We map your archive structure. You map your concerns. No NDA is needed for the first call; mutual NDA before office-specific details.

Step 02 · Day 1

Edge server arrives

One 1U appliance, sealed and inventoried, included in the subscription. Your IT team racks it. On-prem inference adds a dedicated server or a validated GPU host.

Step 03 · Days 2–4

Index + validation

OCR and embedding jobs run on the edge server. Data stays in place. We validate retrieval quality against representative questions before expanding usage.

Result · Week 1

Done. Your office works faster.

Similar cases, full-case briefs, and cited answers are ready in minutes instead of buried in folders.


Your IT does

About two hours, end-to-end.

Rack the appliance. Power. Network port. Allow the outbound operations and inference path. On-prem inference uses a local path.

We do

The rest, remotely.

Configuration, ingest, audit, first retrieval validation, user training, and two weeks of support. Recorded, with your consent, for your records.

If something fails

Priority support on premium.

A spare-appliance plan and recovery runbook are agreed before go-live. Your data stays on the edge either way.

Package option

Lexindex Enterprise.

Optional on-prem inference for offices where generative requests cannot leave the building.

§ 08 / Pricing model

Pricing

ShapeWhat's included
Monthly subscription Unlimited monthly usage 3-month discounted trial · edge appliance · provisioning · first index build · software updates · model updates · EU inference · standard support · no query meter
Enterprise Full on-prem inference Dedicated inference server in your office · local GPU validation · inference runbook · no generative request ever leaves your premises
§ 10 / Questions

The questions you're
already drafting.

Plain answers, written in a buyer's language. If yours is missing, we will answer it on the demo call — and write it into a future version of this section.

What happens if the EU GPU server is unreachable? +
Local features keep working — search, file access, audit log, and previously generated outputs. Generative answers queue and notify you when the channel is restored.
How do you keep one office separate from another? +
Each office has its own edge deployment, local store, local index, and audit trail. Client case data is not pooled in a central case database. The shared inference layer processes only transient request context and does not durably store payloads. open security page ↑
Can inference run on our premises? +
Yes. Lexindex Enterprise keeps the inference server inside your environment, so generative requests do not need the EU GPU hop. see § 07 ↑
Who at Lexindex can access our data? +
Nobody, by default. Support sessions are opt-in, time-boxed, and logged on your edge server. Every action a support engineer takes appears in your audit log under their name.
What does cancellation look like? +
A 30-day offboarding. We retrieve the appliance and wipe it using industry-standard media sanitization (certificate provided). Before retrieval, you export anything you need to keep — audit log copies, signed reports, summaries. The hardware is leased as part of the subscription, not bought. After offboarding we retain none of your data on our side; the contract spells this out in writing.
How is this priced and billed? +
No setup fee. One monthly subscription with unlimited usage. The first three months are discounted, and the edge appliance, provisioning, and first index build are included. Lexindex Enterprise is optional when the full inference path must stay on-prem. No metered query charge. see § 08 ↑
Can the on-prem appliance run air-gapped? +
Yes. The local index, search, and reading work fully offline. Generative features pause until the channel is restored — which can be on your schedule, not ours.
Can our DPO inspect the configuration? +
Yes. Configuration, retention choices, sub-processor list, audit events, and the DPIA worksheet are reviewable with your DPO before go-live.
What if the EU GPU provider changes? +
Sub-processor changes require 30-day notice. If you object, the contract gives you a clean exit on terms specified in writing — without penalty.
§ 11 · Book a demo

Thirty minutes.
Your archive structure.
Real answers.

We will demo on a synthetic archive, then talk through your archive structure and answer the security question you bring. Mutual NDA before office-specific details — within the same hour, electronically. The first call is with someone who has done this before, not a sales rep reading a script.

Tell us about your office.
Leave your contact details and we'll get back within one business day. The first call is with someone who has done this before — not a sales rep reading a script.