Mid-size law and compliance firms — 50 to 500 people — burn 30-45 minutes per query making associates hunt through PDF graveyards for answers that already exist in their own files. Harvey and Casetext won't help them: those tools are trained on public databases, priced at $100-200/seat for BigLaw, and structurally can't ingest a firm's proprietary precedents. Partners don't trust the outputs anyway because citations hallucinate.
There are roughly 8,000 mid-market legal/compliance firms in EU+UK fitting our profile. At 1% capture and €18k blended ACV (€8k build + €1.5k/mo), that's ~€1.4M ARR — small but real, and the underserved segment is genuinely empty between Notion AI and Harvey.
The wedge is narrow but defensible short-term: own-document ingestion plus source-authority-weighted citations plus a senior-annotation feedback loop. Generic RAG players (Glean, Guru) will commoditize retrieval in 18-24 months, so the moat is the trust layer, not the pipes.
Why now: GPT-4-class models finally pass the 95% citation-accuracy bar on domain documents; 12 months ago they didn't.
Honest on edge: I have no specific operator advantage here. This is a cold-start enterprise sale into a sector I don't own.
The path: one paid pilot, €2,700 build fee, 14 days to first answers. Kill at day 60 if we don't have 2 signed POs and €2k MRR, or if InfoSec blocks ingestion at 2 of 5 prospects without a self-hosted answer.
Small check, fast read, clean kill. Let's run it.
The detail behind the pitch
Problem
Professional services firms (law, consulting, accounting) with unstructured document repositories spend 30-45 minutes per query manually searching PDFs for client answers, creating labor inefficiency.
Proposed solution
AI research assistant that ingests firm's documents, answers plain-language questions with exact citations, weighted by source authority, and learns from senior annotations.
Target market
Mid-size legal, compliance, and consulting teams (50-500 person firms) with 100+ documents to search; willingness to pay €2,700-15,000+ for build + €1,000-2,000/month maintenance.
First test
Build for 1 law firm or compliance team; measure if system cuts query-resolution time from 30-45 min to <2 min with 95%+ accuracy; target 5+ queries/week adoption.
Kill criteria
<2 paying clients contracted (PO signed, not verbal) AND <€2,000 MRR by day 60 → kill; OR pilot firm logs <5 queries/week for any 2 consecutive weeks within days 14-45 → pivot ingestion model; OR data-security objection raised by 2 of first 5 prospects without a documented resolution path by day 30 → kill or hard-pivot to on-premise/self-hosted architecture before spending further on sales
Incumbents: Casetext CoCounsel (Thomson Reuters), Harvey AI, Luminance, iManage RAVN, Relativity aiR, Kira Systems (Litera), Ironclad AI, Lexis+ AI (LexisNexis), Westlaw AI (Thomson Reuters), Notion AI / Guru (generic KB players)
Pricing: $50-$200/seat/mo for SaaS incumbents; Harvey AI ~$100-$200/seat/mo enterprise; Casetext ~$100/seat/mo; Kira Systems ~$1,500-$3,000/mo flat; custom/enterprise deals common above 50 seats
Saturation: medium
Wedge: Own-document ingestion with verifiable, source-authority-weighted citations and a senior-annotation feedback loop directly addresses the trust and proprietary-knowledge gaps that public-database-only incumbents structurally cannot close.
User complaints: Hallucinated citations that look plausible but reference non-existent document sections — partners don't trust outputs without manual verification; No ingestion of firm's own proprietary documents; tools are trained on public legal databases only, missing internal precedents and client memos; Black-box answers with no source authority weighting — junior and senior sources treated identically; Steep per-seat SaaS pricing makes firm-wide rollout expensive for mid-size firms (50-500 people); Onboarding requires IT/vendor involvement; no self-serve ingestion pipeline for unstructured legacy PDFs; No annotation/feedback loop — senior partner corrections are discarded, not learned from; Generic RAG pipelines fail on domain-specific document structures (e.g., legal schedules, annexes, exhibit cross-references)
Notes: The large incumbents (Harvey, Casetext, Lexis+ AI) dominate BigLaw and are priced/positioned for it; the mid-market (50-500 person firms) is meaningfully underserved because per-seat SaaS economics don't justify adoption and self-hosted or build-plus-retainer models are rare. The proposed pricing structure (€2,700-15,000 build + €1,000-2,000/month) maps well to mid-market budget cycles and avoids the per-seat trap. The authority-weighting and annotation loop are genuine differentiators absent from all major incumbents reviewed. Key risk: well-funded generic RAG startups (e.g., Glean, Guru) are moving down-market and could commoditize the retrieval layer within 18-24 months, making the annotation/trust layer the only durable moat.
Skeptic + judge rationale
Death modes:
- The single pilot law firm's IT/compliance department blocks document ingestion due to client confidentiality clauses and GDPR/data-residency concerns, delaying go-live by 8-12 weeks; the founder burns runway waiting for security sign-off and never reaches the 5 queries/week adoption threshold within the 90-day window
- The senior-annotation feedback loop requires a named partner to spend 2-3 hours/week labeling corrections, but partners bill at €400-800/hr and treat annotation as unbillable overhead — adoption stalls at 1-2 queries/week from a single curious associate, never reaching firm-wide use, making the 'learning' differentiator a paper feature that cannot be demonstrated to a second prospect
- The build fee (€2,700-15,000) closes with the managing partner verbally, but contract execution requires sign-off from procurement/finance who reclassify it as a software vendor relationship requiring a 60-90 day legal review, InfoSec questionnaire, and DPA negotiation — the founder reaches day 90 with a signed NDA but no purchase order and zero MRR
# Judge rationale (score=65.0)
Strong on ARPU (€12-24k/yr/client), recurring revenue, and a real underserved mid-market wedge with differentiated authority-weighting. Loses heavily on human intervention: build-plus-retainer model means Lisandro is on InfoSec calls, DPA negotiations, custom ingestion tuning, and annotation onboarding for every client — antithetical to zero-human thesis. Sales cycles to law/compliance firms are 60-90 days with procurement gates, not 14, and defensibility is thin once Glean-class players move down-market. Operationally it's software plus active service delivery, not pure self-serve SaaS.