MVP Scope
This document defines what is in scope and what is explicitly out of scope for the Almathal MVP. The MVP is the first shippable version that proves the architecture and supports paying enterprise customers on a limited workload.
MVP Goals
- Prove the retrieval-first thesis with measurable economics: archetype-matched generations should cost meaningfully less than equivalent pure-LLM output.
- Establish trust guarantees that pure-LLM tools structurally cannot offer: per-component provenance, license verification, CVE scanning, audit trails.
- Ship two Archetypes end-to-end through the full pipeline, with deterministic gates and bounded LLM stitching.
- Build the curation infrastructure so the library can grow with the team rather than blocking on it.
What Ships in MVP
Archetypes (2)
Each Archetype takes generation end-to-end: Spec Resolution → Compatibility Validation → Build Approval → Seam Validation → Scaffolding → Stitching → Build Verification → Output Review → Delivery.
Component Library
Approximately 150 Adapters at MVP, across four slices:
- Java slice (~50 adapters): Spring Boot, JPA, Security, Kafka, JHipster integration, Micrometer, etc.
- TypeScript/JS slice (~35 adapters): React, Next.js, Tailwind, shadcn/ui, TanStack Query, Zod, etc.
- Python slice (~30 adapters): FastAPI, pandas, polars, pydantic, SQLAlchemy, etc.
- AI/LLM slice (~25 adapters): Anthropic SDK, OpenAI SDK, pgvector, LangChain, sentence-transformers, etc.
- Cross-cutting infrastructure (~15 containers/services): PostgreSQL, Valkey, Kafka, MinIO, Keycloak, Prometheus, Grafana, Nginx, OpenAPI tooling.
Detailed inventory at Component Library Seed.
Module Library
Approximately 5–8 Modules at MVP, covering only what the two MVP Archetypes need:
module:auth/jwt-with-rbac— authentication for the CRUD admin Archetypemodule:ingestion/standard-corpus— document ingestion for the RAG Archetypemodule:rag/chat-with-citations— core RAG loop with citation renderingmodule:observability/standard-stack— Prometheus + Grafana + structured loggingmodule:audit-log/event-stream— audit trail emission for compliance- Possibly 2–3 more as Archetype design evolves
Composer
Full implementation of all pipeline stages:
- Spec Resolution (Resolver subcomponent) with hybrid form + conversation input
- Compatibility Validator (deterministic matrix walk)
- Briefing Generator (LLM summary for Build Approval)
- Seam Validator (static analysis)
- Scaffolder Runner (JHipster, Spring Initializr, create-t3-app)
- Build Validator (compile, lint, test runners)
- Handoff to Stitcher
Stitcher
Bounded LLM layer with:
- Prompt templates per Seam type
- Multi-model support (configurable per stage)
- Retry logic on Build Verification failure
- AI Reviewer (using a different model from the Stitcher)
- Token accounting and cost reporting per generation
Trust Infrastructure
- Central Manifest Registry with full audit trail
- License verification (SPDX-based)
- CVE scanning integrated into the Curation Pipeline
- SBOM emission with each generated app
- Build records (UUIDs, slug-at-build-time, version locks) attached to every generation
- Customer-accessible audit trail
Curation Pipeline
- Discovery automation for upstream releases and CVEs
- Stage 1 (LLM-drafted, human-directed) Manifest authoring workflow
- Stage 2 (LLM-drafted, human-reviewed) workflow for non-AI slices
- Mechanical evaluation checks
- Weekly patch / monthly minor release cadence
- Decision log
Customer Surface
- Web UI for Archetype selection, form-based Variation Point input, conversation refinement, Build Approval
- Generated app delivery (code, containers, tests, audit trail)
- Saved Spec templates (per ADR-0014)
- Audit trail viewer
- Release notes and changelog
What Is Explicitly Out of Scope for MVP
The following are real future work but not in MVP:
Archetypes deferred
- Microservices Archetypes (deferred to v1 per ADR-0015)
- E-commerce, marketplace, POS, billing Archetypes
- AI agent / multi-tool workflow Archetypes
- Document processing pipeline Archetypes
- Embedded analytics Archetypes
- Compliance reporting tools
Component Library deferred
- Cloud-vendor-specific SDKs (AWS, GCP, Azure) — added in v1.5 “cloud pack”
- Heavyweight enterprise integrations (IBM MQ, Oracle DB, SAP) — added on paying-customer demand
- Mobile frontend (React Native, Flutter) — out of enterprise web focus
- Additional language slices (Go, Rust, C#, .NET) — post-v1 expansion
- Legacy framework support (Spring 5, React 17) — only current stable versions in MVP
Module Library deferred
- Payments Module (Stripe, Adyen wrappers) — v1
- Billing Module (Lago wrapper or platform-composed) — v1
- Email/notification Modules — v1
- File upload and storage Modules — v1
- Search Modules (full-text search beyond what RAG covers) — v1
- Workflow / approval engine Modules — v1+
Pipeline features deferred
- Exploratory archetype path (replaces pure-LLM fallback) — v1
- Cross-build re-generation with customization preservation — v1
- Multi-app orchestration — v1 (alongside microservices Archetypes)
- Live editing of generated apps within the platform — possibly never (out of focus)
Trust infrastructure deferred
- Optional verified artifact mirror (per ADR-0017) — v1 enterprise tier
- On-prem / air-gapped deployment — v1 enterprise tier
- Customer-specific Manifest extensions — v1+
- SOC 2 Type II audit (Type I is acceptable for MVP launch) — v1
- HIPAA, PCI-DSS specific compliance packs — v1+
Customer surface deferred
- Marketplace for community Adapters / Modules — post-v1
- Multi-user team collaboration on a Spec — v1
- CI/CD integration for auto-regeneration — v1
- API access (machine-driven generations) — v1
- IDE plugins — v1+
MVP Success Criteria
The MVP is judged on:
- Both Archetypes produce working generated apps. Compile, deploy, smoke tests pass.
- Token cost ratio. Archetype-matched generations cost at least 2× less than comparable pure-LLM tools producing the same app (target: 5–10×).
- Provenance completeness. Every Adapter and Module in every generated app has full provenance recorded in the build’s audit trail.
- Build reproducibility. Re-running the same Spec produces functionally equivalent output (mechanical verification passes identically).
- Curation cadence. The Curation Pipeline can sustain weekly patch releases without falling behind on upstream CVEs.
- First enterprise pilot. At least one paying enterprise customer running real generated apps in production by MVP+90 days.
Timeline
A rough estimate, contingent on team size and AI-assisted-development efficiency:
| Phase | Duration | Output |
|---|---|---|
| Foundation | 4–6 weeks | Component Library MVP corpus (Stage 1 authoring), Manifest registry running, schema validation in place |
| Composer | 4–6 weeks | Pipeline stages implemented end-to-end for one Archetype (CRUD admin first, as the simpler verification surface) |
| Stitcher | 4–6 weeks | Seam-driven generation, multi-model support, AI Reviewer, verification harness |
| Second Archetype | 3–4 weeks | RAG chatbot Archetype on top of foundations |
| Polish & Trust | 3–4 weeks | Audit trail, license/security infrastructure, customer UI refinement |
| Enterprise readiness | 3–4 weeks | SSO, deployment options, SOC 2 Type I preparation |
Total: roughly 5–7 months from start of focused implementation to MVP-ready, with a small team of 2–3 senior engineers using AI-assisted development aggressively.
Related
- Roadmap → Component Library Seed
- Roadmap → Version Roadmap
- ADR-0003: Target Enterprise for V1
- ADR-0022: RAG Chatbot as MVP Archetype #1
- ADR-0023: CRUD Admin Tool as MVP Archetype #2