Skip to content

Roadmap — Research & Analysis

Archived from roadmap.md. External research findings and Nooplex paper alignment tracking.


External Research Survey — March 2026

19 projects evaluated (11 deep research + 8 Scout-identified). 49 patterns absorbed. 0 new dependencies. ProbOS core architecture validated.

Project Stars License Layer Relationship Key Patterns Absorbed
LangGraph 27K MIT Workflow orchestration Study/Absorb Typed channels, checkpoint interrupt/resume, version-based triggers, durability modes, Send fan-out, blob dedup
LangChain 131K MIT Agent framework Study Tool calling patterns, chain orchestration (patterns largely overlap with LangGraph)
Mem0 51K Apache-2.0 Memory layer Absorb Contradiction resolution, dual-track memory, search reranking, memory audit trail, procedural memory summarization
PydanticAI 16K MIT Type-safe agents Absorb Structured output validation + auto-retry, validate-then-execute, dynamic tool visibility, deferred tool execution, RunContext DI
Google ADK 19K Apache-2.0 Agent dev toolkit Absorb Behavioral eval framework, LLM-as-judge multi-sampling, user simulator, agent optimizer Pareto front, hallucination detection
IBM ContextForge 3.5K Apache-2.0 MCP/A2A gateway Study/Absorb Per-tool execution metrics, typed hook middleware, gateway federation
Letta 22K Apache-2.0 Stateful agents Absorb Self-editing memory, three-tier architecture, memory-as-tool, sleeptime agents
MS Agent Framework 8K MIT Enterprise orchestration Study Task ledger fact classification, declarative YAML agents
AgentScope 18K Apache-2.0 Multi-modal agents Absorb MsgHub broadcast groups, agent-initiated memory with reasoning
AG2 4K Apache-2.0 Agent-to-agent Study Tree-of-thought MCTS reasoning
OASIS 4K Apache-2.0 Agent simulation Study Social network graph, simulation time dilation

Scout-Identified Candidates (2026-03-22):

Project Stars License Layer Relationship Key Patterns Absorbed
Serena 1.8K Apache-2.0 MCP code intelligence Visiting Officer LSP-backed symbol retrieval (definitions, references, callers across 30+ languages)
Composio 25K ELv2 Auth/tool platform Visiting Officer Managed auth delegation (OAuth for 1000+ services), sandboxed tool execution
Firecrawl 45K AGPL-3.0 Web scraping Visiting Officer (API only) Pre-extraction actions, change tracking. AGPL: API consumption only
Browser Use 90K MIT Browser automation Partial (primitives only) DOM accessibility tree, vision+DOM dual-mode. Use primitives, not Agent class
Stripe AI Proprietary Payment tools Visiting Officer Permission-scoped tools, token metering. Commercial phase (Nooplex)
Gemini CLI 50K+ Apache-2.0 Agentic CLI Competing Captain Study only: free-tier Gemini access (60 req/min), Google Search grounding
Chroma 18K Apache-2.0 Vector DB Already integrated ProbOS already uses ChromaDB for episodic memory and vector storage
Ruflo 22.5K MIT Agent orchestration Competing Captain Study only: WASM deterministic transforms (subsumed by Procedural Learning), three-tier cost routing (validates Cognitive Division of Labor). 504MB repo, single author, claims exceed verifiable depth

Cross-cutting findings: - Structured output validation is industry consensus — ProbOS is behind here (Top 5 #1) - Everyone has checkpointing — ProbOS explicitly deferred to Phase 25, which is validated (Top 5 #3) - Agent behavioral testing is a gap — only Google ADK has a formal framework (Top 5 #2) - Memory needs active curation — stores degrade without contradiction resolution and fact distillation (Top 5 #4) - ProbOS's core architecture is validated — no project has trust-driven routing, probabilistic governance, biological memory, self-modification, and federation all together. The absorbed patterns are incremental enhancements, not architectural corrections - Visiting Officer Subordination Principle validated again: Gemini CLI is a competing captain (own orchestration loop), Browser Use's Agent class is too, but its browser primitives pass the litmus test


Nooplex Paper Alignment — Principle Gaps

"The Nooplex: A Planetary Cognitive Ecosystem for Emergent General Intelligence" (Galliher, Feb 2026). ProbOS should only improve on the paper, never regress. Full tracker in commercial repo

Last checked: 2026-03-22 (AD-396). 25 principles extracted, 12 well-covered, 13 gaps identified.

Principles already well-covered by ProbOS: Cooperative Emergence (trust/consensus/Hebbian), Decentralization (federation), Long-horizon Cognition (episodic/dreaming), "Brains are brains" (Ward Room unified bus), Shared Memory (Shared Cognitive Fabric AD-393), Self-Assessment (SystemSelfModel AD-318, EmergentDetector), Meta-Cognitive (Model-of-Models roadmap, dream cycles), Trust & Federation (TrustNetwork, Bayesian trust), Minimal Authority (Earned Agency roadmap), Unit Cell Completeness (federation is additive), Independent Brains/Shared Memory (federation design), Zero Corporate Dependencies (no cloud required)

Critical finding (AD-434 research, 2026-03-25): The "Shared Memory" principle was listed as well-covered, but deeper analysis reveals the shared knowledge fabric is not implemented. KnowledgeStore was intended to be the "shared library" but evolved into operational state persistence (trust snapshots, routing weights, agent source code). No agent writes knowledge to it. Dream consolidation modifies in-memory weights but does not promote distilled insights. AD-434 (Ship's Records) is designed to be the actual implementation of the Nooplex shared knowledge fabric — Git-backed institutional memory where agents document research, log duty output, and publish findings. Updates Gaps #1, #4, and #10 below.

Gaps — principles not yet in ProbOS architecture:

# Principle Paper Section Gap ProbOS Target Priority
1 Provenance Tagging §3.3 (Transparency) Every knowledge entry must carry source, confidence, timestamp, derivation chain. KnowledgeStore has some but no systematic provenance on all operations. AD-434 update: Ship's Records YAML frontmatter provides systematic provenance (author, classification, status, topic, tags, timestamps) on every document. Partially addresses this gap for authored knowledge. KnowledgeStore + EpisodicMemory + CognitiveJournal + AD-434 (Ship's Records) High
2 Safety Budget §4.3.4 (Governance) Every action carries implicit risk cost. Low-risk proceeds; higher-risk requires proportionally stronger consensus; destructive actions always require collective agreement. Currently implicit in trust tiers but not formalized as per-action risk accounting Earned Agency (Phase 33), SIF High
3 Reversibility Preference §4.3.4 (Governance) When multiple strategies achieve a goal, prefer the most reversible. Read before write. Backup before delete. Partially in Standing Orders but not a systematic architectural constraint Standing Orders, SIF invariant High
4 Precedent Store §6.4 (Self-Stabilization) Resolved conflicts recorded as "case law" for future consistency. No equivalent exists. Pairs with Mem0 contradiction resolution pattern. AD-434 update: Ship's Records reports/ directory provides a natural home for precedent documents — formal published findings that serve as case law. Git history provides immutability. AD-434 (Ship's Records reports/) + dream cycle Medium
5 Four-Stage Conflict Reconciliation §6.4 (Self-Stabilization) Formal pipeline: confidence comparison → independent verification → structured argumentation → human escalation. Memory contradiction resolution (Mem0 pattern) is on roadmap but not this formal Consensus + KnowledgeStore Medium
6 Semantic Coherence §3.3 (Core Design) Shared ontologies, schema registries, aligned embedding spaces across meshes. IntentDescriptors exist but no formal schema registry with versioning AD-429 (Vessel Ontology), Schema Registry Medium
7 Anti-fragility §3.3 (Core Design) System grows stronger through stress. Implicit in trust/Hebbian (failures lower trust, successes raise it) but not explicitly architected as a property Trust + Hebbian + dream cycle (already emergent, needs formalization) Low
8 Five-Capability AGI Criteria §1.3 (Definition) Cross-domain transfer, long-horizon planning, self-correction, cumulative learning, novel problem solving. Not used as evaluation criteria for ProbOS Agent behavioral eval framework (Google ADK pattern) Medium
9 Four Emergence Criteria §3.4 (Emergence) Cross-domain synthesis, TC_N > 0, novel coordination patterns, cumulative capability growth. EmergentDetector has TC_N proxy but not the other three EmergentDetector enhancement Medium
10 Human-Agent Knowledge Feedback Loop §6.2 (Feedback) Four-phase loop: human contributes → agents amplify → human refines → substrate evolves. Not explicitly modeled as a cycle. AD-434 update: Ship's Records completes this loop: Captain's Log (human contributes) → agent notebooks (agents amplify & document) → Ward Room discussion (human refines) → dream consolidation → notebook entries (substrate evolves). The four phases map to the SECI knowledge cycle. AD-434 (Ship's Records) + Ward Room + dream consolidation Low → Medium
11 Honesty About Limitations Vision Doc Lead with what ProbOS cannot do. Messaging/docs principle, not technical probos.dev docs, README Low
12 Falsifiability Commitment §3.4, §10 (Methodology) Testable predictions for emergence. If the four emergence criteria aren't met, the hypothesis is disconfirmed. Not operationalized into automated measurement EmergentDetector + reporting Low
13 Moral Status Assessment Protocol §9.4 (Ethics) Pre-deployment criteria for ethics review if system exhibits self-referential states or goal-directed resistance to shutdown Long Horizon — relevant when emergence is measurable Low