Hermes Agent Cognitive Architecture Blueprint
Confidential β Hermes Agent Internalπ₯ System State
Runtime
Hermes v0.16.0
2026.6.5 Β· upstream c0689809
Python
3.11.2
PEP 668 managed
Node.js
v22.23.0
OpenAI SDK
2.24.0
Primary Model
mimo-v2.5
Provider: opencode-go
Base URL
https://opencode.ai/zen/go/v1
Disk Usage
/dev/vda1
33G / 40G 84%
6.6G available
β Configuration β Post-Optimization
| Parameter | Previous | Current | Change |
|---|---|---|---|
| memory.memory_char_limit | 2,200 | 4,000 | +82% |
| memory.user_char_limit | 1,375 | 2,000 | +45% |
| compression.threshold | 0.91 | 0.70 | β0.21 |
| compression.target_ratio | 0.20 | 0.30 | +0.10 |
| tool_output.max_bytes | 50,000 | 100,000 | +100% |
| tool_output.max_lines | 2,000 | 4,000 | +100% |
| context_file_max_chars | null | 50,000 | new |
| delegation.model | inherit | kimi-k2.6 | pinned |
| smart_model_routing | β | enabled | new |
| MCP tools (total) | 48 | 33 | β31% |
π€ Profiles
| Profile | Model | Provider | Status |
|---|---|---|---|
| default | mimo-v2.5 | opencode-go | β RUNNING |
| falcon | kimi-k2.5 | opencode-go | β STOPPED |
| nova | deepseek-v4-pro | opencode-go | β STOPPED |
π MCP Servers
| Server | Transport | Tools | Status |
|---|---|---|---|
| deep-thinker | stdio | 8 / 17 filtered | ACTIVE |
| siyuan_bundinha | stdio | 17 / 17 all | ACTIVE |
| stitch | HTTP | 8 / 14 filtered | ACTIVE |
| elenhub | β | β | REMOVED |
deep-thinker β Tool Registry
think
evaluate
metacog
optimize_prompt
simulate_devils_advocate
conclude
reset
help
siyuan_bundinha β Tool Registry
unified_search
get_document_content
create_document
append_to_document
update_document
append_to_daily_note
move_documents
get_document_tree
list_notebooks
get_recently_updated_documents
create_snapshot
list_snapshots
rollback_to_snapshot
list_all_tags
batch_replace_tag
list_prompts
get_prompt
stitch β Tool Registry
create_project
get_project
list_screens
generate_screen_from_text
edit_screens
generate_variants
create_design_system
apply_design_system
elenhub β Removed
DATABASE CORRUPTION
elenhub was removed from the MCP registry after repeated connection failures. The underlying database file was missing or corrupted.
π Credential Pool β opencode-go
| Entry | Key Identifier | Status | Role |
|---|---|---|---|
| 1 | OPENCODE_GO_API_KEY | ACTIVE | Primary |
| 2 | api-key-3 | ACTIVE | Failover |
| 3 | api-key-4 | ACTIVE | Failover |
| β | api-key-2 | REMOVED | Stale (401 every session) |
Available Models (20)
minimax-m3
minimax-m2.7
minimax-m2.5
kimi-k2.7-code
kimi-k2.6
kimi-k2.5
glm-5.2
glm-5.1
glm-5
deepseek-v4-pro
deepseek-v4-flash
qwen3.7-max
qwen3.7-plus
qwen3.6-plus
qwen3.5-plus
mimo-v2-pro
mimo-v2-omni
mimo-v2.5-pro
mimo-v2.5
hy3-preview
π€ Auxiliary Models
| Role | Model | Provider | Usage |
|---|---|---|---|
| Approval | deepseek-v4-flash | opencode-go | User action approval gates |
| Compression | deepseek-v4-flash | opencode-go | Session history compression |
| Curator | kimi-k2.6 | opencode-go | Content curation & ranking |
| Kanban Decomposer | kimi-k2.6 | opencode-go | Task decomposition |
| MCP | minimax-m3 | opencode-go | MCP tool orchestration |
| Profile Describer | deepseek-v4-flash | opencode-go | Profile metadata generation |
| Skills Hub | kimi-k2.6 | opencode-go | Skill registry operations |
π§ Honcho Status
ENABLED
Workspace: hermes
| Parameter | Value |
|---|---|
| API Key | ***REDACTED*** |
| Peer Card | 34 facts |
| Recall Mode | hybrid |
| Dialectic Cadence | every 1 turn |
| Reasoning | base=low, cap=high |
| Observation | user+ai, directional |
| Write Freq | async |
| Context Budget | uncapped |
| Hosts | hermes, hermes.falcon, hermes_nova (cleaned from 5) |
π Architecture Diagrams
1. Memory Hierarchy β 5-Layer Retrieval Stack
L0 β L4 priority order, highest context density at top
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β MEMORY HIERARCHY β
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ€
β β
β ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β
β β L0: SOUL.md (Minimal Kernel) β β
β β ββββββββ ~2000 chars, byte-stable, identity & core rules β β
β β Always injected, never compressed, never truncated β β
β ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β
β β β
β βΌ β
β ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β
β β L1: USER.md (User Preferences & Profile) β β
β β ββββββββ memory.user_char_limit: 2000 chars β β
β β Injected per-session, updated via observation β β
β ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β
β β β
β βΌ β
β ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β
β β L2: Honcho (Persistent Peer Memory) β β
β β ββββββββ Peer card: 34 facts, hybrid recall β β
β β Auto-injected, dialectic every 1 turn, async writes β β
β β Hosts: hermes, hermes.falcon, hermes_nova β β
β ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β
β β β
β βΌ β
β ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β
β β L3: Session Search (Runtime Memory) β β
β β ββββββββ memory.memory_char_limit: 4000 chars β β
β β Compression threshold: 0.70, target ratio: 0.30 β β
β β Session-local, compressed on threshold β β
β ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β
β β β
β βΌ β
β ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β
β β L4: SiYuan (Persistent Knowledge Base) β β
β β ββββββββ 17 tools, unified_search, snapshots, tags β β
β β Long-term document storage, notebook organization β β
β ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β
β β
β Flow: User message β L0+L1 injected β Honcho recall β L3 context β
β β L4 on-demand search (unified_search) β
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
2. MCP Tool Flow
Request routing from user intent to MCP tool execution
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β MCP TOOL FLOW β
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ€
β β
β ββββββββββββ ββββββββββββ ββββββββββββββββββ β
β β USER ββββββΆβ ROUTER ββββββΆβ MCP REGISTRY β β
β β INTENT β β β β (33 tools) β β
β ββββββββββββ β smart_ β βββββββββ¬βββββββββ β
β β model_ β β β
β β routing β β β
β ββββββ¬ββββββ β β
β β β β
β βββββββββββββΌββββββββββββ β β
β βΌ βΌ βΌ β β
β βββββββββββββ βββββββββββ ββββββββββ β β
β β deep- β β siyuan_ β β stitch β β β
β β thinker β β bundin. β β β β β
β β 8 tools β β 17 toolsβ β 8 toolsβ β β
β βββββββ¬ββββββ ββββββ¬βββββ βββββ¬βββββ β β
β β β β β β
β βΌ βΌ βΌ βΌ β
β ββββββββββββ ββββββββββββ ββββββββββββ β
β β think β β search β β create β β
β β evaluate β β docs β β project β β
β β metacog β β snapshot β β screens β β
β β ... β β tags β β design β β
β ββββββββββββ ββββββββββββ ββββββββββββ β
β β
β Tool Filtering: β
β deep-thinker: 17 β 8 (47% filtered) β
β stitch: 14 β 8 (43% filtered) β
β siyuan: 17 β 17 (0% filtered) β
β Total: 48 β 33 (31% reduction) β
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
3. Model Selection Decision Tree
Smart routing: cheap model for trivial, default for complex
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β MODEL SELECTION DECISION TREE β
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ€
β β
β User Request β
β β β
β βΌ β
β βββββββββββββββββββββββ β
β β Classify Complexity β β
β βββββββββββ¬ββββββββββββ β
β β β
β ββββββββ΄βββββββ β
β βΌ βΌ β
β TRIVIAL COMPLEX β
β (approval, (code gen, β
β compression, analysis, β
β routing) reasoning) β
β β β β
β βΌ βΌ β
β ββββββββββββ ββββββββββββ β
β β CHEAP β β DEFAULT β β
β β MODEL β β MODEL β β
β ββββββ¬ββββββ ββββββ¬ββββββ β
β β β β
β βΌ βΌ β
β deepseek-v4 mimo-v2.5 β
β -flash@ocgo @opencode-go β
β β
β Auxiliary Model Assignments: β
β ββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β
β β approval βββββββΆ deepseek-v4-flash β β
β β compression ββββΆ deepseek-v4-flash β β
β β curator ββββββββΆ kimi-k2.6 β β
β β kanban Decomp ββΆ kimi-k2.6 β β
β β mcp ββββββββββββΆ minimax-m3 β β
β β profile desc βββΆ deepseek-v4-flash β β
β β skills_hub βββββΆ kimi-k2.6 β β
β ββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β
β β
β Delegation: kimi-k2.6 (pinned, was inherit) β
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
4. Skills Lifecycle
Full pipeline from observation to promoted skill
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β SKILLS LIFECYCLE β
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ€
β β
β ββββββββββββ ββββββββββββ ββββββββββββ ββββββββββββ β
β β OBSERVE ββββΆβ PATTERN ββββΆβ DRAFT ββββΆβ REPLAY β β
β β β β β β β β β β
β β Watch β β Detect β β Write β β Re-run β β
β β user β β repeat- β β skill β β on new β β
β β behavior β β able β β skeleton β β input to β β
β β β β actions β β β β validate β β
β ββββββββββββ ββββββββββββ ββββββββββββ ββββββ¬ββββββ β
β β β
β ββββββββββββββββββββββββββββββ β
β βΌ β
β ββββββββββββ ββββββββββββ ββββββββββββ β
β β VERIFY ββββΆβ GATE ββββΆβ PROMOTE β β
β β β β β β β β
β β Assert β β LLM β β Write to β β
β β output β β review: β β skill_ β β
β β matches β β safety, β β manage β β
β β expected β β quality, β β CRUD β β
β β β β utility β β registry β β
β ββββββββββββ ββββββββββββ ββββββββββββ β
β β
β Tools: skill_manage (CRUD) Β· class-level umbrella Β· skill hub β
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
π Diagnosis Logs
Credential Rotation
2026-06-20 00:21:07,076 credential pool: marking api-key-2 exhausted (status=401), rotating 2026-06-20 00:21:07,076 credential pool: rotated to OPENCODE_GO_API_KEY
MCP Server Registration
2026-06-20 02:11:08,433 MCP server 'deep-thinker' (stdio): registered 8 tool(s) 2026-06-20 02:11:09,484 MCP server 'stitch' (HTTP): registered 8 tool(s) 2026-06-20 02:11:10,776 MCP server 'siyuan_bundinha' (stdio): registered 17 tool(s) 2026-06-20 02:11:10,777 MCP: registered 33 tool(s) from 3 server(s)
elenhub Database Failure (pre-removal)
Error: error returned from database: (code: 14) unable to open database file
Memory Overflow (pre-fix)
Tool memory returned error: Memory at 7,764/2,200 chars. Adding this entry would exceed the limit.
π Gap Analysis
| Feature | Status | Notes |
|---|---|---|
| SOUL.md minimal kernel | β Implemented | L0-L2, ~2000 chars, byte-stable |
| Skills system | β Implemented | skill_manage CRUD, class-level umbrella |
| MCP tool filtering | β Applied | tools.include/exclude, 48 β 33 tools |
| Memory hierarchy | β Implemented | 5 layers, Honcho auto-injection |
| Credential pools | β Working | 3 entries, auto-rotation |
| Smart model routing | β Enabled | deepseek-v4-flash for trivial tasks |
| Compression tuning | β Applied | threshold 0.70, target 0.30 |
| soul.d/ loader | β Not recommended | Would break prompt caching |
| Intent β plugin routing | β Requires upstream | Needs NLP classification |
| Skills CI/CD gates | β Requires upstream | Needs test harness |
| Complexity-aware routing | β οΈ Partial | Beyond cheap/default split |
π¬ Deep Thinker State
Binary
/usr/local/bin/deep-thinker
Profile
hermes-bundinha
Saved Sessions
3
All with 0 nodes β were reset
Autosave
14 KB
AIGuaratuba civic dashboard audit
Current Session Usage
9 nodes 0.87 confidence
Used for gap analysis β 9 reasoning nodes generated with high confidence score
π¦ Available Models β Full Registry
| # | Model | Provider | Use |
|---|---|---|---|
| 1 | minimax-m3 | opencode-go | MCP orchestration |
| 2 | minimax-m2.7 | opencode-go | General |
| 3 | minimax-m2.5 | opencode-go | General |
| 4 | kimi-k2.7-code | opencode-go | Code tasks |
| 5 | kimi-k2.6 | opencode-go | Delegation, curator, skills |
| 6 | kimi-k2.5 | opencode-go | falcon profile |
| 7 | glm-5.2 | opencode-go | General |
| 8 | glm-5.1 | opencode-go | General |
| 9 | glm-5 | opencode-go | General |
| 10 | deepseek-v4-pro | opencode-go | nova profile |
| 11 | deepseek-v4-flash | opencode-go | Cheap model (routing) |
| 12 | qwen3.7-max | opencode-go | General |
| 13 | qwen3.7-plus | opencode-go | General |
| 14 | qwen3.6-plus | opencode-go | General |
| 15 | qwen3.5-plus | opencode-go | General |
| 16 | mimo-v2-pro | opencode-go | General |
| 17 | mimo-v2-omni | opencode-go | General |
| 18 | mimo-v2.5-pro | opencode-go | General |
| 19 | mimo-v2.5 | opencode-go | DEFAULT |
| 20 | hy3-preview | opencode-go | General |