Author an agent in plain language, or bring your own code.
The agents deployed on your workspace. Open the code, or run one live on the harness.
Author an agent in plain language, or bring your own code.
Every agent your teams have deployed on the engine, and the two metered counters that price it. Counts and signatures leave the perimeter; data never does.
| Agent | Entity | Status | Turns / 7d | Entities | Eval vs SLA | Actions |
|---|---|---|---|---|---|---|
| Sign in to see the agents on your workspace. | ||||||
Every entity your workspace knows, the typed + sourced facts about it, and its interaction timeline. This is what your agents read before they act.
Bring data into your Memory Fabric. Upload a file to ingest it as facts now, or connect a managed source. Programmatic ingestion uses your API key.
State a goal; the Conductor plans, routes to sub-agents, runs, and verifies — grounded in the entity's memory. The latency strip tracks read-time p95 against the 150ms SLA. The outcome routes to one-click approval and is fully replayable in Audit.
Open entities that need a human, ranked by a composite severity + SLA-remaining score. A low-severity item near breach still surfaces. Rows carry only triage essentials — full timeline and evidence are one keypress away.
| Sev | Entity | What's happening | Owner | Waiting | Assignee |
|---|---|---|---|---|---|
| No open work items for your workspace. | |||||
One glance → one decision. Every card carries its evidence and recommendation, so the human is verifying, not researching. Confidence is a calibrated word-label, never a bare percentage. Irreversible actions require visible sign-off.
Candidate pairs the engine flagged as possibly one entity — reviewed side-by-side with a per-attribute algorithm table (exact · Jaro-Winkler · phonetic · embedding) and a weighted consensus. Merges are reversible and audited; a wrong link is a correction, never data loss.
Every deployed agent grouped by the lifecycle stage its template declared — with live status, last/next run, spend and a word-label standing. Read-only: a thin projection of real rows, not an execution canvas.
A streamed feed of real lifecycle events — runs completed, facts written, approvals raised, ship-gate decisions — straight from the append-only audit log. No canned feed; every entry is a real event for your workspace.
Incomplete intakes the agents flagged for follow-up — each shows the missing fields and the source so a reviewer can request them or mark the record complete. Driven by real request_more_information facts, not fixtures.
| Entity | Missing fields | Summary | Source | Flagged |
|---|---|---|---|---|
| No incomplete intakes — every record is complete. | ||||
What does the system actually know about this thing — resolved identity, the sources it was stitched from with confidence, a bitemporal timeline, and the freshness of every fact. Every fact drills to its source and last-verified time.
Every agent, its model, what it's doing right now, and live spend against caps. A soft alert (amber bar + bell) is observability; a hard cap (red bar + stop) is enforcement — the two are visually distinct. One bad loop must never become a $14,000 bill.
Static benchmarks lie — frozen test sets get memorized while the real workload doesn't improve. The eval set is the tenant's own data, fresh; cases the agent got wrong are promoted to the top. Pass-rate by intent against SLA tells a supervisor which intents are safe to push toward auto.
Live, anonymized interactions are continuously sampled and replayed as eval cases — the test set is the tenant's own fresh data, not a frozen benchmark.
Each trajectory is graded against the vertical's verifier (resolution, recurrence, reconciliation, replay-completeness) and scrubbed of PII before any signal leaves the perimeter.
Cases the agent got wrong rise to the top; skills below threshold are flagged as new, amplified, or thinning so regressions are caught early.
Graded trajectories feed per-tenant retrieval, routing, and optional fine-tunes. The moat: 10,000 in-tenant graded trajectories a competitor never had access to.
A release that regresses pass-rate on a tenant's own promoted-failure set does not advance (§16.2). An intent earns its way escalate → one-click → auto only when eval clears SLA.
Two jobs: time-travel / replay, and rights fulfillment. Render any past run as a read-only DAG — goal → sub-goals → skill calls → outcomes — each node click-to-evidence. The graph is something you read, never edit. An audit is a query, not a three-week project.
The skills and tools available to agents, with a dependency graph (nodes = skills, edges = dependsOn). You author the vocabulary; the Conductor authors the sentence. There is no place to draw execution order — if a draggable canvas ever appears here, it's a regression (§4 non-goal).
A source catalog with sync health, schema-drift alerts that propose remappings, and the policy editor. On a breaking change the connector pauses and requires manual review rather than silently propagating downstream.
| Agent | $ cap / turn | Bound entity | HITL tier (click to change · reversible) |
|---|---|---|---|
| No agents deployed yet. | |||
Owner-only · logging · metrics · system health
| Time | Agent | Model | Status | In | Out | Cost | Latency | Trace |
|---|---|---|---|---|---|---|---|---|
| Loading… | ||||||||
| Time | Tenant | Actor | Event | Entity | Trace |
|---|---|---|---|---|---|
| Loading… | |||||
| Time | Subject | Method | Outcome | Route | IP |
|---|---|---|---|---|---|
| Loading… | |||||
Your workspace · recent activity, spend, and errors — your tenant only
| Time | Agent | Model | Status | Tokens | Cost | Trace |
|---|---|---|---|---|---|---|
| Sign in to view. | ||||||
| Run | Status | Prompt | Cost | When |
|---|---|---|---|---|
| — | ||||
| Where | Detail |
|---|---|
| No errors. | |