History

Jiayuan 14bcebee2a feat(agent): add real-time streaming updates to exec tool Utilize the onUpdate callback from pi-agent-core's AgentTool interface to stream output updates in real-time while commands are executing. - Accept onUpdate as 4th parameter in execute function - Emit tailBuffer updates on each stdout/stderr data event - Stop emitting updates once command is backgrounded (yielded) This enables UI to show live command output progress via the tool_execution_update agent event. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>		2026-01-31 18:31:50 +08:00
..
memory	fix(memory): handle exactOptionalPropertyTypes for optional fields	2026-01-31 02:25:42 +08:00
web	fix(tools): resolve exactOptionalPropertyTypes errors	2026-01-31 02:25:42 +08:00
exec.ts	feat(agent): add real-time streaming updates to exec tool	2026-01-31 18:31:50 +08:00
glob.test.ts	fix(tools): resolve exactOptionalPropertyTypes errors	2026-01-31 02:25:42 +08:00
glob.ts	feat(agent): add glob tool for file pattern matching (#18 )	2026-01-30 04:33:01 +08:00
groups.ts	feat(tools): register memory tools and add group:memory	2026-01-31 02:25:42 +08:00
index.ts	feat(tools): add tool policy system with 4-layer filtering	2026-01-31 02:25:22 +08:00
policy.test.ts	test(tools): add policy system unit tests	2026-01-31 02:25:22 +08:00
policy.ts	feat(tools): integrate Profile tools config with runner	2026-01-31 02:25:42 +08:00
process-registry.ts	fix(agent): return collected output when exec auto-backgrounds	2026-01-31 18:31:50 +08:00
process.ts	fix(agent): return collected output when exec auto-backgrounds	2026-01-31 18:31:50 +08:00
README.md	docs(tools): simplify roadmap and add Chinese translation	2026-01-31 02:25:42 +08:00
README.zh-CN.md	docs(tools): simplify roadmap and add Chinese translation	2026-01-31 02:25:42 +08:00

README.md

Tools System

中文文档

The tools system provides LLM agents with capabilities to interact with the external world. Tools are the "hands and feet" of an agent - without tools, an LLM can only generate text responses.

Architecture Overview

┌─────────────────────────────────────────────────────────────────┐
│                        Tool Definition                          │
│  (AgentTool from @mariozechner/pi-agent-core)                  │
│                                                                 │
│  ┌─────────────┐  ┌─────────────┐  ┌─────────────┐             │
│  │    name     │  │ description │  │ parameters  │             │
│  │   label     │  │   execute   │  │  (TypeBox)  │             │
│  └─────────────┘  └─────────────┘  └─────────────┘             │
└─────────────────────────────────────────────────────────────────┘
                              │
                              ▼
┌─────────────────────────────────────────────────────────────────┐
│                    4-Layer Policy Filter                        │
│                                                                 │
│  ┌──────────────────────────────────────────────────────────┐  │
│  │ Layer 1: Profile                                         │  │
│  │ Base tool set: minimal | coding | web | full             │  │
│  └──────────────────────────────────────────────────────────┘  │
│                              │                                  │
│                              ▼                                  │
│  ┌──────────────────────────────────────────────────────────┐  │
│  │ Layer 2: Global Allow/Deny                               │  │
│  │ User customization via CLI or config                     │  │
│  └──────────────────────────────────────────────────────────┘  │
│                              │                                  │
│                              ▼                                  │
│  ┌──────────────────────────────────────────────────────────┐  │
│  │ Layer 3: Provider-Specific                               │  │
│  │ Different rules for different LLM providers              │  │
│  └──────────────────────────────────────────────────────────┘  │
│                              │                                  │
│                              ▼                                  │
│  ┌──────────────────────────────────────────────────────────┐  │
│  │ Layer 4: Subagent Restrictions                           │  │
│  │ Limited tools for spawned child agents                   │  │
│  └──────────────────────────────────────────────────────────┘  │
│                                                                 │
└─────────────────────────────────────────────────────────────────┘
                              │
                              ▼
┌─────────────────────────────────────────────────────────────────┐
│                      Filtered Tools                             │
│              (passed to pi-agent-core)                          │
└─────────────────────────────────────────────────────────────────┘

Available Tools

Tool	Name	Description
Read	`read`	Read file contents
Write	`write`	Write content to files
Edit	`edit`	Edit existing files
Glob	`glob`	Find files by pattern
Exec	`exec`	Execute shell commands
Process	`process`	Manage long-running processes
Web Fetch	`web_fetch`	Fetch and extract content from URLs
Web Search	`web_search`	Search the web (requires API key)
Memory Get	`memory_get`	Retrieve a value from persistent memory
Memory Set	`memory_set`	Store a value in persistent memory
Memory Delete	`memory_delete`	Delete a value from persistent memory
Memory List	`memory_list`	List all keys in persistent memory

Note

: Memory tools require a profileId to be specified. They store data in the profile's memory directory.

Tool Groups

Groups provide shortcuts for allowing/denying multiple tools at once:

Group	Tools
`group:fs`	read, write, edit, glob
`group:runtime`	exec, process
`group:web`	web_search, web_fetch
`group:memory`	memory_get, memory_set, memory_delete, memory_list
`group:core`	All of the above (excluding memory)

Tool Profiles

Profiles are predefined tool sets for common use cases:

Profile	Description	Tools
`minimal`	No tools (chat-only)	None
`coding`	File system + execution	group:fs, group:runtime
`web`	Coding + web access	group:fs, group:runtime, group:web
`full`	No restrictions	All tools

Usage

CLI Usage

# Use a specific profile
pnpm agent:cli --tools-profile coding "list files"

# Minimal profile with specific tools allowed
pnpm agent:cli --tools-profile minimal --tools-allow exec "run ls"

# Deny specific tools
pnpm agent:cli --tools-deny exec,process "read file.txt"

# Use tool groups
pnpm agent:cli --tools-allow group:fs "read config.json"

Programmatic Usage

import { Agent } from './runner.js';

const agent = new Agent({
   tools: {
      // Layer 1: Base profile
      profile: 'coding',

      // Layer 2: Global customization
      allow: ['web_fetch'], // Add web_fetch to coding profile
      deny: ['exec'], // But deny exec

      // Layer 3: Provider-specific rules
      byProvider: {
         google: {
            deny: ['exec', 'process'], // Google models can't use runtime tools
         },
      },
   },

   // Layer 4: Subagent mode
   isSubagent: false,
});

Inspecting Tool Configuration

Use the tools CLI to inspect and test configurations:

# List all available tools
pnpm tools:cli list

# List tools after applying a profile
pnpm tools:cli list --profile coding

# List tools with deny rules
pnpm tools:cli list --profile coding --deny exec

# Show all tool groups
pnpm tools:cli groups

# Show all profiles
pnpm tools:cli profiles

Policy System Details

Layer 1: Profile

The profile determines the base set of available tools. If not specified, all tools are available.

// In groups.ts
export const TOOL_PROFILES = {
   minimal: { allow: [] }, // No tools
   coding: { allow: ['group:fs', 'group:runtime'] }, // FS + execution
   web: { allow: ['group:fs', 'group:runtime', 'group:web'] }, // + web
   full: {}, // No restrictions
};

Layer 2: Global Allow/Deny

User-specified allow/deny lists that modify the profile's tool set:

allow: Only these tools are available (additive to profile)
deny: These tools are blocked (takes precedence over allow)

Layer 3: Provider-Specific

Different LLM providers may have different capabilities or restrictions:

{
  byProvider: {
    google: { deny: ["exec"] },      // Gemini can't execute commands
    anthropic: { allow: ["*"] },     // Claude has full access
  }
}

Layer 4: Subagent Restrictions

When isSubagent: true, additional restrictions are applied to prevent spawned agents from accessing sensitive tools like session management.

Adding New Tools

Create a new file in src/agent/tools/ (e.g., my-tool.ts)
Define the tool using TypeBox for the schema:

import { Type } from '@sinclair/typebox';
import type { AgentTool } from '@mariozechner/pi-agent-core';

const MyToolSchema = Type.Object({
   param1: Type.String({ description: 'Parameter description' }),
   param2: Type.Optional(Type.Number()),
});

export function createMyTool(): AgentTool<typeof MyToolSchema> {
   return {
      name: 'my_tool',
      label: 'My Tool',
      description: 'What this tool does',
      parameters: MyToolSchema,
      execute: async (toolCallId, args) => {
         // Implementation
         return { result: 'success' };
      },
   };
}

import { createMyTool } from './tools/my-tool.js';

export function createAllTools(cwd: string): AgentTool<any>[] {
   // ... existing tools
   const myTool = createMyTool();

   return [
      ...baseTools,
      myTool as AgentTool<any>,
      // ...
   ];
}

Add the tool to appropriate groups in groups.ts:

export const TOOL_GROUPS: Record<string, string[]> = {
   'group:my_category': ['my_tool', 'other_tool'],
   // ...
};

Testing

Run the policy system tests:

npx tsx src/agent/tools/policy.test.ts

Agent Profile Integration

Tools configuration can be defined in Agent Profile's config.json, allowing different agents to have different tool capabilities:

┌─────────────────────────────────────────────────────────────────┐
│                      Super Multica Hub                          │
│                                                                 │
│   ┌───────────┐    ┌───────────┐    ┌───────────┐              │
│   │  Agent A  │    │  Agent B  │    │  Agent C  │              │
│   │  Profile: │    │  Profile: │    │  Profile: │              │
│   │  coder    │    │  reviewer │    │  devops   │              │
│   │           │    │           │    │           │              │
│   │  tools:   │    │  tools:   │    │  tools:   │              │
│   │  coding   │    │  minimal  │    │  full     │              │
│   └─────┬─────┘    └─────┬─────┘    └─────┬─────┘              │
│         │                │                │                     │
└─────────┼────────────────┼────────────────┼─────────────────────┘
          │                │                │
          ▼                ▼                ▼
    ┌──────────┐     ┌──────────┐     ┌──────────┐
    │  Client  │     │  Client  │     │  Client  │
    └──────────┘     └──────────┘     └──────────┘

Each Agent's Profile can define its own tools configuration in config.json:

{
   "tools": {
      "profile": "coding",
      "deny": ["exec"]
   },
   "provider": "anthropic",
   "model": "claude-sonnet-4-20250514"
}

See Profile README for full documentation.

Config Priority

When both Profile config and CLI options are provided:

Profile config.json - Base configuration
CLI options - Override/extend profile settings

# Profile has tools.profile = "coding"
# CLI adds --tools-deny exec
# Result: coding profile without exec tool
pnpm agent:cli --profile my-agent --tools-deny exec "list files"

Future Tools

The following tools are planned for future implementation:

Browser - Simplified web automation (screenshot, click, type)
Session Management - sessions_list, sessions_history, sessions_send, sessions_spawn, session_status
Image - Image generation and manipulation
Cron - Scheduled task execution
Message - Inter-agent communication
Canvas - Visual output generation