goclaw

mirror of https://github.com/tiennm99/goclaw.git synced 2026-07-25 14:22:57 +00:00

Author	SHA1	Message	Date
Goon	a8afed2a81	fix(channels): announce tools before execution	2026-05-31 19:29:49 +07:00
Duy /zuey/andGitHub	269e2618ae	feat: add archived run timeline Squash merge PR #113 after resolving the project changelog conflict with current dev. Local checks passed: Go store/http/gateway/agent/pipeline tests, SQLite-tagged tests, both Go builds, web Vitest, and web build. PR CI run 26705098712 passed release-versioning, go, and web.	2026-05-31 13:22:07 +07:00
Duy /zuey/andGitHub	f0f39ce31f	feat(channels): add human-like chat behavior Squash merge PR #99 after resolving conflicts with current dev. PR CI run 26703381807 passed release-versioning, go, and web.	2026-05-31 11:44:56 +07:00
Duy /zuey/andGitHub	ef829c1ab4	fix(bitrix24): resolve PR 8-10 stack findings Resolve the PR #8/#9/#10 stack on current dev, including Bitrix24 install callback hardening, migration renumbering, duplicate-domain fail-closed routing, UI textarea/mobile cleanup, and review hardening.	2026-05-22 20:53:01 +07:00
Duy Nguyen	f1277af8c9	style(protocol): format method constants	2026-05-17 17:20:50 +07:00
Duy Nguyen	536ab4ac6c	feat(permissions): add agent channel permission matrix	2026-05-17 15:09:49 +07:00
Duy /zuey/andGitHub	4472c607b8	feat(workstation): Remote Workstation Runtime — SSH exec + security + audit (#4 ) * feat(packages): add update flow for GitHub binaries (#900) Closes #900. Proactive update-check + atomic swap for GitHub-installed binaries on the Runtime & Packages page. Interfaces prepared for pip/npm/apk extension in Phase 2. - UpdateCache + UpdateRegistry + PackageLocker (ctx-aware keyed mutex) - GitHubUpdateChecker: ETag-aware, distinct /latest vs /list ETag keys, semver-correct ordering via golang.org/x/mod/semver, non-semver fallback that refuses to downgrade, pre-release + stable candidate fusion for the v1.0.0-rc.1 -> v1.0.0 transition - GitHubUpdateExecutor: two-phase .bak swap with hadBackup-aware rollback, manifest save retry (3x, 100ms/500ms/1s backoff), nil-safe meta access, explicit ScratchDir, 0755 set pre-rename - HTTP: GET /v1/packages/updates (SWR), POST /v1/packages/updates/refresh, POST /v1/packages/update, POST /v1/packages/updates/apply-all (always 200, failed[] is error source). Master-scope gated. - WS events package.update.{checked,started,succeeded,failed} forwarded to owner clients via event_filter.go - Frontend: useUpdates hook + 3 components (summary bar, update-all modal, row button), master-scope-gated disabled state - i18n: 8 backend keys + 17 frontend keys x en/vi/zh - Config: packages.github_token (reserved), updates_check_ttl, scratch_dir - 45+ new tests, race-clean, BenchmarkCheckAll10Packages ~1.1ms/op warm * docs(packages): document update flow + Phase 1 completion - packages-github.md: "Updating Installed Packages" section with UI + API contract, troubleshooting runbook (corrupt cache, rate-limit, scratch dir, mid-swap recovery) - 17-changelog.md + CHANGELOG.md: Phase 1 entry - 14-skills-runtime.md: cross-ref to update flow - journal entry capturing CRIT fixes (double-write, lock-key mismatch, rollback false-alarm) + design wins (keyed locks, red-team pre-flight) * feat(workstation): remote workstation runtime — SSH exec + security + audit Adds generic Remote Workstation Runtime enabling agents to execute commands on user-owned SSH workstations. Includes registry (DB + API + UI), SSH backend with connection pool and circuit breaker, workstation.exec + claude_remote tools, NFKC + binary-name allowlist security, and audit logging. Standard edition only. Closes #941. * fix(workstation): address 3 critical + 5 important code review findings - C1: Add json:"-" to Metadata/DefaultEnv fields; use SanitizedView() in all API responses to prevent SSH private key leakage - C2: Wire CheckEnv into PermCheckFn; LD_PRELOAD/PATH injection now blocked - C3: SSH Setenv fallback — prepend `export K=V;` when server rejects Setenv - I1: BackendCache sync.RWMutex → sync.Mutex (fix data race on lastUsed) - I2: Validate metadata shape in handleUpdate before store write - I3: Include command in exec-done event; activity sink uses actual cmd hash - I4: Wrap pool release in sync.Once (idempotent double-call safety) - I5: Verify workstation tenant ownership before adding permissions * fix(packages): bypass HTTPS+IP validation in update executor tests Test httptest servers bind to http://127.0.0.1 which fails both the HTTPS scheme check and literal-IP SSRF guard. Add testSkipDownloadValidation flag (same pattern as existing withTestDownloadHosts) to skip full URL validation in test context. * fix(workstation): address Claude review findings — tenant isolation + pool leak + dead code - Activity list: add workstation ownership check before listing (prevents cross-tenant activity enumeration via known UUID) - SSH pool: clean up p.sem + p.circuits maps in CloseWorkstation, prune, and Close to prevent unbounded map growth - RPC handlers: return ErrInvalidRequest on JSON unmarshal failure instead of silently using zero-value params - Remove unused containsControlChars function in normalize.go - HTTP tests: add 10s context timeout to prevent CI package timeout * fix(workstation): DefaultEnv JSON parse, backend cache leak, perm ownership check - DefaultEnv: replace KEY=VALUE text parse with json.Unmarshal (stored as JSON by HTTP handler, was silently ignored) - BackendCache: close losing backend on concurrent cache miss to prevent pruneLoop goroutine leak - Backend interface: add Close() error method; SSHBackend delegates to pool.Close() - handlePermList: add wsStore.GetByID ownership check (prevents cross-tenant UUID enumeration returning empty array vs 404) - scanRows: log scan errors instead of silently skipping * fix(workstation): wire activity sink shutdown + remove misleading comment - WireActivitySink: capture cleanup func, register in gateway shutdown (was discarded → retention goroutine leaked + buffered rows lost) - Add Stop() to WorkstationActivityStore interface (PG+SQLite already had it) - wireWorkstationTools returns cleanup func; gateway.go defers it - Remove misleading "re-validate env" comment in allowlist.go Check() * ci: bump unit test timeout from 90s to 120s hooks/handlers package (goja script tests) consumes ~85s on cold CI runners, leaving insufficient headroom for HTTP retry tests with 1s backoff. 120s provides adequate breathing room without masking real deadlocks. * fix: compile errors in integration tests + allowlist docstring - packages_update_test: add missing lockKey arg to registry.Apply - mcp_grant_revoke_test: remove unused fakeMCPClient struct - allowlist.go: fix Check() docstring to match actual 3-step pipeline * fix(test): relax mcp grant revoke assertion for pre-Phase02 state Execute-time grant checking not yet wired — test correctly gets an error but the message is "no active client" (nil clientPtr) rather than "grant revoked". Accept any error as valid regression guard. * chore: trigger CI on digitopvn/goclaw fork * ci: retrigger workflows * fix(permissions): classify workstation methods in RBAC policy	2026-05-11 14:58:19 +07:00
viettranx	a7962d182a	feat(pipeline): session compaction overflow recovery (#958 ) - Add ZAI/GLM context overflow patterns to error_classify.go - ThinkStage detects overflow, triggers emergency compaction + 1 retry - Wire ReserveTokensFloor config to pipeline budget calculation - Send user-friendly error on channel RunFailed events - Add sessions.compact WebSocket method for manual truncation	2026-04-20 08:49:38 +07:00
viettranx	aa1b2f026a	feat(hooks): WS RPC surface + tenant-scope guards + shared test runner Exposes agent hooks over WebSocket (hooks.list/create/update/delete/ toggle/test/history) behind existing auth layer. Additions: - Gateway wiring: buildHookHandlers() factory reused by dispatcher and hooks.test runner, so UI test panel exercises production code paths - GetByID tenant-scope guard: non-master callers only see own tenant + global rows (matches List() behavior on both PG and SQLite) - i18n keys + catalogs (en/vi/zh) for hooks error messages - Protocol method constants for hooks.* and config.defaults	2026-04-16 14:17:47 +07:00
viettranx	c7f2a260e8	feat(gateway): /v1/voices HTTP + WS RPC endpoints Add ListVoices and RefreshVoices methods to RPC protocol. Implement HTTP /v1/voices endpoint with provider-aware voice listing and filtering.	2026-04-15 11:24:56 +07:00
viettranx	1ac08155b0	feat(trace): reliable stop/abort with ctx-aware streams and 2-phase router Makes the Stop button on the traces page actually stop running traces. Seven-phase implementation across provider HTTP, agent router, trace persistence, WS events, tool exec, i18n, and integration tests. - Provider HTTP+SSE ctx-aware: close socket on cancel via CtxBody wrapper - Router 2-phase abort: CAS state machine, 3s grace, force-mark fallback - Trace retry: 3 inline retries + 10-max retry queue, stale recovery 10min - trace.status WS event: real-time UI updates (invalidates query on receive) - Tool exec: process-group kill (SIGTERM→3s→SIGKILL), Rod page ctx watch - i18n: 6 abort toast variants in en/vi/zh - Integration: 9 scenarios, -race clean Fixes tenant-ctx loss in forceMarkTraceAborted and retry worker broadcast (caught by code-reviewer: C1/C2). Stale threshold intentionally 10min because start_time-based; last_span_at migration is a follow-up.	2026-04-14 18:28:31 +07:00
viettranx	1a471cbde3	feat(bgalert): surface non-retryable background worker errors to admin UI - Add bgalert package: classify provider errors (auth, billing, model_not_found), store alert in system_configs, broadcast WS event - Wire AlertDeps into consolidation (episodic, semantic, dreaming) and vault enrich workers to report failures after retry exhaustion - Auto-clear alert when admin changes provider-related system configs - Add BackgroundErrorBanner component with dismiss + "Fix in Settings" - Lift settings modal state to AppLayout so banner can open it - Add EventBackgroundError to admin-only WS event filter - Add i18n translations (en/vi/zh) for alert messages and reasons	2026-04-13 19:19:51 +07:00
henkedk GitHub Jens Henke viettranx	ad893908a5	fix(telegram): propagate local_key for forum topic routing in team notifications (#800 ) * fix(telegram): propagate local_key for forum topic routing in team notifications Team task status messages (dispatched, completed, progress) were always delivered to the General topic in Telegram forum groups because the notification pipeline had no access to the originating topic's local_key. Root cause: wireTeamProgressNotifySubscriber in gateway_events.go published OutboundMessage with no Metadata, so the Telegram adapter had no message_thread_id to route to the correct forum topic. Fix has two parts: 1. Team notify path (root cause): - Add LocalKey field to TeamTaskEventPayload (protocol) - Extract local_key from tool context in WithContextInfo() - Add LocalKey to NotifyRoutingMeta - Pass LocalKey through to OutboundMessage Metadata in both leader mode (InboundMessage) and direct mode (OutboundMessage) 2. MCP bridge context (supporting): - Propagate local_key and session_key through bridge HTTP headers - Add X-Local-Key and X-Session-Key to BridgeContext - Extract and inject into tool context in gateway middleware - Include in HMAC signature for integrity Closes #798 * fix(telegram): pass LocalKey from task metadata in all dispatch/fail broadcast sites The initial fix added LocalKey to the event payload and WithContextInfo(), but 4 broadcast call sites use individual With* options instead of WithContextInfo — so LocalKey was never populated for: - fallback_dispatch (team_tasks_create.go) - dispatch_unblocked (team_tool_dispatch.go) - post_turn dispatch (team_tool_validation.go) - blocker/fail (team_tasks_blocker.go) Add WithLocalKey() option function and extract TaskMetaLocalKey from task metadata at each site, matching the existing TaskMetaPeerKind pattern. * test(mcp): add HMAC verification tests for extra params (localKey, sessionKey) - Add tests for SignBridgeContext/VerifyBridgeContext with extra params - Test backward compat fallback for pre-localKey sessions - Test that param order matters in signature - Add clarifying comment for routing context injection security model --------- Co-authored-by: Jens Henke <jens@henke.dk> Co-authored-by: viettranx <viettranx@gmail.com>	2026-04-12 19:03:47 +07:00
viettranx	49fced695f	feat(vault): cross-agent API, WS enrichment progress, nullable agent_id Backend: - Add cross-agent routes for all vault CRUD (/v1/vault/documents/, /v1/vault/links/) alongside existing per-agent routes for backward compat. - All handlers skip agent ownership check when agentID is empty. - Path traversal validation (reject .. and leading /) on create. - Scope defaults to "shared" when creating without agent. - Enrichment progress broadcast via WS event (vault.enrich.progress) using existing MessageBus infrastructure. - GET /v1/vault/enrichment/status for polling fallback. Frontend: - Fix VaultDocument.agent_id type to optional/nullable — prevents runtime crash on shared/team docs without agent. - All vault hooks use cross-agent URLs (/v1/vault/...) — no more skipping docs with null agent_id in graph/links/mutations. - Remove dead agentId params from useUpdateDocument, useDeleteDocument, useCreateLink, useDeleteLink, useVaultLinks, LinkBadge. - Pass team_id from selected team filter to create dialog. - WS-based enrichment progress bar on vault page (useWsEvent hook). - Disable rescan button during enrichment, auto-clear after 3s.	2026-04-10 18:25:08 +07:00
Duc NguyenandGitHub	0db1e93abf	feat(whatsapp): add native WhatsApp channel with whatsmeow (#720 ) Replace Node.js Baileys bridge with native go.mau.fi/whatsmeow — zero external dependencies. QR auth, media support, markdown formatting, typing indicators, dual JID/LID identity, group policies, pairing. Resolves #703	2026-04-07 12:12:44 +07:00
viettranx	014f74ec15	fix(agent): group session unresponsive during team task execution (#266 ) Two fixes: 1. Remove assistant prefill from team task reminders. The injected [user]+[assistant]+[user] pattern caused LLMs to treat the canned ack as "turn complete", returning NO_REPLY for every user message in group sessions with active tasks. Reminders are now merged into the user message as prefix tags. 2. Add PeerKind propagation to team notification routing. TaskTicker and progress notifications were missing PeerKind on InboundMessage, causing them to route to phantom DM sessions instead of the correct group session. PeerKind is now carried through event payloads, notify queue metadata, and all inbound message publications.	2026-03-30 15:20:01 +07:00
viettranx	8b75e095e9	feat(cancel): add run.cancelled event and fix stop across web UI, desktop, and channels - Add distinct `run.cancelled` agent event to differentiate user-initiated cancellation from real errors (previously both emitted `run.failed`) - Fix desktop stop button: wire onStop callback to InputBar, add abort() function in useChat hook, restore isRunning state on session switch - Fix web UI: handle run.cancelled without showing error message, preserve partial streamed content on cancel - Fix hanging chat.send promise: send {cancelled: true} OK response instead of silently dropping (was hanging 600s until timeout) - Fix channel stream cleanup: handle run.cancelled for stream stop + run context deletion (prevents memory leak) + reaction status clear - Fix child trace status: cancelled subagent runs now correctly get TraceStatusCancelled instead of TraceStatusError - Add run.cancelled to all event consumers: query invalidation, session detail refresh, trace dialog refresh, gateway activity clearing	2026-03-29 09:00:06 +07:00
viettranx	f94fc1528e	feat(ui): conversation-wide image gallery, layout fix, and session state restore - Fix chat layout overflow when TaskPanel sidebar is open by adding min-w-0 to flex containers in AppLayout and ChatPage - Add conversation-wide image gallery with prev/next navigation (ArrowKeys) and image counter — collects images from both mediaItems and markdown - Rename task panel header from "Tasks" to "Running Tasks" - Add chat.session.status RPC to restore isRunning and activity state on session switch, preventing lost running indicators - Add cancelled flag to session restore effect to prevent race conditions on rapid session switching	2026-03-25 19:41:53 +07:00
viettranx	f793bba0e0	feat(teams): restore active tasks on session switch via teams.tasks.active-by-session RPC	2026-03-25 18:26:31 +07:00
viettranx	7ad15a375a	feat(chat): add realtime comment/attachment counters with toggleable task panel Subscribe to team.task.commented and new team.task.attachment_added events in chat view. Show inline notifications and increment counters on active task cards. Add toggleable sidebar panel (auto-opens on first task dispatch, auto-closes when all tasks complete). Mobile renders as slide-over with backdrop overlay.	2026-03-25 12:06:58 +07:00
Viet TranandGitHub	cd022699f6	feat: multi-tenant isolation — complete implementation (#359 ) * feat(security): multi-tenant user data isolation (Plan 1) Comprehensive user data isolation for non-owner system users: - API key identity binding: owner_id column forces user_id on auth, prevents spoofing via X-GoClaw-User-Id header - Sessions: ownership checks on list/preview/patch/delete/reset, non-admin users see only their own sessions - Cron: user_id filtering on list, ownership checks on mutations - Server-side WS event filtering: agent/chat/session/cron/team events scoped per-user instead of broadcast to all clients - Web UI role guards: RequireAdmin on 15 admin-only pages, role propagated from WS connect response to auth store - Tracing/activity: user_id enforcement for non-admin HTTP callers - Teams: HasTeamAccess membership checks on get/delete/list - Skills: fail-closed ownership check (deny non-admin if store doesn't support owner lookup) - HTTP auth: requireAuthBearer now enforces owner_id + user context for file/media downloads (was missing) - Dead code: removed delegation_history, handoff_routes tables and all related handlers/store code - New: team_user_grants table for user-to-team access control Migration 000026: api_keys.owner_id + team_user_grants + DROP legacy tables * feat(security): multi-tenant foundation — tenants table, tenant_id propagation, permission cache (Plan 2) Add tenant isolation infrastructure across the entire gateway: Schema (migration 000027): - Create tenants + tenant_users tables with master tenant seed - Add tenant_id column to 30 user-scoped tables (NOT NULL DEFAULT master) - api_keys.tenant_id nullable (NULL = system-level cross-tenant key) - Create builtin_tool_tenant_configs + skill_tenant_configs for per-tenant overrides - Drop custom_tools table (agent loop integration never wired) Store layer: - TenantStore interface + PGTenantStore (CRUD tenants + tenant_users) - TenantID field on AgentData + APIKeyData - tenant_id in agents/api_keys/skills SQL (Create, Get, List) Context propagation: - WithTenantID/TenantIDFromContext (uuid.Nil = fail-closed) - WithCrossTenant/IsCrossTenant (owner/system admin flag) Auth tenant resolution: - HTTP: resolveAuthBearer sets TenantID/CrossTenant on all 5 auth paths - WS: handleConnect sets tenantID/crossTenant on Client - API key 2-tier: NULL = cross-tenant (system), set = tenant-scoped Runtime isolation: - Event bus: TenantID field on Event, fail-closed filter in event_filter.go - Cron: tenant context injected in RunJob handler - Subagent: tenant validation prevents cross-tenant spawn - Security logging: tenant_id in auth resolution logs Tenant management: - WS RPC: 7 methods (tenants.list/get/create/update, tenants.users.) - HTTP: 7 endpoints (/v1/tenants/) - Slug validation + path traversal prevention - Role validation (owner/admin/operator/member/viewer) Infrastructure: - PermissionCache: 4 sub-caches (tenant resolve, role, agent access, team access) - tenant_paths.go: filesystem path helpers with master-tenant backward compat - i18n: MsgInvalidRole key + translations (en/vi/zh) Dead code removed: custom_tools store, HTTP handler, DynamicToolLoader (-828 lines) * feat(security): tenant query filtering + workspace isolation (Plan 3) Add WHERE tenant_id filtering to all 30+ tenant-scoped store queries, wire workspace filesystem isolation, and harden restrict_to_workspace. Store query filtering: - Add tenantClauseN/tenantIDForInsert/requireTenantID helpers - Filter all SELECT/INSERT/UPDATE/DELETE by tenant_id for non-cross-tenant - Refactor SessionStore.GetOrCreate and CronStore.AddJob/ListJobs to accept context.Context for tenant propagation - System skills (is_system=true) bypass tenant filter for all tenants - Special cases: GetByKey (channels), GetByHash (auth) skip filter Workspace isolation: - Resolver computes tenant-scoped workspace + dataDir for non-master tenants - Add WithTenantSlug/TenantSlugFromContext to context propagation - Add TenantStore + Workspace to ResolverDeps - Force effectiveRestrict() to always return true (multi-tenant security) - Remove restrict_to_workspace from agentAllowedFields UI cleanup: - Remove custom-tools pages, types, routes, constants (backend removed in Plan 2) - Clean tool-name-select component of custom tools references * feat(security): session ctx propagation + execMapUpdate tenant guard (Plan 4) Session store: - Add ctx to AddMessage, SetSessionMetadata, SetAgentInfo, List, Save - List now filters by tenant_id for non-cross-tenant callers - Save uses ExecContext for cancellation support - All ~15 callers updated to pass ctx execMapUpdate tenant guard: - Remove deleted_at IS NULL from execMapUpdateWhereTenant (only agents has soft-delete) - Migrate 8 callers to execMapUpdateWhereTenant: agent_links, channel_instances, mcp_servers, secure_cli, tracing, teams, skills_crud, cron_update - Add ctx to UpdateSkill, UpdateJob interfaces + all callers Deferred: cron scheduler global cache (correct by design — system process), browser per-tenant isolation (separate plan). * refactor(store): add context.Context to all SessionStore interface methods Complete ctx propagation across all 24 SessionStore methods for: - Future tenant-aware DB operations - Request cancellation/timeout support - Distributed tracing capability Updated ~15 files including all callers in agent loop, gateway methods, heartbeat ticker, tools, and CLI commands. * fix(security): remove context.Background() shadowing in gateway handlers Critical fix from code review: gateway agent handlers (create, update, delete, identity, files, links, teams) were creating ctx := context.Background() which shadowed the handler's ctx that carries tenant_id. This breaks tenant-scoped agent queries for non-master tenants. - Remove ctx shadowing in 7 agent handler files - Add ctx param to resolveAgentUUID/resolveAgentInfo helpers - Use store.WithCrossTenant in resolver (system-level operation) * feat(security): tenant-scoped UNIQUE constraints for multi-tenant isolation Update UNIQUE indexes to include tenant_id, allowing same names across tenants: - agents: (agent_key) → (tenant_id, agent_key) WHERE deleted_at IS NULL - sessions: (session_key) → (tenant_id, session_key) - skills: (slug) → (tenant_id, slug) - mcp_servers: (name) → (tenant_id, name) - channel_contacts: (channel_type, sender_id) → (tenant_id, channel_type, sender_id) Code changes: - GetByKey now filters by tenant_id (same pattern as GetByID) - ON CONFLICT clauses updated for sessions and skills - Channel consumer uses WithCrossTenant for agent resolution - Down migration restores original constraints * fix(security): close remaining tenant isolation gaps from final audit Critical fixes: - gateway_setup: WithCrossTenant for default agent lookup at startup (C6) - channel_contacts: ON CONFLICT updated to (tenant_id, channel_type, sender_id) (Q15) - agents.Delete: tenant filter on DELETE (Q1) High priority fixes: - agents: List, GetDefault, ShareAgent, RevokeShare, ListShares, CanAccess, ListAccessible, Update unset-default — all now tenant-scoped - skills_crud: DeleteSkill now takes ctx, verifies tenant ownership - mcp_servers, channel_instances, secure_cli: Delete methods tenant-scoped - WithCrossTenant added to: gateway team notifications, team_tool_cache, pending_messages GetDefault * fix(migration): add tenant_id to usage_snapshots unique index Update idx_usage_snapshots_unique to include tenant_id, preventing cross-tenant upsert collisions when different tenants have agents with same provider/model/channel combination. * feat(security): cron tenant guard + browser per-tenant isolation Phase 3 — Cron API tenant guard: - Add ctx to 5 CronStore methods (GetJob, RemoveJob, EnableJob, RunJob, GetRunLog) - All API-facing cron ops now filter by tenant_id (prevents cross-tenant CRUD) - RemoveJob/EnableJob return "not found" on tenant mismatch (no enumeration) - GetRunLog JOINs cron_jobs for tenant filtering - UpdateJob internal reads scoped by tenant (defense-in-depth) - Scheduler-internal methods (GetDueJobs, refreshJobCache) unchanged (system-level) Phase 4 — Browser per-tenant isolation: - Per-tenant incognito browser contexts via rod Incognito() (separate cookie jars) - All page access (Snapshot, Screenshot, Navigate, Click, Type, etc.) validated via getPageForTenant — blocks cross-tenant access by targetID - OpenTab creates pages in tenant's incognito context - ListTabs scoped to tenant's incognito context - ConsoleMessages validates page ownership - Stop/reconnect properly cleans up incognito contexts * feat(security): isolation gaps + per-tenant config (Plan 5) Part A — Isolation Gap Fixes: - Merge migration 028 into 027: add tenant_id to llm_providers + config_secrets, fix UNIQUE constraints for paired_devices + channel_instances - providers.go: tenant filtering on all CRUD queries - config_secrets.go: ON CONFLICT (key, tenant_id) - pairing_store: add ctx to all 7 interface methods, remove hardcoded MasterTenantID, update ~15 channel caller files - Session cache: prefix keys with tenantID to prevent cross-tenant collision. DB queries (loadFromDB, Save, Delete, LastUsedChannel) add tenant filter - config_permissions cache: prefix keys with tenantID - Cron ListJobs: fail-closed when tenant context missing Part B — Per-Tenant Configuration: - Provider Registry: compound key tenantID/name with fallback to master tenant. GetForTenant/ListForTenant/RegisterForTenant - Resolver: uses tenant-aware provider lookup + disabled tools query - Agent loop: filter disabled tools from LLM tool definitions - Builtin tool tenant configs: store interface + PG implementation + PUT/DELETE HTTP endpoints - Skill tenant configs: store interface + PG + ListAccessible LEFT JOIN to exclude disabled skills per tenant - OAuth: DBTokenSource with tenantID field for tenant-scoped token refresh - All HTTP provider handlers use RegisterForTenant/UnregisterForTenant * feat(security): channel tenant propagation + MCP per-user credentials (Plan 6) - Propagate tenant_id from channel_instances through BaseChannel → InboundMessage → agent loop context (fixes 5-point break in tenant flow) - Inject tenant context in WS router dispatch for all gateway methods - Add MCP per-user credential overrides (api_key, headers, env) with AES-256-GCM encryption and HTTP API endpoints - Rewrite MCP pool with tenant-scoped keys, slot semaphore, idle eviction, and credential rotation support (Evict per tenant+server) - Bypass pool for users with custom credentials (separate connections) - Fix MCP APIKey never passed to connections (inject as Authorization header) * fix(security): close remaining tenant isolation gaps from Plan 1-6 audit - Add tenant_id to 6 missing tables: agent_context_files, skill_agent_grants, mcp_agent_grants, team_tasks, spans, embedding_cache (migration 027) - Fix tid==uuid.Nil fallback to fail-closed (return error) in 8 update methods: agent_links, teams, skills, channel_instances, secure_cli, cron, mcp_servers, tracing - Add tenant filter to bare DELETEs: DeleteLink, DeleteTeam - Add tenant filter to queries: ListChildTraces, GetMonthlyAgentCost, CountAgentGrantsByServer, ListAccessible (MCP), ReviewRequest, ResolveGroupTitles, buildTraceWhere - Fix missing tenant_id in INSERTs: CreateSkill, GrantToUser, ReviewRequest grant INSERTs - Add tenant filter to api_keys: List, Revoke, Delete - Fix cron scanJob/RemoveJob/EnableJob fallthrough patterns * fix(security): inject tenant context into channel handler entry points Channel handlers used context.Background() which lost tenant context, causing store operations to either fail-closed or default to master tenant. Now all 10 handler entry points inject tenant from BaseChannel. * fix(security): tenant filters for teams, tasks, skills (Plan 6b audit) - Teams: add tenant filter to GetTeamForAgent, ListMembers, ListIdleMembers, KnownUserIDs (JOIN agent_teams for tenant check) - Teams: add tenant_id to GrantTeamAccess INSERT, tenant filter to RevokeTeamAccess, ListTeamGrants, HasTeamAccess - Team tasks: add tenant_id to CreateTask INSERT, fail-closed UpdateTask, tenant filter on all 7 query/delete methods - Skills: add tenant filter to RevokeFromAgent, ListAgentGrants - Skills: add ctx param + tenant filter to ToggleSkill - History: annotate context.Background() locations with TODOs for future tenant injection (requires PendingHistory struct refactor) * fix(security): add tenant_id to 4 missing team tables + fix INSERTs Add tenant_id column to: agent_team_members, team_task_comments, team_task_events, team_task_attachments (migration 027). Fix INSERT statements to include tenant_id: AddMember, AddTaskComment, RecordTaskEvent, AttachFileToTask. * fix(migration): cast UUID literals in tenant_users seed + usage_snapshots index PostgreSQL doesn't auto-cast string to UUID in SELECT and expression index contexts. Add explicit ::uuid casts to prevent migration failure. * docs: add multi-tenant architecture guide for integrators Comprehensive solution doc covering auth model, WS protocol, event system, data isolation, API reference, and integration patterns. Target audience: developers building custom frontends or SaaS on GoClaw. * feat(ui): multi-tenant awareness + tenant admin page (Plan 7) Backend: - Enrich WS connect response with tenant_name, tenant_slug, cross_tenant - Add tenants.mine WS method (any user, returns own memberships) - Parse tenant_hint in connect params for browser pairing multi-tenant - Wire tenantStore to MethodRouter for connect-time tenant lookup Frontend: - Auth store: tenantId, tenantName, tenantSlug, isCrossTenant, availableTenants - WS client: capture tenant fields from connect, send tenant_hint - WS provider: auto-fetch tenants.mine on connect - useTenants() shared hook for all tenant-aware components - Tenant indicator in sidebar connection status - Tenant admin page (/admin/tenants) with list + create dialog - Tenants nav in sidebar (cross-tenant admin only) - i18n: tenants namespace (en/vi/zh) - Type updates: tenant_id on AgentData, ApiKeyData * refactor(ui): move tenant selector into user menu dropdown in topbar Replace simple logout button with a Radix Popover user menu showing: - User ID display - Tenant selector (when multi-tenant: list all tenants with check mark) - Logout button Remove tenant indicator from connection-status.tsx (now in topbar). Tenant switch saves slug to localStorage and reloads for reconnect. * feat(ui): add logout confirmation dialog Show destructive confirm dialog before logout via ConfirmDialog component. Added logoutConfirm i18n key for en/vi/zh. * fix(ui): security hardening — hide admin nav, fix route guard, fix refresh - Hide System nav group for non-admin roles in sidebar (was visible to all) - Replace RequireAdmin with RequireCrossTenant guard on /admin/tenants route - Add RequireCrossTenant component to require-role.tsx - Fix refresh button animation: use isFetching instead of isLoading - Clean up connection-status.tsx (remove tenant indicator, now in topbar) * feat: cross-tenant admin tenant scope selector Backend: add tenant_scope connect param. Cross-tenant clients can narrow their scope to a specific tenant (slug). applyTenantScope() sets client.tenantID and clears crossTenant flag. UI: user menu shows "All Tenants" option for cross-tenant admins. Selecting a tenant saves slug to localStorage as tenant_scope, reload reconnects with narrowed scope. "All Tenants" clears scope. * feat: provisioning API key scope + tenant detail page (Plan 8) Backend: - Add operator.provision scope for limited tenant management - Add HasScope() method to gateway Client - Allow provision-scoped keys to create tenants + add users - Allow provision-scoped keys to create tenant-bound API keys Frontend: - Tenant detail page with user management (list, add, remove) - Clickable tenant list rows navigate to detail - i18n: tenant detail keys (en/vi/zh) - Route /admin/tenants/:id with RequireCrossTenant guard * fix: tenant scope keeps admin privileges + UI pattern fixes Backend: - applyTenantScope keeps crossTenant=true (retains admin features) - Router: scoped cross-tenant injects WithTenantID (filters data) while keeping admin role for method access UI: - Fix "All Tenants" check mark (compare against nil UUID string) - Fix tenant label when scope active (show selected tenant name) - Use ConfirmDialog for user removal (was hand-rolled) - Add DialogDescription to add-user dialog (Radix a11y) - Fix table min-w-[600px] consistency - Fix column header mismatch (was "role", should be "created") * fix(ui): clean up tenant detail header — remove redundant info panel Remove duplicate slug/status/created panel. Info now shown in PageHeader description (slug + date). Status badge removed (redundant with description). Cleaner, consistent with other admin pages. * fix(ui): redesign tenant detail with info cards + user cards * feat(ui): tenant selection gate — require tenant before app access - Add tenantSelected flag to auth store (persisted via localStorage) - WS provider auto-selects: single-tenant user auto, cross-tenant admin defaults to "All Tenants", zero-tenant user blocked - RequireAuth gate: redirect to /select-tenant when connected but no tenant selected - New TenantSelectorPage: centered card layout matching login page, "All Tenants" amber card for cross-tenant admin, per-tenant cards with role badges, no-access state with logout button - i18n: selectTenant, noAccess keys (en/vi/zh) * fix(security): scope events for cross-tenant admin with tenant_scope Event filter was checking !crossTenant before filtering — scoped cross-tenant admins (crossTenant=true + tenantID set) bypassed tenant event filtering. Now checks tenantID != Nil regardless of crossTenant flag, ensuring scoped admins only see their chosen tenant's events. * fix(security): HTTP API now respects tenant_scope for gateway token Root cause: UI uses HTTP API (/v1/agents, /v1/mcp/servers, etc.) for data fetching. HTTP auth middleware with gateway token always set CrossTenant=true with no tenant filtering. tenant_scope only worked for WS connection, not HTTP requests. Fix: - HTTP client sends X-GoClaw-Tenant-Scope header from localStorage - HTTP auth resolves header slug → tenant UUID via tenantStore - requireAuth: CrossTenant + TenantID → WithTenantID (scoped) - Wire InitTenantStore(pgStores.Tenants) in gateway startup * feat(security): tenant-aware provider registry, event filter, and membership validation - Refactor providers.Registry: Get(ctx, name) / List(ctx) extract tenant from context via injected TenantFromCtx func (avoids circular import) - Event filter: fail-closed 3-mode tenant filtering Mode 1: unscoped admin sees all Mode 2: scoped admin sees tenant events + system events Mode 3: regular user sees only own tenant (fail-closed) - WS connect: resolveTenantHint validates membership via GetUserRole with PermissionCache (30s TTL, bus invalidation) - BroadcastForTenant helper for tenant-scoped event emission - Session list: add TenantID to SessionListOpts from context - Cron handleRun: preserve tenant in background goroutine context - GOCLAW_LOG_LEVEL env var (debug\|info\|warn\|error) for Docker/K8s - Cache debug logging: tenant_cache, permission_cache, api_key_cache - Friendly verify error: timeout → user-readable message - Verify timeout: 15s → 30s * feat(ui): setup wizard improvements + agent preset enrichment - Setup: skip link with confirm dialog, language selector (en/vi/zh) - Setup: card padding fix (py-0 gap-0 on Card, py-5 on CardContent) - Setup: remove duplicate skip link from layout - Step Model: verify countdown timer (30s), stops on result - Step Agent: default Fox Spirit preset, selected state styling, hide agent key/name inputs, auto-derive from preset, emoji in config - Summoning modal: elapsed timer (m:ss format) - Agent presets: enriched prompts with human-like quirks Fox Spirit: playful personality, care reminders Artisan: portrait/banner/ads/logo expertise Astrologer: reference sites (astro.com, cafeastrology, labyrinthos) - i18n: "triệu hồi linh hồn" fix, all 3 locales updated * feat(ui): API Key tenant support + card layout + provider chain fix - API Key create: tenant selector for cross-tenant admin, provision scope - API Key create: redesigned dialog with scope cards, Radix Select, icons - API Key list: card layout with badges (status, tenant, scopes) - API Key: shortcut in user menu (topbar) - API Key: keep "API Key" untranslated across all locales - Provider chain: empty state fix — skip legacy entry when provider not found in current tenant - i18n: form.cancel key added to all 3 locales * fix(ui): add bottom padding to all page layouts + misc improvements - Add pb-10 to all 24 page containers to prevent content touching bottom edge of viewport - Various UI polish from user modifications (summoning colors, layout icon, agent cards, sidebar adjustments) * feat(ui): MCP user credentials dialog + builtin tool tenant toggle - MCP: per-user credentials dialog (api_key, headers, env KV editor) with status badges, delete all, save - MCP: "My Credentials" button on each server row - Builtin Tools: per-tenant enable/disable override toggle with "Using default" / "Enabled/Disabled for tenant" badges and reset-to-default button - Setup: larger logo (h-16) and bolder title (text-4xl font-bold) - i18n: all keys added to en/vi/zh for both features * fix(ui): API key card spacing + remove pagination border - Card padding: px-4 py-3.5 (was px-3 py-2), rows spaced with gap-2 - Scopes on separate row from dates for readability - Card gap: space-y-2.5 between cards - Pagination: add className prop, remove border-t on API keys page - Badge/icon sizes bumped to text-xs / h-3.5 (was text-[10px] / h-3) * fix(security): comprehensive tenant isolation audit — SQL, events, cache, skills, files Defense-in-depth hardening across 12 audit phases: - SQL: add tenant_id WHERE to teams_tasks lifecycle/activity/followup/progress/embedding (~30 functions) - Events: broadcastTeamEvent + task_ticker + subagent announce now carry TenantID - Cache: agentKeyCache scoped by tenant (agent keys per-tenant, not globally unique) - Skills: SkillStore interface accepts ctx, SQL filter (is_system OR tenant_id=$N), per-tenant list cache, GrantToAgent includes tenant_id, tenant-scoped file storage - Files: StorageHandler/FilesHandler/TeamAttachments/teamWorkspaceDir use config.TenantDataDir/TenantTeamDir - Security: HMAC signed file tokens (file_token.go) replace gateway token in URLs - Audit: AuditEventPayload carries TenantID for async subscriber tenant scoping - InboundMessage: subagent/dispatch/validation/session_send propagate TenantID - Pending messages: DeleteStale scoped by tenant * fix(security): skip gateway token in URLs with signed file tokens toFileUrl() now skips appending ?token=GATEWAY_TOKEN when the URL already contains ?ft= (HMAC signed file token). Prevents gateway token exposure via browser history, logs, and referrer headers. * fix(security): stop persisting auth tokens in session media URLs mediaToMarkdown() now stores clean paths (/v1/files/path) without any auth tokens. Previously embedded ?token=GATEWAY_TOKEN (or ?ft=) into markdown which gets persisted in session messages DB. Frontend toFileUrl() adds auth at render time — tokens never stored. * fix(security): migration 027 strips leaked gateway tokens from session URLs Adds cleanup step to tenant foundation migration: removes ?token=xxx from persisted media URLs in session messages. Old code embedded the gateway token; new code stores clean paths only. * fix(security): sign file URLs at delivery time, not persist time Add SignFileURLs() utility that finds /v1/files/ and /v1/media/ URLs in content and appends HMAC signed ?ft= tokens before delivery. Applied at 4 delivery points: - WS agent events (OnEvent callback in gateway_managed.go) - WS chat.history response - WS sessions.preview response - HTTP /v1/chat/completions response Sessions store clean paths only. Tokens are generated per-delivery with 1h TTL — never persisted in DB. Frontend toFileUrl() skips appending gateway token when ?ft= is already present. * fix: file token verify path must match signed path (/v1/files/ prefix) SignFileURLs() signs the full URL path "/v1/files/{path}" but the verify in files.go auth() was using "/{path}" (without prefix). HMAC mismatch caused all signed file tokens to return 401. * fix(security): scope storage size cache per-tenant sizeCache was a single global entry — all tenants shared one cached size. Changed to sync.Map keyed by tenantBaseDir so each tenant gets its own cached size calculation. * feat(ui): redesign API keys page — table layout + code snippet dialog Replace card-based API keys list with table layout matching MCP Servers pattern. Add "API Key Usage" dialog with tabbed code snippets (cURL, TypeScript, Go) showing gateway connection examples with syntax highlighting and copy-to-clipboard. * fix(builtin-tools): seed media tools disabled, fix tenant toggle, add unconfigured warning - Seed media tools with Enabled=false and no default provider settings (user must configure provider chain before enabling) - Fix provider chain form ghost entries: validate provider exists in tenant before showing (parseInitialEntries new-format path) - Fix double toggle: show only tenant override OR global toggle, not both - Fix list API: merge tenant_enabled from builtin_tool_tenant_configs into response when tenant-scoped (was always null) - Add ListAll() to BuiltinToolTenantConfigStore for full override map - Add amber warning banner for enabled media tools missing provider config * feat(mcp): require_user_credentials setting + KeyValueEditor for user creds - Add require_user_credentials setting in mcp_servers.settings JSONB - Backend: skip MCP server in LoadForAgent when user lacks credentials - Frontend: toggle in MCP form dialog, persisted in settings field - Redesign MCP user credentials dialog: replace raw Textarea with KeyValueEditor (sensitive key masking for auth/token/secret fields) - Add settings to mcpServerAllowedFields for HTTP update * fix(security): restrict cross-tenant to owner IDs, config to owners only - Gateway token + non-owner user ID: admin role but tenant-scoped (no cross-tenant access). Fallback: only "system" is owner when GOCLAW_OWNER_IDS not configured (fail-closed). - Config page (WS config.* methods): wrapped with requireCrossTenant middleware — non-owner admins get permission denied - Config sidebar link: hidden for non-cross-tenant users - Logout: clear tenant_id and tenant_hint from localStorage (prevents tenant scope leak to next user session) - Refactor: LOCAL_STORAGE_KEYS.TENANT_ID/TENANT_HINT constants * fix(ui): chat bubble contrast, login logo, tenant no-access UX - Chat bubble: use --chat-bubble-user CSS var (darker orange, L=0.50/0.52) with text-white for WCAG AA contrast (~5.5:1) - Login page: logo h-20 w-20, title text-3xl font-bold - Tenant selector no-access: shield icon + hint text explaining user needs admin to add them to a tenant - Sidebar: GoClaw text uses text-sidebar-primary (brand color) * feat(contacts): merge/unmerge contacts to tenant users Add API and UI for linking channel contacts to tenant_users identity, enabling cross-channel user identification within a tenant. Backend: - POST /v1/contacts/merge — link contacts to existing or new tenant_user - POST /v1/contacts/unmerge — remove merged_id from contacts - GET /v1/contacts/merged/{id} — list contacts by tenant_user - GET /v1/tenant-users — list users for current tenant - Add display_name + metadata columns to tenant_users (migration 27) - All endpoints enforce tenant isolation via context tenant_id Frontend: - Checkbox multi-select on contacts table - Selection toolbar with Merge/Unmerge buttons - Merge dialog: link to existing user or create new - Link2 icon indicator for merged contacts - i18n: en/vi/zh translations for merge section * fix(security): add tenant_id to span and embedding_cache inserts SpanData struct was missing TenantID field — all span inserts failed with NOT NULL constraint violation after migration 027 dropped defaults. Fix captures tenant_id from context at emit time (6 call sites in loop_tracing.go + subagent_tracing.go), then includes it in both CreateSpan() and BatchCreateSpans() SQL (25→26 columns). Also fixes embedding_cache writeEmbeddingCache() which was missing tenant_id in its batch INSERT — same class of bug. Both use MasterTenantID fallback for backward compatibility. * feat: Introduce tenant switcher UI and enhance multi-tenant architecture documentation. * fix(security): enforce tenant scoping, fix session isolation and UI cleanup - Force cross-tenant admins to always have a concrete tenant_id (default MasterTenantID) instead of unscoped WithCrossTenant — prevents mismatch between session listing (no filter) and writes (MasterTenantID fallback) - Make agent router tenant-aware: Get(ctx, agentID) resolves agent for the caller's tenant, preventing cross-tenant agent cache collisions - Fix context.Background() in title goroutine and summarization — now uses tenant-aware context (WithoutCancel) so titles and compaction persist to the correct tenant - Add read-only SessionStore.Get() method; replace GetOrCreate in auth checks (preview/patch/delete/reset) to prevent phantom session creation - Inject tenant from channel instance into inbound message processing - Remove "All Tenants" option from tenant selector, topbar switcher, and ws-provider auto-select — admin must always operate within a tenant - Fix contacts page selection toolbar layout shift (always rendered) - Widen MCP credentials sensitive header regex to catch API_KEY etc. * fix(security): propagate tenant_id in consumer handlers and background ops - InjectTeamDispatch: use context.WithoutCancel instead of context.Background to preserve tenant_id while avoiding cancel propagation from HTTP/WS handlers - handleTeammateMessage/handleSubagentAnnounce: inject tenant_id from msg - Add nil guard for outcome.Result to prevent panic on agent-not-found - Use BroadcastForTenant for EventTeamTaskFailed/Completed/LeaderProcessing - Remove unnecessary WithCrossTenant in autoSetFollowup (ctx already scoped) - resolveAgentByKey: accept ctx param for tenant-scoped agent lookup - pending_messages: use request ctx instead of cross-tenant for GetDefault * fix(security): tenant-scope EnsureContact and PendingHistory DB operations - All channel EnsureContact calls now use tenant-scoped ctx instead of context.Background (whatsapp, slack, discord, telegram, feishu, zalo) - PendingHistory: add tenantID field, thread through constructors - All PendingHistory DB ops (load, flush, compact, delete) use tenantCtx() - Normalize timeouts: 10s for simple queries, 15s for batch writes * feat(teams): auto-attach media, retry completed tasks, improve tool messages - Auto-attach workspace media from any tool (create_image/audio/video) to team tasks via loop-level hook, not just write_file interceptor - Store absolute paths in team_task_attachments instead of relative - Extend retry action to support completed tasks (reopen for follow-up) - Context-aware comment result messages with next-action guidance for leader vs member roles and task status - All tool results include task_id for agent follow-up actions - Use #N "subject" format instead of raw UUIDs in tool messages * feat(multi-tenant): tenant isolation for media, events, providers and UI - Tenant-scoped media store, event filter, provider registry - Tenant header propagation in WS/HTTP clients - UI: tenant-aware chat messages, markdown renderer improvements - Protocol: tenant error codes and event definitions * docs: add multi-tenant architecture documentation * fix(teams): store absolute paths in team_task_attachments - AutoAttachWorkspaceFile: use cleanPath consistently instead of raw absPath - executeAttach: resolve relative paths to absolute via team workspace - AfterWrite interceptor already uses filepath.Clean (verified) * fix(teams): attachment download handles both absolute and relative paths filepath.Join with an absolute att.Path discards the teamBase prefix, causing path traversal check to fail and download to serve wrong file. Now checks IsAbs first — uses path directly for new absolute entries, falls back to legacy join for old relative entries. * fix(teams): attachment download validates against workspace root not tenant dir Absolute paths stored in DB don't match TenantTeamDir structure (master tenant has no tenants/ prefix). Now validates absolute paths against dataDir (workspace root) instead. Legacy relative paths still resolve via TenantTeamDir as before. IDOR check on att.TeamID ensures cross-team isolation. * fix(teams): attachment download uses workspace root, not data dir Files are stored under GOCLAW_WORKSPACE/teams/ but handler was passed dataDir (GOCLAW_DATA_DIR) — completely different directory. Now passes workspace. Legacy relative paths resolve via {workspace}/teams/{teamID}/{chatID}/{path}. * fix(security): use HMAC-signed file tokens for attachment downloads Replace gateway token exposure (?token=) with HMAC-signed short-lived file tokens (?ft=) for team task attachment downloads — same mechanism used by chat file URLs. Backend: - team_attachments auth: accept ?ft= signed token (priority 1), Bearer (priority 2) - teams_tasks RPC: sign download_url with HMAC at delivery time - Add fileTokenSecret to TeamsMethods, thread through wireChannelRPCMethods Frontend: - Use server-signed download_url from attachment data instead of ?token= - Remove useAuthStore dependency from task-detail-dialog * fix(security): decouple file token signing from gateway token - Generate random 256-bit HMAC key at startup (crypto/rand, memory-only) - All file signing/verification uses FileSigningKey() instead of gateway token - Remove ?token= query param fallback from /v1/files/, /v1/media/, attachments - Only ?ft= signed tokens and Bearer header accepted for file access - Reduce file token TTL from 1h to 5min - Frontend: remove gateway token from all file URLs and imports - Note: tokens invalidate on restart (acceptable for 5min TTL + WS reconnect) * fix(ui): use signed download_url for task attachments - Add download_url to TeamTaskAttachment type - Use a.download_url (server-signed ?ft=) instead of bare URL * fix(security): tenant-scope team workspace paths + show user/tenant in topbar - WorkspaceDir callers now use config.TenantWorkspace() to resolve tenant-scoped base dir (non-master tenants get workspace/tenants/{slug}/) - Fixes: all tenants previously wrote to global /app/workspace/teams/ without filesystem-level isolation - Affected: loop.go (agent run), team_tasks_mutations.go (task creation) - teams_workspace.go already correct (uses TenantTeamDir) - UI topbar: show "userId (tenantName)" in user menu * feat(ui): redesign task detail dialog with improved UX - Split monolithic 343-line component into 5 focused files - New header: subject as title, identifier + status badges above - Metadata grid with soft bg-muted/30 background, priority icons - Attachments: card-style with mime-type icons + proper download Button - Description/Result: markdown rendering via MarkdownRenderer - Comments: avatar circles + markdown rendering for content - All sections collapsible with chevron + count badge - Timeline: vertical dot-line pattern, collapsed by default - Fix kanban card hover layout shift (opacity instead of display toggle) * fix(security): tenant-scoped workspace paths and tool cache isolation - Scope team workspace paths to tenant directory - Add tenant isolation to tool cache and task reads - Shell deny pattern improvements - Agent resolver and context file tenant scoping - Sidebar tenant/user display fix - Add tests for workspace, boundary, and context file interceptor * fix(ui): tenant visibility fallback, merge coming-soon, task detail tweaks - Tenants page: try tenants.list (owner), fall back to tenants.mine for regular users; hide create button for non-owners - Merge contacts dialog: add coming-soon banner (i18n en/vi/zh), disable form and submit button - Task detail: collapse attachments/comments by default, guard download_url before rendering	2026-03-23 08:08:23 +07:00
viettranx	9ad3043fe9	feat(ui): team leader processing indicator and announce run streaming - Add team.leader.processing backend event (emitted from onDrain before announce run starts) so all WS clients show processing status - Capture announce run.started events to stream leader's summarization in real-time (thinking, tool calls, text) - Add leader_processing phase to ActivityIndicator and ChatTopBar - isBusy stays true during leader processing → stop button visible	2026-03-20 22:26:22 +07:00
Viet TranandGitHub	84650c5c14	feat(teams): attachments refactor, semantic search, improved prompting (#310 ) * feat(teams): refactor attachments, remove team_message, add task comments UI Major team system refactoring: - Drop team_workspace_files, team_workspace_file_versions, team_workspace_comments, team_messages tables; replace team_task_attachments with path-based schema - Add denormalized comment_count/attachment_count on team_tasks for dashboard perf - Auto-track file writes as task attachments via WorkspaceInterceptor - Remove team_message tool entirely (tool, store, i18n, builtin_tools, MCP bridge) - Members communicate via task comments; approve/reject use comments for audit trail - Add commented/new_task notification types to TeamNotifyConfig - Enrich task completion announce with member comments - User-created tasks stay pending (backlog) — no auto-assign to leader - Configurable member request tasks (member_requests.enabled in team settings) - Structured task description template in TEAM.md for v2 leads - HTTP attachment download endpoint with IDOR + path traversal protection - Web UI: count badges on task list, comments section with input, download button - Team settings UI: completed/commented/new_task toggles, member requests section * feat(teams): priority dispatch, compact prompting, realtime comments - Priority dispatch: DispatchUnblockedTasks dispatches only 1 task per owner per round (highest priority first). Fixes cancel bug where CancelSession killed innocent queued tasks. - Prompt rework: Replace verbose Task Decomposition (25 lines) with compact Task Planning (8 lines). Add explicit UUID warning and sequencing guidance for weak models (Qwen, MiniMax). - Recent comments in dispatch: buildRecentCommentsSummary appends 3 most recent comments to re-dispatched tasks (reject, retry, stale). - Enrich comment event payload with TaskNumber, Subject, CommentText (truncated 500 runes, UTF-8 safe). - UI: Board subscribes to TEAM_TASK_COMMENTED for realtime comment_count badge updates. Task detail dialog auto-refreshes comments on event. - Tool description hint: guide models to write self-contained task descriptions with clear objectives and context. * perf(teams): add ListRecentTaskComments with SQL LIMIT Dispatch only needs 3 most recent comments — avoid fetching all. New ListRecentTaskComments(ctx, taskID, limit) uses ORDER BY DESC LIMIT N then reverses to chronological order. * feat(teams): add subject embedding for semantic task search + improve prompting - Add vector(1536) embedding column to team_tasks with HNSW index - Implement hybrid search: FTS (0.3) + cosine similarity (0.7) with graceful fallback - Auto-generate embeddings on task create/update, backfill existing tasks on startup - Wire embedding provider into PGTeamStore via gateway_setup - Change FTS from OR to AND with prefix matching for precise keyword search - Reduce search page size from 30 to 5 to save tokens - Rename migration 000023 → 000024, bump RequiredSchemaVersion to 24 - Update TEAM.md hints: prefer search over list, batch task creation with blocked_by - Add anti-pattern examples to prevent sequential task creation	2026-03-20 17:51:40 +07:00
viettranx	4da85913a2	feat(chat): auto-generate conversation titles via lightweight LLM call After the first agent run completes, generate a short title from the user's message using a 10s/50-token LLM call. Title is stored in the session label column and pushed to clients via session.updated event.	2026-03-19 23:34:45 +07:00
viettranx	68a4ee39b4	feat: config permissions RPC + UI tab + contact search improvements Backend: - Add config.permissions.list/grant/revoke RPC methods - Auto-fill granted_by from caller's context identity - Fix Telegram contact display_name: FirstName + LastName (was FirstName only) - Change contact search to prefix match (term%) for B-tree index usage Frontend: - Add Permissions tab to agent detail with inline add row layout - Contact search combobox shows name, @username, sender_id, [channel] - Reduce contact search debounce from 300ms to 150ms - i18n keys for en/vi/zh	2026-03-19 13:35:57 +07:00
viettranx	96cfd1bf08	feat(heartbeat): improve prompting, suppression, delivery targets and session cleanup - Rewrite heartbeat prompt to instruct agent to EXECUTE checklist tasks, not echo them - Simplify suppression: HEARTBEAT_OK present = always suppress, absent = always deliver - Add delivery targets RPC (heartbeat.targets) for channel/chatId picker - Sanitize backend errors — never expose raw SQL to client - Add session cleanup for isolated heartbeat sessions after run - Cap StaggerOffset at 10% of interval to avoid user-visible delay - Fix Upsert to persist next_run_at correctly	2026-03-18 16:37:36 +07:00
viettranx	08a2d95c0c	feat: agent heartbeat system — periodic proactive check-ins (#245 ) Phase 1 (Core): - Migration 000022: agent_heartbeats, heartbeat_run_logs, agent_config_permissions tables - HeartbeatStore + ConfigPermissionStore interfaces with PG implementations - HeartbeatTicker: background poll → active hours filter → queue-aware skip → run → smart suppression → deliver/log - Heartbeat tool: status/get/set/toggle/set_checklist/get_checklist/test/logs actions - Permission check with wildcard scope matching + TTL cache (60s) - RPC methods: heartbeat.get/set/toggle/test/logs/checklist.get/checklist.set - HEARTBEAT.md routed via context file interceptor (read/write for both open + predefined agents) - Session keys: agent:{id}:heartbeat or agent:{id}💓{ts} (isolated) - PromptMinimal for heartbeat sessions (like cron/subagent) - Event broadcasting + cache invalidation via bus (heartbeat + config_perms) - Gateway wiring: ticker init, event wiring, graceful shutdown Phase 2 (Integration): - wakeMode: CronPayload.WakeHeartbeat triggers heartbeat after cron job completes - Queue-aware: Scheduler.HasActiveSessionsForAgent() skips busy agents - Stagger: deterministic FNV offset spreads heartbeats across interval - lightContext: RunRequest.LightContext skips context files, only injects checklist - System prompt distinguishes cron (user-scheduled tasks) vs heartbeat (autonomous monitoring)	2026-03-18 13:11:44 +07:00
viettranx	b231878a85	feat(teams): add limit param to ListTasks + lightweight get-light endpoint - Add limit parameter to ListTasks interface (dashboard=200, agent=30) - Add teams.tasks.get-light WS method returning task only (no comments/events) - Truncate dashboard response to exact limit (fix off-by-one from limit+1) - Update all 7 ListTasks callers with explicit limit values	2026-03-17 18:03:10 +07:00
viettranx	4678065887	refactor: remove dead quality gates / hook engine code The delegation system this depended on was previously removed, leaving internal/hooks/ as dead code with zero imports. Remove the entire hook engine, UI config section, protocol types, i18n keys, and all documentation references.	2026-03-17 18:00:09 +07:00
viettranx	ae3e5cebcf	feat(teams): add multi-select checkboxes and bulk delete to task list Add checkbox column to task list view for selecting terminal-status tasks (completed/failed/cancelled). Header checkbox supports select-all with indeterminate state. Bulk action bar appears on selection with delete button that opens ConfirmDeleteDialog requiring user to type "delete" to confirm. Backend: new teams.tasks.delete-bulk RPC method with DeleteTasks batch SQL (DELETE ... WHERE id = ANY($1) RETURNING id). Broadcasts delete event per task for real-time UI sync. i18n: added bulk action keys for en/vi/zh.	2026-03-17 15:10:43 +07:00
viettranx	eee79d111e	feat(teams): granular progress notifications with direct/leader mode - Replace progress_notifications toggle with granular config: dispatched (on), progress (on), failed (on) + delivery mode - Direct mode: outbound to channel, no AI processing - Leader mode: inject into leader session with NO-ACTION instructions - Add consumer.team-notify subscriber for event forwarding - Enrich TeamTaskEventPayload with TaskNumber, ProgressPercent/Step - Add auto-status system prompt section - UI: card-select for delivery mode (Zap/Bot icons), 3 toggles	2026-03-16 22:46:51 +07:00
viettranx	3f2b6e258e	chore(teams): remove deprecated delegation tools Remove delegate_search, evaluate_loop, handoff from: - Seed data, system prompt, i18n keys/catalogs, channel events - Consumer handler (handleHandoffAnnounce), handoff route lookup - HandoffRouteData struct + PG implementation - Protocol events, MCP bridge comment - Web UI locale files (en/vi/zh)	2026-03-16 22:46:18 +07:00
viettranx	950d5d5c94	feat(teams): add task delete + agent emoji + priority labels in board views - Backend: DeleteTask RPC handler with terminal-status guard (completed/failed/cancelled) - Frontend: delete button in kanban cards (top-right, hover), task list, and detail dialog - Show agent emoji from members lookup next to owner name in all task views - Replace priority colored dot with P-0/P-1/P-2/P-3 labels with hover tooltips - Auto-reload board on TEAM_TASK_DELETED WebSocket event - i18n: add delete task strings for en/vi/zh	2026-03-15 23:47:53 +07:00
GoonandGitHub	75c570e951	feat(security): credentialed exec + HTTP RBAC + API key cache (#197 ) - Secure CLI credential injection via AES-256-GCM encrypted env vars - API key management with fine-grained RBAC scopes - resolveAuth/requireAuth middleware across all 25+ HTTP handlers - In-memory API key cache with TTL, negative caching, pubsub invalidation - Sandbox-first execution (fails if unavailable, no silent fallback) - Credential scrubbing, constant-time token comparison, Admin-only CLI creds - SQL migration 000020: secure_cli_binaries + api_keys tables - 14 unit tests for cache and RBAC with race detector Closes #197	2026-03-15 20:13:18 +07:00
Viet TranandGitHub	9a9744077e	refactor(teams): v2 system cleanup — remove legacy tools, fix followup, add events API (#210 ) Major refactoring of the team system with multiple improvements: ## Removed legacy delegation tools - Delete `delegate.go`, `delegate_async.go`, `delegate_sync.go`, `delegate_events.go`, `delegate_policy.go`, `delegate_prep.go`, `delegate_state.go`, `delegate_search_tool.go` - Delete `evaluate_loop_tool.go`, `handoff_tool.go` - Remove all references and registrations from tool manager and policy - Clean up TEAM_PLAYBOOK_IDEAS.md and TEAM_SYSTEM.md (moved to docs) ## Rename await_reply → ask_user - Rename action `await_reply` → `ask_user`, `clear_followup` → `clear_ask_user` - Rename functions `executeAwaitReply` → `executeAskUser`, `executeClearFollowup` → `executeClearAskUser` - Update system prompt with stronger wording to prevent model misuse - Model was confusing "await_reply" with general waiting; "ask_user" is unambiguous ## Fix auto-followup false positives - Add `HasActiveMemberTasks(ctx, teamID, excludeAgentID)` store method - Guard `autoSetFollowup()` in consumer: skip when lead has active member tasks - Prevents auto-followup when lead is orchestrating teammates (not waiting for user) ## Task identifier zero-padding - Change format from `T-1-xxxx` → `T-001-xxxx` (3-digit minimum) ## Refactor workspace WS handlers to filesystem-only - Rewrite `teams.workspace.list/read/delete` to use pure filesystem (os.ReadDir/ReadFile/Remove) - Remove DB dependency from workspace WS handlers - Consistent with storage handler and workspace tools - Simplify TeamWorkspaceFile type and frontend hook ## Add team events listing API - New WS method `teams.events.list` with team_id, limit, offset params - New HTTP endpoint `GET /v1/teams/{id}/events` with bearer auth - New `ListTeamEvents(ctx, teamID, limit, offset)` store method - JOIN with team_tasks for team-wide event filtering ## Extract team access policy - New `team_access_policy.go` — centralized team tool access control ## Migration 000019: team_id columns - Add team_id foreign key columns to relevant tables ## Other improvements - Add team_id propagation through agent loop, tracing, sessions - Update i18n locale files (en/vi/zh) for new tool labels - Update frontend builtin-tools page and require-setup component - Bump RequiredSchemaVersion for migration 000019	2026-03-15 14:53:19 +07:00
Viet Tran GitHub Nam Nguyen Ngoc	1a42dc93a6	feat(teams): team system v2 with bug fixes, workspace scope, versioning, and prompt optimization (#183 ) * feat(workspace): add team shared workspace for file collaboration - Add workspace_write and workspace_read tools for agents to share files across team members - Create team_workspaces DB table with migration 000017 (file metadata, pinning, tags) - Implement PostgreSQL store layer for workspace CRUD operations - Add RPC handlers for workspace list/read/delete from web UI - Build React workspace tab with file listing, content preview, and delete - Propagate workspace channel/chatID scope through delegation chain - Auto-allow workspace tools in agent tool policy when agent belongs to a team - Inject team workspace guidance into system prompt for team agents - Add /reset command handler for clearing session history - Harden MCP bridge context middleware to reject headers when no gateway token - Add i18n strings for workspace UI in en/vi/zh locales * feat(teams): add comprehensive task management with followup reminders and recovery - Add task followup/reminder system with auto-set on lead agent reply and auto-clear when user responds on channel - Add task recovery ticker to re-dispatch stale/pending tasks periodically - Add task scopes, filtering by status/channel/chatID, and task events - Add WS RPC handlers for task CRUD, assignments, comments, events, and bulk operations (teams_tasks.go) - Add task detail dialog, settings UI for followup config, and scope filtering in web dashboard - Add migrations 000018 (team_tasks_v2) and 000019 (task_followup) - Extend team_tasks_tool with await_reply, clear_followup actions - Auto-complete/fail team tasks when delegate agent finishes - Add workspace file listing and team tool manager enhancements * docs(teams): add team system architecture and playbook ideas documentation - Add TEAM_SYSTEM.md with full architecture design covering task management, shared workspace, and delegation engine subsystems - Add TEAM_PLAYBOOK_IDEAS.md outlining future team coordination layers (playbook, member capabilities, auto-learned patterns) - Document data models, status flows, tool actions, followup reminder system, task ticker, execution locking, and workspace scope model * fix(teams): resolve 6 critical bugs in team task system - Fix unblock SQL: check array_length after array_remove (not before) - Enforce single-team leadership in team creation - Add requireLead() for approve/reject tool actions - Validate cross-team dependency references in blocked_by - Add team_id to handoff route for multi-team isolation - Set blocked_by DEFAULT '{}' to prevent NULL array issues * refactor(workspace): use stable userID as scope key instead of connection UUID Workspace scope changed from (team_id, channel, chat_id) to (team_id, userID). Fixes workspace fragmentation across WS tab refreshes and reconnections. * feat(teams): add V1/V2 versioning with feature gating and optimized prompts - IsTeamV2() helper gates advanced features (locking, followup, review, audit) - V2 tool actions rejected for V1 teams with clear error message - Ticker, gateway consumer, delegation hooks respect version flag - TEAM.md renders v1/v2 sections conditionally - Tool descriptions and params optimized (~38% token reduction) - UI: version toggle in settings, V2 Beta badge, conditional rendering - i18n: version modal keys for en/vi/zh * fix(migration): use VARCHAR(255) for user ID columns and add metadata JSONB - assignee_user_id, user_id, actor_id: TEXT → VARCHAR(255) - Add metadata JSONB to team_task_comments and team_task_attachments --------- Co-authored-by: Nam Nguyen Ngoc <namnn.0911@gmail.com>	2026-03-13 22:41:32 +07:00
viettranx	1a4c211539	feat(teams): add task approval workflow with approve/reject actions Tasks can be created with require_approval flag, starting in pending_approval status. Users approve/reject via tool actions or WS methods. Approval respects blocked_by dependencies — tasks with unresolved blockers transition to blocked instead of pending. Delegate agents are restricted from approving/rejecting.	2026-03-12 21:45:28 +07:00
viettranx	9115169c03	feat: expand audit logging via pub/sub event pattern Replace direct ActivityStore injection with event-driven audit system. Handlers emit audit events via msgBus.Broadcast(), a single subscriber with buffered channel persists to activity_logs table. Coverage expanded from 3 agent CRUD actions to ~65 audit points across all HTTP handlers and WebSocket RPC methods including agents, providers, skills, MCP servers, cron, sessions, teams, pairing, and more.	2026-03-12 18:34:56 +07:00
Viet Tran GitHub Goon	ace07509b7	feat(skills): system skills integration — toggle, dep checking, per-item install (#161 ) * feat(infra): add runtime package support for skills Install nodejs, npm, pandoc, github-cli + pre-install Python packages (openpyxl, pandas, python-pptx, markitdown) and Node packages (docx, pptxgenjs). Configure runtime dirs for agent pip/npm installs with PIP_TARGET, NPM_CONFIG_PREFIX, NODE_PATH to enable dynamic package installation in read-only container environment. * feat(infra): add bundled skills with runtime package support - Add 5 bundled skills: docx, pdf, pptx, xlsx, skill-creator from container skills-store - Wire GOCLAW_BUILTIN_SKILLS_DIR env var in gateway and CLI - Support optional runtime packages alongside dynamic skill loading - Update Dockerfile to COPY bundled-skills at /app/bundled-skills/ - Add PIP_CACHE_DIR in docker-entrypoint.sh for clean pip installs - Document bundled skills in 14-skills-runtime.md section 6 * feat(infra): remove ai-multimodal skill directory from bundled skills Remove the ai-multimodal skill package as part of consolidating runtime package support for bundled skills. This directory is no longer needed in the bundled skills structure. * feat(ci): add semantic release and Docker Hub publishing Add go-semantic-release workflow to auto-create semver tags on merge to main. Extend docker-publish to push all variants to both GHCR and Docker Hub (digitop/goclaw). * feat(skills): add system skills infrastructure with is_system column, dep scanning, and seeder - Migration 000017: add is_system boolean column with partial index - Store layer: UpsertSystemSkill, delete protection, IsSystemSkill - ListAccessible auto-includes system skills (no grants needed) - ListWithGrantStatus returns is_system field - Dependency scanner: auto-detect deps from scripts/ or skill-manifest.json - Dependency checker: verify system binaries, Python/Node packages - Seeder: seed bundled skills into DB on startup (idempotent via hash) - Gateway wiring: GOCLAW_BUNDLED_SKILLS_DIR env for bundled skills - HTTP: delete guard (403), slug conflict check (409), rescan-deps endpoint - UI: System badge, hide delete for system skills, rescan deps button - Agent skills tab: "Always available" for system skills - i18n: en/vi/zh keys for system skills, deps scanning * feat(skills): conditional system prompt, skill manifests, and Zip Slip fix - System prompt: only show package list when python3/node are available - Add skill-manifest.json for pdf, docx, xlsx, pptx bundled skills - Fix Zip Slip vulnerability in office/unpack.py (all 3 copies) * refactor(skills): extract shared office code to _shared/ and deduplicate Move office scripts (pack, unpack, validate, schemas, validators) from duplicated copies in docx/xlsx/pptx to skills/_shared/office/ with symlinks. Remove soffice.py (non-functional in containers) and update SKILL.md references to use soffice binary directly. Update seeder copyDir to follow symlinks. Removes ~45K lines of duplicate code across 3 skills. * fix(skills): address code review findings for system skills integration - H1: Remove dead symlink branch in copyDir (filepath.Walk follows symlinks) - H3: Fix rescan-deps to query ALL skills (including archived) and re-activate when deps become available; add ListAllSkills() + Status field to SkillInfo - H4: Add Status field to SkillCreateParams, stop overloading Visibility - M1: Batch Python/Node dep checks into single subprocess per runtime - M4: Add rows.Err() check in ListSkills to prevent caching partial results * feat(skills): async dep checking with realtime WS events Split Seed() into sync DB upsert + async CheckDepsAsync() goroutine. Gateway startup no longer blocks on Python/Node subprocess dep checks. - Seed() returns seeded skills list, all initially status="active" - CheckDepsAsync() runs in background, emits skill.deps.checked per-skill - skill.deps.complete event emitted when all checks finish - Each failed dep check: archives skill + BumpVersion() for immediate cache invalidation so next agent turn picks up the change - UI: use-query-invalidation listens to skill.deps.* events → auto-refresh skills list in realtime * feat(skills): system skills integration with toggle, dep checking, and per-item install - Add is_system, deps, enabled columns to skills table (migration 017) - Seed bundled core skills (pdf, docx, pptx, xlsx, skill-creator) on startup - PYTHONPATH-based dep detection — eliminates false positives from local modules - Per-item dep install UI with individual status (installing/success/error) - Enable/disable toggle for core and custom skills (independent of dep status) - Re-run dep check when skill is toggled back on - Inline skill thresholds: 40 skills / 5000 tokens before switching to search mode - Fix UpsertSystemSkill: backfill null file_hash without bumping DB version - Remove redundant skill-manifest.json files (replaced by deps JSONB column) - Show author from frontmatter in custom skills tab - Runtime checker for python3/pip3/node/npm availability - WS events for dep checking/installing progress - docs: add 15-core-skills-system.md, 16-skill-publishing.md --------- Co-authored-by: Goon <duy@wearetopgroup.com>	2026-03-12 09:20:41 +07:00
viettranx	bdb60de7ae	chore: upgrade Go 1.25 → 1.26 and apply go fix modernizations - Update go.mod and Dockerfile to Go 1.26 - Apply `go fix ./...` stdlib modernizations across 170+ files - Add `go fix` to post-implementation checklist in CLAUDE.md - Fix go fix misapplied rewrite in loop_history.go	2026-03-10 00:09:15 +07:00
viettranx	e593b9cf22	feat(channels): real-time agent activity status & intent classification - Add tool status display on channels during tool execution (streaming preview + reactions) - Emit agent.activity events at phase transitions (thinking, tool_exec, compacting) - Enrich delegation progress with per-member activity and tool info - Add LLM-based intent classifier for DM status queries when agent is busy - Keyword fast-path for cancel/status patterns (no LLM cost) - Falls back to LLM classification with 5s timeout - Supports status_query (immediate reply) and cancel (abort run) intents - Register/unregister runs in makeSchedulerRunFunc for channel inbound tracking - Add sessionRuns secondary index in Router for O(1) IsSessionBusy lookups - Add intent_classify config toggle (global default + per-agent override) - Add tool_status config toggle for channel tool status display - Add i18n keys and translations (en/vi/zh) for status messages - Add web UI config toggles for intent_classify and tool_status	2026-03-09 23:58:56 +07:00
viettranx	0f2737ce53	feat(media): persistent media storage, read_document tool, and pipeline refactor - Add persistent media storage (internal/media/) replacing temp file deletion - Add MediaRef type for lightweight media references in session messages - Refactor media pipeline to use bus.MediaFile{Path, MimeType} across all channels - Add read_document builtin tool for PDF/DOCX/XLSX analysis via Gemini native API - Move image sanitization from Telegram to shared agent/media layer - Add media reload for multi-turn conversations (images from last 5 messages) - Add reply-to-message media resolution for Telegram (re-download on reply) - Add media inventory to compaction summary to preserve awareness after truncation - Fix coreToolSummaries for read_image, read_document, create_image tools - Add real-time trace update events via WebSocket broadcast - Improve trace detail UI with media refs and tool result display	2026-03-08 14:00:34 +07:00
viettranx	faa47abfb6	feat(block-reply): deliver intermediate text during tool iterations with 2-tier config (#55 ) Add block.reply event that delivers intermediate assistant text to non-streaming channels during multi-tool iterations. Includes 2-tier config toggle: gateway-level default (disabled) + per-channel override (inherit/on/off). Backend: - Emit block.reply events from agent loop between tool iterations - Add BlockReply *bool to GatewayConfig and all 6 channel config structs - Add BlockReplyChannel interface with ResolveBlockReply() resolution - Guard delivery in HandleAgentEvent by RunContext.BlockReplyEnabled - Resolve config at RegisterRun time, pass to consumer goroutine - Conditional dedup: skip final message if identical to last block reply UI: - Gateway settings: Switch toggle for global default - Per-channel: tri-state select (Inherit from gateway / Enabled / Disabled) - Protocol: BLOCK_REPLY constant in AgentEventTypes - Form: coerceBoolSelects for proper JSON boolean serialization	2026-03-07 01:06:10 +07:00
Viet TranandGitHub	6895e369f6	refactor: remove standalone mode, consolidate to managed-only (PostgreSQL) (#70 ) - Remove standalone mode code: file-based stores, standalone gateway, heartbeat service, SQLite memory, standalone docker-compose - Rename docker-compose.managed.yml → docker-compose.postgres.yml - Clean up ~130 Go comments referencing "managed mode" qualifier - Simplify docker-compose.yml env vars (providers/channels via web UI) - Update .env.example to essential vars only (token + encryption key) - Add setup wizard UI (provider → agent → channel bootstrap flow) - Add logs.tail WebSocket handler for live log streaming - Add cursor-pointer to interactive UI components - Clean up config page (remove standalone-only sections) - Update README and docs for managed-only architecture	2026-03-06 18:51:11 +07:00
viettranxandClaude Opus 4.6	61da687ad1	feat(ws): comprehensive team & delegation WS events with typed payloads Add 16 new event constants and typed payload structs for full WS visibility into team agent operations. Enrich AgentEvent with delegation and routing context (delegationId, teamId, parentAgentId, userId, channel, chatId). Emit thinking/chunk events for non-streaming runs so delegate member agents also produce WS events. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-05 12:56:15 +07:00
viettranxandClaude Opus 4.6	c7d7c80100	feat(dashboard): redesign overview page with quota usage, recent traces, system info Add quota.usage RPC method backed by QuotaChecker.Usage() which queries per-user request counts (hour/day/week) with resolved limits and today's aggregate stats. Dashboard now shows summary cards, quota progress bars, recent requests table, system health, and cron jobs. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-04 11:41:46 +07:00
Duc NguyenandGitHub	0f5dd08f76	feat(channels): introduce Zalo Personal channel integration (#32 ) * feat(channels): implement Zalo Personal Chat (ZCA) protocol layer Implement complete Zalo Personal Chat integration including: - Message protocol layer (request/response/event types) - Connection management with auth flow - Message sending/receiving with text and media support - User/group management and sync - Telegram-style contact and conversation handling - Comprehensive unit tests with 85%+ coverage Architecture follows existing channel patterns (Telegram, Feishu) with raw API calls for session management and message delivery. Includes error handling, rate limiting awareness, and logging. * feat(channels): add Zalo Personal channel integration layer Wire protocol package to GoClaw's channel system: - channel.go: Channel struct, Start/Stop/Send, listenLoop, message handlers - auth.go: credential resolution (preloaded > file > QR), persistence - policy.go: DM/group policy, @mention gating, pairing with debounce - factory.go: managed mode factory (requires credentials, no QR) - cmd/gateway.go: register standalone + managed factory * feat(ui): add Zalo Personal channel type to web dashboard Add zalo_personal to channel type dropdown, credential fields (IMEI, cookie, userAgent), and config schema (DM/group policy, require_mention, allow_from). * feat(channels): add WebSocket QR login for Zalo Personal channel Add real-time QR code login flow for zalo_personal channel instances in managed mode. Users create an instance without credentials, then trigger QR login from the web dashboard. Backend: - New RPC method zalo.personal.qr.start with per-instance mutex - QR PNG pushed via client-scoped WS events (not broadcast) - Credentials encrypted and saved to DB on successful scan - Cache invalidation triggers automatic channel reload/start - Factory returns nil,nil for missing credentials (skip, not error) - Instance loader handles nil-channel gracefully Frontend: - ZaloPersonalQRDialog with auto-start, retry, and auto-close - QR button in channel instances table for zalo_personal type - Credential fields no longer required (auto-populated via QR) * fix(channels): skip redundant LoginWithCredentials after QR login QR flow already validates session via qrCheckSession + qrGetUserInfo. Calling LoginWithCredentials again conflicts with the active QR session state, causing "empty response" errors. Credentials are validated when the channel starts instead. Also rename log prefix from "zca" to "Zalo Personal". * fix(channels): fix Zalo Personal cookie domain for login API BuildCookieJar only set cookies for chat.zalo.me but the login API uses wpa.chat.zalo.me. Cookies weren't sent to the subdomain, causing "empty response" on channel startup. Now sets cookies for both hosts. * fix(channels): move UTF-8 check after gzip decompression in Zalo listener The UTF-8 validity check in decryptAESGCMPayload ran on raw decrypted bytes before gzip decompression, causing all encType=2 (AES-GCM+gzip) messages to fail with "decrypted payload is not valid UTF-8". Move the check to decryptEventData so it runs after all processing (decryption + decompression) is complete. * feat(channels): add QR-only onboarding and contacts picker for Zalo Personal - Remove credential text fields for zalo_personal, show QR auth info banner - Add has_credentials boolean to HTTP and WS mask functions - Implement FetchFriends/FetchGroups protocol (encrypted Zalo API) - Add zalo.personal.contacts WS RPC method with parallel fetch - Create ZaloContactsPicker component with search, selection, manual entry - Integrate picker in channel instance edit dialog for allow_from config * refactor(channels): rename zca error prefix to zalo_personal across protocol package * fix(channels): unwrap inner response envelope in Zalo contacts decryption The Zalo API returns double-wrapped responses: outer envelope contains encrypted base64 data, which when decrypted yields another Response envelope with error_code and data fields. The decryptDataField helper was returning the raw decrypted bytes without unwrapping the inner envelope, causing json unmarshal failures when parsing friends/groups. * fix(channels): pass version 0 for group details to get full data The Zalo group info endpoint uses a version-based caching mechanism. Passing the actual version from step 1 causes the server to return the group in "unchangedsGroup" with empty "gridInfoMap". By passing version 0 for all groups, we force the server to return full group info including name, avatar, and member count. * fix(ui): auto-load contacts on modal reopen to resolve display names When the edit modal is reopened with already-selected contact IDs, contacts are now auto-fetched so badges show display names instead of raw numeric IDs. * fix(channels): handle gzip-compressed response in Zalo SendMessage SendMessage used io.ReadAll + json.Unmarshal directly but the response is gzip-compressed (Accept-Encoding: gzip header). Use readJSON() which handles gzip decompression, fixing "invalid character '\x1f'" errors. * fix(channels): decrypt encrypted send response in Zalo SendMessage The Zalo send message API response is encrypted like all other endpoints. Parse outer envelope, decrypt the data field, then extract msgId from the decrypted inner response. * feat(channels): improve Zalo listener reliability and UI channel wizard - Migrate WebSocket client from gorilla to coder/websocket, eliminating unsafe/reflect hacks for RSV1 decompression and buffer inspection - Add channel-level restart with exponential backoff (2s→60s cap, max 10) so channels auto-recover instead of stopping permanently - Reset listener retry counters after 60s stable connection to prevent long-lived connections from exhausting retry budget - Add code 3000 (duplicate session) recovery with 60s initial delay - Detect silent disconnects via read deadline (2.5x ping interval) - Fix Stop() to always cancel context, preventing reconnect timer leaks - Refactor UI channel form into wizard-based flow with registry pattern - Auto-refresh channel status after create/update dialog closes * refactor(channels): move Zalo RPC methods to zalomethods package Move Zalo personal channel RPC handlers from internal/gateway/methods to internal/channels/zalo/personal/zalomethods, improving code organization and removing prefix redundancy. Rename types: ZaloPersonalQRMethods → QRMethods, ZaloPersonalContactsMethods → ContactsMethods. - Move zalo_personal_qr.go → zalomethods/qr.go - Move zalo_personal_contacts.go → zalomethods/contacts.go - Update imports in cmd/gateway.go (2 call sites) - Update internal/channels/zalo/personal imports * feat(channels): add typing indicator to Zalo Personal channel Show "typing..." in Zalo while the LLM processes messages, matching the Telegram/Discord pattern. Uses the shared typing.Controller with 4s keepalive (Zalo typing expires ~5s) and 60s TTL safety net. * feat(channels): handle image attachments in Zalo Personal channel - Add Raw field to Content struct to preserve non-string JSON payloads - Add Attachment struct with IsImage() detection (ext + Zalo CDN paths) - Add AttachmentText() for human-readable placeholders (image/file/other) - Download image attachments to temp files for agent vision pipeline - Non-image files get text placeholder only (no download) - Fix URL query param stripping in file extension detection * fix(channels): switch Zalo WS client to gorilla/websocket with cookie jar fix coder/websocket did not propagate session cookies for wss:// URLs, causing Zalo backend to reject connections with "zpw_sek not found". Switch to gorilla/websocket which handles wss→https scheme conversion natively. Add wsJar safety wrapper and fix Close() mutex consistency. Also update Makefile `up` target to use --no-cache builds. * fix(channels): inject cookies manually for Zalo WS connection Replace wsJar wrapper with direct cookie injection from chat.zalo.me base domain. Fixes host-only cookies (zpw_sek) not matching WS subdomains (ws-msg.chat.zalo.me) due to Go cookiejar limitations. fix(channels): harden Zalo Personal channel security and concurrency - Add SSRF protection to downloadFile using CheckSSRF (URL validation, private IP blocking, DNS pinning) with context and 30s timeout - Protect c.sess/c.listener with sync.RWMutex to eliminate data races during restart; add thread-safe session()/getListener() accessors - Add stopped flag + reconnTimer to Listener to prevent zombie reconnects after Stop(); timer cancelled on Stop(), checked before Start() - Fix QR flow using context.Background() detached from WS client; now derives from parent ctx so flow cancels on client disconnect - Set initial 30s read deadline for cipher key handshake to prevent indefinite blocking before ping loop starts - Use defer in WSClient.Close() to prevent connection leak on panic - Document ReadMessage ctx limitation and two-layer reconnect design * chore: remove unused gobwas/ws dependency from go.mod gobwas/ws was a leftover from the previous coder/websocket usage, no longer imported by any Go source files. * fix(channels): align Zalo Personal policy defaults across UI and backend Policy defaults were inconsistent across three layers causing group/DM allowlist enforcement to silently fail. New() applied "allowlist" default to local vars but never wrote back to config; checkGroupPolicy() then read empty string and defaulted to "open", bypassing the allowlist. UI Select components displayed schema defaults visually without persisting them to configValues, so DB config never stored the policy.	2026-03-03 14:21:07 +07:00
viettranxandClaude Opus 4.6	4387f6b1ba	fix: improve spawn tool team_task_id validation and orphan detection When LLMs call team_tasks create + spawn in parallel, the spawn tool receives a hallucinated task_id that fails uuid.Parse, causing a misleading error and bypassing orphan detection. - Include pending task IDs in spawn error message so LLM can retry with the correct UUID - Move spawn counting to post-execution so failed spawns don't increment teamTaskSpawns, allowing orphan detection to fire Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-03 11:36:46 +07:00
viettranxandClaude Opus 4.6	6ed62b8506	feat: channel-isolated workspace, resolvePath fix, create_image workspace, summoner Expertise section, bus Topic constants - Fix resolvePath for nested non-existent dirs (use resolveThroughExistingAncestors) - Channel-isolated workspace: user_agent_profiles.workspace stores channel prefix, used as source of truth with backward compat for existing users - Loop caches workspace per-user with CacheKindUserWorkspace invalidation via pubsub - ContractHome/ExpandHome for portable ~-based paths in DB - create_image saves to workspace/generated/YYYY-MM-DD/ instead of OS temp dir - SOUL.md template: add ## Expertise section for domain knowledge - Summoner buildEditPrompt: section guide, complete file output, frontmatter update - Bus: Topic* constants for Subscribe/Broadcast keys, CacheKind* for payload kinds - Teams, delegates, sessions, agent links: various enhancements Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-02 10:52:32 +07:00
viettranxandClaude Opus 4.6	0e75c21e55	feat: overhaul team delegation, Telegram resilience, and artifact forwarding Team delegation: - Unify spawn/subagent/delegate into single spawn tool - Sibling-aware announce suppression with artifact accumulation - Fix auto-complete race (isLastDelegation guard) - Add team tasks list limit (20) with search guidance - Multi-round orchestration patterns in TEAM.md - Communication guidance for initial vs follow-up delegations Telegram resilience: - Add retrySend wrapper (3 attempts, escalating delay) for network errors - Fix HTML fallback: strip tags + unescape entities instead of showing raw HTML - Pre-process HTML tags in LLM output to markdown before conversion pipeline - Skip caption truncation entirely when > 1024 bytes, send text separately - Auto-send large images (>5MB) as documents to avoid compression Artifact forwarding: - Fix missing ContentType on forwarded media (mimeFromExt for result.Media/ForwardMedia) - Add deliver parameter to write_file for file attachment delivery - Extend mimeFromExt with document MIME types UI: fix regenerate dialog overflow, improve task list layout, delegation detail view Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-01 14:34:36 +07:00

1 2