goclaw

mirror of https://github.com/tiennm99/goclaw.git synced 2026-06-10 08:11:23 +00:00

Author	SHA1	Message	Date
Duc Nguyen	23d0b5eb0b	fix(providers): auto-clamp max_tokens on model rejection (#267 ) * fix(providers): auto-clamp max_tokens on model rejection + fix verify for reasoning models When OpenAI-compat models reject max_tokens as too large (e.g. gpt-3.5-turbo supports 4096 but we send 8192), parse the model's stated limit from the 400 error, clamp the value, and retry once. This fixes agent creation for models with lower output token limits without hardcoding model names. Also increase the provider verify endpoint's max_tokens from 1 to 50 so reasoning models (gpt-5, o-series) have enough headroom for internal reasoning during the check call. Closes #248, closes #245 * refactor(providers): extract chat retry closure + fix clamp log key - Extract duplicate retry closure into chatRequestFn() to follow DRY - Fix slog logging wrong key: body["max_tokens"] was nil for reasoning models that use max_completion_tokens — now uses clampedLimit() helper - Remove unnecessary _ = resp in provider verify endpoint --------- Co-authored-by: viettranx <viettranx@gmail.com>	2026-03-19 08:41:20 +07:00
badgerbees	365f41f81c	fix: pass custom name to DashScopeProvider for correct registry lookup (#228 )	2026-03-16 22:54:10 +07:00
badgerbees	5c0612a07f	fix(providers): prevent Gemini thought_signature from leaking to other providers and breaking validation (#230 ) * fix: prevent gemini thought_signature from leaking to other providers * test: refine gemini model detection for robust provider multiplexing	2026-03-16 22:53:21 +07:00
viettranx	50a42ad110	feat(agent): team workspace resolution for lead/member agents - Lead agents: auto-resolve team workspace as default (relative paths) - Dispatched members: team workspace as default via req.TeamWorkspace - Direct-chat members: own workspace default, team workspace accessible - Add dataDir field to Loop/LoopConfig for global workspace root - System prompt shows team workspace absolute path for model guidance - Remove orphan task detector (superseded by post-turn dispatch) - Log warning on OpenAI tool call argument parse failures	2026-03-16 20:06:01 +07:00
viettranx	0857321a6b	fix(providers): correct Anthropic prompt caching + add datetime tool - Move cache_control from request root (ignored by API) to per-block placement on last system block and last tool definition - Change system prompt time format to date-only for better cache stability - Add builtin datetime tool for precise timestamps (cron, memory, etc.) - Add atMs past-time validation in cron handleUpdate (was only in handleAdd) - Update cron description to guide model to use datetime tool first	2026-03-16 08:14:03 +07:00
viettranx	9468aae422	refactor(providers): simplify DashScope per-model thinking guard Remove ModelThinkingCapable interface and ChatRequest.ModelSupportsThinking hint field — DashScope handles per-model checks internally via its own whitelist. Fix double applyThinkingGuard on ChatStream→Chat tool fallback by calling OpenAIProvider.Chat directly.	2026-03-16 07:55:25 +07:00
hoangvinh14	a44dbf2ba4	feat(providers/dashscope): add Qwen 3.5 series support with per-model thinking capability (#215 ) * feat(providers/dashscope): guard enable_thinking injection by per-model capability check Introduces ModelThinkingCapable interface and ModelSupportsThinking field on ChatRequest so DashScope can skip thinking-param injection for models that do not support it (e.g. qwen3-plus, qwen3-turbo), preventing \"model not supported\" API errors. - types.go: add ModelThinkingCapable interface + ModelSupportsThinking bool on ChatRequest - dashscope.go: add dashscopeThinkingModels whitelist + ModelSupportsThinking(); honour pre-computed hint - loop.go: detect ModelThinkingCapable and set hint on ChatRequest before LLM call - provider_models.go: add qwen3.5-plus / qwen3.5-turbo to DashScope model list - dashscope_test.go: full test suite for whitelist, injection, hint override, budget mapping Fix review code. --------- Co-authored-by: Nguyen Gia Hoang Vinh <vinhngh@runsystem.net>	2026-03-16 07:43:08 +07:00
viettranx	84b1b07634	refactor(config): centralize hardcoded ~/.goclaw paths via config resolution Replace all hardcoded ~/.goclaw path constructions with configurable sources (cfg.ResolvedDataDir() for service dirs, cfg.Agents.Defaults.Workspace for agent workspaces). This fixes data persistence issues in Docker deployments where paths differ from local dev. - Add DataDir field to Config with ResolvedDataDir() resolver - Add ResolvedDataDirFromEnv() package-level helper for packages without Config - Populate StoreConfig.SkillsStorageDir (was never set, caused hardcoded fallback) - Agent workspaces now use subdirectory format (workspace/{key}) for volume compatibility - Remove dead GOCLAW_SESSIONS_STORAGE env/config (sessions moved to PostgreSQL) - Fix deploy-stg.sh trailing space after backslash + remove deprecated GOCLAW_MODE - Add GOCLAW_SKILLS_DIR override in docker-compose for volume persistence	2026-03-15 21:20:46 +07:00
HXD.VN	23eb42accb	feat(anthropic): add model alias resolution and update Chat methods (#204 )	2026-03-15 19:42:18 +07:00
Marcelo Emmerich	e5f8a2a7da	fix(openai): use max_completion_tokens and skip temperature for GPT-5/o-series models (#213 ) GPT-5 and o-series models reject the legacy max_tokens parameter and require max_completion_tokens instead. Additionally, gpt-5-mini, gpt-5-nano, and o-series models only support the default temperature (1). This patch adds model-aware branching in buildRequestBody() to: - Send max_completion_tokens instead of max_tokens for gpt-5*/o1/o3/o4 - Skip temperature parameter for gpt-5-mini/gpt-5-nano/o-series Co-authored-by: Marcelo Emmerich <marcelo@agenticsystems.de>	2026-03-15 19:39:39 +07:00
Goon	5e2fa395c7	feat(providers): add ACP provider for external coding agents (#190 ) * feat(providers): add ACP provider for orchestrating external coding agents (#189) Implement native Go ACP (Agent Client Protocol) client as a new Provider. Enables GoClaw to orchestrate any ACP-compatible agent (Claude Code, Codex CLI, Gemini CLI) as a subprocess via JSON-RPC 2.0 over stdio. - Add bidirectional JSON-RPC 2.0 transport over stdio pipes - Add subprocess process pool with idle TTL reaping and crash recovery - Add ACP session lifecycle (initialize, session/new, session/prompt) - Add tool bridge for agent-initiated fs/terminal/permission requests - Add workspace sandboxing, shell deny patterns, and env var filtering - Wire config-based and DB-based provider registration paths - Export DefaultDenyPatterns from tools package for reuse * feat(providers): add changelog entry for ACP provider integration * fix(tools): prevent workspace traversal bypass via /tmp/ fallback in resolveMediaPath Reject paths containing ".." in the isInTempDir fallback to prevent workspace escape where traversal path still resolves inside /tmp/. * fix(tools): block workspace-sibling paths in resolveMediaPath /tmp/ fallback When workspace is inside /tmp/, traversal paths like workspace/../X resolve to /tmp/ siblings that pass isInTempDir. Reject paths inside the workspace parent directory to prevent this escape. * feat(providers): add ACP provider web UI and live reload via pubsub Web UI for creating/editing ACP providers with dedicated form fields (binary, args, idle TTL, permission mode, work directory). ACP providers now update immediately without gateway restart via cache invalidation pubsub pattern. Frontend: - New ACPSection form component with i18n (en/vi/zh) - Provider form dialog integration with ACP state management - ACP type badge on providers list page - Settings field added to provider TypeScript types Backend: - ACP models handler (claude/codex/gemini) without API key requirement - Binary path validation + LookPath verification in verify handler - Provider CRUD emits cache.invalidate events via msgBus - Subscriber in gateway_managed.go re-registers ACP providers from DB - ACP core improvements from code review (helpers, jsonrpc, process, terminal, tool_bridge) --------- Co-authored-by: viettranx <viettranx@gmail.com>	2026-03-14 16:16:08 +07:00
Viet Tran	1a42dc93a6	feat(teams): team system v2 with bug fixes, workspace scope, versioning, and prompt optimization (#183 ) * feat(workspace): add team shared workspace for file collaboration - Add workspace_write and workspace_read tools for agents to share files across team members - Create team_workspaces DB table with migration 000017 (file metadata, pinning, tags) - Implement PostgreSQL store layer for workspace CRUD operations - Add RPC handlers for workspace list/read/delete from web UI - Build React workspace tab with file listing, content preview, and delete - Propagate workspace channel/chatID scope through delegation chain - Auto-allow workspace tools in agent tool policy when agent belongs to a team - Inject team workspace guidance into system prompt for team agents - Add /reset command handler for clearing session history - Harden MCP bridge context middleware to reject headers when no gateway token - Add i18n strings for workspace UI in en/vi/zh locales * feat(teams): add comprehensive task management with followup reminders and recovery - Add task followup/reminder system with auto-set on lead agent reply and auto-clear when user responds on channel - Add task recovery ticker to re-dispatch stale/pending tasks periodically - Add task scopes, filtering by status/channel/chatID, and task events - Add WS RPC handlers for task CRUD, assignments, comments, events, and bulk operations (teams_tasks.go) - Add task detail dialog, settings UI for followup config, and scope filtering in web dashboard - Add migrations 000018 (team_tasks_v2) and 000019 (task_followup) - Extend team_tasks_tool with await_reply, clear_followup actions - Auto-complete/fail team tasks when delegate agent finishes - Add workspace file listing and team tool manager enhancements * docs(teams): add team system architecture and playbook ideas documentation - Add TEAM_SYSTEM.md with full architecture design covering task management, shared workspace, and delegation engine subsystems - Add TEAM_PLAYBOOK_IDEAS.md outlining future team coordination layers (playbook, member capabilities, auto-learned patterns) - Document data models, status flows, tool actions, followup reminder system, task ticker, execution locking, and workspace scope model * fix(teams): resolve 6 critical bugs in team task system - Fix unblock SQL: check array_length after array_remove (not before) - Enforce single-team leadership in team creation - Add requireLead() for approve/reject tool actions - Validate cross-team dependency references in blocked_by - Add team_id to handoff route for multi-team isolation - Set blocked_by DEFAULT '{}' to prevent NULL array issues * refactor(workspace): use stable userID as scope key instead of connection UUID Workspace scope changed from (team_id, channel, chat_id) to (team_id, userID). Fixes workspace fragmentation across WS tab refreshes and reconnections. * feat(teams): add V1/V2 versioning with feature gating and optimized prompts - IsTeamV2() helper gates advanced features (locking, followup, review, audit) - V2 tool actions rejected for V1 teams with clear error message - Ticker, gateway consumer, delegation hooks respect version flag - TEAM.md renders v1/v2 sections conditionally - Tool descriptions and params optimized (~38% token reduction) - UI: version toggle in settings, V2 Beta badge, conditional rendering - i18n: version modal keys for en/vi/zh * fix(migration): use VARCHAR(255) for user ID columns and add metadata JSONB - assignee_user_id, user_id, actor_id: TEXT → VARCHAR(255) - Add metadata JSONB to team_task_comments and team_task_attachments --------- Co-authored-by: Nam Nguyen Ngoc <namnn.0911@gmail.com>	2026-03-13 22:41:32 +07:00
Luan Vu	405a753239	fix: resolve media provider type from DB instead of guessing from name (#154 ) Media tools (create_image, create_video, create_audio, read_audio, read_video, read_document) routed API calls based on provider name pattern matching (e.g. strings.HasPrefix(name, "gemini")). This breaks when users give custom names to DB providers — a Gemini provider named "chatgpt-sap-het" would be misrouted to the OpenAI-compat endpoint, causing 404 errors. Fix: carry the DB provider_type through OpenAIProvider, resolve it via typedProvider interface in ExecuteWithChain, and inject as _provider_type param for callProvider routing. Name-based heuristic kept as fallback for config-file providers that don't have a DB type. Co-authored-by: Luvu182 <208665161+Luvu182@users.noreply.github.com>	2026-03-11 18:32:51 +07:00
viettranx	bdb60de7ae	chore: upgrade Go 1.25 → 1.26 and apply go fix modernizations - Update go.mod and Dockerfile to Go 1.26 - Apply `go fix ./...` stdlib modernizations across 170+ files - Add `go fix` to post-implementation checklist in CLAUDE.md - Fix go fix misapplied rewrite in loop_history.go	2026-03-10 00:09:15 +07:00
Luan Vu	9249d9e358	fix(claude-cli): check scanner.Err() and increase stream buffer to 10MB (#96 ) Fixes #94. The stream-json scanner never checked scanner.Err() after the scan loop, so if a line exceeded the 1MB buffer limit the scanner would stop with bufio.ErrTooLong and the response would be silently truncated. - Check scanner.Err() after the loop and return an explicit error - Increase max buffer from 1MB to 10MB to handle large tool outputs Co-authored-by: Luvu182 <208665161+Luvu182@users.noreply.github.com>	2026-03-09 15:49:25 +07:00
Nam Nguyen Ngoc	11bed0cc01	fix(mcp-bridge): per-session security context + media forwarding (#91 ) * fix(mcp-bridge): add per-session agent context and HMAC verification - Add per-session MCP config with X-Agent-ID/X-User-ID headers instead of shared global config file - Sign bridge context headers with HMAC-SHA256 to prevent forgery - Add bridgeContextMiddleware to verify signatures on MCP bridge requests - Store MCP configs in ~/.goclaw/mcp-configs/ outside agent workDir - Use atomic writes (tmp + rename) for MCP config files - Fix provider rename leaving ghost registry entries - Remove provider_type from mutable fields on update - Tighten temp dir permissions from 0755 to 0700 * feat(mcp-bridge): propagate channel routing context through MCP bridge - Pass channel, chat_id, and peer_kind from agent loop to CLI provider options - Inject X-Channel, X-Chat-ID, X-Peer-Kind headers in bridge context middleware - Add BridgeContext struct to bundle per-call context for MCP config generation - Include channel routing headers in per-session MCP config files - Expose "message" tool via MCP bridge for cross-channel messaging - Add extract helpers for new option keys in claude_cli_session.go * feat(mcp-bridge): forward media attachments to outbound message bus - Wire MessageBus into gateway server and MCP bridge handler - Publish tool result media files to outbound bus for channel delivery - Extract channel/chatID/peerKind from tool context for proper routing - Add mimeFromExt helper for content-type detection on attachments * feat(mcp-bridge): inject per-agent DB-backed MCP servers into Claude CLI config - Add MCPServerLookup type to resolve agent-specific MCP servers from DB - Wire MCPServerStore through provider registration and HTTP handler - Extract mcpServerEntryToConfig helper to deduplicate transport config logic - Add JSON-to-Go helpers (jsonToStringSlice, jsonToStringMap) for DB fields - Merge per-agent MCP servers at config write time without overriding static entries * fix(mcp-bridge): use Media struct fields and prefer explicit MimeType - Map Media.Path to attachment URL instead of treating Media as string - Use Media.MimeType when available, fall back to extension-based detection * refactor(providers): deduplicate option extractors and extract bridge media forwarding - Replace per-field extractors (extractSessionKey, extractAgentID, etc.) with generic extractStringOpt/extractBoolOpt - Add bridgeContextFromOpts helper to build BridgeContext in one call - Extract forwardMediaToOutbound from inline block in makeToolHandler - Change NewBridgeServer msgBus param from variadic to explicit pointer * fix(providers): validate provider_type on update instead of silently dropping it - Add explicit validation against ValidProviderTypes with 400 response - Remove silent delete(updates, "provider_type") that hid invalid values - Caller now receives clear error when submitting unsupported provider_type * fix(providers): add header injection validation to MCP bridge headers - Extend CRLF/null-byte checks to agentID, channel, chatID, and peerKind - Previously only userID had header injection prevention - Prevents HTTP header injection via crafted values in MCP config * fix(mcp-bridge): sign all context fields in HMAC and remove legacy code - Sign all 5 bridge context fields (agentID\|userID\|channel\|chatID\|peerKind) in HMAC instead of only agentID\|userID to prevent channel routing forgery - Propagate context.Context into MCPServerLookup to respect request cancellation instead of using context.Background() - Remove legacy BuildCLIMCPConfig, WithClaudeCLIMCPConfig, mcpConfigPath, and mcpCleanup (dead code since system is PG-only) - Use mime.TypeByExtension before custom fallback in mimeFromExt - Add debug log when media forwarding is skipped due to missing context - Add thread-safety comment to SetMCPServerLookup --------- Co-authored-by: Nam Nguyen Ngoc <namnn.0911@gmail.com> Co-authored-by: viettranx <viettranx@gmail.com>	2026-03-09 15:23:56 +07:00
viettranx	0f2737ce53	feat(media): persistent media storage, read_document tool, and pipeline refactor - Add persistent media storage (internal/media/) replacing temp file deletion - Add MediaRef type for lightweight media references in session messages - Refactor media pipeline to use bus.MediaFile{Path, MimeType} across all channels - Add read_document builtin tool for PDF/DOCX/XLSX analysis via Gemini native API - Move image sanitization from Telegram to shared agent/media layer - Add media reload for multi-turn conversations (images from last 5 messages) - Add reply-to-message media resolution for Telegram (re-download on reply) - Add media inventory to compaction summary to preserve awareness after truncation - Fix coreToolSummaries for read_image, read_document, create_image tools - Add real-time trace update events via WebSocket broadcast - Improve trace detail UI with media refs and tool result display	2026-03-08 14:00:34 +07:00
viettranx	b2c4d543aa	feat(providers): add Claude CLI provider with MCP bridge (#61 ) Add Claude CLI as an LLM provider (subscription-based, no API key needed). The CLI manages session history, tool execution, and context while GoClaw forwards messages and streams responses. Key features: - Claude CLI provider with session persistence (--resume) - MCP bridge server exposing GoClaw tools to CLI via streamable-http - Security hooks (shell deny patterns, workspace path restrictions) - Per-session mutex preventing concurrent CLI calls - Onboard wizard for Claude CLI setup and auth verification - Web UI for adding/managing Claude CLI provider with auth status - Provider registry Close() for proper shutdown cleanup Security: - CLI path validation (only "claude" or absolute paths from DB) - Token auth middleware for MCP bridge endpoint - Shell injection prevention in hook scripts (single-quoted paths) - Relative path resolution before workspace boundary checks - Resource leak prevention on provider replace/unregister Co-authored-by: nhokboo <nhokboo@users.noreply.github.com>	2026-03-07 02:06:39 +07:00
Luan Vu	7d744eb4f2	refactor(oauth): DB-backed token storage, split codex.go, remove file-based artifacts (#65 ) Replace file-based OAuth token storage with DB-backed storage using llm_providers (access token) + config_secrets (refresh token). - Store: Add Settings JSONB field, chatgpt_oauth provider type - OAuth: DBTokenSource backed by provider + secrets stores - HTTP: oauth.go uses DB stores + registers provider in-memory - Providers: chatgpt_oauth support in registerInMemory/registerProvidersFromDB - Config: Remove HasOAuthToken, revert envFallback→envStr - CLI: auth commands call HTTP API on running gateway - Split codex.go (478→189 LOC) into codex.go + codex_build.go + codex_types.go - Frontend: Remove fake OAUTH_PROVIDER_ID, use real DB-backed providers - Tests: Rewrite with mock stores, fix SSE mock servers	2026-03-07 00:15:30 +07:00
Viet Tran	6041c683cb	fix(summoner): sequential file generation with progress + HTTP timeout (#74 ) Split agent summoning into two sequential LLM calls: - Step 1: Generate SOUL.md (personality) - Step 2: Generate IDENTITY.md + USER_PREDEFINED.md using SOUL.md as context This enables real-time progress updates in the UI as each file completes. On retry, previously generated files are detected by comparing against the default template and skipped, so only missing files are regenerated. Also increase HTTP client timeout from 120s to 300s for both OpenAI and Anthropic providers. Some models (e.g. Kimi) with deep reasoning can take 160s+ for non-streaming requests, which exceeded the 120s limit and caused cascading retry failures until the context deadline expired.	2026-03-06 22:39:33 +07:00
Viet Tran	3dcbc72ddd	fix(openai): increase SSE scanner buffer to 1MB and remove duplicate error check (#72 ) Default bufio.Scanner limit (64KB) can truncate large SSE lines from tool call arguments or thinking content, causing silent parse failures. Set buffer to 1MB to match the scale of extended thinking responses. Also remove a duplicate scanner.Err() check that was accidentally left in.	2026-03-06 22:39:21 +07:00
Luvu182	ab9f582de6	fix: OpenAI-compat SSE streaming drops usage data (tokens always 0) (#52 ) When stream_options.include_usage is enabled, the OpenAI API sends the final usage chunk with an empty choices array. The streaming parser skipped chunks with no choices before checking for usage data, causing token counts to always be 0 in traces. Fix: extract usage data before the empty-choices skip. Also fixes two related SSE parsing issues: - SSE data prefix now handles both "data: " and "data:" (some providers like Kimi omit the space after the colon, per SSE spec) - Added scanner.Err() check to surface stream read errors (timeout, connection reset) instead of silently returning partial results Co-authored-by: Luvu182 <208665161+Luvu182@users.noreply.github.com>	2026-03-04 23:20:27 +07:00
Luvu182	1c9e8db8e4	fix: OpenAI-compat SSE parsing, scanner error check, reasoning passback, and env var parity - SSE: handle "data:" without space (Kimi and other providers) - Add scanner.Err() check to detect stream read failures - Echo reasoning_content for thinking models (Kimi, DeepSeek) - Add Thinking field to Message struct for reasoning passback - Add GOCLAW_OPENAI_BASE_URL env var override (parity with Anthropic)	2026-03-04 09:38:38 +07:00
viettranx	50a90aa8c6	refactor: split large Go files (>350 lines) into smaller same-package files Pure file reorganization — no logic changes, no renames, no refactoring. Functions moved to new files in the same package for maintainability. Split 13 files across 6 packages into 25 new files: - store/pg: teams.go → teams_tasks/delegation/messaging.go; mcp_servers.go → mcp_servers_access.go - tools: delegate.go → delegate_state/policy/events.go; subagent.go → subagent_exec/config.go; web_search.go → web_search_brave/ddg.go; web_fetch.go → web_fetch_convert.go; sessions.go → sessions_history/send.go - providers: anthropic.go → anthropic_stream/request.go - mcp: manager.go → manager_connect/tools/util.go - channels/feishu: bot.go → bot_parse/policy.go; larkclient.go → larkclient_messaging.go - cmd: gateway_consumer.go → gateway_cron.go; agent_chat.go → agent_chat_client/standalone.go Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-28 17:27:28 +07:00
viettranx	73b46c3634	feat(security): apply upstream TS OpenClaw security and core engine fixes Port 8 fixes from upstream TypeScript OpenClaw (4 CRITICAL + 4 HIGH): CRITICAL: 1. Tool call name trimming — add strings.TrimSpace() to all provider response parsers (Anthropic 2 locations, OpenAI 3 locations) to prevent registry lookup failures from LLM whitespace-padded names 2. Shell env injection deny patterns — block GIT_EXTERNAL_DIFF, GIT_DIFF_OPTS, BASH_ENV, and ENV=.*sh to prevent code execution via environment variable injection during git/shell operations 3. Broken symlink escape — recursive target resolution via resolveThroughExistingAncestors() to catch chained symlinks that escape workspace (e.g. link1→link2→/etc/passwd) 4. Mutable parent-symlink TOCTOU check — hasMutableSymlinkParent() detects symlinks in writable parent dirs that could be rebound between path validation and file operation HIGH: 5. Model fallback thinking preservation — add ThinkingCapable interface to providers/types.go, implement on Anthropic/OpenAI/DashScope, check before injecting thinking_level in agent loop, warn on fallback when thinking is configured 6. Cron session-key double-prefix guard — prevent agent:X:cron:agent:X duplication in BuildCronSessionKey() 7. Webhook rate limiter — bounded WebhookRateLimiter (4096 keys max, 60s window, 30 hits/window) to prevent memory exhaustion DoS 8. DM policy allowlist validation — warn at startup when dmPolicy=allowlist with empty allowFrom (silent message drop) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-28 16:52:37 +07:00
viettranx	552ddc059e	feat(thinking): add per-agent extended thinking with Anthropic, OpenAI, and DashScope support Implement full thinking mode system: per-agent thinking_level config (off/low/medium/high) stored in other_config JSONB, per-provider param injection (Anthropic budget_tokens, OpenAI reasoning_effort, DashScope enable_thinking+thinking_budget), Anthropic extended thinking with streaming parse and tool use block preservation via RawAssistantContent, thinking token tracking in trace spans, and Web UI with agent config selector, chat thinking block rendering, and trace thinking token display. No DB migration needed. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-28 15:45:34 +07:00
viettranx	d913f29c69	feat(providers): add DashScope and Bailian Coding providers with reasoning_content support - Add DashScope (Qwen) native provider with tools+streaming fallback - Add Bailian Coding provider with hardcoded model list (no /v1/models API) - Parse reasoning_content in OpenAI-compat streaming/non-streaming responses - Emit ChatEventThinking events in agent loop for thinking models - Add vision support for DashScope (qwen3-vl) - Fix provider form dialog not updating API base URL when switching types - Update README provider count from 11+ to 13+ Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-28 14:56:14 +07:00
Duc Nguyen	4c67dff24d	feat(providers): support custom base URL for Anthropic provider (#16 ) Allow overriding the Anthropic API base URL via GOCLAW_ANTHROPIC_BASE_URL env var, config JSON, or DB provider record. Enables use of Anthropic- compatible proxies and custom endpoints. Also adds Makefile shortcuts for docker compose (up/down/logs).	2026-02-28 11:50:00 +07:00
viettranx	42263e5cc5	feat: Add tool loop detection, negative context injection, and fix Docker workspace permissions - Add tool loop detection (toolloop.go): tracks repeated no-progress tool calls using SHA256 hashing of args+results. Warning at 5 identical calls, force stop at 10. Prevents Gemini models from burning tokens in infinite loops. - Inject AVAILABILITY.md negative context when agent has no team/delegation targets, so models don't waste iterations probing unavailable capabilities. - Fix Dockerfile: create /app/.goclaw directory so Docker volume initializes with correct goclaw:goclaw ownership instead of root:root. - Update collapseToolCallsWithoutSig comments for clarity. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-27 22:00:50 +07:00
viettranx	40425d3ad2	fix: Fold tool results as user messages instead of imitable text pattern Gemini 2.5 flash doesn't return thought_signature, causing collapseToolCallsWithoutSig to convert tool_calls to "(Used tool ...)" text that Gemini imitates instead of making real tool calls. Now strips tool_calls and folds results into plain user messages so the model retains context without an imitable pattern. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-27 21:03:44 +07:00
viettranx	ef3c35a8b5	fix: Drop tool_calls without thought_signature instead of converting to text, fix trace detail word-wrap Gemini 2.5 flash doesn't return thought_signature in OpenAI-compat responses, causing collapseToolCallsWithoutSig to convert ALL tool_calls to "(Used tool ...)" text. Gemini then imitates this pattern instead of making actual tool calls. Now drops tool_call cycles entirely, preserving only the assistant's text content. Also fixes word-wrap in trace detail dialog (break-words instead of break-all) and uses <pre> with whitespace-pre-wrap for trace-level input/output previews. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-27 20:53:53 +07:00
viettranx	2ee39c8483	feat: Add pairing enforcement, group history, and display name resolution for Discord & WhatsApp channels. - Implement DM pairing flow for Discord and WhatsApp (checkDMPolicy, sendPairingReply with debounce) - Add "pairing" case to BaseChannel.CheckPolicy() to reject instead of falling through to "open" - Add group pending history tracking to Discord (record when not mentioned, prepend context when mentioned) - Resolve Discord display names with priority: server nickname > global name > username - Fix Gemini collapse format to prevent model from imitating tool call patterns - Fix formatTokens crash on null/undefined input	2026-02-27 19:59:51 +07:00
viettranx	0844969819	feat: Add `thought_signature` support for Gemini tool calls, including a compatibility function for old sessions and refactoring OpenAI types.	2026-02-27 18:46:01 +07:00
viettranx	66808850d4	feat: Implement LLM retry notifications in channels via placeholder updates.	2026-02-27 09:53:45 +07:00
viettranx	d5cc5a745d	feat: Implement vision capabilities and image generation tools, adding media handling, dedicated configurations, and trace optimization for image data.	2026-02-26 22:28:27 +07:00
viettranx	6066adc15a	feat: Implement agent delegation, quality gates, and a new hooks evaluation system.	2026-02-26 10:15:07 +07:00
viettranx	9b0dce2aa5	feat: Dynamically register/unregister providers in memory and recompute stale cron job next_run_at on startup.	2026-02-24 21:59:38 +07:00
Viet Tran	f3f4c67b36	Initial commit: GoClaw AI agent gateway Multi-agent AI gateway with WebSocket RPC, HTTP API, and messaging channel integrations. Go port of OpenClaw with multi-tenant PostgreSQL, per-user isolation, security hardening, and production observability. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-22 14:58:07 +07:00

38 Commits