orcs-code

Author	SHA1	Message	Date
root	2f4a06dd40	fix(theme): remove stale React Compiler memo wrappers from theme hooks Rebase on current main (includes #589 reconciler fix). The React Compiler memo caches (_c) in useTheme() and usePreviewTheme() use referential equality checks on destructured context values. These caches can return stale references when the ThemeProvider's useMemo recreates the context value object but the individual property references (setThemeSetting, setPreviewTheme, etc.) compare equal — the memo short-circuits and returns a cached tuple/object that still holds the old closure captures. This is a distinct bug from #589 (which fixed the ink reconciler's commitUpdate path for host prop updates). #589 ensures that when React _does_ re-render a component with new props, those props actually reach the DOM node. But the memo wrappers here prevent React from _even seeing_ the new context value in the first place — the hook returns the stale cached result. Removing the memo wrappers ensures useTheme() and usePreviewTheme() always read the current context value, eliminating the stale-reference path entirely.	2026-04-12 07:39:39 +00:00
lunamonke	4c50977f3c	Decouple and fix mistral (#595 ) * decouple and fix mistral * fix wrong variable for currentBaseUrl and buildAPIProviderProperties	2026-04-12 15:26:14 +08:00
euxaristia	b126e38b1a	fix: display selected model in startup screen instead of hardcoded sonnet 4.6 (#587 )	2026-04-11 21:20:00 +08:00
Alina Lisova	6e94dd9136	fix(ink): restore host prop updates in React 19 reconciler (#589 ) React 19's react-reconciler@0.33 mutation path calls commitUpdate with (instance, type, oldProps, newProps, fiber), but our Ink host config still expected an updatePayload from prepareUpdate. That left mounted ink-* nodes with stale onKeyDown, tabIndex, and textStyles, making menu navigation and highlights appear stuck until remount. Diff old/new props directly inside commitUpdate and add regression tests covering in-place updates for ink-box handlers/attributes and ink-text styles.	2026-04-11 21:19:39 +08:00
FluxLuFFy	91e4cfb15b	fix: WebSearch providers + MCPTool bugs (#593 ) * fix: WebSearch providers + MCPTool bugs WebSearchTool: - custom.ts: fix buildAuthHeadersForPreset WEB_AUTH_HEADER opt-out - custom.ts: fix WEB_AUTH_SCHEME empty string handling - custom.ts: fix walkJsonPath null safety for jsonPath parsing - duckduckgo.ts: use SafeSearchType enum instead of raw 0 - mojeek.ts: always send Accept: application/json header - README: fix timeout documentation (15s -> 120s to match code) - custom.test.ts: add tests for auth header behavior MCPTool: - MCPTool.ts: fix outputSchema to accept ContentBlockParam[] (not just string) - MCPTool.ts: fix isResultTruncated for array output (iterates text blocks) * fix: address PR #593 review feedback 1. Export buildAuthHeadersForPreset and add direct tests for: - WEB_AUTH_HEADER="" explicit opt-out behavior - WEB_AUTH_SCHEME="" stripping scheme prefix - Preset defaults (authHeader + authScheme) - No WEB_KEY returns empty headers 2. Add duckduckgo.test.ts verifying SafeSearchType.STRICT === 0, confirming the enum change is semantically identical to the previous raw value. Addresses review by @Vasanthdev2004 at pullrequestreview-4093533095 --------- Co-authored-by: FluxLuFFy <flux@openclaude.dev> Co-authored-by: Fix Bot <fix@openclaude.local>	2026-04-11 21:07:20 +08:00
Zartris	f4ac709fa6	fix: report cache reads in streaming and correct cost calculation (#577 ) * fix: report cache reads in streaming and correct cost calculation Fix two bugs in how the OpenAI-to-Anthropic shim handles cached tokens: 1. codexShim: streaming message_delta missing cache_read_input_tokens The codexStreamToAnthropic() function builds the final message_delta usage object inline (not through makeUsage()), and only included input_tokens and output_tokens. cache_read_input_tokens was always 0, so /cost never showed cache reads for Responses API models (GPT-5+). Also fix makeUsage() to read input_tokens_details.cached_tokens and prompt_tokens_details.cached_tokens for the non-streaming path. 2. Both shims: cost double-counting from convention mismatch OpenAI includes cached tokens in input_tokens/prompt_tokens (i.e., input_tokens = uncached + cached). Anthropic treats input_tokens as uncached only. The cost formula was: cost = input_tokens * inputRate + cache_read * cacheRate This double-counts cached tokens. Fix by subtracting cached from input during the conversion: input_tokens = prompt_tokens - cached_tokens In practice this was inflating reported costs by ~2x for sessions with high cache hit rates (which is most sessions, since Copilot auto-caches server-side). Fixes #515 * fix: omit zero cache read/write fields from /cost output Only show "cache read" and "cache write" in /cost per-model usage when the value is > 0. Providers like GitHub Copilot never report cache_creation_input_tokens (the server manages its own cache), so showing "0 cache write" on every line is misleading — it implies caching is not working when it actually is. Before: claude-haiku: 2.6k input, 151 output, 39.8k cache read, 0 cache write ($0.04) After: claude-haiku: 2.6k input, 151 output, 39.8k cache read ($0.04) --------- Co-authored-by: Zartris <14197299+Zartris@users.noreply.github.com>	2026-04-10 23:40:42 +08:00
Zartris	8aaa4f22ac	fix: add store:false to Chat Completions and /responses fallback (#578 ) Set store: false in the request body for both the Chat Completions path and the /responses fallback path in openaiShim.ts. The codexShim (Responses API primary path) already sets store: false. The Chat Completions path and the /responses fallback in openaiShim were missing it. store: false tells the API provider not to persist conversation data for model training, logging, or other non-operational purposes. This is a privacy measure — it does not affect caching or functionality. Note: Whether third-party proxies (e.g. GitHub Copilot) honour this parameter is provider-dependent, but setting it is a reasonable default for user privacy. Co-authored-by: Zartris <14197299+Zartris@users.noreply.github.com>	2026-04-10 23:40:09 +08:00
Zartris	a7f5982f64	fix: add GitHub Copilot model context windows and output limits (#576 ) Add context_window and max_output_tokens entries for all models available through the GitHub Copilot proxy (Claude, GPT, Gemini, Grok), sourced from https://api.githubcopilot.com/models. Models are namespaced as "github:copilot:<model>" to avoid collisions with the same model names served by other providers (which may have different limits). A new lookupByKey() helper and qualified-key lookup in lookupByModel() ensures the correct limits are selected when OPENAI_MODEL=github:copilot. Without this, Claude models on Copilot would use default context/output limits that may not match the proxy's actual constraints, causing 400 errors like "max_tokens is too large". Related: #515 Co-authored-by: Zartris <14197299+Zartris@users.noreply.github.com>	2026-04-10 22:00:26 +08:00
Juan Camilo Auriti	cb8f8b7ac2	fix: let saved provider profiles win on restart (#513 ) Treat profile-managed env as restart state rather than explicit user intent so saved OpenAI-compatible profiles can replace stale Ollama values on startup and persist correctly across restarts. Co-authored-by: Claude Opus 4.6 <noreply@openclaude.dev>	2026-04-10 21:58:33 +08:00
ibaaaaal	07621a6f8d	fix: scrub canonical Anthropic headers from 3P shim requests (#499 ) * Stop canonical Anthropic headers from leaking into 3P shim requests The remaining blocker from PR #268 was that canonical Anthropic headers such as `anthropic-version` and `anthropic-beta` could still ride through supported 3P paths even after the earlier x-anthropic/x-claude scrubber work. This tightens header filtering inside the shim itself so direct defaultHeaders, env-driven client setup, providerOverride routing, and per-request header injection all share the same scrubber. Constraint: Preserve non-Anthropic custom headers and provider auth while stripping only Anthropic/OpenClaude-internal headers from 3P requests Rejected: Rely on client.ts filtering alone \| direct shim construction and per-request headers would still leave gaps Confidence: high Scope-risk: narrow Reversibility: clean Directive: Keep header scrubbing centralized in the shim so new call paths do not reopen 3P leakage bugs Tested: bun test src/services/api/openaiShim.test.ts src/services/api/client.test.ts src/utils/context.test.ts Tested: bun run test:provider Tested: bun run build && node dist/cli.mjs --version Not-tested: bun run typecheck (repository baseline currently fails in many unrelated files) * Keep OpenAI client tests from restoring undefined env as strings The new header-leak regression tests in client.test.ts restored environment variables via direct assignment, which can leave literal "undefined" strings in process.env when the original value was unset. This switches the teardown over to the same restore helper pattern already used in openaiShim.test.ts. Constraint: Keep the fix limited to test hygiene without altering runtime behavior Rejected: Restore only the two env vars Copilot called out \| using one helper for all test env restores is simpler and less error-prone Confidence: high Scope-risk: narrow Reversibility: clean Directive: Use restore helpers for env teardown in tests so unset values stay deleted instead of becoming the string "undefined" Tested: bun test src/services/api/client.test.ts src/services/api/openaiShim.test.ts src/utils/context.test.ts Not-tested: Full provider suite (unchanged runtime path) * Prevent GitHub Codex requests from forwarding unsanitized Anthropic headers A base-sync with upstream exposed a separate GitHub+Codex transport branch that still merged per-request headers raw before adding Copilot headers. This keeps the filter aligned across Codex-family paths and adds explicit regression tests for GitHub Codex routing, including providerOverride. Constraint: Must not push or modify GitHub state while validating the reviewer concern Rejected: Leave the GitHub Codex path unchanged \| runtime repro showed anthropic-* headers still leaked after the upstream sync Confidence: high Scope-risk: narrow Directive: Keep header scrubbing consistent across every Codex-family transport branch when provider routing changes Tested: bun test src/services/api/openaiShim.test.ts Tested: bun test src/services/api/client.test.ts src/services/api/codexShim.test.ts src/services/api/providerConfig.github.test.ts Tested: bun run build Not-tested: Full repository test suite	2026-04-10 21:56:40 +08:00
Anandan	692471850f	fix: update theme preview on focus change (#562 ) Treat default select focus as initial state so /theme and first-run previews follow keyboard navigation again. Co-authored-by: anandh8x <test@example.com>	2026-04-10 21:55:15 +08:00
Anandan	68c296833d	fix: restore Ollama auto-detect in first-run setup (#561 ) Co-authored-by: anandh8x <test@example.com>	2026-04-10 21:53:30 +08:00
Zartris	9ccaa7a675	feat: add /cache-probe diagnostic command (#580 ) Add a /cache-probe slash command for debugging prompt caching behaviour on OpenAI-compatible providers (GitHub Copilot, OpenAI direct). The command sends two identical API requests in sequence and compares the raw server response usage stats, showing: - Input/output token counts - Cache read tokens (from prompt_tokens_details or input_tokens_details) - Latency for each request - Cache hit rate percentage Usage: /cache-probe # test default model /cache-probe claude-sonnet-4 # test specific model /cache-probe gpt-5.4 --no-key # test without prompt_cache_key The --no-key flag omits prompt_cache_key/prompt_cache_retention/store to test whether the server does content-based auto-caching (it does on GitHub Copilot). This is a debugging/diagnostic tool, not intended for regular use. It was instrumental in discovering that: 1. Copilot auto-caches server-side based on content hash 2. prompt_cache_key is ignored by the proxy 3. The streaming path was not reporting cached tokens Only enabled when the provider is OpenAI or GitHub (not for firstParty Anthropic which has different caching semantics). Related: #515 Co-authored-by: Zartris <14197299+Zartris@users.noreply.github.com>	2026-04-10 21:34:38 +08:00
Kevin Codex	598651f423	fix: rebrand prompt identity to openclaude (#496 ) * fix: rebrand prompt identity to openclaude * fix prompt branding * fix: align prompt branding with config compatibility	2026-04-10 01:20:05 +08:00
KRATOS	c385047abb	feat: add auto-fix service — auto-lint and test after AI file edits (#508 ) * feat: add AutoFix config schema and reader module Implements AutoFixConfigSchema (Zod v4) with validation for lint/test commands, maxRetries (0-10, default 3), and timeout (1000-300000ms, default 30000). Adds getAutoFixConfig helper that returns null for disabled or invalid configs. All 9 unit tests pass. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * feat: add autoFix runner with lint/test command execution Implements AutoFixRunner (Task 2) - executes lint and test shell commands sequentially, short-circuits on lint failure, handles timeouts, and produces structured AutoFixResult with AI-friendly error summaries. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * feat: add autoFix field to SettingsSchema with integration tests Integrates AutoFixConfigSchema into SettingsSchema so autoFix settings are validated at the settings layer. Adds two integration tests verifying that valid configs are accepted and invalid configs (enabled with no commands) are rejected. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * feat: add autoFix hook integration helpers (Task 4) Implements shouldRunAutoFix and buildAutoFixContext functions used by the PostToolUse hook to determine when to run auto-fix and format errors as AI-readable context for injection. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * feat: wire autoFix into PostToolUse hook flow (Task 5) Add auto-fix lint/test check after existing PostToolUse hooks in runPostToolUseHooks. When autoFix is configured in settings, runs lint/test commands after file_edit/file_write tools and yields errors as hook_additional_context for the model to act on. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * feat: add /auto-fix slash command Adds the /auto-fix prompt command that helps users configure autoFix settings (lint/test commands, maxRetries, timeout) in .claude/settings.json. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * fix: remove unused imports in autoFixRunner test Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: address review feedback — enforce maxRetries, wire abort signal, use cross-platform shell 1. Enforce maxRetries: track auto-fix attempts per query chain in toolHooks.ts and stop feeding errors back after the configured limit is reached. 2. Wire abort signal to subprocess: subscribe to AbortController signal in runCommand() and kill the process tree on abort. Uses detached process groups on Unix to ensure child processes are also terminated. 3. Replace hardcoded bash with shell:true: use Node's cross-platform shell resolution instead of spawn('bash', ['-c', ...]) so auto-fix commands work on Windows and non-bash environments. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-09 21:18:57 +08:00
Kevin Codex	42b121bd0d	Fix/openclaude diagnostics settings (#483 ) * fix: use openclaude paths in diagnostics and settings * fix: strip leaked reasoning from assistant output * fix: preserve legacy claude config compatibility * fix: tighten path and reasoning compatibility * fix: buffer streamed reasoning leak preambles * test: cover openclaude migration and reasoning fixes * test: isolate execFileNoThrow from cross-file mocks	2026-04-09 20:42:51 +08:00
FluxLuFFy	32fbd0c7b4	fix: custom web search — WEB_URL_TEMPLATE not recognized, timeout too short, silent native fallback (#537 ) * fix: custom web search — WEB_URL_TEMPLATE not recognized, timeout too short, silent native fallback 1. custom.ts: Add WEB_URL_TEMPLATE to isConfigured() so the custom provider is recognized when configured via URL template alone. 2. custom.ts: Bump DEFAULT_TIMEOUT_SECONDS from 15s to 120s. Self-hosted search APIs (SearXNG, internal) commonly need 30-90s. 3. WebSearchTool.ts: When an explicit adapter is selected via WEB_SEARCH_PROVIDER=custom, do not silently fall through to the native Anthropic path on adapter errors or 0-hit results. - 0 hits: return directly (no fallback) - Error: throw the real error (no fallback) - Auto mode: existing fallback behavior preserved * fix: tighten auto-mode adapter fallback — only swallow transient errors Address review feedback: in auto mode, only fall through to native on transient errors (network failure, timeout, HTTP 5xx). Config and guardrail errors (SSRF, HTTPS, bad URL, header allowlist, etc.) now surface properly instead of being silently swallowed. --------- Co-authored-by: FluxLuFFy <fluxluffy@users.noreply.github.com>	2026-04-09 20:41:58 +08:00
sooth	e30ad17ae0	fix(tui): restore prompt rendering on startup (#498 ) * fix(tui): restore prompt rendering on startup * test(tui): document render-time command split * fix(tui): reduce ghostty prompt repaint scope	2026-04-09 20:40:06 +08:00
Kevin Codex	c328fdf9e2	feat: add wiki mvp commands (#532 )	2026-04-09 14:54:38 +08:00
FluxLuFFy	4ad6bc50c1	refactor: provider adapter system + 7 new search providers (bug-fixed) (#512 ) * refactor: provider adapter system + 7 new search providers Architecture: - Each search backend is a small adapter implementing SearchProvider - 12 providers: custom, tavily, exa, you, jina, bing, mojeek, linkup, firecrawl, duckduckgo + native - WEB_SEARCH_PROVIDER controls selection: auto (fallback chain) or specific provider - Auth always in headers, never in query strings Bug fixes from review feedback: - Fix applyDomainFilters catch block: keep hits with malformed URLs on blocked_domains (can't confirm blocked), drop on allowed_domains (can't confirm allowed) - Add safeHostname() helper: safely extract hostname from URLs without throwing - Replace unsafe new URL(r.url).hostname in 7 providers with safeHostname() - Remove dead code: buildAllHeaders, buildAuthHeaders, parseExtraHeaders from types.ts - Fix WEB_PARMS typo: consistently use WEB_QUERY_PARAM everywhere - AbortSignal forwarded to fetch() in all 12 providers - DuckDuckGo: wrap dynamic import in try/catch for graceful error - Exa: remove double domain filtering (server-side already) - runSearch(): aggregate all provider errors instead of throwing only the last one - Retry logic: check numeric status code directly, retry 5xx/network, skip 4xx Test coverage (44 tests, all passing): - types.test.ts: safeHostname, normalizeHit, applyDomainFilters (20 tests) - index.test.ts: getProviderMode, getProviderChain, getAvailableProviders (13 tests) - custom.test.ts: extractHits flexible response parsing (11 tests) Co-authored-by: FluxLuFFy <195792511+FluxLuFFy@users.noreply.github.com> * security: add guardrails to custom search provider (Option B) - HTTPS-only by default (opt-out: WEB_CUSTOM_ALLOW_HTTP=true) - Private/localhost IPs blocked by default (opt-out: WEB_CUSTOM_ALLOW_PRIVATE=true) - Header allowlist: only known-safe headers allowed unless WEB_CUSTOM_ALLOW_ARBITRARY_HEADERS=true - Configurable timeout in seconds (WEB_CUSTOM_TIMEOUT_SEC, default 15) - Configurable POST body limit (WEB_CUSTOM_MAX_BODY_KB, default 300) - Removed max URL size restriction - Audit log warning on first custom search call - Updated .env.example and README_SEARCH_PROVIDERS.md with all new options * fix: remove custom provider from auto chain (Option 1) Remove customProvider from the auto fallback chain so it is only available when WEB_SEARCH_PROVIDER=custom is explicitly selected. Changes: - Remove customProvider from ALL_PROVIDERS array in providers/index.ts - Add 3 new tests verifying custom is excluded from auto chain - Update README_SEARCH_PROVIDERS.md: auto priority, mode table, note - Update .env.example: auto priority comment, custom mode annotation All 47 tests pass (44 existing + 3 new). Co-Authored-By: @Vasanthdev2004 * fix: address review blockers (routing, abort, config check, domain matching) 1. Native/Codex routing precedence in auto mode shouldUseAdapterProvider() now checks if native/first-party/vertex/foundry or Codex paths are available before falling back to adapter providers. Auto mode: native paths take precedence; adapter is fallback only. 2. AbortError stops provider chain immediately runSearch() now checks for AbortError/aborted signal before continuing the fallback chain. Cancelled searches don't create extra outbound requests. 3. Explicit provider mode fails fast on missing credentials runSearch() validates isConfigured() for explicit modes before attempting requests. Throws clear error: 'Search provider "X" is not configured.' 4. Domain filter exact-or-subdomain matching (fixes suffix collision) New hostMatchesDomain() helper: exact match or .subdomain match. badexample.com no longer matches example.com. 5. Tests: 56 pass (9 new) covering all 4 fixes Co-Authored-By: @Vasanthdev2004 --------- Co-authored-by: Claude Fix <fix@openclaude.local> Co-authored-by: FluxLuFFy <195792511+FluxLuFFy@users.noreply.github.com> Co-authored-by: bot <bot@openclaw.ai>	2026-04-09 02:51:25 +08:00
José Zechel	284d9bda36	Error: Fix of an image in the conversation exceeds the dimension limit for many-image requests (2000px) (#520 ) Root cause: IMAGE_MAX_WIDTH and IMAGE_MAX_HEIGHT were set to 2000 — exactly the APIs many-image dimension limit. Images resized to exactly 2000px would get rejected when the conversation accumulated enough images to trigger the API's many-image mode. Fix: Changed both constants from 2000 to 1568 in src/constants/apiLimits.ts:42-43. This is the resolution the API internally downscales to anyway (documented in the API's encoding/full_encoding.py), so there is zero effective quality loss. All images are now safely below the many-image threshold. export const IMAGE_MAX_WIDTH = 1568 export const IMAGE_MAX_HEIGHT = 1568 Impact: The single constant change propagates everywhere — imageResizer.ts uses IMAGE_MAX_WIDTH/IMAGE_MAX_HEIGHT for all resize decisions, and the error messages reference these constants dynamically. No other files need changes.	2026-04-08 22:12:57 +08:00
Vasanth T	537c469c3a	fix: replace isDeepStrictEqual with navigation-aware options comparison (#507 ) The select cursor highlight was broken because isDeepStrictEqual in use-select-navigation.ts and use-multi-select-state.ts would fail when options contained identity-unstable properties (JSX label elements, function onChange callbacks, computed disabled booleans). This caused the reset logic to fire on every re-render, resetting focusedValue back to the first option. Replace isDeepStrictEqual with optionsNavigateEqual which only compares properties that affect navigation behavior: value, disabled, and type. ReactNode labels and function callbacks are intentionally excluded as they are identity-unstable but don't change navigation semantics. Fixes #472 Co-authored-by: OpenClaude Worker 3 <worker-3@openclaude.local>	2026-04-08 16:44:42 +08:00
Juan Camilo Auriti	ccaa193eec	fix: preserve only originally-required properties in strict tool schemas (#471 ) Fixes #430. In normalizeSchemaForOpenAI(), the strict branch was adding every property key to required[], including optional ones. This caused providers like Groq, Azure OpenAI, and others to reject valid tool calls with a 400 / tool_use_failed error because the model correctly omits optional arguments but the provider sees them as missing required fields. Root cause: the strict branch used `[...existingRequired, ...allKeys]` instead of `existingRequired.filter(k => k in normalizedProps)`. The Gemini branch already had the correct logic. Fix: align the strict branch with the Gemini branch — only keep properties that were already marked required in the original schema. The additionalProperties: false constraint is preserved as strict-mode providers still require it. Add regression test covering the Read tool schema (file_path required, offset/limit/pages optional).	2026-04-08 16:42:11 +08:00
Vasanth T	2caf2fd982	fix: defer startup checks and suppress recommendation dialogs during startup window (issue #363 ) (#504 ) * fix: defer startup plugin checks and suppress recommendation dialogs during startup window (issue #363) Root cause: performStartupChecks() fires immediately on REPL mount, triggering plugin loading which populates trackedFiles, which triggers useLspPluginRecommendation to surface an LSP recommendation dialog. Since promptTypingSuppressionActive is false before any user input, getFocusedInputDialog() returns the dialog, unmounting PromptInput entirely and making the CLI appear frozen. Fix: Two-pronged approach: 1. Defer performStartupChecks by 1500ms and gate on promptTypingSuppressionActive so startup checks dont run while the user is typing or has early input buffered 2. Suppress lower-priority startup dialogs (LSP recommendation, plugin hint, desktop upsell) until startupChecksStartedRef is true, preventing them from stealing focus during the vulnerable startup window This also explains why --bare mode and disabling plugins work: --bare mode skips plugin loading entirely, and disabling the autoresearch plugin eliminates the LSP match, so lspRecommendation stays null and PromptInput renders normally. * fix: move startup checks effect after promptTypingSuppressionActive declaration Fixes temporal dead zone warning flagged by code-quality bot. promptTypingSuppressionActive is declared on line ~1340 but the useEffect was on line ~800, causing a reference-before-declaration. Also adds missing semicolons for style consistency. * fix: gate startup checks on prompt readiness, not just a timeout (issue #363) The previous approach used a fixed 1500ms timeout, but as gnanam1990 pointed out, if a user pauses for >1.5s before typing the timer can still fire and recommendation dialogs can steal focus. This is a timing mitigation, not a reliable fix. New approach: gate startup checks on actual prompt readiness: 1. After first message submission (submitCount > 0) — always safe 2. After grace period (3s) elapsed AND user is idle — safe because no dialog will interrupt an idle user who hasn't started typing 3. While user is actively typing — deferrred until they stop This ensures startup checks never steal focus from a prompt the user is about to type into, regardless of how long they pause before typing. Also removes the old STARTUP_CHECK_DELAY_MS constant in favor of STARTUP_GRACE_PERIOD_MS with clearer semantics. * fix: move startup checks after submitCount declaration to avoid temporal dead zone Code quality bot flagged that submitCount was used before its declaration. Moved the entire startup checks block to after the submitCount useState declaration. Also added nullish coalescing (submitCount ?? 0) per bot suggestion. * fix: gate startup checks strictly on first submission, remove grace period (issue #363) As gnanam1990 pointed out, the 3s grace period still allows the failure mode: if a user pauses for a few seconds before typing, startup checks fire and recommendation dialogs steal focus. A grace period is still a timing mitigation, not a reliable fix. New approach: startup checks only run after the user has submitted their first message (submitCount > 0). No grace period, no timeout. This guarantees the prompt gets first interaction — no dialog can steal focus before the user has actually used the CLI. If the user never submits a message, startup checks never run. That's acceptable because with no user interaction there's no need for plugin installations or marketplace seeding. --------- Co-authored-by: OpenClaude Worker 3 <worker-3@openclaude.local>	2026-04-08 16:08:36 +08:00
Meetpatel006	ad724dc3a4	Improve GitHub Copilot provider: official OAuth onboarding, Copilot API routing, and test hardening and auto refresh token logic (#288 ) * update gitHub copilot API with offical client id and update model configurations * test: add unit tests for exchangeForCopilotToken and enhance GitHub model normalization * remove PAT token feature * test(api): harden provider tests against env leakage * Added back trimmed github auth token * added auto refresh logic for auto token along with test * fix: remove forked provider validation in cli.tsx and clear stale provider env vars in /onboard-github * refactor: streamline environment variable handling in mergeUserSettingsEnv * fix: clear stale provider env vars to ensure correct GH routing * Remove internal-only tooling from the external build (#352) * Remove internal-only tooling without changing external runtime contracts This trims the lowest-risk internal-only surfaces first: deleted internal modules are replaced by build-time no-op stubs, the bundled stuck skill is removed, and the insights S3 upload path now stays local-only. The privacy verifier is expanded and the remaining bundled internal Slack/Artifactory strings are neutralized without broad repo-wide renames. Constraint: Keep the first PR deletion-heavy and avoid mass rewrites of USER_TYPE, tengu, or claude_code identifiers Rejected: One-shot DMCA cleanup branch \| too much semantic risk for a first PR Confidence: medium Scope-risk: moderate Reversibility: clean Directive: Treat full-repo typecheck as a baseline issue on this upstream snapshot; do not claim this commit introduced the existing non-Phase-A errors without isolating them first Tested: bun run build Tested: bun run smoke Tested: bun run verify:privacy Not-tested: Full repo typecheck (currently fails on widespread pre-existing upstream errors outside this change set) * Keep minimal source shims so CI can import Phase A cleanup paths The first PR removed internal-only source files entirely, but CI provider and context tests import those modules directly from source rather than through the build-time no-telemetry stubs. This restores tiny no-op source shims so tests and local source imports resolve while preserving the same external runtime behavior. Constraint: GitHub Actions runs source-level tests in addition to bundled build/privacy checks Rejected: Revert the entire deletion pass \| unnecessary once the import contract is satisfied by small shims Confidence: high Scope-risk: narrow Reversibility: clean Directive: For later cleanup phases, treat build-time stubs and source-test imports as separate compatibility surfaces Tested: bun run build Tested: bun run smoke Tested: bun run verify:privacy Tested: bun run test:provider Tested: bun run test:provider-recommendation Not-tested: Full repo typecheck (still noisy on this upstream snapshot) --------- Co-authored-by: anandh8x <test@example.com> * Reduce internal-only labeling noise in source comments (#355) This pass rewrites comment-only ANT-ONLY markers to neutral internal-only language across the source tree without changing runtime strings, flags, commands, or protocol identifiers. The goal is to lower obvious internal prose leakage while keeping the diff mechanically safe and easy to review. Constraint: Phase B is limited to comments/prose only; runtime strings and user-facing labels remain deferred Rejected: Broad search-and-replace across strings and command descriptions \| too risky for a prose-only pass Confidence: high Scope-risk: narrow Reversibility: clean Directive: Remaining ANT-ONLY hits are mostly runtime/user-facing strings and should be handled separately from comment cleanup Tested: bun run build Tested: bun run smoke Tested: bun run verify:privacy Tested: bun run test:provider Tested: bun run test:provider-recommendation Not-tested: Full repo typecheck (upstream baseline remains noisy) Co-authored-by: anandh8x <test@example.com> * Neutralize internal Anthropic prose in explanatory comments (#357) This is a small prose-only follow-up that rewrites clearly internal or explanatory Anthropic comment language to neutral wording in a handful of high-confidence files. It avoids runtime strings, flags, command labels, protocol identifiers, and provider-facing references. Constraint: Keep this pass narrowly scoped to comments/documentation only Rejected: Broader Anthropic comment sweep across functional API/protocol references \| too ambiguous for a safe prose-only PR Confidence: high Scope-risk: narrow Reversibility: clean Directive: Leave functional Anthropic references (API behavior, SDKs, URLs, provider labels, protocol docs) for separate reviewed passes Tested: bun run build Tested: bun run smoke Tested: bun run verify:privacy Tested: bun run test:provider Tested: bun run test:provider-recommendation Not-tested: Full repo typecheck (upstream baseline remains noisy) Co-authored-by: anandh8x <test@example.com> * Neutralize remaining internal-only diagnostic labels (#359) This pass rewrites a small set of ant-only diagnostic and UI labels to neutral internal wording while leaving command definitions, flags, and runtime logic untouched. It focuses on internal debug output, dead UI branches, and noninteractive headings rather than broader product text. Constraint: Label cleanup only; do not change command semantics or ant-only logic gates Rejected: Renaming ant-only command descriptions in main.tsx \| broader UX surface better handled in a separate reviewed pass Confidence: high Scope-risk: narrow Reversibility: clean Directive: Remaining ANT-ONLY hits are mostly command descriptions and intentionally deferred user-facing strings Tested: bun run build Tested: bun run smoke Tested: bun run verify:privacy Tested: bun run test:provider Tested: bun run test:provider-recommendation Not-tested: Full repo typecheck (upstream baseline remains noisy) Co-authored-by: anandh8x <test@example.com> * Finish eliminating remaining ANT-ONLY source labels (#360) This extends the label-only cleanup to the remaining internal-only command, debug, and heading strings so the source tree no longer contains ANT-ONLY markers. The pass still avoids logic changes and only renames labels shown in internal or gated surfaces. Constraint: Update the existing label-cleanup PR without widening scope into behavior changes Rejected: Leave the last ANT-ONLY strings for a later pass \| low-cost cleanup while the branch is already focused on labels Confidence: high Scope-risk: narrow Reversibility: clean Directive: The next phase should move off label cleanup and onto a separately scoped logic or rebrand slice Tested: bun run build Tested: bun run smoke Tested: bun run verify:privacy Tested: bun run test:provider Tested: bun run test:provider-recommendation Not-tested: Full repo typecheck (upstream baseline remains noisy) Co-authored-by: anandh8x <test@example.com> * Stub internal-only recording and model capability helpers (#377) This follow-up Phase C-lite slice replaces purely internal helper modules with stable external no-op surfaces and collapses internal elevated error logging to a no-op. The change removes additional USER_TYPE-gated helper behavior without touching product-facing runtime flows. Constraint: Keep this PR limited to isolated helper modules that are already external no-ops in practice Rejected: Pulling in broader speculation or logging sink changes \| less isolated and easier to debate during review Confidence: high Scope-risk: narrow Reversibility: clean Directive: Continue Phase C with similarly isolated helpers before moving into mixed behavior files Tested: bun run build Tested: bun run smoke Tested: bun run verify:privacy Tested: bun run test:provider Tested: bun run test:provider-recommendation Not-tested: Full repo typecheck (upstream baseline remains noisy) Co-authored-by: anandh8x <test@example.com> * Remove internal-only bundled skills and mock helpers (#376) * Remove internal-only bundled skills and mock rate-limit behavior This takes the next planned Phase C-lite slice by deleting bundled skills that only ever registered for internal users and replacing the internal mock rate-limit helper with a stable no-op external stub. The external build keeps the same behavior while removing a concentrated block of USER_TYPE-gated dead code. Constraint: Limit this PR to isolated internal-only helpers and avoid bridge, oauth, or rebrand behavior Rejected: Broad USER_TYPE cleanup across mixed runtime surfaces \| too risky for the next medium-sized PR Confidence: high Scope-risk: moderate Reversibility: clean Directive: The next cleanup pass should continue with similarly isolated USER_TYPE helpers before touching main.tsx or protocol-heavy code Tested: bun run build Tested: bun run smoke Tested: bun run verify:privacy Tested: bun run test:provider Tested: bun run test:provider-recommendation Not-tested: Full repo typecheck (upstream baseline remains noisy) * Align internal-only helper removal with remaining user guidance This follow-up fixes the mock billing stub to be a true no-op and removes stale user-facing references to /verify and /skillify from the same PR. It also leaves a clearer paper trail for review: the deleted verify skill was explicitly ant-gated before removal, and the remaining mock helper callers still resolve to safe no-op returns in the external build. Constraint: Keep the PR focused on consistency fixes and reviewer-requested evidence, not new cleanup scope Rejected: Leave stale guidance for a later PR \| would make this branch internally inconsistent after skill removal Confidence: high Scope-risk: narrow Reversibility: clean Directive: When deleting gated features, always sweep user guidance and coordinator prompts in the same pass Tested: bun run build Tested: bun run smoke Tested: bun run verify:privacy Tested: bun run test:provider Tested: bun run test:provider-recommendation Not-tested: Full repo typecheck (upstream baseline remains noisy; changed-file scan still shows only pre-existing tipRegistry errors outside edited lines) * Clarify generic workflow wording after skill removal This removes the last generic verification-skill wording that could still be read as pointing at a deleted bundled command. The guidance now talks about project workflows rather than a specific bundled verify skill. Constraint: Keep the follow-up limited to reviewer-facing wording cleanup on the same PR Rejected: Leave generic wording as-is \| still too easy to misread after the explicit /verify references were removed Confidence: high Scope-risk: narrow Reversibility: clean Directive: When removing bundled commands, scrub both explicit and generic references in the same branch Tested: bun run build Tested: bun run smoke Not-tested: Additional checks unchanged by wording-only follow-up --------- Co-authored-by: anandh8x <test@example.com> * test(api): add GEMINI_AUTH_MODE to environment setup in tests * test: isolate GitHub/Gemini credential tests with fresh module imports and explicit non-bare env setup to prevent cross-test mock/cache leaks * fix: update GitHub Copilot base URL and model defaults for improved compatibility * fix: enhance error handling in OpenAI API response processing * fix: improve error handling for GitHub Copilot API responses and streamline error body consumption * fix: enhance response handling in OpenAI API shim for better error reporting and support for streaming responses * feat: enhance GitHub device flow with fresh module import and token validation improvements * fix: separate Copilot API routing from GitHub Models, clear stale env vars, honor providerOverride.apiKey * fix: route GitHub GPT-5/Codex to Copilot API, show all Copilot models in picker, clear stale env vars * fix GitHub Models API regression * feat: update GitHub authentication to require OAuth tokens, normalize model handling for Copilot and GitHub Models * fix: update GitHub token validation to support OAuth tokens and improve endpoint type handling --------- Co-authored-by: Anandan <anandan.8x@gmail.com> Co-authored-by: anandh8x <test@example.com>	2026-04-08 16:03:31 +08:00
Urvish Lanje	648ae8053b	ci: run python provider tests in pr-checks (#477 ) * Add WakaTime extension to devcontainer configuration * ci: run python provider tests in pr-checks * Delete .devcontainer directory * ci: added requirements.txt for pip caching * ci: addressed security and mainenance issues * ci: updated release tag * Update .github/workflows/pr-checks.yml Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * ci: added full commit SHA for python setup --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2026-04-08 15:18:04 +08:00
lunamonke	3188f6ac66	fix example agents (#438 )	2026-04-08 02:55:27 +08:00
Kevin Codex	69ea1f1e4a	fix: restore default context window for unknown 3p models (#494 ) * fix: restore default context window for unknown 3p models * fix: add MiniMax context metadata	2026-04-08 02:45:49 +08:00
KRATOS	f9ce81bfb3	fix: handle missing skill parameter in SkillTool (#485 ) * fix: handle missing skill parameter in SkillTool * fix: preserve SkillTool schema contract * fix: align SkillTool schema error output	2026-04-08 00:33:52 +08:00
Juan Camilo Auriti	4975cfc2e0	fix: strip Anthropic params from 3P resume paths (#479 ) * fix: strip Anthropic-specific params from 3P provider paths Three silent failure modes affecting all third-party provider users: 1. Thinking blocks serialized as <thinking> text corrupt multi-turn context — strip them instead of converting to raw text tags. 2. Unknown models fall through to 200k context window default, so auto-compact never triggers — use conservative 8k for unknown 3P models with a warning log. 3. Session resume with thinking blocks causes 400 or context corruption on 3P providers — strip thinking/redacted_thinking content blocks from deserialized messages when resuming against a non-Anthropic provider. Addresses findings 2, 3, and 5 from #248. * test: align resume stripping expectation with orphan-thinking filter * test: isolate provider env in conversation recovery tests * test: move provider-sensitive resume coverage behind module mocks * test: trim extra blank lines in conversation recovery test Keep the focused provider-resume test diff clean so the regression branch stays easy to review. Co-Authored-By: Claude Opus 4.6 <noreply@openclaude.dev> --------- Co-authored-by: Claude Opus 4.6 <noreply@openclaude.dev>	2026-04-07 23:24:10 +08:00
ibaaaaal	600c01faf7	fix: restore Grep and Glob reliability on OpenAI paths (#461 ) * fix: restore Grep and Glob reliability on OpenAI paths Preserve Grep and Glob pattern fields during OpenAI/Codex schema sanitization, and fall back to system ripgrep when the packaged binary is missing. This keeps search tool schemas intact and improves Linux usability for npm/source installs. Co-Authored-By: Claude Opus 4.6 <noreply@openclaude.dev> * test: clean up ripgrep fallback test helpers Remove the unused ripgrepCommand import and normalize mocked builtin ripgrep paths so the test behaves consistently across platforms. Co-Authored-By: Claude Opus 4.6 <noreply@openclaude.dev> * test: remove duplicate Codex URI schema case Drop the duplicated WebFetch URI-format test in codexShim.test.ts so test names stay unique and failures remain easier to read. Co-Authored-By: Claude Opus 4.6 <noreply@openclaude.dev> * test: stabilize ripgrep fallback coverage Avoid fs/module mocking in ripgrep fallback tests by extracting the config selection logic into a pure helper. This preserves the fallback coverage while removing the test interaction that caused the narrowed Bun hang repro. Co-Authored-By: Claude Opus 4.6 <noreply@openclaude.dev> * test: tighten ripgrep and schema coverage Align the ripgrep fallback test with the actual auto-fallback branch, clean up strict typing in schema sanitizer tests, and tighten ripgrep error narrowing for type safety. Co-Authored-By: Claude Opus 4.6 <noreply@openclaude.dev> --------- Co-authored-by: Claude Opus 4.6 <noreply@openclaude.dev>	2026-04-07 17:26:00 +08:00
makskopchan-tech	b07bafa5bd	Security code scanning (#459 ) * fix: address code scanning alerts Parse Gemini hostnames instead of matching raw URL substrings, redact gRPC error logs, and harden the Finder drag-drop test escape helper so the flagged paths are fixed without regressing working behavior. * Potential fix for pull request finding 'CodeQL / Clear-text logging of sensitive information' Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com> * fix: restore safe grpc error summaries A later autofix commit removed the exported gRPC error summarizer while the new regression test still imported it. Restore the safe name/code-only summary so CI stays green without reintroducing clear-text logging. * fix: keep grpc logging generic Remove the stale helper/test pair and keep the gRPC startup and stream logs free of error-derived data so the CodeQL clear-text logging alert stays closed while the rest of the security fixes remain intact. --------- Co-authored-by: OpenClaude Worker 3 <worker-3@openclaude.local> Co-authored-by: Vasanth T <148849890+Vasanthdev2004@users.noreply.github.com> Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com>	2026-04-07 16:03:16 +08:00
changjiaoxigua	85aa8b0985	fix: add File polyfill for Node < 20 to prevent startup deadlock with proxy (#442 ) When a proxy is configured, configureGlobalAgents() loads undici to set a global dispatcher. However, undici v7.24.6 requires Node.js >= 20.18.1 and references globalThis.File at module evaluation time for webidl type assertions. Node 18 lacks the File global, causing ReferenceError inside the bundled __commonJS require chain, which deadlocks due to unresolved circular dependencies in the module initialization. Fix by polyfilling globalThis.File early in cli.tsx entrypoint, before any undici code loads. Try node:buffer.File (available in Node 18.13+), fallback to minimal Blob-based stub. Fixes: bun run start hangs indefinitely when HTTP_PROXY/HTTPS_PROXY is set Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-07 16:02:05 +08:00
Vasanth T	e365cb4010	fix: address code scanning alerts (#434 ) * fix: address code scanning alerts Parse Gemini hostnames instead of matching raw URL substrings, redact gRPC error logs, and harden the Finder drag-drop test escape helper so the flagged paths are fixed without regressing working behavior. * Potential fix for pull request finding 'CodeQL / Clear-text logging of sensitive information' Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com> * fix: restore safe grpc error summaries A later autofix commit removed the exported gRPC error summarizer while the new regression test still imported it. Restore the safe name/code-only summary so CI stays green without reintroducing clear-text logging. * fix: keep grpc logging generic Remove the stale helper/test pair and keep the gRPC startup and stream logs free of error-derived data so the CodeQL clear-text logging alert stays closed while the rest of the security fixes remain intact. --------- Co-authored-by: OpenClaude Worker 3 <worker-3@openclaude.local> Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com>	2026-04-07 00:43:09 +08:00
CRABHIVE	52d33a87a0	fix: include MCP tool results in microcompact to reduce token waste (#348 ) ## Summary - Added `isCompactableTool()` helper in `microCompact.ts` that matches both the existing COMPACTABLE_TOOLS set and any tool prefixed `mcp__` - MCP tool results were never compacted because the hardcoded allowlist only contained 9 built-in tools — MCP tools fell through and persisted in full for the entire session, wasting 10-500K tokens/session ## Impact - user-facing impact: long sessions using MCP servers (GitHub, Slack, Playwright, etc.) will compact stale MCP tool results, reducing token usage and delaying autocompact triggers - developer/maintainer impact: new MCP servers are automatically covered via prefix match — no need to update the allowlist per-server ## Testing - [x] `bun run build` - [x] `bun run smoke` - [x] focused tests: `bun test src/services/compact/microCompact.test.ts` - module exports load correctly - estimateMessageTokens counts MCP tool_use blocks - microcompactMessages processes MCP tools without error - microcompactMessages processes mixed built-in and MCP tools ## Notes - provider/model path tested: n/a (compaction logic is model-agnostic) - screenshots attached (if UI changed): n/a - follow-up work or known limitations: subagent results and thinking blocks are still not compacted (separate RFCs) https://claude.ai/code/session_01D7kprMn4c66a5WrZscF7rv Co-authored-by: Claude <noreply@anthropic.com>	2026-04-06 23:13:20 +08:00
KRATOS	b4bd95b477	fix: normalize malformed Bash tool arguments from OpenAI-compatible providers (#385 ) * fix: normalize malformed Bash tool arguments from OpenAI-compatible providers * fix: keep invalid Bash tool args from becoming commands * fix: preserve malformed Bash JSON literals * test: stabilize rebased PR 385 checks * test: isolate provider profile env assertions * fix: extend tool argument normalization to all tools and harden edge cases - Extend STRING_ARGUMENT_TOOL_FIELDS to normalize Read, Write, Edit, Glob, and Grep plain-string arguments (fixes "Invalid tool parameters" errors reported by VennDev) - Normalize streaming Bash args regardless of finish_reason, not only when finish_reason is 'tool_calls' - Broaden isLikelyStructuredObjectLiteral to catch malformed object-shaped strings like {command:"pwd"} and {'command':'pwd'} (fixes CR2 from Vasanthdev2004) - Apply blank/object-literal guard to all tools, not just Bash - Extract duplicated JSON repair suffix combinations into shared constant - Add 32 isolated unit tests for toolArgumentNormalization Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: skip streaming normalization on finish_reason length Truncated tool calls (finish_reason: 'length') now preserve the raw buffer instead of normalizing into executable commands, preventing incomplete commands from becoming runnable. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: comprehensive tool argument normalization hardening - Remove all { raw: ... } returns that caused InputValidationError with z.strictObject schemas — return {} instead for clean Zod errors - Extend normalizeAtStop buffering to all mapped tools (Read, Write, Edit, Glob, Grep) so streaming paths also get normalized - Make repairPossiblyTruncatedObjectJson generic — repair any valid JSON object, not just ones with a command field - Export hasToolFieldMapping for streaming normalizeAtStop decision - Skip normalization on finish_reason: length to preserve raw truncated buffer - Update all test expectations to match new behavior Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-06 22:08:45 +08:00
Otávio Carvalho	1e057025d6	Fix GLM-5 and other reasoning models appearing to hang via OpenAI shim (#365 ) * Fix GLM-5 and other reasoning models appearing to hang via OpenAI shim Reasoning models like GLM-5 and DeepSeek stream chain-of-thought in `reasoning_content` while `content` stays empty (""). The OpenAI shim only read `delta.content`, so it saw empty strings and never emitted any Anthropic stream events — causing the UI to appear frozen. - Add `reasoning_content` to streaming chunk and non-streaming response types - Emit `reasoning_content` as thinking blocks (thinking_delta) in streaming mode - Properly transition from thinking to text blocks when content phase begins - Fall back to `reasoning_content` in non-streaming mode when content is null Fixes #214 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Fix non-streaming reasoning_content fallback and add tests - Use explicit empty-string check instead of \|\| for content fallback so content: "" doesn't leak reasoning_content as visible text - Close thinking block before tool call blocks in streaming path - Add non-streaming and streaming reasoning_content tests Co-Authored-By: GLM-5.1 <noreply@openclaude.dev> * Fix flaky Ink reconciler tests caused by react-compiler memoization Remove hard throw in createTextInstance that crashed when hostContext.isInsideText was stale due to react-compiler element caching. Add timeout guards to prevent test hangs when render errors prevent exit() from firing. Co-Authored-By: Claude GLM-5.1 <noreply@openclaude.dev> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> Co-authored-by: GLM-5.1 <noreply@openclaude.dev>	2026-04-06 22:02:29 +08:00
Agent_J	aff2bd87e4	fix: avoid sync github credential reads in provider manager (#428 ) * fix: avoid sync github credential reads in provider manager * test: stabilize provider manager async credential test * fix: avoid first-frame github provider false negative --------- Co-authored-by: KRATOS <84986124+gnanam1990@users.noreply.github.com>	2026-04-06 21:29:53 +08:00
hsain9357	72e6a945fe	Fixed gemini error Function call is missing a thought_signature in functionCall parts (#426 ) * docs(docs): add agent guidance and repository instructions - Created `AGENTS.md` and `CLAUDE.md` to provide high-signal guidance for AI agents and developers working in the repository. - Outlined critical developer commands for building, testing, and running diagnostics using `bun`. - Documented the repository architecture, source entrypoints, and core service logic. - Defined framework-specific quirks, including module stubbing for internal modules and macro versioning. - Established style and workflow guidelines regarding telemetry, environment variables, and security scan requirements. * feat(api): support gemini thought signatures in openai shim - Added `isGeminiMode` utility to detect Gemini backends via `CLAUDE_CODE_USE_GEMINI` or `OPENAI_BASE_URL`. - Updated `convertMessages` to extract `thought_signature` from thinking blocks and inject them into tool calls. - Implemented a fallback mechanism that provides a `skip_thought_signature_validator` string to avoid 400 validation errors when a signature is missing. - Enhanced `openaiStreamToAnthropic` and `OpenAIShimMessages` to correctly preserve and pass through Gemini-specific metadata in `extra_content`. * refactor(api): improve gemini metadata handling and remove redundant docs - Updated `src/services/api/openaiShim.ts` to merge existing `google`-specific metadata within `extra_content` instead of overwriting it. - Simplified the `thought_signature` assignment logic to use a fallback value of `skip_thought_signature_validator` when no signature is provided. - Deleted `AGENTS.md` and `CLAUDE.md` files to eliminate redundant agent guidance documentation. * fix(api): propagate gemini thought signatures to all parallel tool calls - Removed the index constraint when assigning the `signature` from a `thinkingBlock` to tool calls in `openaiShim.ts`. - Ensured that the `thought_signature` is applied to every tool call in a parallel set, rather than just the first one. - Aligned the shim with Gemini API requirements, which mandate that the same signature must be present on every replayed function call part within an assistant turn.	2026-04-06 21:04:49 +08:00
Kevin Codex	39f3b2babd	test: isolate latest main suite regressions (#427 )	2026-04-06 19:50:31 +08:00
Agent_J	ff7d49990d	feat: GitHub provider lifecycle and onboarding hardening (#351 ) * feat: improve GitHub provider onboarding and lifecycle * fix: address copilot review in provider manager * fix: address follow-up copilot review comments * test: resolve rebase conflict in provider profiles suite * fix: clear stale github hydrated marker * fix: harden github onboarding auth precedence * fix: remove merge markers from provider tests * fix: resolve latest copilot onboarding comments --------- Co-authored-by: KRATOS <84986124+gnanam1990@users.noreply.github.com>	2026-04-06 19:18:58 +08:00
Vasanth T	8ece290087	fix: suppress startup dialogs when input is buffered (#423 ) Co-authored-by: OpenClaude Worker 3 <worker-3@openclaude.local>	2026-04-06 18:31:38 +08:00
Kevin Codex	6c61790063	test: fix leaked ink mocks in full suite (#424 )	2026-04-06 18:10:02 +08:00
NikitaBabenko	26eef92fe7	feat: add headless gRPC server for external agent integration (#278 ) * gRPC Server * gRPC fix * UpdProto * fix: address PR review feedback for gRPC server - Update bun.lock for new dependencies (frozen-lockfile CI fix) - Add multi-turn session persistence via initialMessages - Replace hardcoded done payload with real token counts - Default bind to localhost instead of 0.0.0.0 * fix(grpc): startup parity, cancel interrupt, and cli text fallback - Replace enableConfigs() with await init() in start-grpc.ts for full bootstrap parity with the main CLI (env vars, CA certs, mTLS, proxy, OAuth, Windows shell) - Call engine.interrupt() before call.end() in the cancel handler so in-flight model/tool execution is actually stopped - Show done.full_text in the CLI client when no text_chunk was received, preventing silent drops when streaming is unavailable * fix(grpc): wire session_id end-to-end and remove dead provider field - Move session_id from ClientMessage into ChatRequest to fix proto-loader oneofs encoding bug and make the field functional - Implement in-memory session store so reconnecting with the same session_id resumes conversation context across streams - Remove ChatRequest.provider — per-request provider routing requires global process.env mutation, unsafe for concurrent clients; provider is configured via env vars at server startup * fix(grpc): mirror CLI auth bootstrap in start-grpc and fix tool_name field scripts/start-grpc.ts now runs the same provider/auth bootstrap as the normal CLI entrypoint: enableConfigs, safe env vars, Gemini/GitHub token hydration, saved-profile resolution with warn-and-fallback, and provider validation before the server binds. ToolCallResult.tool_name was being populated with the tool_use_id UUID. Added a toolNameById map (filled in canUseTool) so tool_name now carries the actual tool name (e.g. "Bash"). The UUID moves to a new tool_use_id field (proto field 4) for client-side correlation. * fix(grpc): add tool_use_id to ToolCallStart and interrupt engine on stream close Two blocker-level issues flagged in code review: - ToolCallStart was missing tool_use_id, making it impossible for clients to correlate tool_start events with tool_result when the same tool runs multiple times. Added tool_use_id = 3 to the proto message and populated it from the toolUseID parameter in canUseTool. - On stream close without an explicit CancelSignal the server only nulled the engine reference, leaving the underlying model/tool work running as an orphan. Added engine.interrupt() in the call.on('end') handler to stop work immediately when the client disconnects. * fix(grpc): resolve pending promises on disconnect and guard post-cancel writes Four lifecycle and contract issues identified during proactive review: - Pending permission Promises in canUseTool would hang forever if the client disconnected mid-stream. On call 'end', all pending resolvers are now called with 'no' so the engine can unblock and terminate. - The done message and session save could fire after call.end() when a CancelSignal arrived mid-generation. Added an `interrupted` flag set on both cancel and stream close to gate all post-loop writes. - The session map had no eviction policy, allowing unbounded memory growth. Capped at MAX_SESSIONS=1000 with FIFO eviction of the oldest entry. - Field 3 was silently absent from ChatRequest. Added `reserved 3` to document the gap and prevent accidental reuse in future. * fix(grpc): reset previousMessages on each new request to prevent session history leak previousMessages was declared at stream scope and only overwritten when the incoming session_id already existed in the session store. A second request on the same stream with a new session_id would silently inherit the first request's conversation history in initialMessages instead of starting fresh, violating the session contract. Fix: reset previousMessages to [] at the start of each ChatRequest before the session-store lookup. * fix(grpc): reset interrupted flag between requests and guard against concurrent ChatRequest Two stream-scoped state bugs found during proactive audit: - The `interrupted` flag was never reset between requests on the same stream. If the first request was cancelled, all subsequent requests would silently skip the done message, causing the client to hang. - A second ChatRequest arriving while the first was still processing would overwrite the engine reference, corrupting the lifecycle of both requests. Now returns ALREADY_EXISTS error instead. Engine is nulled after the for-await loop completes so subsequent requests can proceed normally. --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-06 17:54:10 +08:00
Paulo Reis	112df59117	fix: convert dragged file paths to @mentions for attachment (#382 ) * fix: convert dragged file paths to @mentions for attachment When non-image files are dragged into the terminal, the file path was inserted as plain text and never attached. Now detected absolute paths are converted to @mentions so they get picked up by the attachment system. * test: add tests for drag-and-drop file path detection * fix: multi-image drag-and-drop only showing last image insertTextAtCursor read input and cursorOffset from the React closure, which is stale when called in a synchronous loop (e.g. onImagePaste for multiple dragged images). Now uses refs so each insertion chains on the previous one. * fix: quote Windows absolute paths to avoid MCP mention collision Paths containing ':' (e.g. Windows drive letters) are now emitted in quoted @"..." form so they don't match the MCP resource mention regex. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * refactor: decouple dragDropPaths from imagePaste and harden image checks - Check image extension against the cleaned path (post quote/escape stripping) so quoted or backslash-escaped image drops are reliably routed to the image paste handler. - Inline the image extension regex and drop the imagePaste/fsOperations imports so the module (and its tests) no longer pull in `bun:bundle` and the heavier fs wrapper chain. Use plain `fs.existsSync` for the on-disk check. - Add tests covering quoted image paths, uppercase extensions, backslash-escaped image paths, escaped real files with spaces, mixed segments containing an image, quoted-nonexistent paths, and leading or trailing whitespace. * test: verify dragged paths with an `@` segment are preserved Adds a fixture under a scoped-package-style subdir (`@types/index.d.ts`) so we exercise the realistic `node_modules/@types/...` drag case and lock in that `extractDraggedFilePaths` returns the raw path unchanged — the `@` inside the path must not collide with the mention prefix the caller prepends downstream. * test: parametrize dragDropPaths cases with test.each Groups the 21 scenarios into four table-driven describes (empty-result, single-path, multi-path, backslash-escaped) so that adding a new case is a one-line row instead of a new `test()` block. Fixture directories are now created synchronously at describe-load time so their paths are available to the test.each tables, which are built before any hook runs. * test: add contract tests for @-mention extractor boundary Pins the contract between `extractAtMentionedFiles` and `extractMcpResourceMentions` so the MCP regex can't silently swallow quoted file-path mentions. These tests fail on current HEAD — 3 of 11 cases expose the regression pointed out in the review on #382: `extractMcpResourceMentions`'s trailing `\b` backtracks past the closing `"` of a quoted mention and produces a ghost match for `@"C:\Users\..."`, `@C:\Users\...`, and `@"/tmp/weird:name.txt"`. The remaining 8 cases lock in the behaviour that must not change (legitimate `server:resource` mentions and plain file-path mentions). Committed failing on purpose as the first half of a test-then-fix pair; the regex fix follows in a subsequent commit. * fix: prevent MCP extractor from ghost-matching quoted/Windows paths The MCP resource regex used `\b` as a trailing anchor with `[^\s]+` character classes. On any quoted file mention containing a colon (`@"C:\Users\me\file.txt"`, `@"/tmp/weird:name.txt"`), the engine backtracked past the closing `"` to satisfy `\b`, producing a ghost match that collided with `extractAtMentionedFiles`. Unquoted Windows drive-letter paths (`@C:\Users\me\file.txt`) also matched because a drive letter is structurally identical to an MCP `server:resource` token. Two guards: 1. `(?!")` right after `@` drops quoted tokens entirely, and adding `"` to the character classes blocks any mid-match backtracking. 2. A post-match filter discards `^[A-Za-z]:[\\/]` — a single-letter server followed by a path separator is always a Windows drive prefix, never a real MCP resource. Legitimate MCP forms (`@server:resource/path`, plugin-scoped like `@asana-plugin:project-status/123`, inline prose mentions) remain matched and are pinned by the contract tests added in `04998d5`. --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-06 17:49:38 +08:00
Meetpatel006	8724d59d48	fix theme picker live preview broken by react-compiler memoization (#395 ) * fix: remove react-compiler memo cache, restore classical JSX so theme preview actually previews * added themepicker test	2026-04-06 17:46:42 +08:00
Jay Suryawanshi	af08b4f762	docs: add LiteLLM proxy setup guide (#418 ) * docs: add LiteLLM proxy setup guide Document the setup process for LiteLLM and its integration with OpenClaude, including prerequisites, configuration, and troubleshooting steps. * Revise LiteLLM setup steps for Adocs: fix /provider walkthrough to match actual OpenAI-compatible flowPI key and model Updated setup instructions for LiteLLM provider configuration. * docs: fix sub-bullet formatting in /provider steps * docs: clarify key scope in troubleshooting (LiteLLM proxy process env) Clarified instruction for upstream provider error regarding API key.	2026-04-06 17:01:56 +08:00
Sarath Babu	5012c160c9	feat: Add Gemini support with thought_signature fix (#404 ) * feat: Add Gemini support with thought_signature fix and branding updates * fix: gate thought_signature preservation strictly to Gemini provider * fix: explicit extra_content destructuring to seal cross-provider tool search leak	2026-04-06 17:01:06 +08:00
KRATOS	c1934974aa	fix: preserve unicode in Windows clipboard fallback (#388 ) * fix: preserve unicode in Windows clipboard fallback * fix: avoid Windows clipboard stdin codepage issues * test: fix Windows clipboard temp path fixture	2026-04-06 16:12:10 +08:00
Kevin Codex	94de37d44f	chore: release 0.1.8 v0.1.8	2026-04-06 13:45:02 +08:00

1 2 3 4 5 ...

381 Commits