Go to file

ArkhAngelLifeJiggy 0ca4333537 feat: add streaming token counter (#797 )

* feat: add streaming token counter

- Add StreamingTokenCounter for real-time token counting during generation
- Tracks output tokens as they arrive from stream
- Calculates tokens per second rate
- Add tests (4 passing)

PR 4A: Streaming Token Counter (Features 1.2, 1.7)

* refactor: move StreamingTokenCounter to separate file

- Extract StreamingTokenCounter from tokens.ts to streamingTokenCounter.ts
- Add getEstimatedRemainingTokens() method
- Update test import

* fix: word-boundary token counting for stable stream totals

- Accumulate raw content, count only at word boundaries
- Eliminates instability from arbitrary chunk boundaries
- Add finalize() to flush remaining content on stream end
- Add characterCount getter for raw content tracking
- Rename getEstimatedRemainingTokens -> getEstimatedGenerationTimeMs
- Add comprehensive tests

* fix: update streamingTokens test for word-boundary API

- Add finalize() call before checking output tokens
- Use characterCount for interim checks
- Add spaces to trigger word boundary counting

* fix: add estimateRemainingTokens/Time methods

- Add estimateRemainingTokens(target) method
- Add estimateRemainingTimeMs(target) method
- Covers non-blocking: now properly estimates remaining tokens

* fix: PR 797 - fix word boundary counting, consolidate tests

Blockers (Vasanthdev2004):
- recountAtWordBoundary now searches forward from lastCountedIndex+1
- Finds NEXT space after already-counted region, not before it
- Provides accurate live token counts during streaming, not just finalize()

Non-blocking (gnanam1990):
- Delete streamingTokens.test.ts, merge tests into streamingTokenCounter.test.ts
- Added interim-counting test to verify counting updates during streaming

* fix: PR 797 - fix word boundary advancement after space

Blocking:
- Fix recountAtWordBoundary to skip past space when searching for next boundary
- After counting at a space, indexOf(' ') returns 0 (the space itself)
- Now starts search from index 1 to find the NEXT word boundary
- Short chunks now properly trigger count advancement

Non-blocking:
- Add test verifying count increases after each word boundary
- Add test for space-skipping behavior

2026-04-29 16:17:00 +08:00

.github

ci: skip release-please on fork repositories (#701 )

2026-04-15 19:46:39 +08:00

bin

test: stabilize suite and add coverage heatmap (#373 )

2026-04-05 12:44:54 +08:00

docs

docs: add Atomic Chat partner (#942 )

2026-04-28 23:35:25 +08:00

python

Decouple and fix mistral (#595 )

2026-04-12 15:26:14 +08:00

scripts

fix(mcp): disable MCP_SKILLS feature flag — source not mirrored (#872 )

2026-04-24 11:35:59 +08:00

src

feat: add streaming token counter (#797 )

2026-04-29 16:17:00 +08:00

tests/sdk

feat: SDK Foundation — Type Declarations, Errors, and Utilities (#866 )

2026-04-29 14:53:01 +08:00

vscode-extension/openclaude-vscode

feat(vscode): add full chat interface to OpenClaude extension (#608 )

2026-04-16 05:04:31 +08:00

.dockerignore

feat: add Docker image build and push to GHCR on release (#656 )

2026-04-14 19:03:10 +08:00

.env.example

Add OpenAI responses mode and custom auth headers (#906 )

2026-04-26 20:24:03 +08:00

.gitignore

feat: implement Hook Chains runtime integration for self-healing agent mesh MVP (#711 )

2026-04-22 19:40:23 +08:00

.release-please-manifest.json

chore(main): release 0.7.0 (#817 )

2026-04-27 11:47:52 +08:00

ANDROID_INSTALL.md

Create ANDROID_INSTALL.md

2026-04-02 15:10:20 +01:00

bun.lock

fix: bump axios 1.14.0 → 1.15.0 (Dependabot #4 , #5 ) (#670 )

2026-04-14 19:00:55 +08:00

CHANGELOG.md

chore(main): release 0.7.0 (#817 )

2026-04-27 11:47:52 +08:00

CODE_OF_CONDUCT.md

docs: add community standard files (#257 )

2026-04-03 18:58:59 +05:30

CONTRIBUTING.md

docs: add community standard files (#257 )

2026-04-03 18:58:59 +05:30

Dockerfile

feat: add ripgrep to Dockerfile for faster file searching (#688 )

2026-04-15 19:42:06 +08:00

LICENSE

hardening: isolate third-party paths and clean external-build metadata (#311 )

2026-04-04 14:22:33 +08:00

package.json

chore(main): release 0.7.0 (#817 )

2026-04-27 11:47:52 +08:00

PLAYBOOK.md

Fix file path and update placeholder key in PLAYBOOK.md (#886 )

2026-04-26 08:20:25 +08:00

README.md

docs: add Atomic Chat partner (#942 )

2026-04-28 23:35:25 +08:00

release-please-config.json

ci: add secure automated release workflow (#615 )

2026-04-12 21:57:00 +08:00

SECURITY.md

docs: add security policy

2026-04-03 09:40:17 +08:00

tsconfig.json

fix(typecheck): make bun run typecheck actionable on main (#473 ) (#938 )

2026-04-28 17:44:26 +08:00

README.md

OpenClaude

OpenClaude is an open-source coding-agent CLI for cloud and local model providers.

Use OpenAI-compatible APIs, Gemini, GitHub Models, Codex OAuth, Codex, Ollama, Atomic Chat, and other supported backends while keeping one terminal-first workflow: prompts, tools, agents, MCP, slash commands, and streaming output.

OpenClaude is also mirrored to GitLawb: gitlawb.com/node/repos/z6MkqDnb/openclaude

Star History

Why OpenClaude

Use one CLI across cloud APIs and local model backends
Save provider profiles inside the app with /provider
Run with OpenAI-compatible services, Gemini, GitHub Models, Codex OAuth, Codex, Ollama, Atomic Chat, and other supported providers
Keep coding-agent workflows in one place: bash, file tools, grep, glob, agents, tasks, MCP, and web tools
Use the bundled VS Code extension for launch integration and theme support

Quick Start

Install

npm install -g @gitlawb/openclaude

If the install later reports ripgrep not found, install ripgrep system-wide and confirm rg --version works in the same terminal before starting OpenClaude.

Start

openclaude

Inside OpenClaude:

run /provider for guided provider setup and saved profiles
run /onboard-github for GitHub Models onboarding

Fastest OpenAI setup

macOS / Linux:

export CLAUDE_CODE_USE_OPENAI=1
export OPENAI_API_KEY=sk-your-key-here
export OPENAI_MODEL=gpt-4o

openclaude

Windows PowerShell:

$env:CLAUDE_CODE_USE_OPENAI="1"
$env:OPENAI_API_KEY="sk-your-key-here"
$env:OPENAI_MODEL="gpt-4o"

openclaude

Fastest local Ollama setup

macOS / Linux:

export CLAUDE_CODE_USE_OPENAI=1
export OPENAI_BASE_URL=http://localhost:11434/v1
export OPENAI_MODEL=qwen2.5-coder:7b

openclaude

Windows PowerShell:

$env:CLAUDE_CODE_USE_OPENAI="1"
$env:OPENAI_BASE_URL="http://localhost:11434/v1"
$env:OPENAI_MODEL="qwen2.5-coder:7b"

openclaude

Using Ollama's launch command

If you have Ollama installed, you can skip the env var setup entirely:

ollama launch openclaude --model qwen2.5-coder:7b

This automatically sets ANTHROPIC_BASE_URL, model routing, and auth so all API traffic goes through your local Ollama instance. Works with any model you have pulled — local or cloud.

Setup Guides

Beginner-friendly guides:

Advanced and source-build guides:

Supported Providers

Provider	Setup Path	Notes
OpenAI-compatible	`/provider` or env vars	Works with OpenAI, OpenRouter, DeepSeek, Groq, Mistral, LM Studio, and other compatible `/v1` servers
Gemini	`/provider` or env vars	Supports API key, access token, or local ADC workflow on current `main`
GitHub Models	`/onboard-github`	Interactive onboarding with saved credentials
Codex OAuth	`/provider`	Opens ChatGPT sign-in in your browser and stores Codex credentials securely
Codex	`/provider`	Uses existing Codex CLI auth, OpenClaude secure storage, or env credentials
Ollama	`/provider`, env vars, or `ollama launch`	Local inference with no API key
Atomic Chat	`/provider`, env vars, or `bun run dev:atomic-chat`	Local Model Provider; auto-detects loaded models
Bedrock / Vertex / Foundry	env vars	Additional provider integrations for supported environments

What Works

Tool-driven coding workflows: Bash, file read/write/edit, grep, glob, agents, tasks, MCP, and slash commands
Streaming responses: Real-time token output and tool progress
Tool calling: Multi-step tool loops with model calls, tool execution, and follow-up responses
Images: URL and base64 image inputs for providers that support vision
Provider profiles: Guided setup plus saved .openclaude-profile.json support
Local and remote model backends: Cloud APIs, local servers, and Apple Silicon local inference

Provider Notes

OpenClaude supports multiple providers, but behavior is not identical across all of them.

Anthropic-specific features may not exist on other providers
Tool quality depends heavily on the selected model
Smaller local models can struggle with long multi-step tool flows
Some providers impose lower output caps than the CLI defaults, and OpenClaude adapts where possible

For best results, use models with strong tool/function calling support.

Agent Routing

OpenClaude can route different agents to different models through settings-based routing. This is useful for cost optimization or splitting work by model strength.

Add to ~/.openclaude.json:

{
  "agentModels": {
    "deepseek-v4-flash": {
      "base_url": "https://api.deepseek.com/v1",
      "api_key": "sk-your-key"
    },
    "gpt-4o": {
      "base_url": "https://api.openai.com/v1",
      "api_key": "sk-your-key"
    }
  },
  "agentRouting": {
    "Explore": "deepseek-v4-flash",
    "Plan": "gpt-4o",
    "general-purpose": "gpt-4o",
    "frontend-dev": "deepseek-v4-flash",
    "default": "gpt-4o"
  }
}

When no routing match is found, the global provider remains the fallback.

Note: api_key values in settings.json are stored in plaintext. Keep this file private and do not commit it to version control.

Web Search and Fetch

By default, WebSearch works on non-Anthropic models using DuckDuckGo. This gives GPT-4o, DeepSeek, Gemini, Ollama, and other OpenAI-compatible providers a free web search path out of the box.

Note: DuckDuckGo fallback works by scraping search results and may be rate-limited, blocked, or subject to DuckDuckGo's Terms of Service. If you want a more reliable supported option, configure Firecrawl.

For Anthropic-native backends and Codex responses, OpenClaude keeps the native provider web search behavior.

WebFetch works, but its basic HTTP plus HTML-to-markdown path can still fail on JavaScript-rendered sites or sites that block plain HTTP requests.

Set a Firecrawl API key if you want Firecrawl-powered search/fetch behavior:

export FIRECRAWL_API_KEY=your-key-here

With Firecrawl enabled:

WebSearch can use Firecrawl's search API while DuckDuckGo remains the default free path for non-Claude models
WebFetch uses Firecrawl's scrape endpoint instead of raw HTTP, handling JS-rendered pages correctly

Free tier at firecrawl.dev includes 500 credits. The key is optional.

Headless gRPC Server

OpenClaude can be run as a headless gRPC service, allowing you to integrate its agentic capabilities (tools, bash, file editing) into other applications, CI/CD pipelines, or custom user interfaces. The server uses bidirectional streaming to send real-time text chunks, tool calls, and request permissions for sensitive commands.

1. Start the gRPC Server

Start the core engine as a gRPC service on localhost:50051:

npm run dev:grpc

Configuration

Variable	Default	Description
`GRPC_PORT`	`50051`	Port the gRPC server listens on
`GRPC_HOST`	`localhost`	Bind address. Use `0.0.0.0` to expose on all interfaces (not recommended without authentication)

2. Run the Test CLI Client

We provide a lightweight CLI client that communicates exclusively over gRPC. It acts just like the main interactive CLI, rendering colors, streaming tokens, and prompting you for tool permissions (y/n) via the gRPC action_required event.

In a separate terminal, run:

npm run dev:grpc:cli

Note: The gRPC definitions are located in src/proto/openclaude.proto. You can use this file to generate clients in Python, Go, Rust, or any other language.

Source Build And Local Development

bun install
bun run build
node dist/cli.mjs

Helpful commands:

bun run dev
bun test
bun run test:coverage
bun run security:pr-scan -- --base origin/main
bun run smoke
bun run doctor:runtime
bun run verify:privacy
focused bun test ... runs for the areas you touch

Testing And Coverage

OpenClaude uses Bun's built-in test runner for unit tests.

Run the full unit suite:

bun test

Generate unit test coverage:

bun run test:coverage

Open the visual coverage report:

open coverage/index.html

If you already have coverage/lcov.info and only want to rebuild the UI:

bun run test:coverage:ui

Use focused test runs when you only touch one area:

bun run test:provider
bun run test:provider-recommendation
bun test path/to/file.test.ts

Recommended contributor validation before opening a PR:

bun run build
bun run smoke
bun run test:coverage for broader unit coverage when your change affects shared runtime or provider logic
focused bun test ... runs for the files and flows you changed

Coverage output is written to coverage/lcov.info, and OpenClaude also generates a git-activity-style heatmap at coverage/index.html.

Repository Structure

src/ - core CLI/runtime
scripts/ - build, verification, and maintenance scripts
docs/ - setup, contributor, and project documentation
python/ - standalone Python helpers and their tests
vscode-extension/openclaude-vscode/ - VS Code extension
.github/ - repo automation, templates, and CI configuration
bin/ - CLI launcher entrypoints

VS Code Extension

The repo includes a VS Code extension in vscode-extension/openclaude-vscode for OpenClaude launch integration, provider-aware control-center UI, and theme support.

Security

If you believe you found a security issue, see SECURITY.md.

Community

Use GitHub Discussions for Q&A, ideas, and community conversation
Use GitHub Issues for confirmed bugs and actionable feature work

Contributing

Contributions are welcome.

For larger changes, open an issue first so the scope is clear before implementation. Helpful validation commands include:

bun run build
bun run test:coverage
bun run smoke
focused bun test ... runs for files and flows you changed

Disclaimer

OpenClaude is an independent community project and is not affiliated with, endorsed by, or sponsored by Anthropic.

OpenClaude originated from the Claude Code codebase and has since been substantially modified to support multiple providers and open use. "Claude" and "Claude Code" are trademarks of Anthropic PBC. See LICENSE for details.

License

See LICENSE.

README.md

OpenClaude

Sponsors

Star History

Why OpenClaude

Quick Start

Install

Start

Fastest OpenAI setup

Fastest local Ollama setup

Using Ollama's launch command

Setup Guides

Supported Providers

What Works

Provider Notes

Agent Routing

Web Search and Fetch

Headless gRPC Server

1. Start the gRPC Server

Configuration

2. Run the Test CLI Client

Source Build And Local Development

Testing And Coverage

Repository Structure

VS Code Extension

Security

Community

Contributing

Disclaimer

License