orcs-code/src/utils/model/openaiContextWindows.ts at 1137b9a037ded5cae55c507252ad7635cf3f281e

Files

Juan Camilo b65921e8c3 fix: deterministic prefix matching and correct Llama 3.x context windows

Two fixes in openaiContextWindows.ts:

1. Sort lookup keys by length descending in lookupByModel() so the most
   specific prefix always wins. Without this, 'gpt-4-turbo-preview'
   could match 'gpt-4' (8k) instead of 'gpt-4-turbo' (128k) depending
   on V8's object key iteration order.

2. Update Llama 3.1/3.2/3.3 context windows from 8,192 to 128,000.
   These models support 128k context natively (Meta official specs).
   The previous 8k value was Ollama's default num_ctx, not the model's
   actual capability, causing premature auto-compact warnings.

2026-04-02 15:50:52 +02:00

5.3 KiB

Raw Blame History

View Raw

5.3 KiB Raw Blame History

5.3 KiB

Raw Blame History