-
Notifications
You must be signed in to change notification settings - Fork 214
Pull requests: Luce-Org/lucebox-hub
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat(server): soft-close thinking termination via logit-ratio peek
#326
opened Jun 1, 2026 by
easel
Collaborator
Loading…
4 tasks
feat(server): add DFlash disk prefix cache for target layer split
#325
opened May 31, 2026 by
weicj
Collaborator
Loading…
[codex] Retry dflash generations with no visible output
#324
opened May 31, 2026 by
OmarB97
Contributor
Loading…
Add real-time /status dashboard with SSE push
#322
opened May 31, 2026 by
howard0su
Contributor
Loading…
feat(server): support DFlash with mixed-backend target layer split
#321
opened May 31, 2026 by
weicj
Collaborator
Loading…
[LuceBox][DFlash][lucebox-pr314-common-empty-fallback][2/n] Default empty spec retry in backend calls
#319
opened May 31, 2026 by
OmarB97
Contributor
Loading…
[codex] Recover dflash spec-decode agent stalls
#315
opened May 31, 2026 by
OmarB97
Contributor
Loading…
feat(server): add selectable backend IPC payload transport
#312
opened May 30, 2026 by
weicj
Collaborator
Loading…
feat(server): reduce layer-split activation memory with backend precision policy
#310
opened May 29, 2026 by
weicj
Collaborator
Loading…
feat(dflash): reduce feature mirror memory with dtype policy
#309
opened May 29, 2026 by
weicj
Collaborator
Loading…
fix(server): route Qwen3.6/Laguna think-mode reasoning to reasoning_content channel
#308
opened May 29, 2026 by
easel
Collaborator
Loading…
refactor(server): share target layer-split runtime helpers
#306
opened May 29, 2026 by
weicj
Collaborator
Loading…
refactor: extract MoE hybrid mode into common layer for qwen and laguna
#305
opened May 29, 2026 by
howard0su
Contributor
Loading…
feat(server): add Laguna target-layer-split adapter
#297
opened May 28, 2026 by
weicj
Collaborator
Loading…
fix(server): support sampled requests in target layer split
#295
opened May 28, 2026 by
weicj
Collaborator
Loading…
feat(server): passthrough proxy, piecewise keep-ratio curve, query survival check
#294
opened May 28, 2026 by
smpurkis
Contributor
Loading…
feat(server): add Gemma4 draft residency support
#291
opened May 28, 2026 by
weicj
Collaborator
Loading…
feat(qwen35moe): pipelined hybrid MoE decode with GPU/CPU overlap
#289
opened May 28, 2026 by
howard0su
Contributor
Loading…
feat(lucebox): docker stack + CLI + bench/profile + harness + luce-bench in-tree
#285
opened May 27, 2026 by
easel
Collaborator
Loading…
fix(server): Qwen3.6-27B tool calling for claude-code Anthropic path
#276
opened May 25, 2026 by
dusterbloom
Collaborator
Loading…
5 of 7 tasks
feat(drafter): ee3 as production default (depends on #274)
#275
opened May 24, 2026 by
dusterbloom
Collaborator
•
Draft
feat(pflash): prefill compress up to 128k -> 2-12× prefill (content-dependent), decode at parity
#274
opened May 24, 2026 by
dusterbloom
Collaborator
Loading…
feat(harness): typed adapters + format-aware session-inject proxy + multi-turn bandit driver
#266
opened May 23, 2026 by
dusterbloom
Collaborator
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.