Skip to content

Eval bug: dflash: invalid reduced-logits token -1 in draft #47

@Riconec

Description

@Riconec

Name and Version

llama-cli --version
version: 9420 (22d66b5)
built with GNU 14.2.0 for Linux x86_64

Operating systems

Linux

GGML backends

CUDA

Hardware

Tesla P40

Models

unsloth qwen3.6-27b and Ornstein-Hermes-27b

Problem description & steps to reproduce

When trying to use --spec-type dflash any request generates ? or / char in loop. llama-server log shows
=

First Bad Commit

tried week ago and latest - result the same

Relevant log output

llama-server.txt

Logs

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions