Description of onnxRuntimeGenAI.QNN Long-Term Inference Output Anomaly Issue


 # Bug Description and Reproduction
## Describe the Bug
After long-term inference using onnxRuntimeGenAI.QNN, nonsensical output or repeated punctuation may occur.  
EnableCaching has been disabled during initialization:  

```
var options = new OnnxRuntimeGenAIChatClientOptions  
{  
    StopSequences = Array.Empty<string>(),  
    PromptFormatter = TestPromptFormatter,  
    EnableCaching = false  
};  
```

## To Reproduce
Steps to reproduce the issue:  
1. Initialize the model  
2. Repeat the inference process (Start inference → Wait for inference to end → Start inference)  

## Expected behavior
Normal inference output.  

## Desktop (please complete the following information)
- OS: Windows 11 Home 25H2 26200.6588  
- OnnxRuntimeGenAI.QNN: 0.10.0  
- NPU Driver: 30.0.140.1000/30.0.145.1000  

## Additional Context
Add any other relevant information about the issue here.
  

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Description of onnxRuntimeGenAI.QNN Long-Term Inference Output Anomaly Issue #1831

Bug Description and Reproduction

Describe the Bug

To Reproduce

Expected behavior

Desktop (please complete the following information)

Additional Context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Description of onnxRuntimeGenAI.QNN Long-Term Inference Output Anomaly Issue #1831

Description

Bug Description and Reproduction

Describe the Bug

To Reproduce

Expected behavior

Desktop (please complete the following information)

Additional Context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions