Commit graph

2 commits

Author SHA1 Message Date
justLV
002ed7388d Refine stall classifier prompt and group benchmark cases by label
Rework the stall prompt to distinguish LOOKUP (say something specific,
three-to-seven words) from ACTION (content-free backchannel, two-to-five
words, no action verbs or promises) and restructure test_stall.py to
group cases by expected label for easier manual review.
2026-04-12 19:08:40 -07:00
justLV
f0f6e38e7c Move test scripts into tests/ and add stall/stream benchmarks
- git mv test_client, test_mic, test_speaker into tests/
- Add tests/test_stall.py (benchmarks the Gemini stall classifier against
  conversational/fetch/capture/act/follow-up queries)
- Add tests/test_stream.py (raw SSE chunk inspection against the agentic
  gateway)
- Update config path resolution in the new tests to climb one level
- Update README Testing section with new tests/ paths
2026-04-12 14:22:52 -07:00