eval.dataset_builder¶
chunktuner.eval.dataset_builder
¶
Build EvalDataset from LLM outputs or user JSON/YAML files.
DatasetBuilder
¶
Build EvalDataset from user files or LLM-generated Q&A over documents.
Source code in src/chunktuner/eval/dataset_builder.py
build_from_user_file
¶
Parse JSON or YAML with a queries list into an EvalDataset (user-provided).
Source code in src/chunktuner/eval/dataset_builder.py
build_from_docs
¶
LLM-generated Q&A with span validation (requires litellm + API keys).
Source code in src/chunktuner/eval/dataset_builder.py
build_code_function_qa
¶
Heuristic code questions: point at first function body span per file.