Rag#

Source: examples/patterns/rag.py

Introduction#

RAG establishes retrieval-grounded generation, and memory-centric agent systems such as Generative Agents and MemGPT show why persistent context is essential for longer design tasks. This example combines retrieval and reasoning steps so grounded evidence flow is explicit in traces and outputs.

Technical Implementation#

Configure Tracer with JSONL + console output so each run emits machine-readable traces and lifecycle logs.
Build the runtime surface (public APIs only) and execute RAGPattern.run(...) with a fixed request_id.
Configure and invoke Toolbox integrations (core/script/MCP/callable) before assembling the final payload.
Persist and query context via SQLiteMemoryStore to demonstrate memory-backed workflow behavior.
Print a compact JSON payload including trace_info for deterministic tests and docs examples.

        flowchart LR
    A["Input prompt or scenario"] --> B["main(): runtime wiring"]
    B --> C["RAGPattern.run(...)"]
    C --> D["retrieval and reasoning are composed via memory steps"]
    C --> E["Tracer JSONL + console events"]
    D --> F["ExecutionResult/payload"]
    E --> F
    F --> G["Printed JSON output"]

from __future__ import annotations

import json
from pathlib import Path

import design_research_agents as drag
from design_research_agents.memory import SQLiteMemoryStore


def main() -> None:
    """Run one local RAG workflow and print compact JSON result."""
    # Fixed request id keeps traces and docs output deterministic across runs.
    request_id = "example-workflow-rag-design-001"
    tracer = drag.Tracer(
        enabled=True,
        trace_dir=Path("artifacts/examples/traces"),
        enable_jsonl=True,
        enable_console=True,
    )
    db_path = Path.cwd() / "artifacts" / "examples" / "rag_example.sqlite3"
    db_path.parent.mkdir(parents=True, exist_ok=True)
    if db_path.exists():
        db_path.unlink()

    # Run the local RAG pattern using public runtime surfaces. Using this with statement will automatically close
    # the tool runtime, memory store, and managed client when the example is done.
    with (
        drag.Toolbox() as seed_toolbox,
        SQLiteMemoryStore(db_path=db_path) as store,
        drag.LlamaCppServerLLMClient() as llm_client,
    ):
        seed_toolbox.invoke_dict(
            "memory.write",
            {
                "db_path": str(db_path),
                "namespace": "design_examples",
                "records": [
                    {
                        "content": "Design requirement: include graceful shutdown and runtime monitoring.",
                        "metadata": {"kind": "requirement"},
                    }
                ],
            },
            request_id=f"{request_id}:seed_memory",
            dependencies={},
        )
        pattern = drag.RAGPattern(
            reasoning_delegate=drag.DirectLLMCall(llm_client=llm_client, tracer=tracer),
            memory_store=store,
            memory_namespace="design_examples",
            memory_top_k=3,
            write_back=False,
            tracer=tracer,
        )
        result = pattern.run(
            "Draft a concise architecture recommendation for a serviceable edge device.",
            request_id=request_id,
        )

    # Print the results
    summary = result.summary()
    print(json.dumps(summary, ensure_ascii=True, indent=2, sort_keys=True))


if __name__ == "__main__":
    main()

Expected Results#

Run Command

PYTHONPATH=src python3 examples/patterns/rag.py

Example output shape (values vary by run):

{
  "success": true,
  "final_output": "<example-specific payload>",
  "terminated_reason": "<string-or-null>",
  "error": null,
  "trace": {
    "request_id": "<request-id>",
    "trace_dir": "artifacts/examples/traces",
    "trace_path": "artifacts/examples/traces/run_<timestamp>_<request_id>.jsonl"
  }
}

Rag#

Introduction#

Technical Implementation#

Expected Results#

References#