Performance and Costs
Use this guide to balance output quality, run time, and provider spend.
What Drives Cost
Primary drivers:
- Number of requested outputs (
contentTargetscounts) - Model choice for text and image generation
- Retries caused by transient provider failures
- Higher token budgets (
maxTokens) for long outputs
You can inspect per-run totals in generation.analytics.json.
What Drives Runtime
Primary drivers:
- Total output count and mix of content types
- Primary content type (
articleprimary uses structured flow; non-article primary uses generic flow) - Image model speed and retry behavior
- Network latency and provider backoff windows
Article-primary runs execute structured article stages; non-article-primary runs execute generic primary stages and still render one primary cover image.
Cost-Control Patterns
- Validate with dry-run first:
ideon write --dry-run "Your idea" --primary article=1 --secondary x-thread=1 --secondary x-post=1
- Start with fewer variants, then scale:
ideon write "Your idea" --primary article=1 --secondary x-post=1
-
Use job files for repeatable experiments and adjust one variable at a time.
-
Prefer faster models during exploration, then switch to quality-focused models for final runs.
Runtime-Control Patterns
- Keep
contentTargetstight in early iterations. - Use
x-threadonly when thread structure is truly needed. - Resume interrupted runs instead of restarting from scratch:
ideon write resume
- If a run repeatedly fails at one stage, diagnose with Troubleshooting before increasing target counts.
Practical Workflow
- Run dry-run and confirm orchestration and outputs.
- Run a low-count live generation.
- Inspect markdown quality and
generation.analytics.json. - Increase counts or switch models.
- Save the resolved
job.jsonfor reproducible future runs.