🚀AsyncTool v1.0 — first async tool dataset ever!

Build synthetic datasets
like you build UIs

✨Declarative building blocks🧠AI-powered variations⚙️Typesafe pipelines

Like React, but for datasets — declarative and typesafe DSL for composing conversations and generating thousands of AI-powered training examples.

Get Started Why Torque?

Compose reusable conversations with a declarative DSL and generate production-ready datasets in minutes.

import * as T from "@qforge/torque";
import { openai } from "@ai-sdk/openai";

await T.generateDataset(
  () => [
    T.generatedUser({ prompt: "Friendly greeting or introduction" }), // AI generated
    T.oneOf([
      // pick one randomly (weights are optional)
      { value: T.assistant({ content: "Hello!" }), weight: 0.3 }, // static
      T.generatedAssistant({ prompt: "Respond to greeting" }), // AI generated, gets remaining weight
    ]),
    ...T.times(T.between(1, 3), [
      T.generatedUser({
        prompt: "Chat about weather. Optionally mentioning previous message",
      }),
      T.generatedAssistant({ prompt: "Respond to user. Short and concise." }),
    ]),
  ],
  {
    count: 2, // number of examples
    model: openai("gpt-5-mini"), // any ai-sdk model
    seed: 42, // replayable RNG
    metadata: { example: "quick-start" }, // optional per-row metadata
  }
);

Production feature set

Everything you need to ship datasets continuously

Torque borrows from the best product-engineering playbooks: versioned blueprints, strict typing, reproducible runs, and streaming observability. No YAML, no bespoke tools — just code.

🎯

Declarative DSL

Compose conversation blueprints like React components, turning dataset design into a predictable, readable workflow.

🔒

Fully Typesafe

Harness Zod-powered typing with end-to-end inference so every AI training workflow stays safe, reliable, and maintainable.

🔌

Provider Agnostic

Generate with OpenAI, Anthropic, DeepSeek, vLLM, LLaMA.cpp, or any model supported by your composable data pipeline.

🤖

AI-Powered Content

Blend handcrafted prompts with AI dataset generation to produce realistic multi-message conversations at scale.

💰

Cache Optimized

Reuse context across runs to minimize token spend while keeping dataset scaling fast, efficient, and reproducible.

⚡

Concurrent Execution

Stream progress in a beautiful CLI with concurrent workers, deterministic seeds, and instant feedback loops.

The challenge

Building synthetic datasets for LLMs is tedious

Every team building AI products hits the same roadblocks when trying to create high-quality training data at scale.

📊

Sometimes you don't have enough real data

⏱️

Maintaining quality and consistency across thousands of examples is extremely time consuming

🔧

Tool calling patterns require intricate message sequences and are error‑prone

🔀

Generating different conversation flows means rewriting everything or creating various hard to maintain scripts

🎲

Designing generators that are random yet reproducible is surprisingly complex

🧩

Getting AI to understand complex composition scenarios (nested variations, conditional flows) takes significant prompt engineering time

Torque solves this with a declarative approach

The React moment for dataset generation

⚛️

From imperative to declarative

Just like React transformed UI development from imperative DOM manipulation to composable components, Torque transforms dataset generation from manual JSON editing to declarative schemas.

💎

Optimized for efficiency

Use smaller, cheaper models while benefiting from cache optimization for dramatically lower costs.

💡

Write your dataset logic once, generate thousands of variations automatically

The gaps in today's tooling

Dataset creation is still stuck in 2012-era workflows

Torque is built for product teams that expect the same polish from their AI training data as they do from their production apps. First, let's take a quick look at the pain that keeps teams from shipping.

🧱

pain 01

Manual and time-consuming

Teams hack together prompts in spreadsheets and docs. Scaling to thousands of examples means hours of copy/paste, QA, and regressions.

🚨

pain 02

Impossible to validate

Without types or policy guards, the first time you catch a broken run is when your agent drifts in production — an expensive way to discover bugs.

🧭

pain 03

Hard to iterate

Traditional tooling lacks version control, seeds, or caching. Every update turns into a new round of manual cleanup and token spend.

How Torque helps

From idea to production-ready datasets in three moves

Torque replaces sprawling prompts and brittle scripts with a composable TypeScript toolkit. Every phase is automated, typed, and observable.

“Torque is the missing layer between your product engineers and the models they rely on.”

01Stage 1

Compose declarative blueprints

Describe each conversation as a reusable component. Torque keeps prompts, metadata, and policy checks together in strongly typed modules.

Ship new flows faster with versionable building blocks, not ad-hoc scripts.

02Stage 2

Scale with concurrency & caching

Spin up tens of workers, hydrate examples with AI, and cache deterministic seeds so teams get repeatable datasets without wasting tokens.

Every run logs audit trails, token spend, and policy results — plug it straight into CI.

03Stage 3

Evaluate continuously

Replay production traffic through model providers, enforce quality gates, and ship regressions straight into Slack or GitHub checks.

Close the loop with built-in evaluations and targeted retries when policies fail.

Start Building Today

Join developers building the future of LLM training data

🧮Type-safe by default

🚀Minutes to production

✅CI-ready datasets

Install with Bunbun add @qforge/torque

Copied!

Install with npmnpm install @qforge/torque