The AI Runtime Field Lab

The AI Runtime Field Lab

Real AI problems, scoped for serious builders.

A public library of sanitized AI workflow problems from startups and operators, turned into buildable briefs with a stated reliability bar.

Submitting or building a problem does not guarantee mentorship, review, publication, referrals, funding, or hiring outcomes.

What this is

A library of real AI workflow problems scoped to be buildable, testable, and useful as portfolio-grade artifacts. Each brief includes the pain, the workflow, the inputs and outputs, evaluation ideas, the boundaries, and a reliability target. The goal is to help builders work on production-shaped problems instead of random demos.

What this is not

  • Not a bootcamp
  • Not a job board
  • Not a recruiting agency
  • Not an accelerator
  • Not a free consulting marketplace
  • Not a promise that companies review submissions

The problem library

Enterprise SaaS Open

Answer the Buyer: A Sales Engineer Copilot

Build a retrieval copilot that answers technical buyer questions from product, security, and integration documents, with citations and honest gaps.

Intermediate RAG Copilot R2 Draft

Reliability focus retrieval, citation, calibrated refusal

View Brief
Legal Open

Cite or Strike: A Citation Verifier for Legal Drafting

Build a verifier that checks every citation in an AI-drafted legal document against a real source and flags or strikes anything it cannot ground.

Intermediate Verifier R3 Draft

Reliability focus citation grounding, fabrication detection, export gating

View Brief
Productivity Open

From Notes to Owners: A Meeting-to-Action Operator

Build an agent that turns raw meeting notes into owners, action items, risks, and follow-ups, grounded in what was actually said.

Starter Agentic Workflow R2 Draft

Reliability focus extraction, attribution, grounded refusal

View Brief
Internal Tools Open

One Source of Truth: A Shared Context Layer Across Tools

Build a context layer that reconciles facts scattered across code, chat, docs, CRM, and product tools into one owned, cited, conflict-aware source of truth.

Advanced Knowledge Agent R3 Draft

Reliability focus source reconciliation, conflict detection, ownership and freshness

View Brief
AI Infrastructure Open

Right-Size Every Call: An AI Cost and Latency Router

Build middleware that routes each request between small and large models by predicted complexity, measuring the tradeoff across cost, latency, and quality.

Advanced Optimization Middleware R3 Draft

Reliability focus routing policy, quality floor, measurement

View Brief
E-commerce Open

Routing the Mess: A Merchant Operations Copilot

Build a merchant copilot that routes messy requests across catalog, sales, operations, and support while preventing unsafe mutations.

Intermediate Vertical Agent R3 Draft

Reliability focus routing, safe actions, layered evals

View Brief
B2B SaaS Open

Triage First: A Support Ticket Routing Agent

Build an agent that classifies inbound support tickets, drafts a suggested reply, escalates uncertain cases, and explains why it routed each one.

Starter Agentic Workflow R2 Draft

Reliability focus classification, calibrated escalation, explainability

View Brief

How problems are selected

A problem is accepted when it passes all seven questions.

  1. Can one solo builder ship it in 20 to 30 focused hours?
  2. Does it have a clear input and a clear output?
  3. Is there a measurable evaluation target?
  4. Can it run on public, synthetic, or sanitized data?
  5. Does it map to a real buyer or operator pain?
  6. Can its failure be meaningfully analyzed?
  7. Can the teardown teach the community?

Problems requiring confidential data, production credentials, regulated workflows, vague business goals, or heavy integration work are not accepted.

Have a real AI workflow problem? Submit it.

A good problem is specific, painful, buildable with sample or synthetic data, and evaluable without production access. Selected problems may be rewritten into public briefs, and submitting creates no contractor, delivery, or hiring relationship.

Submit a Problem

Want to build one of these?

Builders use Field Briefs to create portfolio-grade artifacts. Some problems may later become cohort capstones or teardown candidates.

Join the builder interest list