Field Reviewer
Review a reliability bar, artifact, or field report.
The lightest way to contribute. You might react to a deployment checklist, a web-agent verification rubric, an eval plan, or a builder report.
- Time
- 10-15 minutes
- Output
- Anonymous or attributed reviewer note
- Best for
- FDEs, applied AI architects, AI infra engineers, senior builders
Example asks:
- Does this reliability bar reflect what you see in real deployments?
- What failure mode is missing?
- Would this artifact be useful to a field team?
- What would make this safer before customer rollout?
I can review a reliability bar
Field Note Contributor
Share one practical lesson from field AI work.
Written or audio, anonymous or attributed. We turn your answers into a short field note and share a draft before publishing.
- Time
- 20-30 minutes, async
- Output
- Published Field Note
- Best for
- FDEs, customer engineers, applied AI architects, founding engineers, solutions architects
Field Notes usually cover:
- what your role actually looks like
- what breaks between demo and deployment
- how you scope ambiguous customer problems
- what ready to ship means in practice
- what skills matter most in the field
- one lesson other AI builders should copy
I can contribute a field note
Lab Mentor
Help builders during a Field Lab.
Lab Mentors support a cohort working on a production-shaped AI problem. This can be a short office-hours session, async feedback, final report review, or demo-day judging.
- Time
- 45-60 minutes
- Output
- Builder feedback, report review, or final judging
- Best for
- Senior engineers, FDEs, startup operators, AI architects, product-minded technical leaders
Mentors help builders think through:
- architecture tradeoffs
- eval design
- customer-readiness
- failure modes
- trust and safety
- cost and latency constraints
- what would need to change before production
I can mentor a lab
Problem Scout
Submit a recurring field problem.
If you keep seeing the same AI deployment problem across customers, teams, or startups, submit it to the Problem Bank.
- Time
- 5-10 minutes
- Output
- An anonymized field problem
- Best for
- Founders, operators, FDEs, customer engineers, consultants, AI platform teams
Good problems are specific enough to test:
- Can a web-grounded agent qualify design-partner leads without unsupported claims?
- Can an internal RAG assistant handle permissions and citations well enough for rollout?
- Can an agent detect when to stop, verify, or escalate?
- Can an AI workflow turn messy customer calls into CRM updates without corrupting records?
I have a field problem to submit
Artifact Reviewer
Improve a checklist, rubric, eval, or runbook.
The Field Lab publishes reusable artifacts for production AI teams. Reviewers help make them sharper before they are widely shared.
- Time
- 10-20 minutes
- Output
- Improved artifact
- Best for
- People with practical deployment experience
Artifacts may include:
- AI Deployment Readiness Checklist
- Agent Escalation Checklist
- Web-Grounded Claim Verification Rubric
- RAG Reliability Bar
- Demo-to-Production Risk Review
- Field Report Template
- Founder-Facing Trust Report Template
I can review an artifact