September 25, 2025
Product evaluation has always been messy. Frameworks exist, but applying them consistently is time-consuming, subjective, and often limited to a few “big bet” ideas. Smaller, incremental ideas get lost.
What’s different now? The rise of agentic LLMs — models that don’t just generate text, but can fetch data, apply structured reasoning, and produce actionable outputs. Rubricon harnesses this capability to transform product evaluation from a manual, biased process into a scalable, transparent, and repeatable system.
At its heart, Rubricon is not just a rubric — it’s a workflow powered by AI agents.
1. Input Integration
2. Rubric Application (UMSF)
3. LLM Agent Reasoning
4. Structured Output
Previous attempts at scoring ideas struggled with scale and bias. Rubricon’s agentic architecture enables a fundamental shift:
1. Agent fetches Jira issue IPI-19
2. Agent fetches UMSF rubric (v1.0)
3. Agent LLM applies rubric → scores + justifications
4. Agent outputs structured JSON with score, band, risks, opportunities
5. Workflow posts summary back into Jira
This loop is the engine that makes Rubricon practical, scalable, and trustworthy.
With Rubricon, evaluation is no longer:
Instead, teams can focus on strategic discussion, not whether someone applied the rubric correctly.
As agentic LLMs evolve, Rubricon will gain:
This isn’t just tooling — it’s a new discipline. Product evaluation as code.
Agentic LLMs mark a turning point in how we evaluate product features. With Rubricon, we’ve crossed the threshold — from subjective debate to structured, AI-augmented decision-making.
Next: we’ll show how the UMSF rubric is encoded for agent reasoning and what a sample evaluation looks like end-to-end.
{ "issueKey": "IPI-19", "issueSummary": "Add in-app referral rewards", "type": "B", "overall_score_100": 74, "priority_band": "Medium", "rubric_version": "1.0", "scores": { "M_market": 4, "A_adoption": 4, "T_effort": 2, "S_strategy": 3, "R_risk": 2 }, "opportunities": [ "Drives organic growth via referrals", "Increases engagement without ad spend" ], "risks": [ "Engineering complexity for fraud detection", "Requires marketing ops support" ], "dependencies": [ "Payments team", "Fraud detection API" ], "next_actions": [ "Run quick technical feasibility spike", "Validate reward mechanics with 10 pilot users" ], "confidence_0_1": 0.7 }