Planner · Executor · Critic · RAG

Your AI
Admissions
Committee

Admitly runs specialized Planner-Executor-Critic pipelines — Opus where judgment compounds, Haiku where work is mechanical. The Critic scores every draft on a 4-axis rubric calibrated against real SOPs from a study-abroad consulting partner, looping up to 3x until it passes.

Get started free Sign in

78%

SOP Rubric Pass-rate

Specialized Pipelines

Critic Iter. Cap

<200ms

Time to First Token

admitly — agent pipeline

█

avg $0.004 / query

Built with

Claude Opus 4.7LLM

Claude Sonnet 4.6LLM

Claude Haiku 4.5LLM

Next.js 14Framework

SupabaseDatabase

pgvectorVector DB

TypeScriptLanguage

Tailwind CSSStyling

ZodValidation

Prompt CachingOptimization

Tool UseClaude API

Server-Sent EventsStreaming

Row-Level SecuritySecurity

Structured OutputSchema

Claude Opus 4.7LLM

Claude Sonnet 4.6LLM

Claude Haiku 4.5LLM

Next.js 14Framework

SupabaseDatabase

pgvectorVector DB

TypeScriptLanguage

Tailwind CSSStyling

ZodValidation

Prompt CachingOptimization

Tool UseClaude API

Server-Sent EventsStreaming

Row-Level SecuritySecurity

Structured OutputSchema

Interactive Pipeline

Agent Orchestration Pipeline

Click any stage to inspect its input, output, and prompt. Toggle scenarios to see how the routing path changes.

Match pipeline: Planner decomposes fit-scoring, Executor ranks against RAG, Critic checks alignment.

Planner

Task Decomposer

opus-4.7 · $15/M

Strong-model planning. Decomposes the request into ordered sub-tasks for the Executor, decides what RAG context is needed, and sets the iteration budget (2–3 loops) so latency stays predictable.

Prompt Pattern

You are the Planner. Given pipeline="{tag}", profile, target → emit ordered sub-tasks[] with required RAG sources. Cap iterations at 3.

Mock Input

{
  "pipeline": "sop",
  "profile": { "publications": 1, "research_area": "instruction tuning" },
  "target": "CMU LTI PhD"
}

Mock Output

streaming

router_output.json

█

AI Engineering

Production-Grade Techniques

Not a wrapper. Every component is a deliberate engineering decision with measurable tradeoffs.

Hybrid RAG

pgvector cosine similarity + BM25 keyword re-ranking over 2,000+ program profiles. Retrieval latency under 50ms.

CMU LTI PhD

94%

Stanford CS PhD

91%

MIT EECS PhD

89%

UCB EECS PhD

85%

pgvectorcosine similarityBM25<50ms

Planner · Executor · Critic

Strong models where judgment compounds (Planner, Critic). Light models where work is mechanical (Executor). Loop capped at 3 iterations.

haiku-4.5

$0.25/MExecutor · Parse

opus-4.7

$15/MPlanner · Critic

sonnet-4.6

$3/MRouting · fallback

judgment-where-it-mattersiter ≤ 3

Domain Calibration + Feedback Loop

Rubric calibrated against real SOP cases from a study-abroad consulting partner. In-product 👍/👎 ratings flow back into prompt iteration.

Offline

LLM-as-judge calibrated on partner-scored SOPs

Online

User 👍/👎 → prompt + rubric refinement

expert-calibratedclosed loop

SOP Eval Rubric

4-axis LLM-as-judge rubric. Tested on 10 real prompts: 78% passed without human intervention; failures pinpointed weak personalization.

structure

0.92

relevance

0.89

personalization

0.81

program align.

0.87

78% pass-rate · n=10 · loop on any axis < 0.7

structurerelevancepersonalizationprogram alignment

Prompt Caching

Ephemeral cache_control on system prompts >1024 tokens. Up to 90% token reduction on repeated RAG contexts.

MonSun

Cache hit rate — avg 84% this week

ephemeral90% token reductioncache_control

Structured Output

Tool-use with Zod-derived JSON schemas. No "please return JSON" prompting — Claude fills a type-safe tool call every time.

const SchoolMatchSchema = z.object({
  schools: z.array(z.object({
    name: z.string(),
    fit_score: z.number().min(0).max(100),
    reasons: z.array(z.string()),
    deadline: z.string(),
  })),
  confidence: z.number(),
});
// → Claude tool_use, never hallucinated JSON

Zod schematool_usetype-safe

Live Demos

See the Agents in Action

Watch the pipeline execute end-to-end on real applicant scenarios.

Match Schools · CS PhD applicant · NLP focus

User Input

Name: Alex Chen
GPA: 3.8/4.0 (top 5% of class)
GRE: 165Q / 158V / 5.0W
Research: NLP — 1 pub at ACL 2024 (co-author)
Advisor interest: LLM alignment, instruction tuning
Location preference: East Coast or no preference
Budget: prefer funded positions

Agent Thinking

Final Output

## Your Top Matches **1. CMU LTI PhD** — Fit: 94% Prof. Graham (LLM alignment) actively recruiting. Your ACL pub is a strong signal. Fully funded. Deadline: Dec 15, 2024 **2. Stanford CS PhD** — Fit: 91% NLP Group — competitive. Prof. Manning's lab. GRE Q 165 puts you in 85th percentile of admits. Deadline: Dec 5, 2024 **3. MIT EECS PhD** — Fit: 89% Language & Intelligence group. Strong fit for instruction tuning research. Deadline: Dec 1, 2024 *Total cost: $0.0042 · Latency: 2.3s*

Your AIAdmissionsCommittee