Planner · Executor · Critic · RAG

Your AI
Admissions
Committee

Admitly runs specialized Planner-Executor-Critic pipelines — Opus where judgment compounds, Haiku where work is mechanical. The Critic scores every draft on a 4-axis rubric calibrated against real SOPs from a study-abroad consulting partner, looping up to 3x until it passes.

78%
SOP Rubric Pass-rate
4
Specialized Pipelines
3
Critic Iter. Cap
<200ms
Time to First Token
admitly — agent pipeline
avg $0.004 / query

Built with

Claude Opus 4.7LLM
Claude Sonnet 4.6LLM
Claude Haiku 4.5LLM
Next.js 14Framework
SupabaseDatabase
pgvectorVector DB
TypeScriptLanguage
Tailwind CSSStyling
ZodValidation
Prompt CachingOptimization
Tool UseClaude API
Server-Sent EventsStreaming
Row-Level SecuritySecurity
Structured OutputSchema
Claude Opus 4.7LLM
Claude Sonnet 4.6LLM
Claude Haiku 4.5LLM
Next.js 14Framework
SupabaseDatabase
pgvectorVector DB
TypeScriptLanguage
Tailwind CSSStyling
ZodValidation
Prompt CachingOptimization
Tool UseClaude API
Server-Sent EventsStreaming
Row-Level SecuritySecurity
Structured OutputSchema
Interactive Pipeline

Agent Orchestration Pipeline

Click any stage to inspect its input, output, and prompt. Toggle scenarios to see how the routing path changes.

Match pipeline: Planner decomposes fit-scoring, Executor ranks against RAG, Critic checks alignment.

Planner

Task Decomposer

opus-4.7 · $15/M

Strong-model planning. Decomposes the request into ordered sub-tasks for the Executor, decides what RAG context is needed, and sets the iteration budget (2–3 loops) so latency stays predictable.

Prompt Pattern

You are the Planner. Given pipeline="{tag}", profile, target → emit ordered sub-tasks[] with required RAG sources. Cap iterations at 3.

Mock Input

{
  "pipeline": "sop",
  "profile": { "publications": 1, "research_area": "instruction tuning" },
  "target": "CMU LTI PhD"
}

Mock Output

streaming
router_output.json
AI Engineering

Production-Grade Techniques

Not a wrapper. Every component is a deliberate engineering decision with measurable tradeoffs.

Hybrid RAG

pgvector cosine similarity + BM25 keyword re-ranking over 2,000+ program profiles. Retrieval latency under 50ms.

CMU LTI PhD
94%
Stanford CS PhD
91%
MIT EECS PhD
89%
UCB EECS PhD
85%
pgvectorcosine similarityBM25<50ms

Planner · Executor · Critic

Strong models where judgment compounds (Planner, Critic). Light models where work is mechanical (Executor). Loop capped at 3 iterations.

haiku-4.5
$0.25/MExecutor · Parse
opus-4.7
$15/MPlanner · Critic
sonnet-4.6
$3/MRouting · fallback
judgment-where-it-mattersiter ≤ 3

Domain Calibration + Feedback Loop

Rubric calibrated against real SOP cases from a study-abroad consulting partner. In-product 👍/👎 ratings flow back into prompt iteration.

Offline

LLM-as-judge calibrated on partner-scored SOPs

Online

User 👍/👎 → prompt + rubric refinement

expert-calibratedclosed loop

SOP Eval Rubric

4-axis LLM-as-judge rubric. Tested on 10 real prompts: 78% passed without human intervention; failures pinpointed weak personalization.

structure
0.92
relevance
0.89
personalization
0.81
program align.
0.87
78% pass-rate · n=10 · loop on any axis < 0.7
structurerelevancepersonalizationprogram alignment

Prompt Caching

Ephemeral cache_control on system prompts >1024 tokens. Up to 90% token reduction on repeated RAG contexts.

MonSun

Cache hit rate — avg 84% this week

ephemeral90% token reductioncache_control

Structured Output

Tool-use with Zod-derived JSON schemas. No "please return JSON" prompting — Claude fills a type-safe tool call every time.

const SchoolMatchSchema = z.object({
  schools: z.array(z.object({
    name: z.string(),
    fit_score: z.number().min(0).max(100),
    reasons: z.array(z.string()),
    deadline: z.string(),
  })),
  confidence: z.number(),
});
// → Claude tool_use, never hallucinated JSON
Zod schematool_usetype-safe
Live Demos

See the Agents in Action

Watch the pipeline execute end-to-end on real applicant scenarios.

Match Schools · CS PhD applicant · NLP focus

User Input
Name: Alex Chen
GPA: 3.8/4.0 (top 5% of class)
GRE: 165Q / 158V / 5.0W
Research: NLP — 1 pub at ACL 2024 (co-author)
Advisor interest: LLM alignment, instruction tuning
Location preference: East Coast or no preference
Budget: prefer funded positions
Agent Thinking
Final Output
## Your Top Matches **1. CMU LTI PhD** — Fit: 94% Prof. Graham (LLM alignment) actively recruiting. Your ACL pub is a strong signal. Fully funded. Deadline: Dec 15, 2024 **2. Stanford CS PhD** — Fit: 91% NLP Group — competitive. Prof. Manning's lab. GRE Q 165 puts you in 85th percentile of admits. Deadline: Dec 5, 2024 **3. MIT EECS PhD** — Fit: 89% Language & Intelligence group. Strong fit for instruction tuning research. Deadline: Dec 1, 2024 *Total cost: $0.0042 · Latency: 2.3s*