All Projects
The Problem
Everyone benchmarks LLM accuracy and speed. Nobody measures creative exhaustion — when a model starts recycling ideas. This matters for real-world use cases like brainstorming, content generation, and ideation.
The Solution
Built a 5-stage pipeline (generate → quality → verify → embed → deduplicate) that measures how many truly unique ideas each model produces before repeating. Used HNSW vector search for semantic deduplication and Perplexity for web-grounded verification.
Technical Highlights
- 14 LLM providers integrated — direct APIs + OpenRouter with auto-shadowing (direct keys override routing)
- Union-Find deduplication with survivor promotion — cluster-based, best entry per cluster survives
- HNSW vector search (Voyage AI embeddings, 768-d cosine) for semantic similarity
- 3 meta-AI loops: seed evolver, seed gatekeeper (Claude Opus), mobile rewriter
- Per-model cost tracking, budget caps, and stratified batch sampling
- Zod schema validation at every I/O boundary
Tech Stack
Language
TypeScript 5.6strict modeESMNode.js 20+
LLM
14 ProvidersAuto-ShadowingBudget CapsCost Tracking
Embeddings
Voyage AIDashScopeGoogle Vertex
Verification
Perplexity Sonar ProWeb-Grounded
Vector Search
hnswlib-nodeHNSW Cosine768-d
Tooling
ZodCommander.jsp-limitCustom Mutex
Key Numbers
1,856 generations10 frontier LLMs14 providers16x cost spread$69 total cost