Prompt Similarity Service

About this project

This is a case study submission for Blooming Health's AI Engineer role. It implements a Prompt Similarity & Deduplication Service — a tool for managing growing prompt template libraries in voice AI platforms by finding semantic overlaps, searching by meaning, and identifying candidates for consolidation.

Engineering Notes Each section has inline annotations covering the architectural choices and technical decisions behind it. Toggle visibility with the switch in the left panel.

How it works

Load Dataset — Select a prompt dataset (12 sample or 1,000 full) and generate vector embeddings via OpenAI's text-embedding-3-small, stored in SQLite.
Performance Benchmarks — Run timed benchmarks across semantic search, similarity lookup, and clustering to measure latency at scale.
Semantic Search — Embed a natural language query on-the-fly and find the most relevant prompts by meaning, not keywords.
Similar Prompts — Select any prompt and find others with similar intent, tunable by similarity threshold.
Duplicate Analysis — Automatically cluster near-duplicate prompts and generate LLM-powered merge suggestions for consolidation.
Prompt Library — Browse and filter all loaded prompts by category.

Technical highlights

Template variables ({{var}}) normalized before embedding so variable names don't skew similarity
Cosine similarity via dot product (OpenAI embeddings are L2-normalized)
Union-find clustering for duplicate detection
GPT-4o-mini powered merge suggestions for duplicate clusters
Server-side latency tracking on all endpoints

Select a dataset in the left panel and click Load & Embed to begin.

View source on GitHub →
Built by Martin Zirulnik · martin@mziru.com