Knowledge RAG Architecture — Operational Data to Semantic Search

Click any component to explore it in the full case study.

Live Wait Time Feed

Intraday attraction wait times sampled every 5 minutes. Aggregated into daily avg/median/percentile docs per ride

Cosmos DB · park_ops

Sellout & Crowd Events

Daily sellout events, top-10 rankings, crowd flow patterns, and historical anomaly flags across Park Whisperer properties

Cosmos DB · selloutEventv2

Park Entities & Metadata

286 park entities: attraction classifications, area mappings, operational flags, popularity tiers, and hidden gem scores

Cosmos DB · parkEntities

ETL Pipeline — build_knowledge_from_ops.py · Rolling 7–14 day window

Cosmos DB partition scans for ops window

low_wait_gem, crowd_trend, sellout_pattern, hidden_gem extractors

Quality gate: minimum signal threshold, dedup, freshness check

Azure OpenAI text-embedding-3-small → 1536-dim vector per doc

PostgreSQL + pgvector · HNSW index on embedding column

PostgreSQL + pgvector Knowledge Base

Primary retrieval store. 100% of knowledge docs write here. Cosmos used only for timeseries aggregates.

HNSW cosine index 1536-dim embeddings Partitioned by date (YYYY-MM-DD) precomputed_docs table pg_point_read_kpi fast path

Multi-Strategy Query Router

Every question is classified before retrieval. Fast deterministic paths fire first; semantic search is the fallback.

SLM fast-path (Phi-3 Mini) Confidence gate ≥ 0.85 Semantic: embed + BM25 rerank Structured: dynamic Cosmos SQL Date drift correction Strategy override layer

Park Operations Agent

Natural language queries from park operations staff. Live wait times, crowd patterns, anomaly alerts, and hidden gem recommendations

Azure Functions · REST API

SLM Content Pipelines

queryIntelligence called as a tool by content generation agents — supplies real operational data grounding to AI-written park articles and social posts

Tool Call · LangChain

Portal UI calls semantic search to surface crowd trends, sellout heatmaps, and ride performance comparisons for park intelligence monitoring

SWA · pipelineMonitor.html

1.6M

Records indexed

1536

Embedding dimensions

7–14d

Rolling data window

3

Retrieval strategies

Stack Azure OpenAI text-embedding-3-small PostgreSQL + pgvector HNSW index Cosmos DB Phi-3 Mini (SLM) Azure Functions LangChain