Retrieval System

Search movies by meaning — not keywords.

CineSeek is a semantic movie search system that understands natural-language queries. It combines embedding-based retrieval, FAISS-powered candidate search, and an optional LLM agent for reranking and explanation.

System Snapshot

Retrieval-first architecture, with an optional LLM agent.

The system keeps retrieval explicit and efficient: sentence embeddings, FAISS-based nearest neighbor search, and a strong frozen embedding baseline selected through evaluation. An optional LLM agent improves ranking and adds explanations, without replacing the underlying retrieval pipeline.

MSRD
Real query-to-movie relevance labels
FAISS
ANN retrieval for low-latency search
Agent
LLM-powered reranking, query rewriting, and explanation
What It Does

Discover movies from natural-language descriptions.

CineSeek is built for real, fuzzy movie queries — not just exact title matches. It connects natural-language intent to relevant results through semantic retrieval and optional agent-based refinement.

Semantic Search

Queries and movies are mapped into a shared embedding space, so the system can retrieve relevant titles beyond exact keyword overlap.

Fast Retrieval

FAISS provides low-latency nearest neighbor search, making the retrieval layer fast enough for interactive search experiences.

Agent Refinement

An optional LLM agent can refine vague queries, rerank retrieved candidates, and generate short explanations for the final results.

Live System

What happens after you submit a query

1. Retrieve

The query is embedded and matched against a FAISS index built from movie representations.

2. Rerank

An LLM agent reorders the top candidates and helps resolve vague, broad, or ambiguous descriptions.

3. Explain

The system returns short explanations so the final recommendations feel more transparent and easier to inspect.