Skip to main contentMoss is the runtime for real-time semantic search in conversational apps. It delivers sub-10 ms lookups and instant index updates without extra infrastructure. It runs in the browser, on-device, or in the cloud—wherever your agent lives—so search feels native. Connect your data once; Moss packages, distributes, and keeps indexes fresh.
- Sub-10 ms lookups with instant updates
- No infra to run; local-first with optional sync
- Browser, device, or cloud—same API
Where Moss shines
- Sub-10 ms answers for docs/FAQ/search
- Ground agents with your data without centralizing user info
- Local or hybrid embeddings; minimal infra
Using Moss Portal
- Sign up at Moss, confirm email, and sign in
- From the portal, click Create Index and copy your Project ID and Project Key for your SDK
- Join our Discord to get onboarded: Moss Discord
Samples
- View samples repo: moss-samples on GitHub
- JavaScript:
javascript/comprehensive_sample.ts, javascript/load_and_query_sample.ts
- Python:
python/comprehensive_sample.py, python/load_and_query_sample.py
- Adapt by swapping the FAQ data with your own, or plug Moss calls into your app
How it works (at a glance)
- Index: Convert your data into an efficient local index
- Embeddings: Generate semantic vectors locally or via a remote service
- Retrieval: Query with semantic similarity or hybrid approaches
- Storage: Persist indexes locally and optionally sync to cloud
Next steps