Home/Product

Vectorize. Index. Query. Delivered three ways.

NeuralDreams turns high-signal corpora into AI-ready vector data — cleaned, embedded, and indexed for retrieval. Consume it however your stack prefers: query it online, download the index, or hit the API.

The pipeline

From raw corpus to trustworthy query.

Every dataset moves through the same disciplined pipeline before it ever reaches you.

01 / VECTORIZE

Vectorize

Source data is cleaned, deduplicated and chunked, then embedded with domain-tuned models into dense vectors.

02 / INDEX

Index

Vectors are assembled into a hierarchical navigable small-world (HNSW) graph, tuned per dataset for recall and latency.

03 / QUERY

Query

Approximate nearest-neighbor search returns the most relevant vectors in single-digit milliseconds — online, on disk, or over the wire.

Catalog

Datasets, pre-vectorized
and search-ready.

Cleaned, chunked, embedded and indexed. Plug one into your RAG stack the moment you get access — no pipeline to build.

Indexed001

Newsletter Intelligence

scribem8.com

Curated newsletter and editorial corpus — semantic search across writers, topics and trends.

12.8M
chunks
1536
dims
cosine
metric
Indexed002

OEM Documentation

oedocs.com

Original equipment manuals, spec sheets and service docs — vectorized for technical retrieval at scale.

34.1M
chunks
1536
dims
cosine
metric
Indexed003

AI Agents Directory

638labs.com

A live registry of production AI agents — searchable by capability, interface and deployment surface.

2.3M
chunks
1024
dims
dot
metric
In pipeline004

SEC Filings

sec.gov · EDGAR

10-K, 10-Q and 8-K filings, parsed and chunked for financial research and disclosure search.

chunks
1536
dims
Q3
eta
In pipeline005

Energy Sector Regulations

multi-jurisdiction

Federal and state energy regulations, tariffs and permitting rules — structured for compliance retrieval.

chunks
1536
dims
Q3
eta
Delivery modes

Three ways to consume the same data.

Pick the surface that fits your workflow — or mix them. The underlying vectors and index are identical.

Mode 02

Direct download indexes

Pull a versioned HNSW index plus its backing-DB bundle. Run it next to your app — fully offline, air-gap friendly, your data never leaves.

  • Versioned artifacts + checksums
  • MongoDB / pgvector / FAISS bundles
  • Perpetual or subscription license
# pull a versioned bundle
$ nd pull oedocs/hnsw@v3
Mode 03

API access

Skip the infra. Query our managed vector store over a single authenticated endpoint, with SDKs and usage-based pricing that scales on demand.

  • One REST / gRPC endpoint
  • Python / TS / Go SDKs
  • Always on the latest version
POST /v1/oedocs/search
{ "query": embed(q), "k": 10 }
Mode 01 · preview

The query console, live.

search ›offshore wind permitting timelinesoedocs · k=5 · 7ms
#top matchdatasetdist
01BOEM lease-to-operation permitting sequenceOEDOCS0.0118
02State coastal commission review windowsENERGY0.0163
03Interconnection queue study phasesENERGY0.0204
04Environmental impact statement scheduleOEDOCS0.0231
05Turbine foundation inspection cadenceOEDOCS0.0258
Request access

Start querying precision data.

Tell us your use case and we'll set you up with a dataset, an index, or an API key — plus a short demo.

Request access → About NeuralDreams