Product — NeuralDreams

The pipeline

From raw corpus to trustworthy query.

Every dataset moves through the same disciplined pipeline before it ever reaches you.

01 / VECTORIZE

Vectorize

Source data is cleaned, deduplicated and chunked, then embedded with domain-tuned models into dense vectors.

02 / INDEX

Index

Vectors are assembled into a hierarchical navigable small-world (HNSW) graph, tuned per dataset for recall and latency.

03 / QUERY

Query

Approximate nearest-neighbor search returns the most relevant vectors in single-digit milliseconds — online, on disk, or over the wire.

Catalog

Datasets, pre-vectorized
and search-ready.

Cleaned, chunked, embedded and indexed. Plug one into your RAG stack the moment you get access — no pipeline to build.

Indexed001

Newsletter Intelligence

scribem8.com

Curated newsletter and editorial corpus — semantic search across writers, topics and trends.

12.8M

chunks

1536

dims

cosine

metric

Indexed002

OEM Documentation

oedocs.com

Original equipment manuals, spec sheets and service docs — vectorized for technical retrieval at scale.

34.1M

chunks

1536

dims

cosine

metric

Indexed003

AI Agents Directory

638labs.com

A live registry of production AI agents — searchable by capability, interface and deployment surface.

2.3M

chunks

1024

dims

dot

metric

In pipeline004

SEC Filings

sec.gov · EDGAR

10-K, 10-Q and 8-K filings, parsed and chunked for financial research and disclosure search.

—

chunks

1536

dims

Q3

eta

In pipeline005

Energy Sector Regulations

multi-jurisdiction

Federal and state energy regulations, tariffs and permitting rules — structured for compliance retrieval.

—

chunks

1536

dims

Q3

eta

Request a dataset

Need a corpus we don't list yet? We vectorize bespoke sources on request.

Tell us what you need →

Delivery modes

Three ways to consume the same data.

Pick the surface that fits your workflow — or mix them. The underlying vectors and index are identical.

Interactive query online

Search any dataset from the browser — no setup. Tune k, filters and metrics in a live console and copy the request straight into your app.

Hosted search console + saved queries
Live tuning of k, filters, metric
Export to cURL / SDK in one click

Open the console →

Mode 02

Direct download indexes

Pull a versioned HNSW index plus its backing-DB bundle. Run it next to your app — fully offline, air-gap friendly, your data never leaves.

Versioned artifacts + checksums
MongoDB / pgvector / FAISS bundles
Perpetual or subscription license

# pull a versioned bundle
$ nd pull oedocs/hnsw@v3

Mode 03

API access

Skip the infra. Query our managed vector store over a single authenticated endpoint, with SDKs and usage-based pricing that scales on demand.

One REST / gRPC endpoint
Python / TS / Go SDKs
Always on the latest version

POST /v1/oedocs/search
{ "query": embed(q), "k": 10 }

Mode 01 · preview

The query console, live.

search ›offshore wind permitting timelinesoedocs · k=5 · 7ms

#top matchdatasetdist

01BOEM lease-to-operation permitting sequenceOEDOCS0.0118

02State coastal commission review windowsENERGY0.0163

03Interconnection queue study phasesENERGY0.0204

04Environmental impact statement scheduleOEDOCS0.0231

05Turbine foundation inspection cadenceOEDOCS0.0258

Request access

Start querying precision data.

Tell us your use case and we'll set you up with a dataset, an index, or an API key — plus a short demo.

Request access → About NeuralDreams

Vectorize. Index. Query. Delivered three ways.

From raw corpus to trustworthy query.

Vectorize

Index

Query

Datasets, pre-vectorizedand search-ready.

Newsletter Intelligence

OEM Documentation

AI Agents Directory

SEC Filings

Energy Sector Regulations

Three ways to consume the same data.

Interactive query online

Direct download indexes

API access

The query console, live.

Start querying precision data.

Datasets, pre-vectorized
and search-ready.