TurboQuantDB - The Zero-Training Quantized Vector Database

Why TurboQuantDB?

Built for teams running large embedding datasets on real hardware budgets.

⚡

Zero Training

Quantize and search from the first insert. No offline phase, no index rebuild.

🗜️

10× Smaller on Disk

Store 1M 1536-dim embeddings in ~600 MB instead of 6 GB with 2-bit compression.

🎯

Accurate Results

Random rotation + residual sketch keep inner product rankings unbiased after compression.

🔍

Metadata Filtering

MongoDB-style filters — $eq, $gt, $in, $contains — applied at query time.

🔌

Drop-in Compatible

Swap ChromaDB or LanceDB with zero code changes. First-class LangChain support.

🏎️

Sub-ms ANN Search

HNSW + AVX2/SIMD. In-process — no network hops, no serialization overhead.

Two minutes to running

No daemon, no Docker, no config files.
Just pip install tqdb and write Python.

Full API Reference →

example.py

from tqdb import Database
import numpy as np

db = Database.open("./my_db", dimension=1536)

db.insert("doc-1",
    np.random.randn(1536).astype("f4"),
    metadata={"topic": "ml", "year": 2026},
    document="Machine learning intro"
)

results = db.search(
    np.random.randn(1536).astype("f4"),
    top_k=5,
    filter={"year": {"$gte": 2026}}
)
for r in results:
    print(r["id"], r["score"], r["document"])

Developer Ecosystem

Embedded library, HTTP server, or drop-in replacement — pick your integration style.

🚀 Production-ready

Server Mode

Spin up a full HTTP service with multi-tenancy, RBAC, per-tenant quotas, and Prometheus metrics. Designed for teams that need centralized vector search without the overhead of a heavyweight database.

multi-tenant RBAC Prometheus async jobs

server startup

# Start the server
tqdb-server --port 8080 \
  --data-dir ./data

# Create a collection
curl -X POST localhost:8080/collections \
  -d '{"name":"docs","dim":1536}'

# Insert vectors
curl -X POST localhost:8080/collections/docs/upsert \
  -d '{"id":"a1","vector":[...]}'

🦜

LangChain RAG

Drop-in TurboQuantRetriever for any LangChain pipeline. Zero boilerplate.

from tqdb import TurboQuantRetriever

View source →

🔄

ChromaDB Drop-in

Replace ChromaDB with zero code changes using the chroma_compat shim.

from tqdb import chroma_compat as chromadb

View source →

⚙️

LanceDB Compatible

Use as a backend replacement for LanceDB with native PyArrow table ingestion.

from tqdb import lancedb_compat as lancedb

View source →

Massive embedding datasets.
Lightweight hardware.

Why TurboQuantDB?

Zero Training

10× Smaller on Disk

Accurate Results

Metadata Filtering

Drop-in Compatible

Sub-ms ANN Search

Two minutes to running

Not sure which config fits your use case?

Developer Ecosystem

Server Mode

LangChain RAG

ChromaDB Drop-in

LanceDB Compatible