Database Internals & Storage · Topic 117

Data persistence guarantees (ACID compliance in VDBs?)

Persistence guarantees in a vector database describe what happens when the process or node fails: whether writes are durable (on disk, survivable across restarts) and whether reads see consistent state. Full ACID (Atomicity, Consistency, Isolation, Durability) is common in relational DBs; VDBs often provide durability and per-operation atomicity, with isolation and multi-operation consistency varying by product. This topic covers ACID in practice and what to check when evaluating.

Summary

Durability: writes persisted to WAL, often fsync before ack; crash before flush can lose recent writes. Atomicity: single upsert/delete is common; multi-point transactions vary.
Consistency and Isolation: many VDBs offer read-your-writes or session consistency, not full serializability. Cross-shard/cross-node transactions are rare.
When evaluating: check durability (fsync, replication), single-point atomicity, and consistency level for reads/writes. Determines suitability for critical vs. search/recommendation workloads.
Trade-off: stronger guarantees (fsync, replication) improve durability and consistency but add latency and cost; eventual consistency is often sufficient for search and RAG.
Practical tip: match guarantee level to workload; for critical data require WAL flush and replication; for high-throughput search, session or eventual consistency may be acceptable.

ACID in practice for VDBs

Durability is usually achieved by persisting writes to a WAL and optionally syncing to disk before acknowledging the write. Until the WAL is flushed, a crash can lose recent writes. Atomicity for a single upsert or delete is common.

Consistency (invariants hold) and Isolation (concurrent transactions don’t see each other’s partial state) are harder: many VDBs offer read-your-writes or session consistency rather than full serializability. Cross-shard or cross-node transactions are rare. Pipeline: write → WAL append → (optional) flush → ack; read sees last visible state according to consistency level.

What to check when evaluating a VDB

When evaluating a VDB, check: durability (fsync policy, replication), single-point atomicity, and what consistency level is offered for reads and writes. That tells you whether the system is suitable for critical, audit-heavy workloads or primarily for search and recommendation where eventual consistency may be acceptable.

Trade-off: full ACID (serializability, multi-key transactions) is uncommon in VDBs and adds cost; per-point atomicity and durability are the norm. Practical tip: document your RPO/RTO and consistency requirements; then verify the vendor’s WAL, flush, and replication behavior against those requirements.

Frequently Asked Questions

Do vector databases support ACID?

Many support D (durability via WAL) and A (atomicity per point or per batch). C and I (full consistency and isolation across concurrent transactions) are less common; check your vendor for read-your-writes, session, or serializable guarantees.

What does “durable” mean for a VDB?

Once a write is acknowledged, it survives process or node failure. Typically the write is in a WAL that has been flushed to disk (or replicated). Without flush, a crash can lose the last few seconds of writes depending on buffer policy.

Can I run multi-key transactions in a VDB?

Some VDBs support batch upserts or multi-point operations with atomic visibility; fewer support full multi-key ACID transactions (e.g. “update A and B together or not at all”). Check the docs for transaction scope and consistency levels.

Is eventual consistency enough for vector search?

For many search and RAG workloads, yes: read-your-writes or session consistency is often sufficient. For audit, financial, or strict ordering requirements, look for stronger guarantees and confirm with the vendor.