Infinispan kNN Vector Search

With Infinispan 15.0.0.Dev06, we have started to expose vector search capabilities using Infinispan’s indexed queries. Using the newly introduced kNN predicate, it is possible to find and order results by the k nearest neighbors of a given vector.

Mapping the embeddings

The new @Vector indexing annotation is used to mark a field as an embedding. Embeddings are vector representations of data, according to a defined model.

The vector dimension is mandatory and should be defined at mapping time. Other options that can be specified during mapping are:

the similarity (distance) function
the beam width
the maximum number of connections.

Bear in mind that these values affect the performance of the approximation algorithm that is used to compute the kNN search.

We support byte[] embeddings. Here is an example of mapping:

That corresponds to the Proto schema:

We also support float[] embeddings. Here is an example of mapping:

That corresponds to the Proto schema:

Searching the embeddings

The following query shows how to perform a kNN search using a supplied vector and a specific distance

The query can be parameterized in several ways:

Or you can pass the entire vector as a single parameter:

If the cache is distributed, the query will be a broadcast query, and it will aggregate all the results from all the nodes that contain shards of the indexes that are related to the search. When we get the result as usual we get all the metadata from the corresponding entities, so that the returning items can easily relate to the application domain.

Mapping the embeddings

Searching the embeddings

Recommend

The case for clipboard managers

A Time Machine widget for your Mac desktop [Awesome Apps]

中石化资本领投，东映碳材完成超3.6亿元Pre-IPO轮融资

有没有 Qx 和 AppleTV 的使用交流群啊？

YouTube will have fewer ad breaks on TV — but the ads are getting longer

Today in Apple history: iMac Pro packs potent all-in-one punch

Online retailer Temu files fresh lawsuit against rival Shein in U.S.

优衣库，闯进三四线城市

Today 广告位已更新，C位尽在掌握！

普力材料完成近两亿元B轮融资，专注于二氧化碳基材料聚合工艺技术

About Joyk