Add integrated inference #181

rohanshah18 · 2025-03-21T21:38:30Z

Problem

Add integrated inference to Java SDK.

Solution

The code was already generated since integrated inference was a part of 2025-01 api spec. So as a part of this PR, I have added the following features:

Create index for model i.e. create an index with an associated embedding model
Configure an existing index to associate it with an embedding model
Upsert records
Search records by id
Search records by vector
Search records by text

Example:

import io.pinecone.clients.Index;
import io.pinecone.clients.Pinecone;
import io.pinecone.helpers.RandomStringBuilder;
import org.junit.jupiter.api.Assertions;
import org.junit.jupiter.api.Test;
import org.openapitools.db_control.client.model.CreateIndexForModelRequest;
import org.openapitools.db_control.client.model.CreateIndexForModelRequestEmbed;
import org.openapitools.db_control.client.model.DeletionProtection;
import org.openapitools.db_data.client.ApiException;
import org.openapitools.db_data.client.model.SearchRecordsRequestQuery;
import org.openapitools.db_data.client.model.SearchRecordsResponse;
import org.openapitools.db_data.client.model.UpsertRecord;

import java.util.ArrayList;
import java.util.HashMap;
import java.util.List;
...

    Pinecone pinecone = new Pinecone.Builder(System.getenv("PINECONE_API_KEY")).build();
    String indexName = RandomStringBuilder.build("inf", 8);

    // Create index associated with a model
    HashMap<String, String> fieldMap = new HashMap<>();
    fieldMap.put("text", "chunk_text");
    CreateIndexForModelRequestEmbed embed = new CreateIndexForModelRequestEmbed()
            .model("multilingual-e5-large")
            .fieldMap(fieldMap);
    pinecone.createIndexForModel(indexName, CreateIndexForModelRequest.CloudEnum.AWS, "us-west-2", embed, DeletionProtection.DISABLED, new HashMap<>());

    // Wait for index to be created
    Thread.sleep(10000);

    Index index = pinecone.getIndexConnection(indexName);

    // Upsert records
    HashMap<String, String> record1 = new HashMap<>();
    record1.put("_id", "rec1");
    record1.put("category", "digestive system");
    record1.put("chunk_text", "Apples are a great source of dietary fiber, which supports digestion and helps maintain a healthy gut.");

    HashMap<String, String> record2 = new HashMap<>();
    record2.put("_id", "rec2");
    record2.put("category", "cultivation");
    record2.put("chunk_text", "Apples originated in Central Asia and have been cultivated for thousands of years, with over 7,500 varieties available today.");

    HashMap<String, String> record3 = new HashMap<>();
    record3.put("_id", "rec3");
    record3.put("category", "immune system");
    record3.put("chunk_text", "Rich in vitamin C and other antioxidants, apples contribute to immune health and may reduce the risk of chronic diseases.");

    HashMap<String, String> record4 = new HashMap<>();
    record4.put("_id", "rec4");
    record4.put("category", "endocrine system");
    record4.put("chunk_text", "The high fiber content in apples can also help regulate blood sugar levels, making them a favorable snack for people with diabetes.");

    upsertRecords.add(record1);
    upsertRecords.add(record2);
    upsertRecords.add(record3);
    upsertRecords.add(record4);

    index.upsertRecords("example-namespace", upsertRecords);

    // Wait for vectors to be upserted
    Thread.sleep(5000);

    String namespace = "example-namespace";
    List<String> fields = new ArrayList<>();
    fields.add("category");
    fields.add("chunk_text");

    SearchRecordsRequestRerank rerank = new SearchRecordsRequestRerank()
        .model("bge-reranker-v2-m3")
        .topN(2)
        .rankFields(Arrays.asList("chunk_text"));
    
    // Search records
    SearchRecordsResponse recordsResponse = index.searchRecordsByText("Disease prevention", namespace, fields, 4, null, rerank);
    System.out.println(recordsResponse);

Type of Change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update
Infrastructure change (CI configs, etc)
Non-code change (docs, etc)
None of the above: (explain here)

Test Plan

Added integration test that creates an index associated with a model, upserts and queries records.

ssmith-pc · 2025-03-31T18:38:27Z

src/integration/java/io/pinecone/integration/dataPlane/UpsertAndSearchRecordsTest.java

+        HashMap<String, String> inputsMap = new HashMap<>();
+        inputsMap.put("text", "Disease prevention");
+        SearchRecordsRequestQuery query = new SearchRecordsRequestQuery()
+                .topK(4)
+                .inputs(inputsMap);
+
+        List<String> fields = new ArrayList<>();
+        fields.add("category");
+        fields.add("chunk_text");
+
+        // Wait for vectors to be upserted
+        Thread.sleep(5000);
+
+        SearchRecordsResponse recordsResponse = index.searchRecords(namespace, query, fields, null);


I think we should consider creating some API abstractions over the SearchRecordsRequestQuery interface.

For example, it might feel more natural if the user could query with something like:

RecordsQuery query = new TextRecordsQuery("Disease prevention") .topK(4); index.searchRecords(namespace, query, ["category", "chunk_text"]);

We could also support ID and vector queries in a similar way?

ssmith-pc · 2025-03-31T20:38:03Z

src/main/java/io/pinecone/clients/RestIndex.java

+        vectorOperations.setCustomBaseUrl(protocol + config.getHost());
+    }
+
+    public void upsertRecords(String namespace, List<UpsertRecord> upsertRecord) throws ApiException {


Can we add an overload that accepts List<Map<String, Object>>?

This feels kind of cumbersome:

UpsertRecord record1 = new UpsertRecord(); record1.id("rec1"); record1.putAdditionalProperty("category", "digestive system"); record1.putAdditionalProperty("chunk_text", "Apples are a great source of dietary fiber, which supports digestion and helps maintain a healthy gut."); ArrayList<UpsertRecord> records = new ArrayList<>(); UpsertRecord record2 = new UpsertRecord(); record2.id("rec2"); record2.putAdditionalProperty("category", "cultivation"); record2.putAdditionalProperty("chunk_text", "Apples originated in Central Asia and have been cultivated for thousands of years, with over 7,500 varieties available today.");

rohanshah18 added 4 commits March 21, 2025 17:38

add upsert and search records

b47d6f7

reorganize and add createIndexForModel

f4ceca5

update docstrings and reorganize

d79f288

add embed to configure request

8855415

rohanshah18 changed the title ~~Add upsert and search records~~ Add integrated inference Mar 28, 2025

rohanshah18 marked this pull request as ready for review March 28, 2025 17:29

rohanshah18 requested review from austin-denoble, jhamon and ssmith-pc and removed request for austin-denoble March 28, 2025 17:30

ssmith-pc reviewed Mar 31, 2025

View reviewed changes

rohanshah18 added 2 commits April 4, 2025 15:21

add overloads and clean up

1fe95d4

add tests for overloaded search methods

4f1b653

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add integrated inference #181

Add integrated inference #181

rohanshah18 commented Mar 21, 2025 •

edited

Loading

ssmith-pc Mar 31, 2025

ssmith-pc Mar 31, 2025

ssmith-pc Mar 31, 2025

Add integrated inference #181

Are you sure you want to change the base?

Add integrated inference #181

Conversation

rohanshah18 commented Mar 21, 2025 • edited Loading

Problem

Solution

Type of Change

Test Plan

ssmith-pc Mar 31, 2025

Choose a reason for hiding this comment

ssmith-pc Mar 31, 2025

Choose a reason for hiding this comment

ssmith-pc Mar 31, 2025

Choose a reason for hiding this comment

rohanshah18 commented Mar 21, 2025 •

edited

Loading