AnnouncementLaunch Week Day 1: Optimizing Pinecone for agents (and more)Learn more
Preview Mode ()

Pinecone Blog

General updates, blog posts about all things Pinecone and vector search.

title

Pinecone serverless on AWS is generally available

Since the public preview announcement, more than 20,000 companies have started building with Pinecone serverless. Today, we’re announcing the general availability of Pinecone serverless on AWS.

Read the Blog Post
Cascading retrieval with multi-vector representations: balancing efficiency and effectiveness

This blog post explores how multi-vector retrieval improves search accuracy by capturing rich query-document interactions, while addressing its scalability challenges. It introduces a practical, staged retrieval pipeline that balances speed and effectiveness, starting with fast retrieval, refining with multi-vector embeddings, and finishing with cross-encoder reranking. The post highlights ConstBERT, a constant-space multi-vector model co-developed by Pinecone and academic collaborators, and shows how to integrate it into Pinecone to build efficient, scalable, and accurate search systems. ConstBERT is now available in open source.

May 28, 2025
Cesare Campagnano

Senior Research Scientist

Antonio Mallia

Staff Research Scientist

Jack Pertschuk

Staff Engineer

Stay connected

Subscribe for the latest updates to our blog!