Google Cloud

AlloyDB Vector Database for Retrieval Augmented Generation

AlloyDB is a fork of PostgreSQL on Google Cloud, optimized for high performance with vector embedding & retrieval workloads. As a PostgreSQL-compatible database, AlloyDB can be used as a drop-in replacement for any application that relies on a Postgres backend.…
Read More

Serverless Deployment of AI Middleware, LiteLLM, with Google Cloud Run

AI middleware is an emerging term for the layer of the technology stack that facilitates the interfacing of AI end user applications with the Large Language Models and GPU-accelerated machines that drive them. Here are the major sub-categories of this…
Read More