AI Infrastructure

Open WebUI + Ollama with Azure Kubernetes Service & Ingress TLS

Open WebUI is a fork of LibreChat, an open source AI chat platform that we have extensively discussed on our blog and integrated on behalf of clients. Where LibreChat integrates with any well-known remote or local AI service on the…
Read More

AI Document Data Pipelines with S3 or Azure Blob Storage

Document data pipelines allow for the creation of automated workflows where an embedding model extracts text from documents added to a share and converts them into vector values that AI/ML applications can use for semantic search and as context for…
Read More

ELT Data Pipelines with Airbyte & BionicGPT for AI RAG

BionicGPT is an open source AI chat platform with features for teams to upload and share datasets for retrieval augmented generation using large language models. With role based access control, it is designed with a hierarchical user structure where team…
Read More

AlloyDB Vector Database for Retrieval Augmented Generation

AlloyDB is a fork of PostgreSQL on Google Cloud, optimized for high performance with vector embedding & retrieval workloads. As a PostgreSQL-compatible database, AlloyDB can be used as a drop-in replacement for any application that relies on a Postgres backend.…
Read More

RAG with any AI Model using Postgres pgVector + LibreChat

The addition of the RAG API microservice to LibreChat in version 0.7.0, the most rapidly trending open source ChatGPT clone, swings the door open to chatting with PDFs and documents using any supported AI model, in a private, self-hosted environment.…
Read More

Serverless Deployment of AI Middleware, LiteLLM, with Google Cloud Run

AI middleware is an emerging term for the layer of the technology stack that facilitates the interfacing of AI end user applications with the Large Language Models and GPU-accelerated machines that drive them. Here are the major sub-categories of this…
Read More

Integrating Azure OpenAI with Search & Retrieval Plugins for RAG

If you have ever used ChatGPT Plus, OpenAI’s SaaS GenAI offering, you are likely familiar with the Browsing extension which retrieves current information from the Internet using Bing search to inform the GPT model’s response to the user’s prompts. One…
Read More