Chat with an LLM and get answers enriched by the documents and data you already have — through Retrieval Augmented Generation.
Integrator of Open Source Software & Open Access AI Models
AI Technology Stack
Frontend / AI Middleware
- Bionic-GPT
- LibreChat
- Chatbot UI
- LangChain
- Ollama / HuggingFace TGI / LiteLLM
- n8n Data Pipeline Automation
Databases for AI Applications
- PostgreSQL & pgVector Extension
- MongoDB & MongoDB Atlas
- Supabase Platform
- Neo4j Graph Database
- Redis Cache / Redis as Vector Database
- Elasticsearch for Vector Search
Compute Infrastructure & AIaaS
- Amazon Web Services (Bedrock)
- Google Cloud (Vertex AI)
- Azure AI Services (AI Studio)
- Paperspace by DigitalOcean
- OctoAI Media & Text Gen Solution
- Docker, Swarm, & Kubernetes
Foundational Open Source AI Models
LLaMA 3 and 2
Llama 3 and Llama 2 is a suite of AI models by Meta, including Llama 2 Chat, Instruct, Llama Guard, and Code Llama.
Mistral AI
Mistral is an AI model fine-tuned for chat applications, developed in Europe and the first to partner with Microsoft after OpenAI.
Falcon LLM
Falcon LLM was among the first models made available for research & commercial applications by Abu Dhabi’s Advanced Technology Research Council.
Google Gemma
Google Gemma is the latest open access model made available by Google, based on its flagship Gemini model, but optimized for edge AI.
Microsoft Phi-3
Microsoft Phi-3 is a 3.8B parameter small language model (SLM) that can be deployed & fine-tuned with low resource usage but high quality results.
Run at the Edge, in the Datacenter, or Cloud
Self-Hosted AI Models
Integrate applications, such as chatbots, with local endpoints so that your users’ prompts and confidential data never leaves your environment. Enjoy predictable, flat costs of provisioning as many GPU instances as you require. Best for sensitive use cases, where data sovereignty is a concern.
AI Models as a Service (AI MaaS)
Integrate with external AI endpoints where you “pay as you go” with simple, per-token pricing and no up-front hardware cost. Take advantage of open models that are multiple times more cost effective than OpenAI GPT models. Best for general use cases and application prototyping.