OCI RAG - Retrieval-Augmented Generation
About This Architecture
Enterprise RAG architecture on OCI integrating Object Storage, OpenSearch vector database, and Generative AI Service for semantic search and LLM inference. Data flows from ingestion through ETL pipelines into embeddings, retrieval via OpenSearch, and orchestrated inference through OCI Compute and Functions. Multi-subnet VCN design isolates data ingestion, retrieval, and inference workloads with API Gateway, Load Balancer, and WAF protecting user-facing endpoints. Security and governance layers including Vault key management, IAM, Cloud Guard, and comprehensive logging ensure compliance and auditability. Fork this diagram to customize subnets, add additional compute nodes, or integrate with your existing OCI tenancy.
People also ask
How do I build a retrieval-augmented generation system on Oracle Cloud Infrastructure with vector search and LLM integration?
This OCI RAG architecture combines Object Storage and ETL for data ingestion, OpenSearch for semantic retrieval, and OCI Generative AI Service for inference, orchestrated through compute and functions. The design uses a multi-subnet VCN to isolate ingestion, retrieval, and inference workloads, with API Gateway, Load Balancer, and WAF protecting endpoints. Security is enforced via Vault key managem
- Domain:
- Cloud Oci
- Audience:
- OCI solutions architects designing retrieval-augmented generation systems
Generated by Diagrams.so — AI architecture diagram generator with native Draw.io output. Fork this diagram, remix it, or download as .drawio, PNG, or SVG.