📖Architecture

ChatBees is an advanced Retrieval Augmented Generation platform provided as a self-managed service. ChatBees enables data storage, processing, and retrieval solutions that are faster, easier to use, and far more flexible than traditional offerings.

ChatBees combines a completely new RAG engine with an innovative architecture natively designed for the cloud. ChatBees uses a central data repository, the cloud storage, for persisted data that is accessible from all compute nodes in the platform.

ChatBees unique architecture consists of two key layers:

Data Storage

When data is loaded into ChatBees, ChatBees parses the data, splits data into chunks, generates the embedding for each chunk, and persists all data into cloud storage. ChatBees reorganizes that data into its internal optimized and compressed format.

ChatBees manages all aspects of how this data is stored — the organization, compression, metadata, statistics, and other aspects of data storage are handled by ChatBees. The data objects are not directly visible nor accessible by customers; they are only accessible through provided APIs.

Elastic Service

The elastic service layer is a collection of services that coordinate activities. These services tie together all of the different components of ChatBees in order to process user requests. The cloud services layer runs on compute instances provisioned by ChatBees from the cloud provider.

Services managed in this layer include:

  • Authentication

  • Access control

  • RAG pipeline

  • Data and Vector Index

  • Security: encryption, secret key management

  • Private LLMs - in the future

Last updated