Configuration
InSpire AI Chat and Insight Services for RAG

Introduction

Mindbreeze InSpire AI Chat offers the possibility of a conversational interface based on generative AI. Thanks to the Insight Services for Retrieval Augmented Generation (RAG), a Large Language Model (LLM) using Mindbreeze InSpire is able to generate answers from the user's input based on the hits in Mindbreeze.

Preparation

Two prerequisites must be fulfilled:

Activation of the corresponding features in Mindbreeze InSpire so that the configuration can be carried out.
Operation of an LLM.

To activate the features, please contact Mindbreeze Support (support@mindbreeze.com) for details on activating the features via Mindbreeze InSpire Feature Flags.

For the operation of an LLM, please contact Mindbreeze Sales (sales@mindbreeze.com) to use an LLM. On-premise customers have the option of using GPU appliances and Mindbreeze SaaS customers can use a remote LLM. For the use of existing LLM infrastructures, the Huggingface TGI interface is currently supported and OAuth can be used as an authorisation option.

Activating the feature

Please contact Mindbreeze Support (support@mindbreeze.com) and, if available, submit your existing feature flag configuration "/etc/mindbreeze/enabled_features.json" to receive an updated version. Then update the new enabled_features.json that you received from Mindbreeze Support.

Then restart the InSpire container. To do this, open the "Setup" menu item in the Management Centre and then click on the "InSpire" submenu item. You can restart InSpire in the "Container Management" area.

Once the InSpire has been restarted and the feature has been activated correctly, you should see in the menu item "Insight Services" the new submenu item "RAG" in the Management Centre.

Configuration

Insight Services for Retrieval Augmented Generation

Open the menu item "Configuration" in the Mindbreeze Management Center. Create a new service in the "Indices" tab by clicking on "Add Service". Give the service a name under "Display Name" and select the option "Insight Services for RAG" in the "Service" setting.

Make sure that in the "Base Configuration" section the configured "Bind port" is not yet occupied.

The setting "Generate with empty Results" is activated by default. If you disable this option, a response will only be generated in the chat if results are found in the index, otherwise an error message will be displayed, this can be useful to prevent responses being given without reference to the indexed data.

The setting "Path to Store Base" can be used to optionally configure the path of the service.

If “Path to Store Base” is defined, based on the path, the service configuration may not be included in the snapshots. If no “Path to Store Base” is defined only the service configurations (pipelines and LLMs) get packed in the snapshot but not the service data (datasets). If all data should be packed into a snapshot, the “Path to Store Base” should be set to a directory which content is fully included in a snapshot. For more information about migrating with a Snapshot, see Handbook - Backup & Restore - Creating a snapshot.

Attention: If the setting "Path to Store Base" is not set, it can occur that the service data for the pipeline and the LLM is not displayed after the application of a Snapshot. To solve this, see the chapter Service data for the pipeline and LLM vanished after applying a Snapshot.

Logging Settings

Note: The setting “Full Logging” which is mentioned in the descriptions below can be found in the section “Support Mode”, which is visible when “Advanced Settings” are activated.

Include Prompt in app.telemetry	This setting is disabled per default. If this setting is enabled, the app.telemetry entry will contain the effective prompt (containing the retrieved contexts) which is sent to the LLM. Note: These prompts may contain excerpts from documents and therefore sensitive data!
Include User Input in app.telemetry	This setting is enabled per default. If this setting is enabled, the app.telemetry entry will contain the questions entered by the users.
Log Prompt in verbose Log Mode	This setting is disabled per default. If this setting and „Full Logging” are enabled, the logs will contain the prompts sent to the LLM. Note: These prompts can contain excerpts from documents and therefore sensitive data!
Log HTTP Requests in verbose Log Mode	This setting is disabled per default. If this setting and „Full Logging” are enabled, the logs will contain the details for the sent requests.

Advanced Settings

Activate the "Advanced Settings" for the following settings.

Self-Signed SSL Certificate

In the "Security Configuration" section, activate the setting "Disable SSL Certificate Validation". This setting is deactivated by default.

Finally, save the changes by clicking on "Save".

Impersonation Configuration

In the section “Impersonation Configuration”, you can enter the endpoint mapping pattern, among other things. The endpoints are defined in the “Network” tab in the Configuration of the Management Center.

$C:\Users\silke.lindenberger\AppData\Local\Microsoft\Windows\INetCache\Content.MSO\6E8B577A.tmp$

With this setting, the retriever retrieves the client certificate for the client service by comparing it with the pattern defined there. The following placeholder can be used for the pattern:

- client_service_id: ID of the client service

The endpoint mapping only takes effect if no "Trusted Peer Certificate" is selected and the HTTP header configured in the Impersonation Identifier Setting is also sent in the Generate request.

$C:\Users\silke.lindenberger\AppData\Local\Microsoft\Windows\INetCache\Content.MSO\25133D8C.tmp$

Finally, save the changes by clicking on "Save".

Configuration of the OpenAI proxy

The proxy API for LLMs can be enabled in the section “Proxy Configuration”. By configuring the proxy API, LLMs that have been configured in Insight Services for RAG can be used directly via Mindbreeze InSpire.

Attention: The OpenAI API must be supported by the respective LLM in to configure a working proxy API. The following table provides an overview with the available protocols:

Protocol	Supports OpenAI API?
InSpire LLM	Yes
OpenAI	Yes, but only if no request transformation is configured.
Azure OpenAI	Yes
InSpire LLM (TGI)	No

The following endpoints are currently supported:

These can then be reached in the Client Service as follows:

/api/openai.v1.chat.completions.create/v1/chat/completions
/api/openai.v1.audio.transcriptions.create/v1/audio/transcriptions

The setting “Allowed URLs Pattern for Images” can be used to restrict the URLs for images in the requests of the Completions API. By default, only Base64-encoded data URLs are used.

InSpire AI Chat

Switch to the "Client Services" tab and activate the advanced settings by ticking the box "Advanced Settings".

If available, use an existing client service. If no client service is available, add a client service by clicking on "Add Client Service". Give the client service a name with the setting "Display Name".

Go to the "Chat UI Settings" section and select the service you created in the "Indices" tab in the setting "Chat Service".

After the configuration, you can access the AI Chat using the path apps/chat, similar to the Insight App Designer. The full path looks like this: https://example.com/apps/chat/.

Personalised theme (optional configuration)

If a personalised theme is desired in AI Chat, this can be set as follows:

In the Management Center in the menu item "File Manager", the folder /data/apps/chat-theme must be created, if it has not yet been created, in which the following files are stored:

logo.png (necessary)	The logo at the top left of the chat.
custom.css	The Custom Stylesheet.
custom.js	The Custom JavaScript.
favicon.png (necessary)	The icon on the left-hand side of the generated response. The recommended size of the icon is 14 x 14 pixels.
favicon.svg	The favicon in the browser tab. If no favicon.svg is available, the favicon.png is used here.

Activate the "Advanced Settings" in the "Client Services" tab and now create an "Additional Context" in your client service in the section "Web Applications Contexts Settings". Activate "Override Existing Path" there and enter /apps/chat/theme as the "URL Path" and /data/apps/chat-theme as the "File Path".

Save the configuration by clicking on "Save".

Languages

The Mindbreeze Inspire AI Chat is available in the following languages by default:

German
English

To change the language of the AI Chat, click on the currently set language at the bottom left. Then click on the language you want to use in the list of available languages. The chat is immediately displayed in the selected language:

Custom Languages

Custom languages can also be added to the list of available languages. To configure a custom language, a simple hook is sufficient. This hook must be configured in the additional resources in the URL Path apps/chat/theme/. The necessary translations can then be added to apps/chat/theme/i18n/.

Hint: Please be aware that German and English are reserved and therefore cannot be changed or adapted.

The following step by step guide explains the full configuration process:

In the Management Center, go to „Configuration” and open the Client Service where the custom languages are needed. Activate “Advanced Settings” and go to the section “Web Applications Contexts Settings”. Then, add a new “Additional Context” with the following settings:

Setting	Value
URL Path*	Example: /apps/chat/theme
File Path*	Example: /data/apps/chat-theme
Override Existing Path*	Activated
Allow Symlinks	Default value is sufficient.
Authenticated URL pattern	Default value is sufficient.
Login URL pattern	Default value is sufficient.
Not Cached MIME-Types Pattern	Default value is sufficient.
* = This setting is required for this feature to work properly.

Hint: It is recommended to define the “Additional Context” first, so that the files you will define in the upcoming steps will be effective immediately.

After the new Additional Context is fully configured, go to “File Manager” in the Management Center and then to “Local Filesystem”. Go to the folder /data/apps and create a new folder called “chat-theme”. Then click on the newly created folder and add a new folder called “i18n”:

Next to the “i18n” folder, create a file called “custom.js”. In this file the aforementioned hook will be defined.

To create the hook, “getEffectiveLanguages” is used to override the default languages and to define the new chat language preference. For example “”es”:”Espanol”” and “”fr”:”Francais”” will be the custom languages and the new chat language preference. Save the changes.

Attention: Please be aware that valid custom locales (for example: es, fr, cs or nl) must be used or else the custom language will not be displayed in the AI Chat. In addition, make sure to avoid the locales “de” or “en” since German and English cannot be changed or adapted.

Hint: If you want to have a list of languages in the AI Chat consisting of German and/or English and custom languages, you can include the locals “de” and/or “en” in the hook, like ““es”:”Espanol”, “fr”:”Francais”, “de”:”German”, “en”:”English””. Creating JSON files for German and English will not be necessary in the next steps, since the translations are automatically provided.

After the hook is created, open the folder “i18n” and create the JSON files for the languages you want to add with the key values you defined in the hook. For example, es.json for Spanish and fr.json for French.

Hint: If German and/or English are defined in the hook, this and the next step are not necessary, since the translation is provided automatically.

Then, open the JSON files and define the translations that you want to see displayed in the AI Chat. Save the changes.

Attention: Please be aware that valid custom locales (for example: es, fr, cs or nl) must be used or else the custom language will not be displayed in the AI Chat. In addition, make sure to avoid the locales “de” or “en” since German and English cannot be changed or adapted.

The configuration for the custom languages is now complete. By refreshing the AI Chat, the new translations should be visible.

Activating the NLQA index

Activate "Natural Language Question Answering" (NLQA) in the desired indexes. For more information about the configuration, see Create NLQA index and Activate NLQA on existing index.

Setting up the Large Language Model (LLM)

On-premise customers have the option to use GPU appliances or Mindbreeze SaaS customers can use a remote LLM. Please contact sales@mindbreeze.com regarding options for running an LLM with Mindbreeze.

For the use of existing, external LLM models, the Huggingface Text Generation Interface (TGI) is currently supported. If necessary, OAuth can be used as an authorisation option. If you want to use a self-hosted LLM with a different interface, please contact Mindbreeze Support (support@mindbreeze.com) to check compatibility.

To set up the LLM, click on the menu item "Insight Services" in the Management Center and open the submenu item "RAG". Select the "LLMs" area there.

Click on "Add" to configure a new LLM. The following values are supplied by the respective LLM:

Setting	Description
Name	Name of the Large Language Model
URL	URL
User Message Token User Message End Token Assistant Message Token Assistant Message End Token Message End Token	To be filled in depending on the model.
Preprompt	A Pre-Prompt is used to apply specific roles, intents and limitations to each subsequent prompt of a Model.
Maximum amount of tokens	Limits the tokens of a prompt. "0" does not limit the tokens. No limitation can decrease the speed of generation when the prompt is too long. A value of 2000 is recommended.

Save your configuration by clicking on "Save". You can find more details on the administration of models and pipelines in Whitepaper - Administration of Insight Services for Retrieval Augmented Generation.

External LLM with authorisation

If you want to use an external LLM, please contact sales@mindbreeze.com. The sales team will discuss your specific situation with you and provide you with a tailored offer.

The mapping of LLM in the RAG Administration to the required credentials is done via Endpoint Mapping. To configure the authorisation, open the menu item "Configuration" in the Management Center and go to the tab "Network". Create a new credential of type "OAuth 2" with the information provided by Mindbreeze. See the following screenshot for an example.

Create a new endpoint with this credential. The "Location" of the endpoint is the URL of the LLM.

Creating a pipeline and using InSpire AI Chat

To use InSpire AI Chat, a pipeline with a model must be created. The steps required for this can be found in Whitepaper - Administration of Insight Services for Retrieval Augmented Generation.

Authorisation of Management Center users for RAG administration

To administer Insight Services for RAG, a user must have the following roles:

An administrator can assign these roles to a user or group in the Management Center under the menu item "Setup". To do this, click on the "Credentials" submenu item and assign the necessary roles to the user or group. For more information on assigning roles, see Configuration – Back-End Credentials.

Troubleshooting

Service Configuration data for the pipeline and LLM is not displayed after applying a Snapshot

After the application of a Snapshot, the problem can occur that already existing service configuration data for a pipeline and LLM is not displayed anymore. The reason is that there is no defined path in the setting “Path to Store Base” and therefore the pipeline and LLM data are not displayed in the RAG service.

To solve this, please restart your RAG service. After the restart, the service configuration data should be displayed again.

Configuration
InSpire AI Chat and Insight Services for RAG

Introduction

Preparation

Activating the feature

Configuration