Home
Home
German Version
Support
Impressum
25.2 Release ►

Start Chat with Collection

    Main Navigation

    • Preparation
      • Connectors
      • Create an InSpire VM on Hyper-V
      • Initial Startup for G7 appliances
      • Setup InSpire G7 primary and Standby Appliances
    • Datasources
      • Configuration - Atlassian Confluence Connector
      • Configuration - Best Bets Connector
      • Configuration - Box Connector
      • Configuration - COYO Connector
      • Configuration - Data Integration Connector
      • Configuration - Documentum Connector
      • Configuration - Dropbox Connector
      • Configuration - Egnyte Connector
      • Configuration - GitHub Connector
      • Configuration - Google Drive Connector
      • Configuration - GSA Adapter Service
      • Configuration - HL7 Connector
      • Configuration - IBM Connections Connector
      • Configuration - IBM Lotus Connector
      • Configuration - Jira Connector
      • Configuration - JVM Launcher Service
      • Configuration - LDAP Connector
      • Configuration - Microsoft Azure Principal Resolution Service
      • Configuration - Microsoft Dynamics CRM Connector
      • Configuration - Microsoft Exchange Connector
      • Configuration - Microsoft File Connector (Legacy)
      • Configuration - Microsoft File Connector
      • Configuration - Microsoft Graph Connector
      • Configuration - Microsoft Loop Connector
      • Configuration - Microsoft Project Connector
      • Configuration - Microsoft SharePoint Connector
      • Configuration - Microsoft SharePoint Online Connector
      • Configuration - Microsoft Stream Connector
      • Configuration - Microsoft Teams Connector
      • Configuration - Salesforce Connector
      • Configuration - SCIM Principal Resolution Service
      • Configuration - SemanticWeb Connector
      • Configuration - ServiceNow Connector
      • Configuration - Web Connector
      • Configuration - Yammer Connector
      • Data Integration Guide with SQL Database by Example
      • Indexing user-specific properties (Documentum)
      • Installation & Configuration - Atlassian Confluence Sitemap Generator Add-On
      • Installation & Configuration - Caching Principal Resolution Service
      • Installation & Configuration - Mindbreeze InSpire Insight Apps in Microsoft SharePoint On-Prem
      • Mindbreeze InSpire Insight Apps in Microsoft SharePoint Online
      • Mindbreeze Web Parts for Microsoft SharePoint
      • User Defined Properties (SharePoint 2013 Connector)
      • Whitepaper - Mindbreeze InSpire Insight Apps in Salesforce
      • Whitepaper - Web Connector - Setting Up Advanced Javascript Usecases
    • Configuration
      • CAS_Authentication
      • Configuration - Alerts
      • Configuration - Alternative Search Suggestions and Automatic Search Expansion
      • Configuration - Back-End Credentials
      • Configuration - Chinese Tokenization Plugin (Jieba)
      • Configuration - CJK Tokenizer Plugin
      • Configuration - Collected Results
      • Configuration - CSV Metadata Mapping Item Transformation Service
      • Configuration - Entity Recognition
      • Configuration - Exporting Results
      • Configuration - External Query Service
      • Configuration - Filter Plugins
      • Configuration - GSA Late Binding Authentication
      • Configuration - Identity Conversion Service - Replacement Conversion
      • Configuration - InceptionImageFilter
      • Configuration - Index-Servlets
      • Configuration - InSpire AI Chat and Insight Services for Retrieval Augmented Generation
      • Configuration - Item Property Generator
      • Configuration - Japanese Language Tokenizer
      • Configuration - Kerberos Authentication
      • Configuration - Management Center Menu
      • Configuration - Metadata Enrichment
      • Configuration - Metadata Reference Builder Plugin
      • Configuration - Mindbreeze Proxy Environment (Remote Connector)
      • Configuration - Personalized Relevance
      • Configuration - Plugin Installation
      • Configuration - Principal Validation Plugin
      • Configuration - Profile
      • Configuration - Reporting Query Logs
      • Configuration - Reporting Query Performance Tests
      • Configuration - Request Header Session Authentication
      • Configuration - Shared Configuration (Windows)
      • Configuration - Vocabularies for Synonyms and Suggest
      • Configuration of Thumbnail Images
      • Cookie-Authentication
      • Documentation - Mindbreeze InSpire
      • I18n Item Transformation
      • Installation & Configuration - Outlook Add-In
      • Installation - GSA Base Configuration Package
      • JWT Authentication
      • Language detection - LanguageDetector Plugin
      • Mindbreeze Personalization
      • Mindbreeze Property Expression Language
      • Mindbreeze Query Expression Transformation
      • SAML-based Authentication
      • Trusted Peer Authentication for Mindbreeze InSpire
      • Using the InSpire Snapshot for Development in a CI_CD Scenario
      • Whitepaper - AI Chat
      • Whitepaper - Create a Google Compute Cloud Virtual Machine InSpire Appliance
      • Whitepaper - Create a Microsoft Azure Virtual Machine InSpire Appliance
      • Whitepaper - Create AWS 10M InSpire Appliance
      • Whitepaper - Create AWS 1M InSpire Appliance
      • Whitepaper - Create AWS 2M InSpire Appliance
      • Whitepaper - Create Oracle Cloud 10M InSpire Application
      • Whitepaper - Create Oracle Cloud 1M InSpire Application
      • Whitepaper - MMC_ Services
      • Whitepaper - Natural Language Question Answering (NLQA)
      • Whitepaper - SSO with Microsoft AAD or AD FS
      • Whitepaper - Text Classification Insight Services
    • Operations
      • Adjusting the InSpire Host OpenSSH Settings - Set LoginGraceTime to 0 (Mitigation for CVE-2024-6387)
      • app.telemetry Statistics Regarding Search Queries
      • CIS Level 2 Hardening - Setting SELinux to Enforcing mode
      • Configuration - app.telemetry dashboards for usage analysis
      • Configuration - Usage Analysis
      • Deletion of Hard Disks
      • Handbook - Backup & Restore
      • Handbook - Command Line Tools
      • Handbook - Distributed Operation (G7)
      • Handbook - Filemanager
      • Handbook - Indexing and Search Logs
      • Handbook - Updates and Downgrades
      • Index Operating Concepts
      • Inspire Diagnostics and Resource Monitoring
      • Provision of app.telemetry Information on G7 Appliances via SNMPv3
      • Restoring to As-Delivered Condition
      • Whitepaper - Administration of Insight Services for Retrieval Augmented Generation
    • User Manual
      • Browser Extension
      • Cheat Sheet
      • iOS App
      • Keyboard Operation
    • SDK
      • api.chat.v1beta.generate Interface Description
      • api.v2.alertstrigger Interface Description
      • api.v2.export Interface Description
      • api.v2.personalization Interface Description
      • api.v2.search Interface Description
      • api.v2.suggest Interface Description
      • api.v3.admin.SnapshotService Interface Description
      • Debugging (Eclipse)
      • Developing an API V2 search request response transformer
      • Developing Item Transformation and Post Filter Plugins with the Mindbreeze SDK
      • Development of a Query Expression Transformer
      • Development of Insight Apps
      • Embedding the Insight App Designer
      • Java API Interface Description
      • OpenAPI Interface Description
    • Release Notes
      • Release Notes 20.1 Release - Mindbreeze InSpire
      • Release Notes 20.2 Release - Mindbreeze InSpire
      • Release Notes 20.3 Release - Mindbreeze InSpire
      • Release Notes 20.4 Release - Mindbreeze InSpire
      • Release Notes 20.5 Release - Mindbreeze InSpire
      • Release Notes 21.1 Release - Mindbreeze InSpire
      • Release Notes 21.2 Release - Mindbreeze InSpire
      • Release Notes 21.3 Release - Mindbreeze InSpire
      • Release Notes 22.1 Release - Mindbreeze InSpire
      • Release Notes 22.2 Release - Mindbreeze InSpire
      • Release Notes 22.3 Release - Mindbreeze InSpire
      • Release Notes 23.1 Release - Mindbreeze InSpire
      • Release Notes 23.2 Release - Mindbreeze InSpire
      • Release Notes 23.3 Release - Mindbreeze InSpire
      • Release Notes 23.4 Release - Mindbreeze InSpire
      • Release Notes 23.5 Release - Mindbreeze InSpire
      • Release Notes 23.6 Release - Mindbreeze InSpire
      • Release Notes 23.7 Release - Mindbreeze InSpire
      • Release Notes 24.1 Release - Mindbreeze InSpire
      • Release Notes 24.2 Release - Mindbreeze InSpire
      • Release Notes 24.3 Release - Mindbreeze InSpire
      • Release Notes 24.4 Release - Mindbreeze InSpire
      • Release Notes 24.5 Release - Mindbreeze InSpire
      • Release Notes 24.6 Release - Mindbreeze InSpire
      • Release Notes 24.7 Release - Mindbreeze InSpire
      • Release Notes 24.8 Release - Mindbreeze InSpire
      • Release Notes 25.1 Release - Mindbreeze InSpire
      • Release Notes 25.2 Release - Mindbreeze InSpire
    • Security
      • Known Vulnerablities
    • Product Information
      • Product Information - Mindbreeze InSpire - Standby
      • Product Information - Mindbreeze InSpire
    Home

    Path

    Sure, you can handle it. But should you?
    Let our experts manage the tech maintenance while you focus on your business.
    See Consulting Packages

    Configuration
    Mindbreeze Proxy Environment (Remote Connector)

    IntroductionPermanent link for this heading

    In this documentation, you will learn how to index into a Mindbreeze InSpire Appliance using a Mindbreeze Proxy Environment.

    A Mindbreeze Proxy Environment can be useful for you if your actual Mindbreeze InSpire Appliance does not have access to the data sources to be indexed due to the network infrastructure - or in other words - if your data sources in your LAN are not accessible from the outside (Internet or VPN) and your Mindbreeze InSpire Appliance is located at another site (different LAN). This can be the case, for example, if your Mindbreeze InSpire Appliance is hosted in the cloud (SaaS).

    In such a case, a Mindbreeze Proxy Environment is ideal. This can be run as a Virtual Machine (VM) on your LAN and crawl the documents from the local data sources. These documents are then sent to the Mindbreeze InSpire Appliance and indexed there. The Semantic Pipeline is then run on the appliance; search queries are also processed by the appliance. The Mindbreeze Proxy Environment is only responsible for crawling.

    PreparationPermanent link for this heading

    See Initial Operation (for VMs, the sections on hardware and iDRAC can be skipped). Furthermore, a special license for the product "Mindbreeze InSpire Remote Connector" must be installed on the Mindbreeze Proxy environment.

    Configuring the Mindbreeze InSpire AppliancePermanent link for this heading

    You need an index to which the Mindbreeze Proxy Environment sends the documents that should  be indexed. In addition, you need backend credentials with which the Mindbreeze Proxy Environment authenticates itself on the Mindbreeze InSpire Appliance.

    IndexPermanent link for this heading

    Create a new index on the Mindbreeze InSpire Appliance. To do so, navigate to "Configuration" and the "Indices" tab in the Mindbreeze Management Center. Then click on "Add Index". Note the "Index Port (HTTP)" that was automatically assigned. You can also change the port.

    Then disable the data source by clicking "Disable" under "Data Source". The data source will be configured later on the Mindbreeze Proxy Environment. Note that you must not delete the data source, but disable it instead, otherwise the search will not work correctly.

    Caching Principal Resolution ServicePermanent link for this heading

    Now create a new Caching Principal Resolution Service on the Mindbreeze InSpire appliance. To do this, navigate to "Configuration" and the "Indices" tab in the Mindbreeze Management Center. Then click on "Add Service".

    Now select the appropriate "Service" you need for your data source. Then activate the "Readonly" checkbox. Otherwise, no further configuration is necessary on the "Caching Principal Resolution Service".

    Finally, select the just configured "Caching Principal Resolution Service" in the data source (under "Data Sources" at the configured index).

    FilterPermanent link for this heading

    Now switch to the "Filters" tab and activate "Advanced Settings". Scroll down to the "Base Configuration" section and configure the following options:

    Setting

    Description

    Destination Pattern

    https://myorganization\.mindbreeze\.com:8443/realm/master/api/v1/index/(\d+)(.*)

    Replace "myorganization\.mindbreeze\.com" with your appliance hostname (see also "Remote Base URL" option in Mindbreeze Proxy Environment configuration). Also, replace the realm "master" (see the "Realm" option in the Mindbreeze Proxy Environment configuration).

    Destination Replacement

    http://localhost:\1/\2

    Create backend credentials (optional)Permanent link for this heading

    In order for the Mindbreeze Proxy environment to have access to the filters and index services via OAuth 2, a user is required that holds at least the "InSpire Index Writer" role. If you do not have a suitable user, please create a new one. For more information, see Configuration Backend Credentials.

    Configuring the Mindbreeze Proxy EnvironmentPermanent link for this heading

    Go to the Mindbreeze Management Center of the Mindbreeze Proxy Environment. Here you can configure the information required to use the services of the Mindbreeze Proxy environment. In addition, you can also configure the crawlers for your data sources here.

    Remote index and data sourcePermanent link for this heading

    In the "Configuration" menu, switch to the "Indices" tab and click on the "Add Index" drop down menu and then select "Add Remote Index".

    In the opened dialogue, select the node ID of the proxy environment under "Remote Index Node". Then select the desired data source under "Data Source".

    Only data sources that have a caching principal resolution service are suitable for remote connectors. Depending on the data source, you will find more information on setting it up in the associated data source documentation.

    Configure the following fields for the newly added remote index:

    Setting

    Description

    Remote Base URL

    The URL to the Mindbreeze Management Center of the Mindbreeze InSpire appliance. If hosted in the cloud, usually https://myorganization.mindbreeze.com:8443

    Realm

    For on-prem appliances "master" by default, in the cloud this value must be adjusted

    Index Port

    The Index Port on the Mindbreeze InSpire Appliance

    Filter Service ID

    The Filter Service ID on the Mindbreeze InSpire Appliance

    Filter Port

    The Filter Port on the Mindbreeze InSpire Appliance. The configuration of a Filter Port is only required, if the Filter Service ID is not configured.

    The configuration of the Filter Service ID:

    LimitationsPermanent link for this heading

    If the selected data source is to be used with ACLs and is not public, there are certain limitations.

    Remote indexes require a caching principal resolution service to handle ACLs, which are not available for all data sources.

    The following data sources cannot be used as a remote index with ACLs:

    • Best Bets
    • Data Integration
    • Egnyte
    • Github
    • GSAFeedAdapter
    • HL7
    • IBM Connections
    • Microsoft Graph
    • SemanticWeb
    • Web

    Remote caching principal resolution servicePermanent link for this heading

    Now create a new Caching Principal Resolution Service on the Mindbreeze proxy environment. To do this, navigate to "Configuration" and the "Indices" tab in the Mindbreeze Management Center. Then click on "Add Service".

    Configure the service according to the Configuration of Caching Principal Resolution Service. Then click on "Add Property" in the "Consumer Caching Principal Resolution Services" section and configure the following fields:

    Setting

    Description

    Readonly on Consumer

    This checkbox should be selected only on producer nodes of Mindbreeze InSpire environments.

    Consumer

    Disable

    Disable updating remote cache.

    Base URL

    The URL to the Mindbreeze Management Center of the Mindbreeze InSpire appliance. If hosted in the cloud, usually https://myorganization.mindbreeze.com:8443.

    Realm

    For on-prem appliances "master" by default, in the cloud this value must be adjusted.

    Service Port

    The Caching Principal Resolution Service Port on the Mindbreeze InSpire Appliance.

    CredentialsPermanent link for this heading

    Switch to the "Network" tab and click on "Add Credential". Configure the following fields:

    Setting

    Description

    Name

    Assign an arbitrary, but meaningful name

    Type

    OAuth 2

    Access Token URL

    The URL from which OAuth 2 access tokens can be requested. If hosted in the cloud, usually https://myorganization.mindbreeze.com:8443/auth/realms/master/protocol/openid-connect/token. Please note that the realm ("master") must be customized for cloud environments.

    Client ID

    OAuth 2 Client ID. The default client "mindbreeze-inspire-public" is recommended

    Username

    Username of a user who has the role "InSpire Index Writer". See also Configuration Backend Credentials

    Password

    Password of this user

    Click Add Endpoint and configure the following fields:

    Setting

    Description

    Location

    https://myorganization.mindbreeze.com:8443/realm/master (Please note that the realm ("master") must be customized for cloud environments).

    Credential

    The credential that you have previously created.

    TroubleshootingPermanent link for this heading

    If you have problems, here is a list of possible solutions:

    • Mindbreeze Proxy Environment is configured incorrectly:
      On the Mindbreeze Proxy environment, open the current crawler log file (in /data/logs/log-mescrawler_launchedservice-<service>/current/log-mescrawler_launchedservice.log). If you find an error message there, it will probably indicate incorrect or missing configuration parameters. Depending on the error message, please check the following parts of the configuration:
      • Credentials (Network Tab)
      • Endpoints (Network Tab)
      • Remote Index (Indices Tab)
      • Data Source (Indices Tab)
    • There are issues on the Mindbreeze InSpire Appliance:
      If you did not find any errors or other errors in the crawler logs on the Mindbreeze Proxy Environment that indicate filter or index problems, please check the filter or index logs on the Mindbreeze InSpire Appliance.

    Mindbreeze Proxy Environment is configured incorrectlyPermanent link for this heading

    On the Mindbreeze Proxy environment, open the current crawler log file (in /data/logs/log-mescrawler_launchedservice-<service>/current/log-mescrawler_launchedservice.log). If you find an error message there, it will probably indicate incorrect or missing configuration parameters. Depending on the error message, please check the following parts of the configuration:

    • Credentials (Network Tab)
    • Endpoints (Network Tab)
    • Remote Index (Indices Tab)
    • Data Source (Indices Tab)

    Issues on the Mindbreeze InSpire AppliancePermanent link for this heading

    If you did not find any errors or other errors in the crawler logs on the Mindbreeze Proxy Environment that indicate filter or index problems, please check the filter or index logs on the Mindbreeze InSpire Appliance.

    Connection failedPermanent link for this heading

    You can check whether a connection to the remote index is possible by executing the following command on your appliance in the inspire container. In this case, we will try to call the Remote Base URL https://myorganization.mindbreeze.com:8443/:
    curl -kv https://myorganization.mindbreeze.com:8443/

    If the connection is successful, you should receive an arbitrary HTML status code as a response (no login is performed here, this is only a connection test):
    …
    < HTTP/1.1 401 Unauthorized
    …

    However, if you get a response like this, it means that you may need to configure a proxy or that a firewall activation is necessary:

    * Connection refused
    * Failed connect to myorganization.mindbreeze.com:8443; Connection refused
    * Closing connection 0
    curl: (7) Failed connect to myorganization.mindbreeze.com:8443; Connection refused

    If a proxy is necessary, you must configure it in the Management Center as described here.
    Furthermore, in order to be able to make a connection test with the proxy using the curl command, an environment variable must also be set as follows:
    export https_proxy=myproxy.myorganization.com:8080

    Ideally, you should now be able to establish a successful connection with the curl connection test mentioned above.

    However, if you get an answer like this:
    < HTTP/1.1 403 Forbidden
    < Server: squid/...

    ...

    < X-Squid-Error: ERR_ACCESS_DENIED 0

    ...

    this means that you have successfully configured a proxy, but the proxy does not allow a connection. In this case, a proxy activation is necessary.

    Download PDF

    • Configuration - Mindbreeze Proxy Environment (Remote Connector)

    Content

    • Introduction
    • Preparation
    • Configuring the Mindbreeze InSpire Appliance
    • Configuring the Mindbreeze Proxy Environment
    • Troubleshooting

    Download PDF

    • Configuration - Mindbreeze Proxy Environment (Remote Connector)