Mindbreeze GmbH, A-4020 Linz, .
All rights reserved. All hardware and software names used are registered trade names and/or registered trademarks of the respective manufacturers.
These documents are highly confidential. No rights to our software or our professional services, or results of our professional services, or other protected rights can be based on the handing over and presentation of these documents.
Distribution, publication or duplication is not permitted.
The term ‘user‘ is used in a gender-neutral sense throughout the document.
Before installing the Data Integration Connector ensure that the Mindbreeze Server is already installed and this connector is also included in the Mindbreeze license.
The Data Integration Connector is available as a ZIP file. This file must be registered with the Fabasoft Mindbreeze Enterprise Server via mesextension.exe as follows:
mesextension --interface=plugin --type=archive --file=DataIntegrationConnector<version>.zip install
PLEASE NOTE: The Connector can be updated by calling the same mesextention. Fabasoft Mindbreeze Enterprise will automatically carry out the required update.
To uninstall the Data Integration Connector, first delete all Data Integration Crawlers and then carry out the following command:
mesextension --interface=plugin --type=archive --file=DataIntegrationConnector <version>.zip uninstall
The Data Integration Connector contains components for Talend Open Studio, which will need to be installed separately. Unpack the file components.zip from the Data Integration Connector installation package into any folder (e.g. C:\custom-talend-components).
Create a new project after installing Talend Open Studio.
Open Window -> Preferences in the Talend Open Studio menu
Select Talend -> Components and enter the name of the folder into which you unpacked the components in the field “User component folder”.
You can now create a new job. Add data sources according to your requirements. More information about working with Talend Open Studio is available in the Talend Open Studio documentation.
The target of this kind of processing chain must always be the component named "MindbreezeIndexOutput". Furthermore, please note that for the component to function correctly, the following fields (string type) must be defined in the data set schema:
The following fields are optional and can be used additionally for further processing in the Mindbreeze Index:
Should further fields be defined in the schema, these are imported as metadata. It is also possible to define annotations in the following format:
In this example, "val1" becomes an annotation with the categoryClass "cc1" and the value "v1".
All "list" type fields become lists of metadata; all other fields are automatically converted into "string" types.
When your job configuration is complete, you can run it to test its functionality. The data are not sent to an index but exported to Talend Open Studio.
If the functionality test runs smoothly, the job still needs to be exported. That can be done by clicking in the context menu of the job:
It is important that the generated ZIP-file is also unpacked.
The "Main-Class" required for the configuration of Fabasoft Mindbreeze Enterprise can be found in the generated batch file.
Select the “Advanced” installation method:
Click on the “Indices” tab and then on the “Add new index” symbol to create a new index.
Enter the index path, e.g. “C:\Index”. Adapt the Display Name of the Index Service and the related Filter Service if necessary
Add a new data source with the symbol “Add new custom source” at the bottom right.
To configure the Crawler you need to enter the job directory in “Directory of Job” and the Java job class in “Main Class”.
If the option “Delete Unprocessed Documents” is enabled, then all unprocessed documents in the index are delete if the crawlrun was successful (exit code of the Talend-Job is 0).