Mindbreeze GmbH, A-4020 Linz, 2018.
All rights reserved. All hardware and software names are brand names and/or trademarks of their respective manufacturers.
These documents are strictly confidential. The submission and presentation of these documents does not confer any rights to our software, our services and service outcomes or other protected rights.
The dissemination, publication or reproduction hereof is prohibited.
For ease of readability, gender differentiation has been waived. Corresponding terms and definitions apply within the meaning and intent of the equal treatment principle for both sexes.
When selecting the installation method, select “Advanced”.
Click on the “Indices” tab and then click the “Add new index” icon to create a new index.
Enter the index path, for example, “C:\Index”. If necessary, adjust the Display Name of the index service and the associated filter service.
The Option “Use ACL References” has to be activated.
Add a new data source with the symbol “Add new custom source” on the lower right.
If not already selected, select “Jive Software Jive” with the “Category” button.
Using the setting “Crawler Interval” you can configure the amount of time that should elapse between two indexing runs.
In the field Crawling Root, you can specify a URL where a Jive sitemap is accessible. If the Mindbreeze sitemap generator add-on is installed on your Jive server and a sitemap is generated, the URL <Jive URL>/rpc/rest/mindbreeze/sitemap?jobid=full should be here.
In the field “URL regex” you can define a regular expression, which sets a pattern for the links that should be indexed.
If certain URLs are excluded from the crawl, they can be configured using a regular expression in “URL Exclude Pattern”.
If the DNS resolution of certain Web servers doesn’t work due to network-related reasons, you can specify the Ips using the “Additional Hosts File”.
If you want to add specific HTTP headers (for example, Accept-Language), you can adjust this using “Accept headers”.
With the “Use Stream Parser” option, a stream parser for processing the sitemaps is used. This option is suitable for sitemaps with a lot of URLs.
The “Max Retries” option determines how often the connector tries to download a document when temporary errors (e.g. socket timeouts) occur. The default value is 0. (No further download attempts). If you are crawling across an unstable network (that causes timeouts), this value should be increased to 10, for example. If the timeouts are caused by an overloaded data source, the value should be left at 0 so that the data source is not loaded even further.
The “Retry Delay Seconds” option determines the waiting time (in seconds) between download attempts (see "Max Retries"). The default value is 1.
In this section (available only when “Advanced Settings” is selected) the crawl speed can be adjusted.
With the number of crawler threads (“Number Of Crawler Threads”) you can define how many threads should simultaneously pick sites from the web server.
“Request Interval” defines the number of milliseconds that the crawler (crawler thread) should wait between the individual requests. However, a “crawl-delay” robots instruction is always taken into consideration and overrides this value.
In the “Proxy” section you can enter a proxy server if your infrastructure requires it. Enter the computer name and the port of the proxy server in “Proxy host” and “Proxy Port”.
The Jive Connector can also log in to the proxy using HTTP-BASIC. Enter the user into the “Proxy User” and the corresponding password in “Proxy Password” if the connections should be made through a proxy with authentication.
In the section “Basic access authentication”, a “credential” can be selected, which is used in the basic authentication for HTTP requests. A credential of the type “Username/Password” should be entered here.
This credential can be added and configured in the “Network”-tab in the section “Credentials”.
If the Jive sitemap is accessible with HTTP form authentication, the login parameters in section “Form Based Login” can be configured as follows:
You can also define so-called access rules, which consist of the following options:
In the "Jive URL" section, the base URL of the Jive Server must be specified.
Open the “Indices” tab in the Mindbreeze configuration and add a Caching Jive Principal Resolution Service.
Specify the “Jive Server URL”. If access for Jive is allowed for unregistered users, please enable the option "Jive Guest Access enabled." The "Groups Containing All Users" option allows groups to be defined so that all users are treated as if they were members of these groups.
Specify the directory path for the cache in the“Database Directory Path“ field and change the “Cache In Memory Items Size” if necessary, depending on the available memory capacity of the JVM. In the “Cache Update Interval“ field, specify the time (in minutes) that should elapse before the cache is updated. This time interval is ignored the first time the service is started. The next time the service is started, this time will be taken into account. The settings “Health Check Interval“, “Health Check max. Retries On Failure“ and “Heath Check Request Timeout“ allow this service to be restarted if, for instance, there are persistent connection problems.
The service is will be available at the specified “Webservice Port”. If multiple principal resolution services are configured, make sure that the "Web service port" parameters are different and that the configured ports are available.
The option "Lowercase Principals" allows all principals from the cache to be delivered in lower case.
If users cannot be resolved for a search query, a request will be sent directly to Jive if the option "Suppress JIVE Service Calls" is not enabled. However, for performance reasons, it is recommended that you enable this option so that no live requests are made to Jive.