How to configure a collection to crawl a WebDAV service

Managed by | Updated .

Background

WebDAV is an extension of HTTP, that allows users to create, change and move documents on a server typically a web server or web share.

Although a WebDAV service extends the HTTP protocol, there are some differences in how Funnelback needs to crawl compared with a standard web site or file share.

Configuration

The following steps will enable you to crawl a WebDAV service:

  1. Create a filecopy collection
  2. Add the following pre_gather_command:
pre_gather.sh
curl -i --connect-timeout 60 --retry 3 --retry-delay 20 --ntlm -u '<DOMAIN>\<USERNAME>:<PASSWORD>' -X PROPFIND <PATH_TO_FOLDER> -H "Depth: 1"

This will download an XML file which you can process like any other XML datasource.

Was this artcle helpful?

Tags
Type: Keywords:
Features: