How to configure a collection to crawl a WebDAV service
Managed by | Updated .
Background
WebDAV is an extension of HTTP, that allows users to create, change and move documents on a server typically a web server or web share.
Although a WebDAV service extends the HTTP protocol, there are some differences in how Funnelback needs to crawl compared with a standard web site or file share.
Configuration
The following steps will enable you to crawl a WebDAV service:
- Create a filecopy collection
- Add the following pre_gather_command:
pre_gather.sh
curl -i --connect-timeout 60 --retry 3 --retry-delay 20 --ntlm -u '<DOMAIN>\<USERNAME>:<PASSWORD>' -X PROPFIND <PATH_TO_FOLDER> -H "Depth: 1"
This will download an XML file which you can process like any other XML datasource.
Was this artcle helpful?