Couldn't access seed page

Managed by | Updated .

Error message

Displayed in the update-COLLECTION.log file

Crawl: Error running crawler: Crawler couldn't access seed page

Error type

Web crawler

Cause

This means the web crawler couldn't access any of the specified start URLs (Administer > Edit Collection > Start URL(s))

Resolution

  1. Confirm that the collection's Start URLs are valid and return a status 200 (OK)
  2. Examine entries for the Start URLs in the the collection's crawl logs (crawl.log.*, url_errors.log). Common issues include:
    • robots.txt file on webserver blocking access
    • Seed URL requires authentication
  3. Update the collection
Was this artcle helpful?

Comments