Crawler out of memory: java.lang.OutOfMemoryError: PermGen space
Managed by | Updated .
Error message
Displayed in the url_errors.log
file
[Crawl URLs: java.lang.OutOfMemoryError: PermGen space]
Error type
Web crawler
Cause
The Java virtual machine PermGen space allocated by the web crawler has been used up. This is a separate allocation of memory to the head and is used to store class object information. The default allocation is quite small at only about 25MB and can be consumed by complex collection configurations given there are already a large number of classes now included in the Funnelback class path. The JVM memory usage can be checked using the java jmap utility with -heap by running the following command from the terminal:
jmap -heap <jvm_process_id>
Resolution
- Increase the allocated PermGen memory by adding the following Java options to the collection.cfg:
java_options=-XX:MaxPermSize=128m
- Start a new crawl
Was this artcle helpful?