YaCy-Bugtracker

View Issue Details Jump to Notes ] Issue History ] Print ]
IDProjectCategoryView StatusDate SubmittedLast Update
0000762YaCyWishlist - Wunschlistepublic2017-07-09 03:582017-07-09 12:28
Reportersmokingwheels 
Assigned To 
PrioritylowSeveritytweakReproducibilitysometimes
StatusnewResolutionopen 
ETAnone 
PlatformOSOS Version
Product VersionYaCy 1.9 
Target VersionFixed in Version 
Summary0000762: Have crawler queue for domains that have large robots delay time.
DescriptionI have noticed when crawling I come across sites that have a crawl delay of 10 to 60 seconds, during this time the PPM drops to 0 and it is not the particular site of crawl I intended to look at its just background noise.

The effect is it slows the site of interest being crawled takes much longer.
A temporary fix is just to add the site to the black list to get around the problem.

There are some sites you want to crawl and there is a robots delay there but you accept it will take some time to gather the information.

TagsNo tags attached.
Attached Files

- Relationships

-  Notes
There are no notes attached to this issue.

- Issue History
Date Modified Username Field Change
2017-07-09 03:58 smokingwheels New Issue


Copyright © 2000 - 2017 MantisBT Team
Powered by Mantis Bugtracker