|Anonymous | Login | Signup for a new account||2021-10-25 22:50 CEST|
|Main | My View | View Issues | Change Log | Roadmap|
|View Issue Details|
|ID||Project||Category||View Status||Date Submitted||Last Update|
|0000606||YaCy||[All Projects] General||public||2015-10-10 18:32||2015-10-13 02:46|
|Product Version||YaCy 1.8|
|Target Version||Fixed in Version|
|Summary||0000606: Recorded crawler mangled|
|Description||A registered crawler in Table_API_p.html fails to execute.|
Clicking its "clone" icon, browser is redirected to CrawlStartExpert.html, where I notice that the "Use filter" input field is pre-filled with an url-encoded string. Once manually url-decoded, the string becomes my correct, valid regex and the crawler task can be successfully started again.
|Steps To Reproduce||1) Create crawler with regex filter;|
2) Re-execute crawler from "Process Scheduler".
|Tags||No tags attached.|
|Attached Files||url-encoded.pdf [^] (102,930 bytes) 2015-10-12 11:01|
were not able to reproduce it.
Used in all 4 filter fields
- Load Filter on URLs - Use filter
- Load Filter on IPs - must-match
- Filter on URLs - must-match
- Filter on Content of Document - must-match
some regex with chars which would be URL-encoded, but all were fine after clone button.
Maybe give a example of the regex you used and which of above filter you encountered it.
The uploaded page url-encoded.pdf shows what I get just after pressing the "clone" button from Table_API_p.html. Notice that the field with html ID "intention" ("Index Attributes" → "Do Remote Indexing") is url-encoded, too.
Here are the original, unencoded values I entered for each field, listed by their input ID:
Hardware product pages in YaCy index
fixed in v1.83/9403
|2015-10-10 18:32||Davide||New Issue|
|2015-10-11 22:50||BuBu||Note Added: 0001115|
|2015-10-12 11:01||Davide||File Added: url-encoded.pdf|
|2015-10-12 11:13||Davide||Note Added: 0001117|
|2015-10-13 02:46||BuBu||Note Added: 0001118|
|2015-10-13 02:46||BuBu||Status||new => resolved|
|2015-10-13 02:46||BuBu||Resolution||open => fixed|
|2015-10-13 02:46||BuBu||Assigned To||=> BuBu|
|Copyright © 2000 - 2021 MantisBT Team|