YaCy-Bugtracker - YaCy
View Issue Details
0000735YaCy[All Projects] Generalpublic2017-04-09 21:222019-07-28 08:38
shni 
 
normalmajorhave not tried
newopen 
none 
YaCy 1.9 
 
0000735: indexing script content of html websites
In html embedded scripts are not always filtered. The code then gets indexed and becomes searchable.
Index bing.com and www.hertie.de/Laufrad/HUDORA-Koolbike-Boy_p1959387

When searching for "bing" results are as seen on the attached screenshot.

When searching for "bing google" the latter website is returned, but it contains the keywords only in <script> sections (not in visible content).
No tags attached.
png 2017-04-09 21_14_43-bing - YaCy '_anonufe-15030389-11'_ Search Page.png (22,145) 2017-04-09 21:22
http://mantis.tokeek.de/file_download.php?file_id=264&type=bug
png
Issue History
2017-04-09 21:22shniNew Issue
2017-04-09 21:22shniFile Added: 2017-04-09 21_14_43-bing - YaCy '_anonufe-15030389-11'_ Search Page.png
2017-04-14 02:06BuBuNote Added: 0001392
2017-04-17 16:14shniNote Added: 0001393
2017-04-27 23:24lucNote Added: 0001397

Notes
(0001392)
BuBu   
2017-04-14 02:06   
Tested your example URL with current dev version, and script was successful filtered.
In Feb. 2017 was a fix applied to this subject.
https://github.com/yacy/yacy_search_server/commit/f254fcfc67d0ed8c585987c4815c5da885a1159f [^]

The above commit was after Version 1.92/9000 was published.
(0001393)
shni   
2017-04-17 16:14   
So it's already fixed. Thanks for clarifying this!

How long would it usually take until the official packages are updated? I mean, is there a fixed release cycle one can rely on, or should I better try build it myself?
(0001397)
luc   
2017-04-27 23:24   
As far as I know the releases cycle is not fixed, but rather relies on the amount of modifications/bug fixes and the time available for maintainer(s) to officially release a new version.

If you whish to always run the latest version you've better regularly build yourself from sources on the github repository, or run YaCy as a docker container (images built automatically from the latest sources are available here : https://hub.docker.com/r/luccioman/yacy/ [^]).

And for this specific bug fix, a developer release including it is already available here : https://github.com/luccioman/yacy_search_server/releases/tag/Release_1.92.9159-dev [^]