YaCy-Bugtracker

View Issue Details Jump to Notes ] Issue History ] Print ]
IDProjectCategoryView StatusDate SubmittedLast Update
0000692YaCy[All Projects] Generalpublic2016-09-25 11:182016-09-27 19:46
Reporterluc 
Assigned ToBuBu 
PrioritynormalSeverityminorReproducibilityalways
StatusresolvedResolutionfixed 
ETAnone 
PlatformOSMicrosoft WindowsOS Version10
Product Version 
Target VersionFixed in Version 
Summary0000692: Intranet mode : duplicates MS Windows file URLs
DescriptionIn intranet mode, various Microsoft Windows file scheme URL variants are not detected as duplicates. Examples :
 - file://V:/Test/image.jpg [^]
 - file:///V:/Test/image.jpg [^]
 - file:///V:Test/image.jpg [^]
Steps To Reproduce- Run YaCy in Intranet mode on a Microsoft Windows OS
- Start a new crawl (/CrawlStartSite.html) with a starting point such as : "file://V:\Test" [^]
- Documents in this folder are indexed
- Start new crawls starting at the same URL but written differently, such as : "file:///V:/Test" [^] or "file:///V:Test" [^]
- Documents are re-indexed but not detected as already in the index
- Search something in the indexed documents : it produces duplicated results with the various URLs flavours
TagsNo tags attached.
Attached Files

- Relationships

-  Notes
(0001306)
BuBu (developer)
2016-09-25 22:10

Not solving the main case but improving a little bit on mixed notation
(c:\tmp\test.txt vs. c:\tmp/test.txt )
with this commit
https://github.com/yacy/yacy_search_server/commit/6f8c3ccea4cc70368c2f4dda989e27365eb4e860 [^]
(0001307)
luc (reporter)
2016-09-27 08:05

Thank you BuBu.
And with this complementary commit (https://github.com/yacy/yacy_search_server/commit/1bb0b135ac5dab0adab423d89612f7b1e13f2e61 [^]) the described use cases are fixed.
Tested on MS Windows 10.
Non regresionn testing on Debian Jessie.

- Issue History
Date Modified Username Field Change
2016-09-25 11:18 luc New Issue
2016-09-25 22:10 BuBu Note Added: 0001306
2016-09-27 08:05 luc Note Added: 0001307
2016-09-27 19:46 BuBu Status new => resolved
2016-09-27 19:46 BuBu Resolution open => fixed
2016-09-27 19:46 BuBu Assigned To => BuBu


Copyright © 2000 - 2018 MantisBT Team
Powered by Mantis Bugtracker