Anonymous | Login | Signup for a new account | 2021-04-16 04:30 CEST | ![]() |
Main | My View | View Issues | Change Log | Roadmap |
View Issue Details [ Jump to Notes ] | [ Issue History ] [ Print ] | ||||||||
ID | Project | Category | View Status | Date Submitted | Last Update | ||||
0000692 | YaCy | [All Projects] General | public | 2016-09-25 11:18 | 2016-09-27 19:46 | ||||
Reporter | luc | ||||||||
Assigned To | BuBu | ||||||||
Priority | normal | Severity | minor | Reproducibility | always | ||||
Status | resolved | Resolution | fixed | ||||||
ETA | none | ||||||||
Platform | OS | Microsoft Windows | OS Version | 10 | |||||
Product Version | |||||||||
Target Version | Fixed in Version | ||||||||
Summary | 0000692: Intranet mode : duplicates MS Windows file URLs | ||||||||
Description | In intranet mode, various Microsoft Windows file scheme URL variants are not detected as duplicates. Examples : - file://V:/Test/image.jpg [^] - file:///V:/Test/image.jpg [^] - file:///V:Test/image.jpg [^] | ||||||||
Steps To Reproduce | - Run YaCy in Intranet mode on a Microsoft Windows OS - Start a new crawl (/CrawlStartSite.html) with a starting point such as : "file://V:\Test" [^] - Documents in this folder are indexed - Start new crawls starting at the same URL but written differently, such as : "file:///V:/Test" [^] or "file:///V:Test" [^] - Documents are re-indexed but not detected as already in the index - Search something in the indexed documents : it produces duplicated results with the various URLs flavours | ||||||||
Tags | No tags attached. | ||||||||
Attached Files | |||||||||
![]() |
|
(0001306) BuBu (developer) 2016-09-25 22:10 |
Not solving the main case but improving a little bit on mixed notation (c:\tmp\test.txt vs. c:\tmp/test.txt ) with this commit https://github.com/yacy/yacy_search_server/commit/6f8c3ccea4cc70368c2f4dda989e27365eb4e860 [^] |
(0001307) luc (reporter) 2016-09-27 08:05 |
Thank you BuBu. And with this complementary commit (https://github.com/yacy/yacy_search_server/commit/1bb0b135ac5dab0adab423d89612f7b1e13f2e61 [^]) the described use cases are fixed. Tested on MS Windows 10. Non regresionn testing on Debian Jessie. |
![]() |
|||
Date Modified | Username | Field | Change |
2016-09-25 11:18 | luc | New Issue | |
2016-09-25 22:10 | BuBu | Note Added: 0001306 | |
2016-09-27 08:05 | luc | Note Added: 0001307 | |
2016-09-27 19:46 | BuBu | Status | new => resolved |
2016-09-27 19:46 | BuBu | Resolution | open => fixed |
2016-09-27 19:46 | BuBu | Assigned To | => BuBu |
Copyright © 2000 - 2021 MantisBT Team |