YaCy-Bugtracker - YaCy
View Issue Details
0000671YaCy[All Projects] Generalpublic2016-07-12 02:302016-07-16 02:02
BuBu 
BuBu 
normalminoralways
resolvedfixed 
none 
Windows
YaCy 1.8 
 
0000671: Crawl file:// uri on windows double in index
On Windows (in intranet mode) crawling a file system is possible with
e.g.
one day file:///C:\tmp\test\ [^] or
other day file:///C:/tmp/test/ [^]

The document hashes are different depending on forward slash or backslash,
with the result that each file is double in index with different index.document.hash

On Windows backslash path should likely be normalized to avoid such duplications of same resources.
No tags attached.
Issue History
2016-07-12 02:30BuBuNew Issue
2016-07-16 02:02BuBuNote Added: 0001262
2016-07-16 02:02BuBuStatusnew => resolved
2016-07-16 02:02BuBuResolutionopen => fixed
2016-07-16 02:02BuBuAssigned To => BuBu

Notes
(0001262)
BuBu   
2016-07-16 02:02   
commit https://github.com/yacy/yacy_search_server/commit/87fcfc6d7854531abde4ff27df566ce4f4379738 [^]