Gnutella Forums  

Go Back   Gnutella Forums > Current Gnutella Client Forums > LimeWire+WireShare (Cross-platform) > Technical Support > General Windows Support
Register FAQ The Twelve Commandments Members List Calendar Arcade Find the Best VPN Today's Posts

General Windows Support For questions about Windows issues regarding LimeWire or WireShare or related questions


Reply
 
LinkBack Thread Tools Display Modes
  #1 (permalink)  
Old October 11th, 2002
Unregistered
Guest
 
Posts: n/a
Question hashing - with a (hash)pipe?

I'm just wondering..

I started up limewire about 3 hours ago.
Defined my shares and left the computer "idle"
with no workload but the hashing of the files.

That was 3 hours ago. It's gotten to about 6500 files now,
still not complete, with about 1500 to go.

My computer is a 1,1GhzAMD with 256MBram and ATA100/ultra-scsi disks currently on a win32 system (xp) with javaVM 1.31
and frankly:
**** I don't quite see why this should take so long?!?!?!? ****

It's a little annoying. I'm not saying that the Java-IO-interface deserves a good pat on the back for speed and agility, but what the hell ?!?

I notice that it takes longer in directories with "huge" files. Do you open the entire file?! If so, WHY are you doing this?
Isn't it enough to open the header of the file ??

Are there any alternative methods to the java approach that may be incorporated in a standalone (REAL (c/c++/asm)) program ?

By the way: The client is on a good path. A few months ago I found it rather unusable ! KEEP UP THE GOÒD WORK!

Best regards from a
Slightly bored user
Reply With Quote
  #2 (permalink)  
Old October 18th, 2002
Connoisseur
 
Join Date: April 26th, 2002
Posts: 328
Treatid is flying high
Default

Hashing does read every byte of every file that you are sharing.

The idea of hashing is to identify whether two files are identical or not.

The hard way of testing two files to see if they are identical is to compare each byte of each file. This is impractical if one of the files is sitting on a remote machine.

Hashing reads each byte of a file and creates a number based on those bytes. Two identical files will generate the same number (hash). Two different files (even if they are only a little different) will (probably) generate different numbers.

Now, a comparison between two files can be made by comparing the numbers rather than the whole file.

Mark
Reply With Quote
  #3 (permalink)  
Old October 20th, 2002
Connoisseur
 
Join Date: January 2nd, 2002
Location: Ooltewah TN
Posts: 495
Norm is flying high
Default

Mark:

Interesting - If I understand what you are saying Limewire will identify, say, a 6,707 byte file on your computer to be the same file as a 6,707 byte file on my computer. If so it must also check the filename for identical or at least some common words to prevent same length but totally different files from appearing alike and attempting to split a download from two totally different files.

Norm
Reply With Quote
  #4 (permalink)  
Old October 20th, 2002
Unregistered
Guest
 
Posts: n/a
Default

not quite, norm.

hashing is more or less a relic (although relic is the wrong word) from cryptography. one example of a bad way of hashing would be to add up every byte of the file squared by the position of that bad. there are many different ways to hash a file, most clients today support an SHA1 hash, which is supposed to be unique per filesize. so, one file that's 500MB wouldn't have the same hash as another file that's 500MB unless they're exactly the same file. this is completely disregarding the file name.

the very nice thing with hashes is that people can change the filenames of files, but limewire (and other clients) will still be able to identify the specific file.
Reply With Quote
  #5 (permalink)  
Old October 26th, 2002
Apprentice
 
Join Date: October 26th, 2002
Location: Here
Posts: 5
Muati is flying high
Default

Another reason that Limewire takes so long to hash file is that it doesn't take up 100% of your CPU for file hashing. It is more like 50%.
Reply With Quote
Reply


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On


Similar Threads
Thread Thread Starter Forum Replies Last Post
Library hashing joeyjoe BearShare Open Discussion 3 April 13th, 2006 06:36 AM
big fat pipe jay173 Connection Problems 3 December 13th, 2005 10:27 PM
hashing speed crohrs New Feature Requests 18 November 27th, 2003 08:06 AM
Hashing indicator Treatid New Feature Requests 0 October 22nd, 2002 04:56 AM
Hashing Unregistered New Feature Requests 3 July 18th, 2002 03:01 AM


All times are GMT -7. The time now is 04:39 PM.


Powered by vBulletin® Version 3.8.7
Copyright ©2000 - 2025, vBulletin Solutions, Inc.
SEO by vBSEO 3.6.0 ©2011, Crawlability, Inc.

Copyright © 2020 Gnutella Forums.
All Rights Reserved.