Craig Crossman photo Craig Crossman
National Newspaper Computer Columnist

Click here to see the index of Craig's columns.

COMPARER TOOLS HELP FIND DUPLICATES

The cost of storage these days has become extremely affordable. With terabyte hard drives selling for less than $200 and 2 terabyte drives available for just a little bit more, you really don't have to worry so much about running out of storage space like we used to with our puny megabyte models. I remember learning about the first gigabyte drive and how amazing it was. A thousand megabytes on one hard disk drive? It was truly awesome. History repeats itself with the terabyte drive. A thousand gigabytes on one hard disk drive? It was truly awesome. And so it continues with a 1.5 and 2 terabyte model already being sold.

With each generation, I also remember thinking that owning such a large capacity drive would be all the storage I'd ever need. But as operating systems and applications got larger, the need for more storage grew correspondingly. So what takes up the most room on a typical user's hard drive these days? Usually it's digital photographs, music and videos. To combat the enormous sizes and the storage space these data files can gobble up, we use various types of compression. For example, raw music files can be paired down to manageable sizes using mp3. And digital image sizes are reduced using the JPEG compression algorithm. But there's one other culprit that can insidiously rob us of precious storage space and until recently there was no easy way to combat it. I'm talking about duplicates.

Duplicate files are easily detected if their names happen to be the same or if the files contain some kind of embedded descriptive tag. The trouble begins when you have files of the same thing that are named differently and have dissimilar or no tags within them. If you only have a few files, it's no big deal to find the duplicates but what if your growing collection of
mp3 files and digital photos are in the hundreds, even thousands? We certainly have the hard drive capacities to store all that stuff but duplicates are not only wasteful of space, they can also lead to confusion and a myriad of misfiling problems. And what about similar images? You may not want 4 pictures of the same thing you quickly took at just a slightly different angle. But when you downloaded everything from that 2 gigabyte memory card in your camera a few weeks later, you may not remember those similar images you took. But now there are two applications available from Bolide Software that can help you single everything out.

Audio Comparer is a program that actually "listens" to audio files. Using some fairly sophisticated artificial intelligence (AI) algorithms, Audio Comparer can listen to audio files at about a second per music file. At that rate, combing through all of your music should be pretty quick. Since the most frequent type of duplication is having the same music stored in a different compression format such as mp3, WMA and OGG files, it's a good thing that Audio Comparer works with all these formats. After doing its listening check, it displays them in different audio groups making duplicates easy to identify. Then it's just a matter of listening to them as a confirmation before deleting.

Image Comparer breezes though your digital photos in much the same way. Using a content-based image search AI technique, Image Comparer "looks" at your images and decides what may be a duplicate and won't be fooled if the images are rotated or reversed. However with Image Comparer you can actually adjust the level of visual similarity to let you further constrain or relax the results. This comes in handy when you want to eliminate photos with similar content like those 4 photos you quickly snapped of the same thing. To speed things up, it can automatically select the best image from the similar duplicates for you making the weeding out process even faster. As with audio, images come in a wide variety of file formats that can lead to accidental duplication and Image Comparer works with most of them.

Audio Comparer ($29.95) and Image Comparer ($34.95) require and work with most versions of Windows. Get them both as they make a perfect match.

www.bolidesoft.com

 

Craig Crossman is a McClatchy-Tribune newspaper columnist writing about computers and technology. He also hosts the nation's longest running nationally syndicated radio talk show on computers and technology, Computer America, heard on both the Business TalkRadio Network® and the Lifestyle TalkRadio Network®, weeknights at 10PM Eastern time.  Visit his website at http://www.computeramerica.com

Jump to:
[ Index of Craig's Columns | Main Columns Page | Computer America Home Page ]