Click here to Skip to main content
6,594,432 members and growing! (16,133 online)
Email Password   helpLost your password?
General Programming » String handling » General     Beginner License: The Code Project Open License (CPOL)

Easy String Compression and Encryption

By BTDex

Class module to easily de-/compress and en-/decrypt big strings
VB (VB 8.0, VB 9.0), .NET (.NET 2.0, .NET 3.0, .NET 3.5), ASP.NET, Visual Studio (VS2005), WinForms, WebForms, Dev
Version:2 (See All)
Posted:1 Jul 2008
Views:13,980
Bookmarked:19 times
Announcements
Loading...
 
Search    
Advanced Search
Add to IE Search
printPrint   add Share
      Discuss Discuss   Broken Article?Report  
12 votes for this article.
Popularity: 3.70 Rating: 3.43 out of 5
2 votes, 16.7%
1
1 vote, 8.3%
2
1 vote, 8.3%
3
4 votes, 33.3%
4
4 votes, 33.3%
5

Introduction

This article discusses string compression with optional decent encryption with pure VB.NET code, and no external tools required.

It can easily be integrated into existing projects. As the code is kept simple, it's suitable for beginners and a conversion to C# can be done easily.

Background

In need of a routine to quickly and safely deflate and inflate big strings, I searched the net for a solution. A comprehensive set of functionalities didn't show up, so I decided to write this class module, which encapsulates all the functionality needed to complete the task.

Using the Code

Although strings of any length can be applied to the process, the compression of short strings (i.e. 'Hello World!') is counterproductive as it results in even bigger compressed counterparts. The CompressionRatio property of the class tells you how effective the compression was. You can decide then, if you want to use the compressed string and if so, prefix and suffix can be automatically applied to it, to distinguish between compressed and uncompressed content afterwards.

Process overview:

Plain text -> to byte array -> gzip compression -> encryption -> to base64 string = shrinked text

shrinked text -> to byte array -> decryption -> gzip decompression -> to string = plain text

The code is simple to use. Here's the quick way to compress a string:

    Dim CompStr As New clsCompressedString(System.Text.Encoding.UTF8)

    CompStr.UnCompressed = "some large text content..."

    MsgBox "The compressed string is: " & CompStr.Compressed  

... and the way back:

    Dim CompStr As New clsCompressedString(System.Text.Encoding.UTF8)

    CompStr.Compressed = "..." '<- the previously compressed string

    MsgBox "The uncompressed string is: " & CompStr.UnCompressed

Error handling is kept at the minimum. The class returns empty strings when fed with corrupt data or supplied with wrong passphrase.

Optional encryption is performed by utilizing the .NET built-in RijndaelManaged at maximum key length and simplified usage: You just need to provide a single passphrase for encryption and decryption. Encryption key and iv are generated based on the passphrase by using SHA256 and MD5 hash value generation.

The demo project shows all features available.

Points of Interest

With string conversions involved, text encoding has to be addressed properly. Otherwise some or all characters could get messed up in the process of compression/decompression, depending on what content you try to compress/decompress.

Why Not Use ICSharpCode.SharpZipLib?

Well, you can easily alter the compression routines in the class to use ZipLib. I experimented with that and it showed, that ZipLib (0.85.4.369) is only up to 7% more efficient than the built in GZip. To get this slight better performance, you have to set ZipLib to the highest compression level (9). But that comes with a price: ZipLib at highest level is very slow compared to GZip and therefore takes several times longer to compress a huge string. So I prefer GZip for this task as it is fast, reliable and doesn't require to link to additional DLLs and I don't run into licensing and security issues by using comprehensive third party code.

Preferences could possibly change when it comes to binary file compression. Maybe then ZipLib outruns GZip - but binary file compression was not the assigned task in this case.

History

  • 1st July, 2008: This is the first version. Participate and help to optimize and extend the code.

License

This article, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)

About the Author

BTDex


Member

Location: Germany Germany

Other popular String handling articles:

Article Top
You must Sign In to use this message board.
FAQ FAQ 
 
Noise Tolerance  Layout  Per page   
 Msgs 1 to 9 of 9 (Total in Forum: 9) (Refresh)FirstPrevNext
GeneralMini-Bug? PinmemberFridolin4:53 22 Sep '09  
GeneralRe: Mini-Bug? Pinmemberjessehogan7:52 8 Oct '09  
GeneralThanks! PinmemberMember 195161021:37 2 Jun '09  
GeneralParadox PinmemberMr.PoorEnglish0:29 18 Jul '08  
GeneralRe: Paradox PinmemberBTDex4:46 20 Jul '08  
GeneralRe: Paradox [modified] PinmemberMr.PoorEnglish11:44 20 Jul '08  
GeneralRe: Paradox PinmemberBTDex14:09 22 Jul '08  
Generalok, I see PinmemberMr.PoorEnglish22:20 22 Jul '08  
GeneralGZip... PinmemberFrankidoze5:32 5 Jul '08  

General General    News News    Question Question    Answer Answer    Joke Joke    Rant Rant    Admin Admin   

PermaLink | Privacy | Terms of Use
Last Updated: 1 Jul 2008
Editor: Deeksha Shenoy
Copyright 2008 by BTDex
Everything else Copyright © CodeProject, 1999-2009
Web21 | Advertise on the Code Project