Click here to Skip to main content
15,879,239 members
Articles / Programming Languages / C#

C# Use Zip Archives without External Libraries

Rate me:
Please Sign up or sign in to vote.
4.92/5 (80 votes)
12 Jun 2011CPOL3 min read 295.8K   8K   150   125
C# use Zip archives without external libraries

Introduction

I found a lot of articles on how to access Zip archives in C# but all with significant disadvantages. The main problem is that Microsoft has Zip archives implemented in the operating system but there is no official API that we can use. In C# for example, we have the System.IO.Compression.GZip but there is no adequate System.IO.Compression.Zip class.

There are some free .NET compression libraries like SharpZipLib and .NET Zip Library, but this leads to additional installation effort and licensing problems.

It is also possible to use the free J# Library. J# has included Zip to keep compatible with the Java libraries. But to bundle a 3.6 MB DLL vjslib.dll, just to support Zip, seems like a really goofy hack.

Since .NET 3.0, we can use the System.IO.Packaging ZipPackage class in WindowsBase.DLL. It's just 1.1 MB, and it just seems to fit a lot better than importing Java libraries.

Problem only that the ZipPackage class isn't a generic Zip implementation, it's a packaging library for formats like XPS and Office Open XML that happen to use Zip.

To access simple Zip archives with ZipPackage fails because the content is checked for Package conventions.

For example, there has to be a file [Content_Types].xml in the root and only files with specified extensions are accessible. Filenames with special characters and spaces are not allowed and the access time is not the best because of the additional Package link logic.

However, the assembly WindowsBase.DLL is preinstalled and the generic Zip implementation is inside. The only problem is that the generic Zip classes are not public and visible for the programmers. But there is a simple way to get access to this hidden API and I wrote a small wrapper class for this.

Background

A quick check in the Object Browser shows us that WindowsBase.DLL has a namespace MS.Internal.IO.Zip. This sounds good, but there are no public classes visible.

However, the following call:

C#
var types = typeof(System.IO.Packaging.Package).Assembly.GetTypes();

gives us 824 class types, public and non-public and especially one with the name MS.Internal.IO.Zip.ZipArchive. Now it is easy to get this special class type and the methods and properties:

C#
var type = typeof(System.IO.Packaging.Package).Assembly.GetType
		("MS.Internal.IO.Zip.ZipArchive");
var static_methodes = type.GetMethods(BindingFlags.Static | 
		BindingFlags.Public | BindingFlags.NonPublic);
var nostatic_methodes = type.GetMethods(BindingFlags.Instance | 
		BindingFlags.Public | BindingFlags.NonPublic);

and we get the most important methods:

C#
static ZipArchive OpenOnFile(string path, FileMode mode, 
	FileAccess access, FileShare share, bool streaming);
static ZipArchive OpenOnStream(Stream stream, FileMode mode, 
	FileAccess access, bool streaming);
ZipFileInfo AddFile(string path, 
	CompressionMethodEnum compmeth, DeflateOptionEnum option);
ZipFileInfo GetFile(string name);
ZipFileInfo DeleteFile(string name);
ZipFileInfoCollection GetFiles();
void Dispose();

The same procedure for ZipFileInfo and we get:

C#
Stream GetStream(FileMode mode, FileAccess access);

and properties like: Name, LastModFileDateTime, FolderFlag...
This is all what we need to implement a small wrapper class and access over Reflection:

C#
class ZipArchive : IDisposable
{
  private object external;
  public static ZipArchive OpenOnFile
      (string path, FileMode mode, FileAccess access, FileShare share, bool streaming)    
  {
    var type = typeof(System.IO.Packaging.Package).Assembly.GetType
		("MS.Internal.IO.Zip.ZipArchive");
    var meth = type.GetMethod("OpenOnFile", BindingFlags.Static | 
		BindingFlags.Public | BindingFlags.NonPublic);
    return new ZipArchive { external = meth.Invoke(null, new object[] 
		{ path, mode, access, share, streaming }) };
  } 
  //...
  public class ZipFileInfo //...
}

The complete ZipArchive wrapper implementation is in the demo project ZipArchiveTest in Program.cs.
Only 97 lines for this class and we can use it in a code sequence like this:

C#
var str = new MemoryStream();

//create some files:
using (var arc = ZipArchive.OpenOnStream(str))
{
  var doc1 = new XDocument(new XElement
	("root", new XElement("item"), new XElement("item"), new XElement("item")));
  var doc2 = new XDocument(new XElement("root", Enumerable.Repeat
		("item", 1000).Select(p => new XElement(p))));
  using (var fs = arc.AddFile("test1.xml").GetStream
		(FileMode.Open, FileAccess.ReadWrite)) doc1.Save(fs);
  using (var fs = arc.AddFile("test2.xml").GetStream
		(FileMode.Open, FileAccess.ReadWrite)) doc2.Save(fs);
}

// read the files:
using (var arc = ZipArchive.OpenOnStream(str))
{
  var doc1 = XDocument.Load(arc.GetFile("test1.xml").GetStream());
  var doc2 = XDocument.Load(arc.GetFile("test2.xml").GetStream());
  var doc3 = XDocument.Load(arc.GetFile("dir/test3.xml").GetStream());
}

Using the Demo

The demo program ZipArchiveTest using the ZipArchive class is as small as possible and can show the content of Zip archives without any restrictions. Double click to files in the ListBox opens a new window to show the file content as text.

Image 1

Conclusion

Microsoft should publish its hidden ZipArchive class, but till then, we can use such a simple wrapper to save terabytes of data worldwide.

License

This article, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)


Written By
Germany Germany
This member has not yet provided a Biography. Assume it's interesting and varied, and probably something to do with programming.

Comments and Discussions

 
GeneralRe: My vote of 5 Pin
Ralf_14-Mar-13 0:30
Ralf_14-Mar-13 0:30 
GeneralRe: My vote of 5 Pin
D. Christian Ohle14-Mar-13 6:13
D. Christian Ohle14-Mar-13 6:13 
GeneralRe: My vote of 5 Pin
Ralf_14-Mar-13 23:58
Ralf_14-Mar-13 23:58 
GeneralMy vote of 5 Pin
DanielSheets11-Mar-13 4:58
DanielSheets11-Mar-13 4:58 
GeneralMy vote of 5 Pin
Jaap Lamfers12-Feb-13 8:32
Jaap Lamfers12-Feb-13 8:32 
Suggestion.NET 3.5 version Pin
StehtimSchilf11-Feb-13 8:49
StehtimSchilf11-Feb-13 8:49 
GeneralThanks! Pin
rbignu5-Jan-13 1:44
rbignu5-Jan-13 1:44 
QuestionAwesome! Pin
Wildbird3-Jan-13 10:04
Wildbird3-Jan-13 10:04 
GeneralMy vote of 5 Pin
Awchie13-Nov-12 15:28
Awchie13-Nov-12 15:28 
GeneralThanks! High 5 Dude Pin
Awchie13-Nov-12 15:21
Awchie13-Nov-12 15:21 
GeneralMy vote of 5 Pin
gelo_one12-Nov-12 23:07
gelo_one12-Nov-12 23:07 
GeneralThanks Pin
TheGrandBazaar12-Nov-12 4:21
TheGrandBazaar12-Nov-12 4:21 
GeneralMy vote of 5 Pin
Elron022-Nov-12 13:28
Elron022-Nov-12 13:28 
exactly what i wanted! Thank you.
GeneralMy vote of 5 Pin
Collin Heine21-Sep-12 7:57
Collin Heine21-Sep-12 7:57 
GeneralMy vote of 5 Pin
Mark Lemke12-Sep-12 0:57
Mark Lemke12-Sep-12 0:57 
QuestionNo file compression ?? Pin
BillJam119-Aug-12 8:25
BillJam119-Aug-12 8:25 
AnswerRe: No file compression ?? Pin
BillJam119-Aug-12 8:49
BillJam119-Aug-12 8:49 
QuestionProblem with non ascii characters in file names Pin
Nyarlatotep18-Jun-12 7:45
Nyarlatotep18-Jun-12 7:45 
AnswerRe: Problem with non ascii characters in file names Pin
D. Christian Ohle19-Jun-12 6:54
D. Christian Ohle19-Jun-12 6:54 
GeneralRe: Problem with non ascii characters in file names Pin
Mark Lemke11-Sep-12 8:34
Mark Lemke11-Sep-12 8:34 
GeneralRe: Problem with non ascii characters in file names Pin
D. Christian Ohle11-Sep-12 23:56
D. Christian Ohle11-Sep-12 23:56 
Questionfs.GetStream gives a zero byte length Pin
James Garfield10-Feb-12 4:17
James Garfield10-Feb-12 4:17 
AnswerRe: fs.GetStream gives a zero byte length Pin
D. Christian Ohle15-Feb-12 4:26
D. Christian Ohle15-Feb-12 4:26 
GeneralRe: fs.GetStream gives a zero byte length Pin
James Garfield23-Feb-12 9:37
James Garfield23-Feb-12 9:37 
Questionhandy Pin
BillW3315-Jul-11 6:08
professionalBillW3315-Jul-11 6:08 

General General    News News    Suggestion Suggestion    Question Question    Bug Bug    Answer Answer    Joke Joke    Praise Praise    Rant Rant    Admin Admin   

Use Ctrl+Left/Right to switch messages, Ctrl+Up/Down to switch threads, Ctrl+Shift+Left/Right to switch pages.