Click here to Skip to main content
Click here to Skip to main content

A Fast/Compact Serialization Framework

, 13 Oct 2010 GPL3
Rate this:
Please Sign up or sign in to vote.
A framework for object serializiation/deserialization that is many times faster and yields a compact output.

Introduction

NxSerialization is an easy to use object serialization framework that replaces the functionality of the default serialization providers for .NET and Mono. The binary formatter for NxSerialization can be up to 50 times faster than the default binary formatter for .NET and Mono. This is evident from the screenshot of the benchmark application shown above. There are three main benefits this framework provides to applications that serialize objects. The main benefits being that of increased space and time performance, and enhanced security comes as a byproduct.

Quick Facts

The figure below contains benchmark results using the sample application shipped with NxSerialization for CLI. The important values are given in bold. The time measured was for 100 iterations of 100 runs each. In each run, an object of the specified type was serialized and then deserialized. These results may vary depending upon the system configuration; however, the important thing to consider is the relative difference or the performance factors between the native and the NxSerializer.

Warning: These stats are from the previous release, and do not reflect comparison with latest native formatters.

Size based comparison of .NET and NxSerialization formatters

Size based comparison of .NET and NxSerialization formatters

Time based comparison of .NET and NxSerialization formatters

Time based comparison of .NET and NxSerialization formatters

What is New in 3.0?

There is nothing substantially new in this release, except the inclusion of the Remoting sub-system and a few unfinished features. After an extended long period of inactivity and quite some queries to release the Remoting specific portions, I have finally decided to release all that I had in my dev folders, and it's probably going to be the last release ever.

An interesting observation is that the latest versions of CLR have much improved native formatters, and what used to be on the average >5 times speed gain in the past is now much reduced. The stats above are therefore not representative of comparison with latest .NET versions. It also follows that the toolkit has probably seen its time Smile | :)

Unfinished Features

EAR - (Emit Avoid Reflection)

Some of the surrogates have an EAR property that when set uses dynamic IL to facilitate creation of objects and avoids the abhorred Activator.CreateInstance that is not known to be a super-fast way. The support is in early stages, and not rigorously tested, and therefore issues may popup. Moreover, there is no way to configure EAR externally, and source modifications are needed should you want to try it.

Remoting

The ability to use NxSerialization in Remoting sinks should theoretically speedup Remoting code - though the network latency may overshadow it - but surprisingly, the results have always been quite the opposite (which is why I never released it). There are also issues with HTTP channels (some functionality is missing), as well as Channel security that does not work at all.

Surrogates for System.Data.*

Still unimplemented - even though a straightforward task.

I would love to know if anyone still finds it useful and could spot the shortcomings in Remoting slowdown and suggest a fix. As always, your feedback is highly welcome!

Using the Framework

Application objects can be integrated with the framework in two ways. By writing a surrogate for the object type and registering the surrogate with the framework, or by implementing INxSerializable. The framework provides a built-in surrogate for types that implement INxSerializable. For unknown types, native .NET serialization is used.

The following sample of code demonstrates a type that implements INxSerializable. Note the line at the bottom that registers the type with the framework.

// Sample class that implements INxSerializable
[Serializable]
class SampleCompactableClass : INxSerializable
{
   private String title = "SampleCompactableClass";

   void INxSerializable.Serialize(INxBinaryWriter w)
   {
      w.Write(title);
   }

   void INxSerializable.Deserialize(INxBinaryReader r)
   {
      title = r.ReadString();
   }
}

...
// Register the class with the framework.
NxFormatterServices.Default.RegisterKnownType(typeof(SampleCompactableClass));

The following sample of code demonstrates a sample surrogate for another type that does not implement INxSerializable. Using surrogates is the only way the framework is able to compactly serialize .NET native types.

// Sample surrogate for SampleSurrogatedClass
class SampleSurrogate : NxSerializationSurrogate
{
   public SampleSurrogate() : base(typeof(SampleSurrogatedClass)) {}

   public override object Read(INxBinaryReader r)
   {
      SampleSurrogatedClass obj = new SampleSurrogatedClass();
      obj.title = r.ReadString();
      return obj;
   }

   public override void Write(INxBinaryWriter w, object graph)
   {
      SampleSurrogatedClass obj = (SampleSurrogatedClass) graph;
      w.Write(obj.title);
   }
}

// Sample class that does not implement INxSerializable
[Serializable]
class SampleSurrogatedClass
{
   internal string title = "SampleSurrogatedClass";
}

...
// Register the surrogate with the framework.
NxTypeSurrogateSelectorNative.Default.Register(new SampleSurrogate());

Everything else is pretty much self-explanatory. For more information, look at the sample benchmark application provided with the source code.

Comments

Please note that for objects where the actual data size to type-info size ratio is very large, not much memory reduction will occur. Try a byte array of size 100K. It is also possible to come up with a case where the native serializer is actually more efficient in terms of CPU.

Among other possibilities with the framework are:

  • Enhanced security as custom serialization protects your object's data from prying eyes. Excluding the possibilities of complete reverse engineering, objects cannot be deserialized from persistent streams.
  • .NET CLR 1.x objects can be deserialized into 2.0 objects. Objects of type A can be deserialized to objects of type B etc.

History

OpenNxSerialization 2.0 (August 08, 2008)

Changes in this version include:

  • Arrays and collections serialization is now significantly faster.
  • New surrogates for a lot of built-in types.
  • Support for serialization of containers in the System.Collections.Generic namespace.
  • Support for serialization of BitVector32, BitArray and KeyValuePair objects.
  • Support for serialization of Type objects.
  • Surrogate redirection support now provided.
  • Dynamic (on the fly) surrogates now supported.
  • Major refactoring of the API.
  • Quite a few enhancements and utilities everywhere.

OpenNxSerialization 1.5 (March 12, 2008)

Changes in this version include:

  • NxFormatter now implements IRemotingFormatter.
  • New surrogates for a lot of built-in types.
  • Support for serialization of ISerializable objects.
  • Support for serialization of MarshalByRef objects.
  • Support for generic versions of SerializeAs and DeserializeAs functions.
  • Streaming context can now contain application specific items.
  • Quite a few enhancements and utilities everywhere.

OpenNxSerialization 1.0 (CompactSerialization 2.5) (July 21, 2007)

Once again, thanks to all contributors. Changes in this version include:

  • CompactSerialization 2.5 is now OpenNxSerialization 1.0.
  • Support for multiple instances of TypeSurrogateSelector.
  • Support for SerializeAs and DeserializeAs functions (faster and more compact).
  • Reader does not close the base stream.
  • Support to configure types using a config file.
  • Quite a few enhancements and utilities everywhere.

CompactSerialization 2.0 (May 17, 2006)

This has been possible due to the wonderful feedback I've received. Thanks to all contributors. Changes in this version include:

  • Support for .NET 2.0 Nullable types.
  • Circular and shared references are now handled wisely.
  • Support for permanent/hard type handles.
  • Support for enumerations, SortedList etc.
  • Major refactoring of the internal and public APIs.
  • Improved performance at places, and decreased at places Smile | :) .

CompactSerialization 1.0 (Feb 15, 2006)

  • Released the initial version of the framework.

License

This article, along with any associated source code and files, is licensed under The GNU General Public License (GPLv3)

Share

About the Author

.Shoaib
Architect
Pakistan Pakistan
Let a = b ....... (1)
a - b = a - b
a^2 - ab = a^2 - ab
a^2 - ab = a^2 - b^2 (from 1)
a (a - b) = (a + b) (a - b)
a = (a + b) ...... (2)
 
if a = 1
1 = (1 + 1) (from 1 & 2)
1 = 2 !!

Comments and Discussions

 
GeneralXmlSerialization PinmemberjklBruin_nyc3-Jul-08 4:18 
AnswerRe: XmlSerialization Pinmember.Shoaib3-Jul-08 20:17 
GeneralRe: XmlSerialization [modified] Pinmemberjboarman7-Jul-08 16:10 
QuestionRe: XmlSerialization Pinmember.Shoaib20-Jul-08 19:53 
AnswerRe: XmlSerialization Pinmemberjboarman21-Jul-08 7:22 
GeneralOpenNxSerialization within Shared Cache Pinmemberschuetz25-Apr-08 7:13 
AnswerRe: OpenNxSerialization within Shared Cache Pinmember.Shoaib28-Apr-08 22:32 
GeneralSurrogate Emitter PinmemberAaron Jackson11-Apr-08 9:01 
This is certainly some nice work. For my own data sets, the serialization has resulted in about a 50% reduction in the size of the payload. Even more impressive was the cost of serializing the structure which dropped so dramatically that I'm reviewing everything to make sure I didn't overlook something obvious.
 
One thing that could certainly be done to improve performance is to do a one-time pass over types when they are registered and emit a surrogate. Just for sanities sake I would probably use on general type and embed a DynamicMethod inside of it that's attached to the module or the type directly. This could result in object graphs built at runtime that run as if they were compiled (i.e. not using reflection).
 
Lastly, I saw you comment on integrating this with Remoting. While I'd love to say we're doing WCF at this point we're in a bit of a wait-and-see holding pattern. I think it would be fantastic if we could replace the BinaryFormatter with this, but as you pointed out replacing the IClientFormatterSink and IServerFormatterSink are a tedious job. Assuming you just wanted to serialize the payload though how might you approach that problem. If the BinaryFormatter was in the chain wouldn't it serialize the content before you even got a chance at it?
 
Again, nice work.
AnswerRe: Surrogate Emitter Pinmember.Shoaib13-Apr-08 20:48 
GeneralRe: Surrogate Emitter PinmemberDerek Viljoen18-Jul-08 6:08 
AnswerRe: Surrogate Emitter Pinmember.Shoaib20-Jul-08 19:50 
GeneralRe: Surrogate Emitter PinmemberAaron Jackson23-Jul-08 9:48 
GeneralRe: Surrogate Emitter [modified] PinmemberAlex_120-Apr-08 21:02 
GeneralFix the license please PinmemberJean-Paul Mikkers2-Apr-08 12:29 
NewsRe: Fix the license please Pinmember.Shoaib3-Apr-08 22:34 
GeneralThe download link is wrong Pinmemberkjetilroe26-Jul-07 0:15 
GeneralIt would be nice if TypeSurrogateSelector wasn't static Pinmemberpatperry11-Jul-07 13:31 
AnswerRe: It would be nice if TypeSurrogateSelector wasn't static Pinmember.Shoaib11-Jul-07 20:27 
GeneralCommercial License PinmemberKay Herzam10-Jul-07 2:44 
AnswerRe: Commercial License Pinmember.Shoaib10-Jul-07 3:51 
QuestionCan you verify compatibility with Mono 1.2.4? PinmemberFelipe Decroaux28-May-07 4:59 
AnswerRe: Can you verify compatibility with Mono 1.2.4? Pinmember.Shoaib4-Jun-07 1:17 
AnswerRe: Can you verify compatibility with Mono 1.2.4? Pinmemberpatperry11-Jul-07 11:14 
GeneralRe: Can you verify compatibility with Mono 1.2.4? Pinmemberpatperry11-Jul-07 11:17 
Joke1=2, Not so fast :) PinmemberAlexandru Lungu30-Apr-07 6:03 

General General    News News    Suggestion Suggestion    Question Question    Bug Bug    Answer Answer    Joke Joke    Rant Rant    Admin Admin   

Use Ctrl+Left/Right to switch messages, Ctrl+Up/Down to switch threads, Ctrl+Shift+Left/Right to switch pages.

| Advertise | Privacy | Terms of Use | Mobile
Web01 | 2.8.141223.1 | Last Updated 13 Oct 2010
Article Copyright 2006 by .Shoaib
Everything else Copyright © CodeProject, 1999-2014
Layout: fixed | fluid