Click here to Skip to main content
Click here to Skip to main content

A serialization primer - Part 1

By , 25 Nov 2002
 

This article is the first of a 3 part tutorial on serialization.

  • Part 1 introduces the basics of serialization.
  • Part 2 explains how to gracefully handle reading invalid data stores and support versioning.
  • Part 3 describes how to serialize complex objects.

Serialization is the process of writing or reading an object to or from a persistent storage medium, such as a disk file. Serializing an object requires 3 ingredients:

  • A CFile object representing the datafile
  • A CArchive object that provides the serialization context
  • The object being serialized
Serialization data flow

Step 1 - Open the datafile

To serialize an object to the file "foo.dat", open the file with the appropriate access mode. In this example, the file is opened for exclusive read/write access.
  // Open file "foo.dat"
  CFile* pFile = new CFile();
  ASSERT (pFile != NULL);
  if (!pFile->Open ("foo.dat", CFile::modeReadWrite | CFile::shareExclusive)) {
      // Handle error
      return;
  }

Step 2 - Hook up the archive

Next, a CArchive object is hooked up to the file. The archive provides an efficient conduit to persistent storage. Instead of directly reading and writing the file, you serialize data to and from the archive. The archive needs to know if you're going to be using it to read or write data. In this example, we'll assume we're writing data.
  // Create archive ...
  bool bReading = false;  // ... for writing
  CArchive* pArchive = NULL;
  try
  {
    pFile->SeekToBegin();
    UINT uMode = (bReading ? CArchive::load : CArchive::store);
    pArchive = new CArchive (pFile, uMode);
    ASSERT (pArchive != NULL);
  }
  catch (CException* pException)
  {
    // Handle error
    return;
  }

Step 3 - Serialize the object

Finally, we serialize the object by calling its serialize() method. serialize() is just a method we made up. It has nothing to with MFC's CObject::Serialize(). Also, you don't have to derive your object from CObject. Our serialize() method takes a pointer to a CArchive and returns an integer status.
  int CFoo::serialize
    (CArchive* pArchive)
  {
    int nStatus = SUCCESS;

    // Serialize the object ...
    ...
    
    return (nStatus);
  }
We'll get to the actual serialization process in a minute. Meanwhile, let's recognize a couple of important points:
  • The same method CFoo::serialize() is used to read/write the object from/to persistent storage.
  • CFoo::serialize() doesn't know anything about the datafile bring accessed.
Assume CFoo represents an employee record that contains a couple of data members.
  class CFoo
  {
    // Construction/destruction
    public:
      CFoo::CFoo();
      virtual CFoo::~CFoo();

    // Methods
    public:
      int serialize (CArchive* pArchive);

    // Data members
    public:
      CString  m_strName;  // employee name
      int      m_nId;      // employee id
  };
We use CArchive's streaming operators << and >> to read/write the data members from/to the archive. CArchive knows how to serialize simple data types like int, float, DWORD, and objects like CString. The archive also knows whether it's in read or write mode. You can query its mode by calling CArchive::IsStoring(). CFoo's serialization method can then be written as:
  int CFoo::serialize
    (CArchive* pArchive)
  {
    int nStatus = SUCCESS;

    // Serialize the object ...
    ASSERT (pArchive != NULL);
    try
    {
      if (pArchive->IsStoring()) {
         // Write employee name and id
         (*pArchive) << m_strName;
         (*pArchive) << m_nId;
      }
      else {
         // Read employee name and id
         (*pArchive) >> m_strName;
         (*pArchive) >> m_nId;
      }
    }
    catch (CException* pException)
    {
      nStatus = ERROR;
    }
    return (nStatus);
  }

Step 4 - Clean up

When you've finished serializing, you should clean up by closing the archive and datafile.
  pArchive->Close();
  delete pArchive;
  pFile->Close();
  delete pFile();

Conclusion

Well, there you have it - serialization in a (very small) nutshell. In Part 2, we'll see how to gracefully handle reading invalid data stores and support different versions of our object. In Part 3, we'll see how to serialize complex objects.

License

This article, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)

About the Author

Ravi Bhavnani
Technical Lead
Canada Canada
Ravi Bhavnani is an ardent fan of Microsoft technologies who loves building Windows apps, especially PIMs, system utilities, and things that go bump on the Internet. During his career, Ravi has developed expert systems, desktop imaging apps, marketing automation software, EDA tools, a platform to help people find, analyze and understand information, trading software for institutional investors and advanced data visualization solutions. He currently works for a company that provides enterprise workforce management solutions to large clients.
 
His interests include the .NET framework, reasoning systems, financial analysis and algorithmic trading, NLP, CHI and UI design. Ravi holds a BS in Physics and Math and an MS in Computer Science and was a Microsoft MVP (C++ and C# in 2006 and 2007). He is also the co-inventor of 2 patents on software security and generating data visualization dashboards. His claim to fame is that he crafted CodeProject's "joke" forum post icon.
 
Ravi's biggest fear is that one day he might actually get a life, although the chances of that happening seem extremely remote.

Sign Up to vote   Poor Excellent
Add a reason or comment to your vote: x
Votes of 3 or less require a comment

Comments and Discussions

 
You must Sign In to use this message board.
Search this forum  
    Spacing  Noise  Layout  Per page   
GeneralMy vote of 4membersagar131015-Aug-12 20:56 
Good article.
Generalthank you !membermayjune5624-Jun-12 18:46 
in a short,its very good
GeneralRe: thank you !memberRavi Bhavnani25-Jun-12 1:16 
Thank you for your comments!
 
/ravi
My new year resolution: 2048 x 1536
Home | Articles | My .NET bits | Freeware
ravib(at)ravib(dot)com

GeneralThe links in the conclusion to the other parts are deadmemberHarrison H7-Oct-10 10:23 
Missing the "x" after .asp
GeneralRe: The links in the conclusion to the other parts are deadmemberRavi Bhavnani7-Oct-10 10:33 
Yes, unfortunately this is a site-wide problem. Frown | :(
 
/ravi
My new year resolution: 2048 x 1536
Home | Articles | My .NET bits | Freeware
 
ravib(at)ravib(dot)com

GeneralRe: The links in the conclusion to the other parts are deadmemberHarrison H7-Oct-10 10:38 
Boooo. But quick reply! Smile | :)
QuestionChanging the format of the data in the filememberAORD29-Sep-07 12:26 
What is the best way of making the data in the file unreadable?
 
Currently I can see all the stings and integers when I open the file in notepad.
Can the data be saved so the file is not readable by somebody opening with notepad etc.
 
Cheers,
AORD
 
___________________________
 
Here come the machines!

AnswerRe: Changing the format of the data in the filememberRavi Bhavnani9-Oct-07 17:51 
[ Sorry for the delay in replying. For some reason, I no longer receive CP emails when someone posts a note in my article forums. ]
 
A simple way to deter casual viewers is to invert the bits of the characters in a string before/after serialization. For example:
void InvertString
  (CString& strString)  // string to be inverted
{
  for (int nIndex=0; (nIndex < strString.GetLength()); nIndex++) {
      char ch = strString.GetAt (nIndex);
      ch = ~ch;
      strString.SetAt (nIndex, ch);
  }
}
Hope this helps!
 
/ravi
 
This is your brain on Celcius
Home | Music | Articles | Freeware | Trips
ravib(at)ravib(dot)com

GeneralAbout the contents of the filemembercristitomi12-Feb-07 22:24 
Hi!
I am creating an object of type CFoo with some values for the data types, for example
m_strName = "someName"
m_nID = 98
and I am using the CArchive in store mode.
But when I look at the file it shows me something like:
 
someNameb
 
instead of the right data.
Does anyone know why this happens ?
Thanks.
 
cristitomi
AnswerRe: About the contents of the filememberRavi Bhavnani13-Feb-07 1:34 
cristitomi wrote:
Does anyone know why this happens ?

The file contains a binary representation of the data. If you'd like to see "98" in the file, you'd need to store the string version of m_nId.
 
Also, data stored for a CString will be preceded by its length (stored in binary form), which is required for the CArchive to know when to stop reading the CString.
 
/ravi
 
This is your brain on Celcius
Home | Music | Articles | Freeware | Trips
ravib(at)ravib(dot)com

GeneralRe: About the contents of the filemembercristitomi13-Feb-07 2:22 
Hi!
Thanks for your helpful comments.
I can tell you this is a great article, very helpful to me indeed.
Thanks again.
GeneralRe: About the contents of the filememberRavi Bhavnani13-Feb-07 3:25 
Glad I could be of help, Cristi!
 
/ravi
 
This is your brain on Celcius
Home | Music | Articles | Freeware | Trips
ravib(at)ravib(dot)com

Generaltrouble in reading in serialized list structuressussitsh1115-Dec-04 12:01 
I am trying to read in a list structure which is serialized as
 
m_DataPointerList.Serialize( ar );
 
where m_DataPointerList is defined as :
 
CTypedPtrList m_DataPointerList;
 
and class CDataPointer is declared as:
 
class CDataPointer : public CObject
{
public:
CDataPointer();
 
protected:
DECLARE_SERIAL(CDataPointer)
 
protected:
 
public:
unsigned long Time; unsigned char Channel;
unsigned char Rate;
 
public:
virtual void Serialize(CArchive& ar);
};
 
and defined as:
 
IMPLEMENT_SERIAL(CDataPointer, CObject, 1)
CDataPointer::CDataPointer()
{
 
}

void CDataPointer::Serialize(CArchive& ar)
{
if (ar.IsStoring())
{
ar << Time;
ar << Channel;
ar << Rate;
}
else
{
ar >> Time;
ar >> Channel;
ar >> Rate;
}
}
 
I am having some difficulties( i donno why?) in reading it in using the same serializing command ie:
 
m_DataPointerList.Serialize( ar );
 
is there any thing i am overlooking while i am doing this??
 
on the other hand,
I try to read this using a 'for' loop
which iterates for the number of occurances of CDataPointer. but i have to add an offset at the end of every iteration if I were to read the next iteration correctly.
something like
 
nDataCounter=ar.ReadCount();
for(i=0;i>Time[i];
ar>>Channel[i];
ar>>Rate[i];
 
ar.Flush();
pFile->Seek(offSET, CFile::current);
}
 
i dont understand why i have to provide this offset.and i have trouble choosing the correct offSET as it varies for different lists. again is there something i am doing wrong ?
 
Can please you give me some insight into these two issues??
 
Thanks a lot
Hemanth
GeneralRe: trouble in reading in serialized list structures(typo corrected)memberitsh1116-Dec-04 4:11 
m_DataPointerList has to be defined as
 
CTypedPtrList < CObList,CDataPointer* > m_DataPointerList;
 
and not as wrongly typed by me
GeneralReading a file using serializationsusshemanth_phk9-Sep-04 11:55 
I am trying to read in a file, which was written using serialization (by someone else). The size of the variables present in the file from which I need to read is way greater than the stack size. So this is giving me problems like stack overflow. Is there any way to solve such problems? Please give me your suggestions .Thanks a lot
GeneralRe: Reading a file using serializationmemberRavi Bhavnani9-Sep-04 12:28 
See my reply in Part 3. No need to cross-post. Smile | :)
 
/ravi
 
My new year's resolution: 2048 x 1536
Home | Articles | Freeware | Music
ravib@ravib.com

GeneralRe: Reading a file using serializationmemberThomas Mielke11-Oct-07 14:58 
Have just ran into it as well. Large data amounts are usually stored in pointer lists, but these are being processed recursively by CArchive resulting in extreme stack consumtion. Two workarounds that are not fully satisfying: disintegrate the list (by zeroing the 'next' pointer) and store each element with a distinct '<<' operation, or increase the default maximum stack size of 1 MB by using a linker option (for example /STACK:0xa00000 for 10 MB).
GeneralCArchive and .NETsussDaniel S. Horwitz12-Jul-04 6:37 
Dear Folks,
 
Is there a class in .NET that can read documents written using
the MFC CArchive class?
The problem is that I am porting an existing MFC C++
application to .NET. It uses CArchive to persist and
restore its documents. I need the ported version of
the code to be able to read the existing documents that
were persisted by CArchive.
 
Thanks for your help,
Daniel S. Horwitz

GeneralIn memory serializationmemberno_reg_name7-May-04 2:59 
Can you do this? Serialize an object and send it accross the wire, without actually storing it to the disk?
GeneralRe: In memory serializationmemberRavi Bhavnani7-May-04 6:02 
I believe this can be done by using CSocketFile instead of CFile.
 
/ravi
 
My new year's resolution: 2048 x 1536
Home | Articles | Freeware | Music
ravib@ravib.com

GeneralSerialization using Dialog applicationmemberHybridLlama12-Jan-04 16:57 
How do i serialize data collected from a dialog using one button to save and one button to load.
GeneralRe: Serialization using Dialog applicationmemberRavi Bhavnani16-Jan-04 6:37 
Store the data in an object, then serialize the object.
 
/ravi
 
My new year resolution: 2048 x 1536
Home | Articles | Freeware | Music
ravib@ravib.com

GeneralRe: Serialization using Dialog applicationmemberHybridLlama16-Jan-04 9:56 
Thanks,
I got the serialization to work but I am having a problem loading the data. It is giving me an error something to the effect that an attempt was made to acessan unamed file past its end. How can I fix this?
GeneralRe: Serialization using Dialog applicationmemberRavi Bhavnani17-Jan-04 3:32 
It's hard to say without looking at the code. Perhaps you could post a snippet? It's possible that you may not have opened the file or hooked up the archive correctly.
 
/ravi
 
My new year resolution: 2048 x 1536
Home | Articles | Freeware | Music
ravib@ravib.com

GeneralRe: Serialization using Dialog applicationmemberHybridLlama17-Jan-04 8:28 
Here is a small segment of the storeing code and the loading code:
Storing:
 
CFile file;
 
file.Open(fileName, CFile::modeCreate|CFile::modeWrite);

CArchive ar(&file, CArchive::store);
 
Serialize(ar);
 
ar.Close();
file.Close();
 
Loading:
 
CFile file;
 
file.Open(filename,CFile::modeRead);
 
CArchive ar(&file, CArchive::load);
 
Serialize(ar);
 

The program would load the information just fine until I added 5 strings to be stored, after that it went down hill. I hope the above is what you wanted.

General General    News News    Suggestion Suggestion    Question Question    Bug Bug    Answer Answer    Joke Joke    Rant Rant    Admin Admin   

Permalink | Advertise | Privacy | Mobile
Web04 | 2.6.130617.1 | Last Updated 25 Nov 2002
Article Copyright 2002 by Ravi Bhavnani
Everything else Copyright © CodeProject, 1999-2013
Terms of Use
Layout: fixed | fluid