Click here to Skip to main content
Licence 
First Posted 22 Mar 2001
Views 106,100
Bookmarked 23 times

AMMimeUtils

By | 13 Jun 2001 | Article
An article on how to decode Base64 and Quoted-Printable text without using MFC.

Introduction

There are already other articles here on The Code Project that shows how to decode Base64 and Quoted-Printable, but they all use MFC. I needed some code that didn't use MFC, so I wrote AMMimeUtils.

I wrote these classes because I was working with receiving and sending emails and Usenet messages. Almost all email messages and attachments are either Base64 or Quoted-Printable encoded. Attachments in Usenet messages are often UU encoded, I still need to write a class to handle this, but it might come in a later version.

When you get an email, the subject and other header fields might also be encoded, so this code also includes some code to decode these fields. Different mail programs encode the subject in different ways. The following text:

Just a small text (for demo), and some more text...

can look both like

=?iso-8859-1?Q?Just a small text =28for demo=29, and some more text...?=

or like

Just a small text =?iso-8859-1?Q?=28for demo=29?=, and some more text...

The first line is easy, because we can see that the entire string is encoded with Quoted-Printable (the ?Q? part means Quoted-Printable). In the second string, it's only a part of it that's encoded, so we have to get the first non-encoded part, decode the encoded part, and get the last non-encoded part, and add the 3 parts together to get the final subject.

I made a function char* MimeDecodeMailHeaderField(char *s); to handle this. If you have a string called s containing the subject you want to decode, simply call it like this:

s = MimeDecodeMailHeaderField(s);

Now s contains the decoded text.

I have 2 classes CBase64Utils and CQPUtils for general encoding and decoding of Base64 and Quoted-Printable. The interface looks like:

//class to handle all base64 stuff...
class CBase64Utils
{
private:
  int ErrorCode;
public:
  int GetLastError() {return ErrorCode;};
  CBase64Utils();
  ~CBase64Utils();
  //caller must free the result, bufsize holds the decoded length
  char* Decode(char *input, int *bufsize);
  //caller must free the result, bufsize is the length of the input buffer
  char* Encode(char *input, int bufsize);
};

//class to handle quoted-printable stuff
class CQPUtils
{
private:
  char* ExpandBuffer(char *buffer, int UsedSize, 
             int *BufSize, bool SingleChar = true);
  int ErrorCode;
public:
  int GetLastError() {return ErrorCode;};
  char* Decode(char *input); //caller must free the result
  char* Encode(char *input); //caller must free the result
  CQPUtils();
  ~CQPUtils();
};

The only difference is the Decode() and Encode() functions. Quoted-Printable is always text, therefore it only takes one parameter, the string containing encoded text, and returns a pointer to a new buffer containing the decoded text. Base64 might be an encoded binary file, so it puts the length of the returned buffer in the bufsize variable. Then it's possible to save the decoded buffer as a binary file.

Both classes have a function GetLastError(), if you decode something, and this variable is zero, everything is fine, if it's non-zero there was an error in the input, but you still get the (maybe) encoded/decoded result.

Right now, this code only has functions for what I needed when I wrote it. In the future, it I might add some better error handling.

If you want to know more about MIME and email messages, you can take a look at:

  • RFC 2045 - Multipurpose Internet Mail Extensions (MIME) Part One, Format of Internet Message Bodies
  • RFC 2046 - Multipurpose Internet Mail Extensions (MIME) Part Two, Format of Internet Message Bodies
  • RFC 2044 - Multipurpose Internet Mail Extensions (MIME) Part Three, Format of Internet Message Bodies
  • RFC 822 - STANDARD FOR THE FORMAT OF ARPA INTERNET TEXT MESSAGES

License

This article has no explicit license attached to it but may contain usage terms in the article text or the download files themselves. If in doubt please contact the author via the discussion board below.

A list of licenses authors might use can be found here

About the Author

Anders Molin

Software Developer (Senior)

Denmark Denmark

Member

Huh! Wink | ;-)

Sign Up to vote   Poor Excellent
Add a reason or comment to your vote: x
Votes of 3 or less require a comment

Comments and Discussions

 
You must Sign In to use this message board. (secure sign-in)
 
Search this forum  
 FAQ
    Noise  Layout  Per page   
  Refresh
BugMajor bug [modified] PinmemberPavel674:06 29 Mar '12  
Generalunicode version Pinmembermengkim22:39 3 Apr '07  
GeneralNotice!!! Pinmembersongzd22:51 7 Dec '06  
GeneralBug in mail decoder Pinmembersuperagenteg23:24 22 Nov '06  
QuestionBUG when decode b64 HTML!? PinmemberKFC12321:45 4 Jul '06  
GeneralSmall, but nasty bugs :mad: Pinmemberscjurgen2:41 16 Sep '04  
General2 bugs found PinmemberDavid Connet13:45 28 Apr '04  
GeneralRe: 2 bugs found PinmemberNate Austin18:53 8 Mar '07  
GeneralQ-Printable decode improvement PinmemberIrek Zielinski8:53 13 Apr '04  
GeneralRe: Q-Printable decode improvement Pinsussthenickname9:55 19 Jan '05  
GeneralRe: Q-Printable decode improvement PinmemberIrek Zielinski10:21 19 Jan '05  
I agree. But the world is not perfect. There is a lot of software around that produces non RFC content. If you want to create a reliable e-mail reader (like in my case) you need to handle also a non fully RFC compatible messages (otherwise your users will start complaing that your e-mail reader sometimes don't work (and MS Outlook has no problem to decode the message)).
 
Check out my software at: http://www.ireksoftware.com
GeneralI think it is bug (base64 encode) PinmemberKiangGiap Lau16:52 8 Jan '04  
GeneralRe: I think it is bug (base64 encode) Pinmemberratzfatz23:27 18 Jan '04  
Generaldecoding string with many ISO PinmemberDevCrazy4:01 17 Jun '03  
GeneralBinary data PinmemberJuan Carlos Cobas23:10 5 Sep '02  
GeneralRe: Binary data PinmemberIrek Zielinski7:29 9 Apr '04  
GeneralGet Base64 Encoded Length PinmemberPaul Kissel8:25 22 Aug '02  
GeneralRe: Get Base64 Encoded Length PinmemberPaul Kissel8:49 22 Aug '02  
GeneralRe: Get Base64 Encoded Length PinmemberAnders Molin13:55 26 Aug '02  
Generalquoted-printable result truncated. PinmemberAnonymous13:06 2 Nov '01  
GeneralRe: quoted-printable result truncated. PinmemberAnonymous7:23 5 Nov '01  
GeneralRe: quoted-printable result truncated. PinmemberAnders Molin9:30 17 Nov '01  
GeneralBug in decode PinmemberAnonymous4:55 25 Oct '01  
QuestionIn the class CBase64,why the decode fun can't decode the jpg file whice had been encode?? PinmemberAnonymous17:02 17 Oct '01  
AnswerRe: In the class CBase64,why the decode fun can't decode the jpg file whice had been encode?? PinmemberAnders Molin10:09 19 Oct '01  

General General    News News    Suggestion Suggestion    Question Question    Bug Bug    Answer Answer    Joke Joke    Rant Rant    Admin Admin   

Use Ctrl+Left/Right to switch messages, Ctrl+Up/Down to switch threads, Ctrl+Shift+Left/Right to switch pages.

Permalink | Advertise | Privacy | Mobile
Web02 | 2.5.120528.1 | Last Updated 14 Jun 2001
Article Copyright 2001 by Anders Molin
Everything else Copyright © CodeProject, 1999-2012
Terms of Use
Layout: fixed | fluid