Click here to Skip to main content
15,888,610 members
Articles / Programming Languages / C++

CString-clone Using Standard C++

Rate me:
Please Sign up or sign in to vote.
4.93/5 (128 votes)
7 Dec 2011CPOL4 min read 3.9M   14.1K   227   568
A Drop-In replacement for CString that builds on the Standard C++ Library's basic_string template

Introduction

As much as I use and appreciate the Standard C++ Library, I've never liked its string template - basic_string<>. At times, it seems the designers went out of their way to make it difficult to use.

On the other hand, I've always loved the ease of use of MFC's CString class. It checks for NULL pointers, implicitly converts to const TCHAR*, and has some very handy member functions (Format, Load, etc.) that make string programming a breeze. But of course, I don't want to use MFC anymore. In fact, I don't want to rely on any proprietary library because I want portability.

Therefore I decided to combine the best of both worlds and create:

CStdString

This is a class (a template instantiation actually) that derives from from basic_string<TCHAR>. To the basic_string it adds the entire CString API. You get CString ease of use with 100% basic_string compatibility. In short, a CStdString object is a basic_string that (with very few exceptions (noted below) it is also a drop-in replacement for CString. The best part of this is that both APIs (basic_string and CString) are well known and well documented.

I originally submitted this article to another code site (which shall remain nameless :)) a few years ago. I like CodeProject so much I thought I'd submit it here too. I have used this class in almost every professional project I've done over the past 4 years. It has proven to be the single most useful piece of code I've ever written. It is also extensively debugged. I hope you like it. If you ever have any problems with it, please e-mail me. I'm happy to help.

I provided a simple source application here to prove some of the CString functions work but it's really just a token. The list of sample projects out there that use CString and/or basic_string is massive.

Features

  • Drop in Replacement for CString (see below for exceptions)
  • Two instantiations available at all times -- wchar_t-based version CStdStringW and char-based version CStdStringA. The name CStdString is just a typedef of one of these two.
  • Safely checks for NULL string pointer inputs (like CString) in all functions
  • Extra constructors and assignment operators to automatically convert between wide (wchar_t-based) and thin (char-based) strings for you.
  • Implicit conversion to c_str(). The C++ committee doesn't like this but I sure do.
  • Builds on several platforms, including Windows, Unix and Linux. Works with several implementations of the Standard C++ Library, including Dinkumware, GNU, CodeWarrior, and STLPort.
  • Win32 builds give you some extra goodies like UNICODE/MBCS conversion macros (just like MFCs) as well as member functions for persisting CStdString objects to and from DCOM IStreams.
  • Makes no use of any implementation details of the base class template (basic_string)
  • The derived template adds no member data to basic_string and adds no virtual functions

There are a couple of issues about this code of that I should point out.

CString Compatibility

I was unable to exactly reproduce the CString API. There are a two functions that both CString and basic_string; share, but implement differently. In these cases, I felt it best to make CStdString behave like basic_string (the base class) rather than CString. To be specific.

  • CStdString::operator[] returns characters by value (unlike CString which returns them by reference)
  • The constructor that takes a character and a count takes them in the order (count, value) which is the opposite of the order CString declares them. That's the order that basic_string<>; needs and it was impossible to implement both versions.

There were also two CString functions I could not implement at all -- LockBuffer and UnlockBuffer.

Deriving From basic_string<>

The template I wrote derives from basic_string, a class template without a virtual destructor. Any introductory text to C++ will tell you that it is dangerous to derive from a class without a virtual destructor. It can lead to behavior that is undefined. So if you were to code the following (deleting a CStdStringA through a pointer to the base class), you would technically get undefined behavior:

C++
// assign DERIVED object to  BASE pointer
std::string* pstr = new CStdStringA("Hi"); 

// delete  DERIVED through BASE class pointer -- UNDEFINED
delete pstr;   

Personally, I don't think this is much of an issue. I mean really how often do you actually do this with string objects? I have rarely (if ever) needed to dynamically allocate a string object on the heap. And if I ever do, I won't using a base-class pointer. So if you don't do this, you'll never have to worry. In fact, even if you do code this way, I doubt you'll have any problems with CStdString. I can tell you that at least with Microsoft Visual C++, even the above code runs just fine with no errors or memory leaks. I doubt many other compilers would give you problems either. However my doubt does not impose reality on the C++ world. Caveat Emptor.

History

  • 7 Dec 2011: Updated source code.

License

This article, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)


Written By
Web Developer
United States United States
I've been a software developer since 1990.

While my main page is out of date (and I have therefore blanked it), you can read about the CStdString pages here

http://home.earthlink.net/~jmoleary/stdstring.htm

Comments and Discussions

 
GeneralProblem with Format Pin
12-Jul-01 23:25
suss12-Jul-01 23:25 
GeneralRe: Problem with Format Pin
13-Jul-01 4:47
suss13-Jul-01 4:47 
GeneralRe: Problem with Format Pin
peterchen2-Dec-01 2:29
peterchen2-Dec-01 2:29 
GeneralMemory Leak! Pin
29-May-01 19:33
suss29-May-01 19:33 
GeneralRe: Memory Leak! Pin
Christian Graus29-May-01 19:58
protectorChristian Graus29-May-01 19:58 
GeneralRe: Memory Leak! Pin
29-May-01 20:01
suss29-May-01 20:01 
GeneralHelp for an other article Pin
Arnaud Brejeon29-May-01 15:53
Arnaud Brejeon29-May-01 15:53 
GeneralRe: Help for an other article Pin
29-May-01 19:39
suss29-May-01 19:39 
Hi Arnaud,

You don't need CStdString to do work under UNICODE. What you need to understand is what UNICODE is and how it fits into Windows.

UNICODE is a 16-bit character encoding designed to be able to represent almost all scripts in all languages. That's actually an over simplification but let's not get too technical... Anyway, since the wchar_t type is two bytes on the Windows platform, UNICODE characters fit very neatly into a std::wstring on Windows.

If you need your program to be able to handle characters of any language, UNICODE is an excellent choice. You use wstrings instead of strings and read and write everything as wchar_t-based strings.

This works pretty well until you need to run your UNICODE-string-based program on one of the less powerful versions of Windows (Win95, 98, or ME) and want to call some operating system function that takes a string.

For example, suppose you want to set the text of a window: You'll see that the Win32 headers provide two versions of SetWindowText -- one that takes an ANSI string and one that takes a wide string. Almost all Win32 functions that take strings are declared this way. Here's roughly how the function is declared in the Win32 headers:

. BOOL WINAPI SetWindowTextA(HWND hWnd, LPCSTR lpString);
. BOOL WINAPI SetWindowTextW(HWND hWnd, LPCWSTR lpString);

. #ifdef UNICODE
. #define SetWindowText SetWindowTextW
. #else
. #define SetWindowText SetWindowTextA
. #endif // !UNICODE

Unfortunately, on Win9x and ME, the wide character-based version will not work. Unless your program runs on Windows NT, 2000, or XP, you must manually convert your UNICODE string to ANSI and call SetWindowTextA. Not much fun. And the same rule applies to every Win32 function that takes a string.

Generally when people talk about "doing a UNICODE build" on Windows, they're talking about #define-ing the preprocessor macro _UNICODE (note the leading underscore) early on -- before #include-ing the Win32 header TCHAR.H. What this does is change the mapping of the generic character type TCHAR. It also maps the generic versions of Win32 functions (e.g. "SetWindowText" above) to the specific one ("SetWindowTextA" or "SetWindowTextW").

I'm tempted to go and on about this topic but the fact is there are established sources out there which explain it far better than I ever could. One such source is the MSDN itself. Search it for the topic "UNICODE" and you'll find a wealth of articles. Here are some links to a couple of good ones that should get you started:

http://msdn.microsoft.com/library/psdk/winbase/unicode_0mw9.htm
http://msdn.microsoft.com/library/periodic/period99/multilangUnicode.htm
http://msdn.microsoft.com/library/psdk/msaa/msaaovrw_1zcj.htm

If that's not enough, you could consult a good book, such as "International Programming for Microsoft Windows" by David Schmitt

http://www.amazon.com/exec/obidos/ASIN/1572319569/qid=991197339/sr=1-2/ref=sc_b_2/102-1896360-1959348.

or even one about the Unicode standard itself:

http://www.amazon.com/exec/obidos/ASIN/0764546252/ref=sim_books/102-1896360-1959348

Hope this helps



Joe O'Leary
GeneralRe: Help for an other article Pin
Arnaud Brejeon29-May-01 20:37
Arnaud Brejeon29-May-01 20:37 
Generaloperator[] and other incompatibilities Pin
William E. Kempf29-May-01 4:17
William E. Kempf29-May-01 4:17 
GeneralRe: operator[] and other incompatibilities Pin
Tomasz Sowinski29-May-01 4:36
Tomasz Sowinski29-May-01 4:36 
GeneralRe: operator[] and other incompatibilities Pin
William E. Kempf29-May-01 4:39
William E. Kempf29-May-01 4:39 
GeneralRe: operator[] and other incompatibilities Pin
29-May-01 8:02
suss29-May-01 8:02 
GeneralRe: operator[] and other incompatibilities Pin
William E. Kempf29-May-01 8:26
William E. Kempf29-May-01 8:26 
GeneralRe: operator[] and other incompatibilities Pin
Tim Smith29-May-01 13:59
Tim Smith29-May-01 13:59 
GeneralRe: operator[] and other incompatibilities Pin
Anna-Jayne Metcalfe30-May-01 23:09
Anna-Jayne Metcalfe30-May-01 23:09 
GeneralRe: operator[] and other incompatibilities Pin
William E. Kempf4-Jun-01 4:34
William E. Kempf4-Jun-01 4:34 
GeneralRe: operator[] and other incompatibilities Pin
Howard Kapustein2-Jun-01 17:28
Howard Kapustein2-Jun-01 17:28 
GeneralRe: operator[] and other incompatibilities Pin
William E. Kempf4-Jun-01 4:39
William E. Kempf4-Jun-01 4:39 
GeneralRe: operator[] and other incompatibilities Pin
4-Jun-01 6:27
suss4-Jun-01 6:27 
GeneralRe: operator[] and other incompatibilities Pin
Marc Brooks7-Jun-01 14:10
Marc Brooks7-Jun-01 14:10 
GeneralRe: operator[] and other incompatibilities Pin
7-Jun-01 15:03
suss7-Jun-01 15:03 
GeneralRe: operator[] and other incompatibilities Pin
Marc Brooks7-Jun-01 15:38
Marc Brooks7-Jun-01 15:38 
GeneralRe: operator[] and other incompatibilities Pin
7-Jun-01 18:06
suss7-Jun-01 18:06 
GeneralExcellent! Pin
Simon Brown29-May-01 3:07
Simon Brown29-May-01 3:07 

General General    News News    Suggestion Suggestion    Question Question    Bug Bug    Answer Answer    Joke Joke    Praise Praise    Rant Rant    Admin Admin   

Use Ctrl+Left/Right to switch messages, Ctrl+Up/Down to switch threads, Ctrl+Shift+Left/Right to switch pages.