Q: Convert std::string to std::wstring using std::ctype widen()

J

Jeffrey Walton

Hi All,

I've done a little homework (I've read responses to similar from P.J.
Plauger and Dietmar Kuehl), and wanted to verify with the Group. Below
is what I am performing (Stroustrup's Appendix D recommendation won't
compile in Microsoft VC++ 6.0).

My question is in reference to MultiByte Character Sets. Will this code
perform as expected? I understand every problem has a simple and
elegant solution that is wrong.

I generally use US English or Unicode, so I don't encounter a lot of
issues others may see (a multibyte character using std::string). I have
verified it works with a Hello World sample.

Jeff
Jeffrey Walton

std::string s = "Hello World";
std::ctype<wchar_t> ct;
std::wstring ws;

for( std::string::const_iterator it = s.begin(); it != s.end();
it++ )
{
ws += ct.widen( *it );
}

// http://www.research.att.com/~bs/3rd_loc.pdf
// by Bjourne himself...
// page 28 of the above reference
// or
// The C++ Programming Language, Special Edition
// Section D.4.2.2, p 895 (Full Manual)
//
// const std::locale& loc = s.getloc();
// wchar_t w = std::use_facet< std::ctype<char> >(loc).widen(c);
// does not compile in Microsft's environment...
// getloc() is not a member of std::basic_string< ... > ...

//
// Dietmar Kuehl code
// Does not compile in VC++ 6.0
//
// std::wstring to_wide_string(std::string const& source) {
// typedef std::ctype<wchar_t> CT;
// std::wstring rc;
// rc.resize(source.size());
// CT const& ct = std::use_facet<CT>(std::locale());
// ct.widen(source.data(), source.data() + source.size(),
rc.data());
// return rc;
 
J

Jeffrey Walton

Jeffrey said:
Hi All,

I've done a little homework (I've read responses to similar from P.J.
Plauger and Dietmar Kuehl), and wanted to verify with the Group.

SNIP...

Jeff
Jeffrey Walton

std::string s = "Hello World";
std::ctype<wchar_t> ct;
std::wstring ws;

for( std::string::const_iterator it = s.begin(); it != s.end();
it++ )
{
ws += ct.widen( *it );
}

// http://www.research.att.com/~bs/3rd_loc.pdf
// by Bjourne himself...
// page 28 of the above reference
// or
// The C++ Programming Language, Special Edition
// Section D.4.2.2, p 895 (Full Manual)

SNIP Code...

SNIP Code...

Before I get flamed for not using std::codecvt, Stroustrup states
(D.4.6 Character Code Conversion, p 925):
The codecvt facet provides conversion between different character sets
when a character is moved between a stream buffer and external
storage...

Jeff
Jeffrey Walton
 
K

kwikius

Jeffrey said:
Hi All,

I've done a little homework (I've read responses to similar from P.J.
Plauger and Dietmar Kuehl), and wanted to verify with the Group. Below
is what I am performing (Stroustrup's Appendix D recommendation won't
compile in Microsoft VC++ 6.0).

// Dietmar Kuehl code
// Does not compile in VC++ 6.0

<..>

Unfortunately the problem is probably VC++6.0 , not the source code.

Save yourself the pain of using VC++6 and get hold of a copy ov VC7.1
or VC8.0 or gcc.,

By doing so you will open up a whole new world...

regards
Andy Little
 
J

Jeffrey Walton

kwikius said:
<..>

Unfortunately the problem is probably VC++6.0 , not the source code.

SNIP ...

regards
Andy Little

Hi Andy,
Save yourself the pain of using VC++6 and get hold of a copy ov VC7.1
or VC8.0 or gcc.,
Not feasible...

I can't even get the following to compile (from Stroustrup):
wchar_t wc = std::use_facet< std::ctype<wchar_t> >
(std::wcout.getloc()).widen('e') ;

I think I am going to punt if I can't get input on walking the
std::string. Since my example compiles (and uses Ch = widen( c ) ), it
should perform as expected.

Stroustrup again (Section D.4.5, p. 923):
A call widen(c) transforms the character c into its corresponding Ch
value. If Ch's character set provides several characters
corresponding to c, the standard specifies that "the implest reasonable
transformation" be used.

Jeff
Jeffrey Walton
 
K

kwikius

Jeffrey said:
Hi Andy,

Not feasible...

All I can say is that I used to use VC++6 for a long time And at that
time I basically gave up on anything with templates then as nothing
worked. I kind of assumed it was my fault. Anyway I got VC7.1 and
suddenly I started to find I could do things with templates and I
progressed quite rapidly onto metaprogramming and so on and so forth.
IOW VC6 really holds youre work back.

If you are forced to use VC6, I can only suggest getting a private copy
of a more modern compiler for your own use and then trying to convince
the powers that be to let you upgrade to it, by showing them the
difference. Of course then you may end up with C# I guess ... :)

OTOH you could try some of the boost people. They have a lot of VC6
workarounds, but even they have got fed up with **** VC6 AFAICS and are
spending much less time trying to support it now better technology is
available.



regards
Andy Little

regards
Andy Little
 
K

kwikius

Jeffrey said:
Hi Andy,

Not feasible...

I can't even get the following to compile (from Stroustrup):
wchar_t wc = std::use_facet< std::ctype<wchar_t> >
(std::wcout.getloc()).widen('e') ;

I think I am going to punt if I can't get input on walking the
std::string. Since my example compiles (and uses Ch = widen( c ) ), it
should perform as expected.

I seem to remember one 'cute' feature of VC6 was static variables in
function templates Oh they compiled and built fine but they never
seemed to get updated correctly or you would have one and one only
despite several instantiations. These are the sort of 'fun' problems
you get with VC6. Maybe this is what is happening here.

regards
Andy Little
 
P

P.J. Plauger

Not feasible...

I can't even get the following to compile (from Stroustrup):
wchar_t wc = std::use_facet< std::ctype<wchar_t> >
(std::wcout.getloc()).widen('e') ;

Look at how _USE_FACET is used in the standard headers. Those
macros continue to work right in later versions of VC++ as well.

P.J. Plauger
Dinkumware, Ltd.
http://www.dinkumware.com
 
J

Jeffrey Walton

Mathias said:
Upgrading MSVC is free.

Hi Mathias,

Details (I'm currently using VC++ Enterprise Edition)? I'm still busy
at the moment trying to get the code to work with recomendations...

Jeff
 
K

kwikius

Jeffrey said:
Hi Mathias,

Details (I'm currently using VC++ Enterprise Edition)? I'm still busy
at the moment trying to get the code to work with recomendations...

Jeff

// The destructor of ctype is protected hence you can't practically
construct one yourself except presumably by derivation.
(22.2.1.1)
//#################
std::ctype<wchar_t> ct;
//#################

// In the other example a const reference is used rather than an object
std::ctype said:
(std::locale());

so you will need to do something else AFAICS to gain access.

regards
Andy Little
 
M

Mathias Gaunard

Jeffrey said:
Details (I'm currently using VC++ Enterprise Edition)?

Microsoft Visual C++ 2005 Express Edition is free. You can download it
from MSDN. I think the IDE isn't as good as the one from Visual Studio
though.

You can also configure your MSVC6 IDE to actually use the newer compiler
if you want also.
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Members online

Forum statistics

Threads
473,731
Messages
2,569,432
Members
44,835
Latest member
KetoRushACVBuy

Latest Threads

Top