Interesting warnings from latest MS compiler

Phlip · Jun 15, 2006

Noah said:
Interestingly, several of the operations in the standard library,
including some in basic_string, are "depricated"

Potentially unsafe method
Safer equivalent

basic_string::copy
basic_string::_Copy_s

Are the equivalents safer because they are harder to overflow?

(And could you practice writing "deprecated"? That spelling doesn't
inspire my newsreader to underline it with a wavy red line...)

Noah Roberts · Jun 15, 2006

Phlip said:
(And could you practice writing "deprecated"? That spelling doesn't
inspire my newsreader to underline it with a wavy red line...)

Get a less annoying newsreader. Might help you to refrain from being a
pedantic, lecturing, butthead.

Victor Bazarov · Jun 15, 2006

Noah said:
Oh, are you drifting again?

Catch me! Catch me! Oh, I am drifting away!...

Markus Schoder · Jun 15, 2006

Phlip said:
The Standards are (sometimes) careful to leave things out that must then
get changed. islower etc are _not_ case-aware. They only do specific
case-like things to raw ASCII letters, so the Standards must leave them in
as the rock-bottom must-have functions.

So when C achieves a useful locale system, it may then support a
high-level strcompare() routine that rates encoded strings for equivalence.

int strcompare(const char *s1, const char *s2)
{
while(tolower(*s1) == tolower(*s2) && *s1)
++s1, ++s2;
return *s1 - *s2;
}

A locale aware case insensitive string compare function. Why should
there anything be missing?

The question is just if it is common enough to put it in the standard
library or not. I think it is.

Victor Bazarov · Jun 15, 2006

Markus said:
int strcompare(const char *s1, const char *s2)
{
while(tolower(*s1) == tolower(*s2) && *s1)

... && *s1 && *s2)

++s1, ++s2;
return *s1 - *s2;
}

A locale aware case insensitive string compare function. Why should
there anything be missing?

Missing? Wide char processing, maybe? What's it called, Unicode?

The question is just if it is common enough to put it in the standard
library or not. I think it is.

Well, with so many Unicode versions, stuffing all the things into the
library doesn't make much sense to me.

V

Phlip · Jun 15, 2006

Markus said:
int strcompare(const char *s1, const char *s2) {
while(tolower(*s1) == tolower(*s2) && *s1)
++s1, ++s2;
return *s1 - *s2;
}
}
A locale aware case insensitive string compare function. Why should there
anything be missing?

The question is just if it is common enough to put it in the standard
library or not. I think it is.

You aren't allowed to call it str[a-z].*.

If you didn't, then the Committee did its job. You found that function
very easy to write, because the Committee provided tolower(). And the
Committee prevented your code from breaking when a future version of a C
language comes along with a real locale system, which can detect upper
case, lower case, and title case correctly in all the scripts that have
cases. Your code would continue to work correctly for ASCII, per your
present requirements, and would not conflict with any str function they
added.

Markus Schoder · Jun 15, 2006

Phlip said:
Markus said:

int strcompare(const char *s1, const char *s2) {
while(tolower(*s1) == tolower(*s2) && *s1)
++s1, ++s2;
return *s1 - *s2;
}
}
A locale aware case insensitive string compare function. Why should there
anything be missing?

The question is just if it is common enough to put it in the standard
library or not. I think it is.

Click to expand...

You aren't allowed to call it str[a-z].*.

That's understood I was putting myself in the role of a library
implementor.

If you didn't, then the Committee did its job. You found that function
very easy to write, because the Committee provided tolower(). And the
Committee prevented your code from breaking when a future version of a C
language comes along with a real locale system, which can detect upper
case, lower case, and title case correctly in all the scripts that have
cases. Your code would continue to work correctly for ASCII, per your
present requirements, and would not conflict with any str function they
added.

The function is fully locale aware. You make it sound like we are
waiting for some kind of addition or change to the standard until such
a function can be part of the standard library. I just have no idea
what that would be.

kwikius · Jun 15, 2006

Noah said:
Get a less annoying newsreader. Might help you to refrain from being a
pedantic, lecturing, butthead.

Yeah... but Phlip's a lovely, pedantic, lecturing, butthead though aint
he ?

regards
Andy Little

Markus Schoder · Jun 15, 2006

Richard said:
char what = toupper('ß');

toupper('ß') == 'ß'
tolower('ß') == 'ß'

Phlip · Jun 15, 2006

Markus said:
The function is fully locale aware.

?

Okay, maybe I don't understand tolower(). Will it handle LATIN SMALL
LIGATURE OE (Å“) correctly?

Phlip · Jun 15, 2006

kwikius said:
Yeah... but Phlip's a lovely, pedantic, lecturing, butthead though aint he
?

ain't

Victor Bazarov · Jun 15, 2006

Markus said:
[..]
toupper('ß') == 'ß'
tolower('ß') == 'ß'

But isn't it wrong? How about toupper('?') or tolower('?')?
At least on my computer I naively expect it to be '?' and '?',
respectively. (Yes, I said *naively*, I know it most likely
not going to work)

V

Markus Schoder · Jun 15, 2006

Victor said:
... && *s1 && *s2)

No this is unnecessary. Good example though why not everybody should be
required to think this through again.

Missing? Wide char processing, maybe? What's it called, Unicode?

Well, with so many Unicode versions, stuffing all the things into the
library doesn't make much sense to me.

There is just one additional wide character function required
(wcscompare). The different Unicode versions are handled by the locale
specific low-level functions which are already part of the standard
(e.g. towlower(wint_t)).

kwikius · Jun 15, 2006

Phlip said:
ain't

Geez! Butthead!

........... ;-)

regards
Andy Little

Markus Schoder · Jun 15, 2006

Phlip said:
?

Okay, maybe I don't understand tolower(). Will it handle LATIN SMALL
LIGATURE OE (œ) correctly?

If it is a valid letter in the currently set locale it will.

Some letters may be only representable in a wide character set for
those you would need the wide character version of the compare function
which would use the towlower function instead (also standard). But that
is a different issue since you obviously need a complete set of new
functions to cover wide character sets.

Phlip · Jun 15, 2006

Markus said:
If it is a valid letter in the currently set locale it will.

Please examine the source to your tolower(). One of mine calls this:

ctype<char>::do_tolower(char __c) const
{ return (char) _S_lower[(unsigned char) __c]; }

And _S_lower is a big static table of character mappings. The top half
of the table trivially maps each character to itself. I'm aware that more
advanced versions of tolower() are possible, but this one appears
locale-proof. It's STLPort, and I don't know how compliant it is.

So let's simplify the question by picking ISO Latin 1 (ISO/IEC 8859-1)
letters. Most desktops default to that.

So here's Æ, LATIN CAPITAL LIGATURE AE, at '\xC6'. Its lowercase is at
'\xE6'. You think you can make this assertion pass:

assert('\xE6' == tolower('\xC6'));

Is there some way to set the locale to ISO Latin 1 first, to get that to
pass?

Victor Bazarov · Jun 15, 2006

Phlip said:
[..]
So here's Æ, LATIN CAPITAL LIGATURE AE, at '\xC6'. Its lowercase is at
'\xE6'. You think you can make this assertion pass:

assert('\xE6' == tolower('\xC6'));

Since both chars are not present in the basic character set, your question
cannot be answered in implementation-independent manner, I believe. But
once you enter implementation-specific behaviour, anything is possible, no?

V

Markus Schoder · Jun 15, 2006

Phlip said:
Markus said:

If it is a valid letter in the currently set locale it will.

Click to expand...

Please examine the source to your tolower(). One of mine calls this:

ctype<char>::do_tolower(char __c) const
{ return (char) _S_lower[(unsigned char) __c]; }

And _S_lower is a big static table of character mappings. The top half
of the table trivially maps each character to itself. I'm aware that more
advanced versions of tolower() are possible, but this one appears
locale-proof. It's STLPort, and I don't know how compliant it is.

So let's simplify the question by picking ISO Latin 1 (ISO/IEC 8859-1)
letters. Most desktops default to that.

So here's Æ, LATIN CAPITAL LIGATURE AE, at '\xC6'. Its lowercase is at
'\xE6'. You think you can make this assertion pass:

assert('\xE6' == tolower('\xC6'));

Is there some way to set the locale to ISO Latin 1 first, to get that to
pass?

You can try

setlocale(LC_ALL, "");

which should set the locale to some sane value (may depend on
environment variables).

The only locale that must exist is "C" which is also the default until
you call setlocale(). This of course is just plain ASCII.

Anyway the following program

#include <cctype>
#include <iostream>
#include <clocale>

using namespace std;

int main()
{
cout << hex << tolower('\xC6') << endl;
setlocale(LC_ALL, "");
cout << hex << tolower('\xC6') << endl;
}

produces:

c6
e6

So yes works like a charm for me.

Phlip · Jun 15, 2006

Markus said:
int main()
{
cout << hex << tolower('\xC6') << endl; setlocale(LC_ALL, "");
cout << hex << tolower('\xC6') << endl;
}
}
produces:

c6
e6

So yes works like a charm for me.

Yay! I learned something new about tolower()! (And STLport!)

Your strcompare() still won't work, because it won't handle multiple byte
character sets, such as UTF-8. ;-)

Phlip · Jun 15, 2006

kwikius said:
Geez! Butthead!

How necessary, the apostrophe.

So small, so cute, so quaint.

It fits between the letters

To point out where they ain't.

Compiler warnings	5	Nov 14, 2005
Strange compiler warnings	1	Jun 12, 2008
Help with 3 compiler warnings from g++	6	Apr 13, 2009
Getchar() problem	8	Jan 2, 2022
jQuery Attribute Summit--Latest Coverage	16	Dec 20, 2009
Compilation of old source code.	0	Mar 3, 2022
Free MS Compiler	3	Nov 8, 2005
Compiler warnings vs remarks mystery	6	Jun 25, 2007

Interesting warnings from latest MS compiler

Phlip

Noah Roberts

Victor Bazarov

Markus Schoder

Victor Bazarov

Phlip

Markus Schoder

kwikius

Markus Schoder

Phlip

Phlip

Victor Bazarov

Markus Schoder

kwikius

Markus Schoder

Phlip

Victor Bazarov

Markus Schoder

Phlip

Phlip

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads