out-of-range parsing with <istream>

Ersek, Laszlo · Feb 7, 2012

Hi,

please consider the following program:

// -------------------------------------------------------------------
#include <sstream>
#include <iostream>

static void
check(short val, const std::istringstream & s)
{
std::cout << "val=" << val << " " << (s.fail() ? "failed" : "ok")
<< std::endl;
}

int
main(void)
{
std::istringstream s1("-1"),
s2("FFFF");
short val = 0;

s1 >> std::hex >> val;
check(val, s1);

val = 0;
s2 >> std::hex >> val;
check(val, s2);

return 0;
}
// -------------------------------------------------------------------

When run (g++ (Debian 4.4.5-8) 4.4.5, Debian GNU/Linux 6.0.4, x86_64), it
prints:

val=-1 ok
val=0 failed

I'd like to ask for help with explaining the behavior, based on the C++03
standard.

I followed "27.6.1.2.2 Arithmetic Extractors" to "22.2.2.1.2 num_get
virtual functions".

Case "-1":

- Stage 1 should determine
- basefield == hex --> %X (table is ordered)
- type: short --> "h" length modifier

- Stage 2 should accumulate all characters until the end of string.

- Stage 3 implies the string "-1" is converted as in:

result = sscanf("-1", "%hX", &val);

Unfortunately, this seems to be undefined behavior in ISO C90 (see below),
and neither of the two branches listed in Stage 3 (successful conversion
or input failure) seem to cover that.

- "%hX" takes a pointer to an unsigned short, not a signed short (C90
7.9.6.2)

- even the identical representation mandated inside the intersecting range
of "short" and "unsigned short" is no remedy, because "-1" is outside of
that range.

So, is the statement that reads from s1 correct? (I'm asking about the
source code, not how g++ translated it.) I must surely be misunderstanding
the C++ standard.

Case "FFFF":

- Stage 1 and Stage 2 should work as before.

- Stage 3:

result = sscanf("-1", "%hX", &val);

and I can only repeat the same two concerns as above.

Thus, is the statement reading from s2 correct?

If both statements are correct, is the output of the translated program
correct? (Considering a 16-bit short.)

I think I "agree" with how the program works (the mathematical value -0x1
can be stored in a 16-bit short, while +0xFFFF can not), but I can't reach
this conclusion based on the standard.

Thank you very much,
Laszlo

Implementing a Q-Learning Algorithm with Logistic Regression Normalization in C++	0	Jun 4, 2025
IndexError: pop index out of range	0	May 15, 2013
RSA implementation issues in public key pem loader function	0	May 21, 2025
Using range-based for with alternative ranges	2	May 18, 2012
indexerror: list index out of range??	8	Jun 28, 2013
Vector help, subscript out of range, and basic tutoring	11	Mar 15, 2008
how to compute roots of a cubic function with minimal truncationerrors?	2	Sep 10, 2008
Bad use of stringstream temporary?	20	Mar 24, 2011

out-of-range parsing with <istream>

Ersek, Laszlo

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads