Can't figure this code

janus · Feb 14, 2013

int TokenList_init(TokenList * tl, Lexer * lex)
{
IG_ASSERT(tl, "TokenList should be defined.");
IG_ASSERT(lex, "Lexer should be defined.");
tl->m_head = NULL;
Token * tok;
do
{
tok = (Token *) malloc(sizeof(Token));
Lexer_read(lex, tok);
tok->m_next = NULL;
if (NULL == tl->m_head) {
tl->m_head = tok;
tl->m_tail = tok;
}
else {
tl->m_tail->m_next = tok;
tl->m_tail = tok;
}
}
while (LEXER_EOF != tok -> m_type);

typedef struct TokenList
{
Token * m_head;
Token * m_tail;
Token * m_curr;
} TokenList;

typedef struct Token
{
int m_type; // an int representing the type of symbol
igstr * m_value; // the string value of the token
int m_line; // the line the token was found at
int m_column; // the column the token was found at
struct Token * m_next; // for later use
} Token;

I am studying the above code I stumbled on. I feel that this is wrong:
tl->m_tail->m_next = tok;
tl->m_tail = tok;

It should be:
tl->m_tail->m_next = tok;

and at the end of the loop we should have
tl->m_tail = tok;

Eric Sosman · Feb 14, 2013

[... building a linked list; this is the crucial bit ...]
tok->m_next = NULL;
if (NULL == tl->m_head) {
tl->m_head = tok;
tl->m_tail = tok;
}
else {
tl->m_tail->m_next = tok;
tl->m_tail = tok;
}
[...]

I am studying the above code I stumbled on. I feel that this is wrong:
tl->m_tail->m_next = tok;
tl->m_tail = tok;

It should be:
tl->m_tail->m_next = tok;

and at the end of the loop we should have
tl->m_tail = tok;

The first time the loop body executes, it creates a list
of one Token. That Token is the first in the list (so m_head
points at it) and the last in the list (so m_tail points at
it, too).

Each additional trip through the loop adds a new Token to
the end of the list. Specifically, it adds the new Token just
after the token that used to be the last one, that is, just
after the token m_tail points at. And after that new Token has
been added, what Token is at the end? The newly-added one, which
is why the code changes m_tail to point at it.

If you left m_tail unchanged until after the loop, then the
first trip through the loop would behave just as it does now,
leaving you with both m_head and m_tail pointing at what I'll
call Token1, and Token1's m_next field equal to NULL. After
the second trip you'd have m_head and m_tail *still* pointing
at Token1, Token1's m_next pointing at Token2, and Token2's
m_next equal to NULL. Each trip through the loop adds the new
Token just after the m_tail token; clear? Okay, now for the
third trip: You'll have m_head and m_tail still pointing to
Token1, Token1's m_next field pointing to Token3, and Token3's
m_next equal to NULL. Who's pointing at Token2 now? Nobody!
Token2 has been lost, and there is no way to find it any more.
Go around again to add Token4, and you'll lose Token3. At the
end, you'll have a situation like:

m_head } --> Token1
m_tail } m_next --> TokenN
m_next == NULL

Lost in the bit bucket:
Token2 Token3 Token4 ...
m_next == NULL m_next == NULL m_next == NULL

janus · Feb 18, 2013

[... building a linked list; this is the crucial bit ...]

Click to expand...

tok->m_next = NULL;

Click to expand...

if (NULL == tl->m_head) {

Click to expand...

tl->m_head = tok;

Click to expand...

tl->m_tail = tok;

else {

Click to expand...

tl->m_tail->m_next = tok;

Click to expand...

tl->m_tail = tok;

}

Click to expand...

[...]

Click to expand...

I am studying the above code I stumbled on. I feel that this is wrong:

Click to expand...

tl->m_tail->m_next = tok;

Click to expand...

tl->m_tail = tok;

It should be:

Click to expand...

tl->m_tail->m_next = tok;

and at the end of the loop we should have

Click to expand...

tl->m_tail = tok;

Click to expand...

The first time the loop body executes, it creates a list

of one Token. That Token is the first in the list (so m_head

points at it) and the last in the list (so m_tail points at

it, too).

Each additional trip through the loop adds a new Token to

the end of the list. Specifically, it adds the new Token just

after the token that used to be the last one, that is, just

after the token m_tail points at. And after that new Token has

been added, what Token is at the end? The newly-added one, which

is why the code changes m_tail to point at it.

If you left m_tail unchanged until after the loop, then the

first trip through the loop would behave just as it does now,

leaving you with both m_head and m_tail pointing at what I'll

call Token1, and Token1's m_next field equal to NULL. After

the second trip you'd have m_head and m_tail *still* pointing

at Token1, Token1's m_next pointing at Token2, and Token2's

m_next equal to NULL. Each trip through the loop adds the new

Token just after the m_tail token; clear? Okay, now for the

third trip: You'll have m_head and m_tail still pointing to

Token1, Token1's m_next field pointing to Token3, and Token3's

m_next equal to NULL. Who's pointing at Token2 now? Nobody!

Token2 has been lost, and there is no way to find it any more.

Go around again to add Token4, and you'll lose Token3. At the

end, you'll have a situation like:

m_head } --> Token1

m_tail } m_next --> TokenN

m_next == NULL

Lost in the bit bucket:

Token2 Token3 Token4 ...

m_next == NULL m_next == NULL m_next == NULL

--

Eric Sosman

(e-mail address removed)

Thanks so much!!!

Lexical Analysis on C++	1	Oct 31, 2023
Does this program produce undefined behavior?	0	Mar 7, 2010
Type punning, code reordering and overloaded operator new() withcustom allocator	0	Nov 23, 2007
why does this segfault??	6	Sep 11, 2003
working with big packets and sockets.	1	Apr 29, 2006
New implementation of FSM using functions returning pointers tofunctions of the same type!	10	Mar 10, 2008
embarrassing spaghetti code needs stylistic advice	72	Mar 20, 2009
Tokenizer interface	11	Nov 28, 2003

Can't figure this code

janus

Eric Sosman

janus

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads