HTML tags optimization [ interesting problem]

D

DENG

hi all,

i use SGMLParser to process HTML files, in order to do some
optimizations,

something like this:

<i><b>TEXT1</b></i><b><i><u>TEXT2</u></i></b>

optimise to

<i><b>TEXT1<u>TEXT2</u></b><i>


at the very beginning, i was thinking of analysing each text-block, to
know their color, size, if is bold or italic, but i found it was too
complicated.

e.g

<font color=red><font size=6>TEXT1</font></font>

optimise to

<font color=red size=6>TEXT1</font>


but if there is TEXT2 exist

<font color=red><font size=6>TEXT1</font>TEXT2</font>

we can not do any optimization.

my problem is I can not find a method to treat all those situation, I
had too much thinking and get fool now


anyone can give me some advices?

thanks




PS:

other examples:

1
<font size=5><font size=7>TEXT</font></font>
=>
<font size=7>TEXT</font>

2
<i>TEXT </i><i>TEXT</i>
=>
<i>TEXT TEXT</i>

3
<i>TEXT<i>TEXT</i></i>
=>
<i>TEXT</i>

etc...
 
B

Bruno Desthuilliers

DENG a écrit :
hi all,

i use SGMLParser to process HTML files, in order to do some
optimizations,

something like this:

<i><b>TEXT1</b></i><b><i><u>TEXT2</u></i></b>

optimise to

<i><b>TEXT1<u>TEXT2</u></b><i>

Doesn't Tidy do this already ?
at the very beginning, i was thinking of analysing each text-block, to
know their color, size, if is bold or italic, but i found it was too
complicated.

e.g

<font color=red><font size=6>TEXT1</font></font>

Whoever writes such a thing in 2005 ought to be shot down for the sake
of mankind !-)
optimise to

<font color=red size=6>TEXT1</font>

You call this "optimized HTML" ? Even MS Word produces better code...
but if there is TEXT2 exist

<font color=red><font size=6>TEXT1</font>TEXT2</font>

we can not do any optimization.

my problem is I can not find a method to treat all those situation,

Tidy is open source AFAICT.
I
had too much thinking and get fool now


anyone can give me some advices?

Learn to use Tidy and CSS.
 
D

DENG

I know very well Tidy, sir

Tidy do a nice job but it is writen in Java, and have Python ported

my aim is to learn Python, learn how to program

I know many people write "hello the world" in 2005, why I can not write
this program in 2005?

you are french, right? peut etre we can talk about it in ecole
polytechnique? i'll be there waiting for you

thanks
 
D

DENG

I know very well Tidy, sir

Tidy do a nice job but it is writen in Java, and have Python ported

my aim is to learn Python, learn how to program

I know many people write "hello the world" in 2005, why I can not write
this program in 2005?

you are french, right? peut etre we can talk about it in ecole
polytechnique? i'll be there waiting for you

thanks
 
B

bruno modulix

DENG said:
I know very well Tidy, sir

Tidy do a nice job but it is writen in Java,

Seems like we're not talking about the same program here. Tidy (aka
HTMLTidy) is written in C. You must be talking about it's Java port JTidy.
and have Python ported

my aim is to learn Python, learn how to program

No one could have guess from your post, and I dont have psychic powers,
ok ?

From a professional POV, using existing tools that have proven to be
reliable is far better than reinventing the square wheel, hence my answer.
you are french, right? peut etre we can talk about it in ecole
polytechnique?

Peut-être pas, je ne mets jamais les pieds chez les polytechniciens.
i'll be there waiting for you

I'm afraid you'll learn a looooong time.

Regards,
 
S

Sybren Stuvel

DENG enlightened us with:
i use SGMLParser to process HTML files, in order to do some
optimizations,

something like this:

<i><b>TEXT1</b></i><b><i><u>TEXT2</u></i></b>

optimise to

<i><b>TEXT1<u>TEXT2</u></b><i>

Why not optimize it to:

<span class='onerole'>TEXT1</span>
[ snipped stuff about <font> tags ]

If you're serious about using HTML, I suggest you read a book that's
not ten years old.
anyone can give me some advices?

Yes, read the following:

http://www.w3.org/QA/Tips/
http://www.w3.org/TR/CSS21/
http://www.w3.org/TR/xhtml1/
http://validator.w3.org/

Sybren
 
D

DENG

hi, Sybren,

thanks for your reply, if use CSS:

<span class=type1>text<span class=type2>text</span></span><span
class=type1>text</span>

optimise to:

<span class=type1>text<span class=type2>text</span>text</span>

what i need is the METHOD to do optimization, in fact, i have ready
write a program to analyse the syntax of CSS, to make it works with all
situation
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Members online

No members online now.

Forum statistics

Threads
473,770
Messages
2,569,583
Members
45,074
Latest member
StanleyFra

Latest Threads

Top