Java and HTML parsing.

Mathias Mejborn · May 7, 2007

Hello.

Iam trying to make my first html parser in Java, but i have some
problems that i can't figure out how to solve.

The interesting method in my program looks like this:

public void findHTML(){
try{
while (s != null){
if(s.indexOf("title=\"DR1\"")>-1){
System.out.println("DR1 fundet");
dr1Fundet = true;
if(dr1Fundet){

int start = s.indexOf("style=\"margin:0px;\">")+20;
System.out.println("Udskriver start: " + start);

tid = s.substring(start,5);
System.out.println("Udskriver tid" + tid);
}
}
s = ind.readLine();
}
}catch(Exception e){}
}

(I hope that the code turns out right when i post this).

What iam trying to achieve is:

On the website http://ontv.dk/tv/1 i would like to parse the following html:

Senere i dag på
DR1<table cellspacing="0" style="width:100%;"><tr
style="background-color:#eeeeee;"><td style="width:40px;
text-align:right;">17.00:</td><td><a href="/programinfo/11178550000">Troldspejlet

You can see the html block starting on line 159 in the html source, and
ending on line 171.

What i want to extract from the html is: 17.00 followed by Troldspejlet.

My problem is that i can't figure out how to do this in any way, hope
some of you would help me out.

Need assistance finetuning HTML, CSS, Javascript - sticky header issue	3	Feb 25, 2022
Stuck with html and css	25	Dec 14, 2022
Closing an overlay outside the overlay as well	1	Dec 11, 2022
I need help fixing my website	2	Oct 15, 2023
Aligned to the left	3	Apr 19, 2023
How to have two html audio players on one page?	0	May 3, 2022
Can anyone please help? HTML - two tables applying different styles	4	Dec 1, 2020
Help with my responsive home page	2	Dec 14, 2022

Java and HTML parsing.

Mathias Mejborn

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads