E
Ezra Zygmuntowicz
Hey Guys-
I have a regex problem that I am not sure how to tackle. I am
parsing some classified ads in order to format them for display
online. I have most of the parsing done but I need help with the
final step. So the file has one ad per line and a line looks like this:
<ftditm><begad:11559303>Selah Country Home 1.5 acres. 3 bdrm, 2 bath,
irrigation, horse barn. $122,000. 509-697-6519<endad>
Now I have already parsed everything to get it to this state but
what I need to do next is to count 50 chars after the <begad:
11559303> tag and insert </ftditm>
But the tricky part is that I need to place the </ftditm> 50
characters in to the line but if the 50 chars ends in the middle of a
word then I need to match the rest of the word as well. So I need a
way to match at least 50 chars plus the rest of the current word if
the 50'th char lands in the middle of a word.
So for this particular ad 50 chars makes it to here:
<ftditm><begad:11559303>Selah Country Home 1.5 acres. 3 bdrm, 2 bath,
irri #<= 50 chars ends here# gation, horse barn. $122,000.
509-697-6519<endad>
So it ends in the middle of the word irrigation and I need it to
consume the whole word.
Any help is much appreciated-
-Ezra Zygmuntowicz
Yakima Herald-Republic
WebMaster
509-577-7732
(e-mail address removed)
I have a regex problem that I am not sure how to tackle. I am
parsing some classified ads in order to format them for display
online. I have most of the parsing done but I need help with the
final step. So the file has one ad per line and a line looks like this:
<ftditm><begad:11559303>Selah Country Home 1.5 acres. 3 bdrm, 2 bath,
irrigation, horse barn. $122,000. 509-697-6519<endad>
Now I have already parsed everything to get it to this state but
what I need to do next is to count 50 chars after the <begad:
11559303> tag and insert </ftditm>
But the tricky part is that I need to place the </ftditm> 50
characters in to the line but if the 50 chars ends in the middle of a
word then I need to match the rest of the word as well. So I need a
way to match at least 50 chars plus the rest of the current word if
the 50'th char lands in the middle of a word.
So for this particular ad 50 chars makes it to here:
<ftditm><begad:11559303>Selah Country Home 1.5 acres. 3 bdrm, 2 bath,
irri #<= 50 chars ends here# gation, horse barn. $122,000.
509-697-6519<endad>
So it ends in the middle of the word irrigation and I need it to
consume the whole word.
Any help is much appreciated-
-Ezra Zygmuntowicz
Yakima Herald-Republic
WebMaster
509-577-7732
(e-mail address removed)