P
peter pilsl
For feeding the content of an xml-file to a search-indexer I need to
remove all tags and extract the plaintext out of a xml-file.
I use the null-xls-stylesheet
<?xml version="1.0"?>
<xsl:stylesheet xmlns:xsl="http://www.w3.org/1999/XSL/Transform"
version="1.0">
</xsl:stylesheet>
to remove all tags.
Problem now is that, while it actually works, I found out that removing
all tags is not exactely what I want, cause I ended up with all content
in one string without any whitespace in between. So actually what i want
is to replace all tags with a space.
I use linux/xsltproc/perl and I am definitely no master of xml. I rarely
used it until now and while I do quite fine in perl, I cannot master
this simple xml-problem on my own
thnx for any help
peter
remove all tags and extract the plaintext out of a xml-file.
I use the null-xls-stylesheet
<?xml version="1.0"?>
<xsl:stylesheet xmlns:xsl="http://www.w3.org/1999/XSL/Transform"
version="1.0">
</xsl:stylesheet>
to remove all tags.
Problem now is that, while it actually works, I found out that removing
all tags is not exactely what I want, cause I ended up with all content
in one string without any whitespace in between. So actually what i want
is to replace all tags with a space.
I use linux/xsltproc/perl and I am definitely no master of xml. I rarely
used it until now and while I do quite fine in perl, I cannot master
this simple xml-problem on my own
thnx for any help
peter