Replacing text on the fly

kelvSYC · Jun 5, 2010

I'm having a tough JavaScript problem. I'm fairly new to JavaScript,
and what I'm trying to do is to replace instances of a particular
string inside a particular element with another string. Here's what I
have so far:

var result = document.evaluate('//div[@class="code"]/code', document,
null, XPathResult.ORDERED_NODE_SNAPSHOT_TYPE, null);

if (result) {
for (var i = 0, len = result.snapshotLength; i < len; i++) {
// use snapshotItem(i) to get nodes
var val = result.snapshotItem(i).nodeValue;
val = val.replace(/<-/gi, "â†");
result.snapshotItem(i).nodeValue = val;
}
}

The problem is that, even though I checked that my xpath works, the
result list is empty for some reason even though I know that it should
not be empty. Because of this, I haven't even checked to see that the
inside code works. Why is that?

Thomas 'PointedEars' Lahn · Jun 5, 2010

kelvSYC said:
var result = document.evaluate('//div[@class="code"]/code', document,
null, XPathResult.ORDERED_NODE_SNAPSHOT_TYPE, null);

if (result) {
for (var i = 0, len = result.snapshotLength; i < len; i++) {
// use snapshotItem(i) to get nodes
var val = result.snapshotItem(i).nodeValue;
val = val.replace(/<-/gi, "â†");
result.snapshotItem(i).nodeValue = val;
}
}

The problem is that, even though I checked that my xpath works, the
result list is empty for some reason even though I know that it should
not be empty. Because of this, I haven't even checked to see that the
inside code works. Why is that?

The `nodeValue' of an element node is not the content of the element; it is
`null' by definition, and assigning to that property has no effect then.
The element node has a text node as child node, which `nodeValue' you need
to change. As an alternative, and to deal summarily with further child
nodes, you can change the `textContent' of the element node (DOM Level 3
Core):

<http://www.w3.org/TR/DOM-Level-3-Core/core.html#ID-745549614>
<http://www.w3.org/TR/DOM-Level-3-Core/core.html#ID-1950641247>

It is however possible that you only want to change the content of a
specific text node that is the child node of a specific child element of the
`code' element. In that case you can either modify exactly that text node,
which can be tricky if inline elements are nested, or you can use the
proprietary `innerHTML' property. The synhl() function used in
<http://PointedEars.de/es-matrix> does the latter, although it does not use
XPath (yet).

PointedEars

VK · Jun 6, 2010

var result = document.evaluate('//div[@class="code"]/code', document,
null, XPathResult.ORDERED_NODE_SNAPSHOT_TYPE, null);

if (result) {
Â Â Â Â for (var i = 0, len = result.snapshotLength; i < len; i++) {
Â Â Â Â Â Â Â Â // use snapshotItem(i) to get nodes
Â Â Â Â Â Â Â Â var val = result.snapshotItem(i).nodeValue;
Â Â Â Â Â Â Â Â val = val.replace(/<-/gi, "â†");
Â Â Â Â Â Â Â Â result.snapshotItem(i).nodeValue = val;
Â Â Â Â }

}

The problem is that, even though I checked that my xpath works, the
result list is empty for some reason even though I know that it should
not be empty. Â Because of this, I haven't even checked to see that the
inside code works. Â Why is that?

nodeValue property has a completely idiotic standard mechanics and
very seldom does what its name seemingly suggests:
https://developer.mozilla.org/En/NodeValue
The purpose of such standardization and naming is obscure, possibly it
is of a too strong grass they smoke in W3C

:-|

To overcome the idioticy, browser producers introduced additional
textContent property that does what you need and what nodeValue should
be doing if W3C members would be using right part of the body for
thinking.
https://developer.mozilla.org/En/DOM/Node.textContent

IE doesn't have this property because it has for ages its own
innerText property that does the same.
http://msdn.microsoft.com/en-us/library/ms533899(VS.85).aspx

To not be bothered with the environment check every single time, you
may define once a string variable with the right property name:

var text = (/*@cc_on true || @*/ false) ? 'innerText' : 'textContent';

// ...

var val = result.snapshotItem(i)[text];

VK · Jun 6, 2010

To not be bothered with the environment check every single time, you
may define once a string variable with the right property name:

var text = (/*@cc_on true || @*/ false) ? 'innerText' : 'textContent';

// ...

var val = result.snapshotItem(i)[text];

To be complete, DOM1 defines for text nodes read/write "data"
property, so theoretically you may use it instead:
var val = result.snapshotItem(i).data;

I never used it so I cannot comment on possible advantages/
disadvantages.

Thomas 'PointedEars' Lahn · Jun 6, 2010

VK said:
nodeValue property has a completely idiotic standard mechanics and
very seldom does what its name seemingly suggests:
https://developer.mozilla.org/En/NodeValue
The purpose of such standardization and naming is obscure, possibly it
is of a too strong grass they smoke in W3C :-|

Will you *please* stop perpetuating your ongoing delusions? The value of an
element node is not supposed to be the element's text content, for an
element can have non-text content.

To overcome the idioticy, browser producers introduced additional
textContent property that does what you need and what nodeValue should
be doing if W3C members would be using right part of the body for
thinking.

Utter nonsense. For one, Microsoft is a W3C member.

https://developer.mozilla.org/En/DOM/Node.textContent

IE doesn't have this property

You only know IE and Mozilla, don't you? The majority of DOM
implementations supports the `textContent' property. That is
why it became and stays a Web standard, not vice-versa.

because it has for ages

"For ages" -- 13 years?

its own innerText property that does the same.

http://msdn.microsoft.com/en-us/library/ms533899(VS.85).aspx

Those properties are _not_ equivalent. For one, `innerText' does not
include leading white-space text nodes, while `textContent' does.

To not be bothered with the environment check every single time, you
may define once a string variable with the right property name:

var text = (/*@cc_on true || @*/ false) ? 'innerText' : 'textContent';

This is insane. The scripting language has nothing to do with whether
either of those DOM properties are supported.

OP: Ignore VK, they do not know what they are talking about.

PointedEars

Thomas 'PointedEars' Lahn · Jun 6, 2010

VK said:
VK said:

To not be bothered with the environment check every single time, you
may define once a string variable with the right property name:

var text = (/*@cc_on true || @*/ false) ? 'innerText' : 'textContent';

// ...

var val = result.snapshotItem(i)[text];

Click to expand...

To be complete, DOM1 defines for text nodes read/write "data"
property, so theoretically you may use it instead:
var val = result.snapshotItem(i).data;

Since result.snapshotItem(i) does not refer to a text node but to an element
node here, a `data' property it might have is certainly not defined by W3C
DOM Level 1 (which was obsoleted by W3C DOM Level 2 a decade ago).

Unsurprisingly, however, it does not have a `data' property, and the `data'
property of the first child text node (which could but does not have to be
result.snapshotItem(i).firstChild) does not need to equal the value of the
`textContent' property; the text content of an element may include several
text nodes, not necesarily adjacent ones.

I never used it so I cannot comment on possible advantages/disadvantages.

You have not the shadow of a clue of what you are talking about. And that
is only a crude approximation.

PointedEars

VK · Jun 6, 2010

Will you *please* stop perpetuating your ongoing delusions? The value of an
element node is not supposed to be the element's text content, for an
element can have non-text content.

Surely: for instance, <textNode>Text</textNode> node has nodeValue
null. It is an obvious behavior additionally hinted by the property
name itself...

You only know IE and Mozilla, don't you? The majority of DOM
implementations supports the `textContent' property. That is
why it became and stays a Web standard, not vice-versa.

It became Web standard for the same reason why CSS cursor:hand is
"wrong" and rather idiotic cursor

ointer is now "right" (any type of
cursor is pointer). The reason is that in Microsoft it is called that
way, so the "standard" one has to be called differently. This practice
has been abandoned only after the W3C's famous letter that ended the
2nd (Religious) Browser Wars (November 26, 2007):
http://www.w3.org/TR/2007/WD-html-design-principles-20071126/
In the particular see "2.3. Do not Reinvent the Wheel".

"For ages" -- 13 years?

Don't make yourself silly so often. Yes, 13 years for IT and the Web
is like 13 centuries for the human history.

Those properties are _not_ equivalent. For one, `innerText' does not
include leading white-space text nodes, while `textContent' does.

It is a separate problem of the phantom nodes introduced by Gecko and
then reproduced by others, see
https://bugzilla.mozilla.org/show_bug.cgi?id=26179
IE never had that problem. So it is not a property specific, it is a
underlaying structure difference. See also
https://developer.mozilla.org/en/Whitespace_in_the_DOM
I especially like that multi-line code under the title "Making things
easier". Yet another proof that some programmers in Mozilla are real
humorists...

This is insane. The scripting language has nothing to do with whether
either of those DOM properties are supported.

OP: Ignore VK, they do not know what they are talking about.

OP: do as you wish.

Thomas 'PointedEars' Lahn · Jun 6, 2010

VK said:
Thomas said:

Will you *please* stop perpetuating your ongoing delusions? The value of
an element node is not supposed to be the element's text content, for an
element can have non-text content.

Click to expand...

Surely: for instance, <textNode>Text</textNode> node has nodeValue
null. [...]

No, certainly not.

It became Web standard for the same reason why CSS cursor:hand is
"wrong"

It is wrong because it implies a shape that is not necessarily used, and
because it is insufficient to describe the shape that is used if it is used.

and rather idiotic cursorointer is now "right" (any type of
cursor is pointer).

Utter nonsense. Pointer here means an shape that can point to something.
This could be an icon that is the pointer digit of a human hand, but does
not need to (it depends on the cursor theme).

The reason is that in Microsoft it is called that way,

No, the reason is that "hand" does not reflect the shape.

so the "standard" one has to be called differently.
Nonsense.

It is a separate problem of the phantom nodes

"Phantom nodes" are an fantasy of yours. Fortunately, the Wikipedia article
you created did not live long enough for this nonsense to be perpetuated.

introduced by Gecko and then reproduced by others,
Nonsense.

see
https://bugzilla.mozilla.org/show_bug.cgi?id=26179

Your delusions have been disproved there, too (the bug is rightfully marked
VERIFIED INVALID, stupid). White-space text nodes are _not_ introduced by
non-MSHTMLs, they are sometimes disregarded by MSHTML. Everybody who has
the slightest clue about the DOM API knows that.

IE never had that problem.

It has another, more serious problem of not providing a consistent document
tree. White-space text nodes are seemingly disregarded at random in MSHTML.
No, we do not really want that.

PointedEars

kelvSYC · Jun 7, 2010

kelvSYC said:
kelvSYC said:

var result = document.evaluate('//div[@class="code"]/code', document,
null, XPathResult.ORDERED_NODE_SNAPSHOT_TYPE, null);

Click to expand...

if (result) {
for (var i = 0, len = result.snapshotLength; i < len; i++) {
// use snapshotItem(i) to get nodes
var val = result.snapshotItem(i).nodeValue;
val = val.replace(/<-/gi, "â†");
result.snapshotItem(i).nodeValue = val;
}
}

Click to expand...

The problem is that, even though I checked that my xpath works, the
result list is empty for some reason even though I know that it should
not be empty. Â Because of this, I haven't even checked to see thatthe
inside code works. Â Why is that?

Click to expand...

The `nodeValue' of an element node is not the content of the element; it is
`null' by definition, and assigning to that property has no effect then. Â
The element node has a text node as child node, which `nodeValue' you need
to change. Â As an alternative, and to deal summarily with further child
nodes, you can change the `textContent' of the element node (DOM Level 3
Core):

<http://www.w3.org/TR/DOM-Level-3-Core/core.html#ID-745549614>
<http://www.w3.org/TR/DOM-Level-3-Core/core.html#ID-1950641247>

It is however possible that you only want to change the content of a
specific text node that is the child node of a specific child element of the
`code' element. Â In that case you can either modify exactly that text node,
which can be tricky if inline elements are nested, or you can use the
proprietary `innerHTML' property. Â The synhl() function used in
<http://PointedEars.de/es-matrix> does the latter, although it does not use
XPath (yet).

Interesting about the inner part. But I still don't get why the xpath
expression "//div[@class="code"]/code" isn't matching anything, even
though there is such a thing in my document. I'm seeing "//div" not
matching anything, "//body" not matching anything, even "//html" not
matching anything. I can get "//*[@class="code"]" to match, though.

I should tell you about my eventual code. I'm planning on having it
so that a string like "<-" is entered in my HTML code in that form,
only for some javascript to transform it to an arrow character upon
rendering (whenever it appears in the XPath expression stated). If I
select the text and copy it, then I should get back the original "<-".

Relating to that is how I can replace some text in a text node with
node content - for example, replacing the word "foo" with a <span
class="...">foo</span>.

Thomas 'PointedEars' Lahn · Jun 7, 2010

kelvSYC said:
Thomas said:

kelvSYC said:

var result = document.evaluate('//div[@class="code"]/code', document,
null, XPathResult.ORDERED_NODE_SNAPSHOT_TYPE, null);

if (result) {
for (var i = 0, len = result.snapshotLength; i < len; i++) {
// use snapshotItem(i) to get nodes
var val = result.snapshotItem(i).nodeValue;
val = val.replace(/<-/gi, "â†");
result.snapshotItem(i).nodeValue = val;
}
}

Click to expand...

The problem is that, even though I checked that my xpath works, the
result list is empty for some reason even though I know that it should
not be empty. Because of this, I haven't even checked to see that the
inside code works. Why is that?

Click to expand...

The `nodeValue' of an element node is not the content of the element; it
is `null' by definition, and assigning to that property has no effect
then. The element node has a text node as child node, which `nodeValue'
you need to change. [...]

Click to expand...

Interesting about the inner part. But I still don't get why the xpath
expression "//div[@class="code"]/code" isn't matching anything, even
though there is such a thing in my document. I'm seeing "//div" not
matching anything, "//body" not matching anything, even "//html" not
matching anything. I can get "//*[@class="code"]" to match, though.

In your first posting you have said that your XPath expression works, only
that the element content does not change. Therefore, I have explained why
it does not change and how to change it.

But now you are saying the XPath expression does not work in the first
place. Something does not add up here.

Suppose your XPath expression does not work, then it is impossible to say
why that would be so without you providing further details about your
runtime environment and the document; at best, a test case. I have since
updated synhl() and '//code[not(contains(concat(" ", @class, " "),
"donthl"))]' works fine in "Mozilla/5.0 (X11; U; Linux i686; en-US;
rv:1.9.2.3) Gecko/20100404 Iceweasel/3.6.3 (like Firefox/3.6.3) GTB7.0".

PointedEars

Thomas 'PointedEars' Lahn · Jun 8, 2010

Thomas said:
Suppose your XPath expression does not work, then it is impossible to say
why that would be so without you providing further details about your
runtime environment and the document; at best, a test case. I have since
updated synhl() and '//code[not(contains(concat(" ", @class, " "),
"donthl"))]' works fine in "Mozilla/5.0 (X11; U; Linux i686; en-US;
rv:1.9.2.3) Gecko/20100404 Iceweasel/3.6.3 (like Firefox/3.6.3) GTB7.0".

Supplemental: This XPath expression is bogus. Its purpose was to match
`code' elements only when their `class' attribute contained "foo", but only
when it is a word (i.e. a class name), and not in "foobar". So it needs to
be

'//code[not(contains(concat(" ", @class, " "), " donthl "))]'

assuming one would use only spaces as white-space delimiter of a `class'
attribute. Suggestions welcome.

PointedEars

kelvSYC · Jun 8, 2010

kelvSYC said:
kelvSYC said:

Thomas said:

kelvSYC wrote:
var result = document.evaluate('//div[@class="code"]/code', document,
null, XPathResult.ORDERED_NODE_SNAPSHOT_TYPE, null);
if (result) {
for (var i = 0, len = result.snapshotLength; i < len; i++) {
// use snapshotItem(i) to get nodes
var val = result.snapshotItem(i).nodeValue;
val = val.replace(/<-/gi, "â†");
result.snapshotItem(i).nodeValue = val;
}
}
The problem is that, even though I checked that my xpath works, the
result list is empty for some reason even though I know that it should
not be empty. Â Because of this, I haven't even checked to see that the
inside code works. Â Why is that?
The `nodeValue' of an element node is not the content of the element; it
is `null' by definition, and assigning to that property has no effect
then. The element node has a text node as child node, which `nodeValue'
you need to change. Â [...]

Click to expand...

Click to expand...

Interesting about the inner part. Â But I still don't get why the xpath
expression "//div[@class="code"]/code" isn't matching anything, even
though there is such a thing in my document. Â I'm seeing "//div" not
matching anything, "//body" not matching anything, even "//html" not
matching anything. Â I can get "//*[@class="code"]" to match, though.

Click to expand...

In your first posting you have said that your XPath expression works, only
that the element content does not change. Â Therefore, I have explained why
it does not change and how to change it.

But now you are saying the XPath expression does not work in the first
place. Â Something does not add up here.

Suppose your XPath expression does not work, then it is impossible to say
why that would be so without you providing further details about your
runtime environment and the document; at best, a test case. Â I have since
updated synhl() and '//code[not(contains(concat(" ", @class, " "),
"donthl"))]' works fine in "Mozilla/5.0 (X11; U; Linux i686; en-US;
rv:1.9.2.3) Gecko/20100404 Iceweasel/3.6.3 (like Firefox/3.6.3) GTB7.0".

Sorry. I'm using Safari 5.0 right now. I can't seem to find a place
where you could get the user agent string. Can anyone point out where
I can do so?

To reiterate, XPath expressions where you name specific child nodes
don't appear to work: even "//html" returns no results. However, "//
*" returns nodes, as is "//*[position() = 1]", say. I'm stumped in
that regard.

Thomas 'PointedEars' Lahn · Jun 9, 2010

kelvSYC said:
Thomas said:

Suppose your XPath expression does not work, then it is impossible to say
why that would be so without you providing further details about your
runtime environment and the document; at best, a test case. I have since
updated synhl() and '//code[not(contains(concat(" ", @class, " "),
"donthl"))]' works fine in "Mozilla/5.0 (X11; U; Linux i686; en-US;
rv:1.9.2.3) Gecko/20100404 Iceweasel/3.6.3 (like Firefox/3.6.3) GTB7.0".

Click to expand...

Sorry. I'm using Safari 5.0 right now. I can't seem to find a place
where you could get the user agent string. Can anyone point out where
I can do so?

Use the script console to display navigator.userAgent, use web-sniffer.net,
set up a dummy Web server, check your server logs, etc.

To reiterate, XPath expressions where you name specific child nodes
don't appear to work: even "//html" returns no results. However, "//
*" returns nodes, as is "//*[position() = 1]", say. I'm stumped in
that regard.

To reiterate, you are not providing sufficient information.

If I were to make an educated guess from the available data, the problem
might be that XPath is case-sensitive with regard to element type names;
HTML is not.

Please trim your quotes to the relevant minimum.

PointedEars

I want to include fees depending on the payment method, using the plugin "Deposits for Woocommerce"	0	Aug 17, 2022
In C, the longest palindromic subsequence multithread exists	0	Nov 23, 2022
Trying to build a SARIMAX model to forecast the S&P500 trend	0	Nov 5, 2023
Fly outmenu on the fly	0	Apr 8, 2005
IE events on the fly	1	Jan 12, 2006
Creating unit tests on the fly	3	Apr 8, 2011
Collect Excel Data from Website	5	Apr 30, 2022
How can I upload a tar.bz2 file to OpenStack swift object storage container using the Python swift client?	1	Mar 22, 2024

Replacing text on the fly

kelvSYC

Thomas 'PointedEars' Lahn

VK

VK

Thomas 'PointedEars' Lahn

Thomas 'PointedEars' Lahn

VK

Thomas 'PointedEars' Lahn

kelvSYC

Thomas 'PointedEars' Lahn

Thomas 'PointedEars' Lahn

kelvSYC

Thomas 'PointedEars' Lahn

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads