document.evaluate and @innerHTML

Csaba Gabor · Dec 14, 2005

In Firefox 1.5 (this question is Mozilla specific as I am using
greasemonkey) I would like to be able to use document.evaluate to
return the first TD entry that shows ^\s*MySearchText\s*$. As I
understand it, xpath doesn't yet have regular expressions so I thought
to do:

function findNode (srch) {
var node;
var expr="//td[contains(@innerHTML,'"+srch+"')]";
var RE = new RegExp("\\s*" + srch + "\\s*$");
var xpathResult = document.evaluate(expr, document, null,
XPathResult.UNORDERED_NODE_SNAPSHOT_TYPE, null);
for (var i = 0; i < xpathResult.snapshotLength; i++) {
node = xpathResult.snapshotItem(i);
if (node.innerHTML.match(RE)) return (node); // node found
}

return null; // node was not found
}

This always returns null though. Specifically,
xpathResult.snapshotLength is always 0 since document.evaluate
evidently doesn't like that @innerHTML. Short of looping through all
TDs, is there another approach someone might suggest?

Thanks,
Csaba Gabor from Vienna

Martin Honnen · Dec 14, 2005

Csaba Gabor wrote:

var expr="//td[contains(@innerHTML,'"+srch+"')]";

@name in XPath accesses an attribute of an element, innerHTML is not an
attribute of an element, it is only a property exposed in the browser DOM.

Martin Honnen · Dec 14, 2005

Csaba said:
In Firefox 1.5 (this question is Mozilla specific as I am using
greasemonkey) I would like to be able to use document.evaluate to
return the first TD entry that shows ^\s*MySearchText\s*$.

var xpathResult = document.evaluate(expr, document, null,
XPathResult.UNORDERED_NODE_SNAPSHOT_TYPE, null);

Note also that you could or even should use
XPathResult.FIRST_ORDERED_NODE_TYPE if you are only looking for the
first node e.g.
var singleResultNode =
document.evaluate(xPathExpression, document, null, 9,
null).singleNodeValue;
if (singleResultNode != null) {
// use node here
}
That way the XPath implementation only needs to find the first node and
does not need to build the complete resulting node set.

Other optimizations make sense, for instance if you want to look for td
elements then those usually sit only inside of the body so you could
write an expression .//td and evaluate it with document.body being the
context node.

Csaba Gabor · Dec 14, 2005

Martin said:
Note also that you could or even should use
XPathResult.FIRST_ORDERED_NODE_TYPE if you are only looking for the
first node e.g.
var singleResultNode =
document.evaluate(xPathExpression, document, null, 9,
null).singleNodeValue;
if (singleResultNode != null) {
// use node here
}
That way the XPath implementation only needs to find the first node and
does not need to build the complete resulting node set.

Thank you Martin, your comments were very helpful. So it seems that I
really do have to stumble through all the elements (ie. I'm not really
saving anything over document.body.getElementsByTagName('TD') it seems
to me). Given that document.evaluate is supposed to be used for
trawling through the DOM, it's unfortunate that the designers did not
include the ability to search on properties as they are some of the
most important characteristics (such as text / positioning) to search
on.

Other optimizations make sense, for instance if you want to look for td
elements then those usually sit only inside of the body so you could
write an expression .//td and evaluate it with document.body being the
context node.

You show a period in front of that //td
(How) does that alter the meaning? That is, how does
document.evaluate("//td", document.body, null, 9, null)
differ from
document.evaluate(".//td", document.body, null, 9, null)

The greasemonkey documentation has a few examples at:
http://diveintogreasemonkey.org/patterns/match-attribute.html
but they are very primitive. On the other hand, I didn't see any //
examples in the documentation that they pointed me to. Would you know
of a good source for examples?

Thanks again,
Csaba

Martin Honnen · Dec 14, 2005

Csaba said:
You show a period in front of that //td
(How) does that alter the meaning? That is, how does
document.evaluate("//td", document.body, null, 9, null)
differ from
document.evaluate(".//td", document.body, null, 9, null)

The first (//td) is an absolute XPath expression, it always starts from
the root node (document node in the DOM) while the second (.//td) is a
relative XPath expression.

cyberrus · Dec 29, 2005

Csaba said:
In Firefox 1.5 (this question is Mozilla specific as I am using
greasemonkey) I would like to be able to use document.evaluate to
return the first TD entry that shows ^\s*MySearchText\s*$. As I
understand it, xpath doesn't yet have regular expressions so I thought

....

the function you are lookin for is called string() - see here
http://www.w3.org/TR/xpath#section-String-Functions
- it returns context node converted into string (but it doesn't handle
RegExps) - if you wanna find first ocurrance of string "srch" in all td
nodes - the evaluator should loook like this:

function findNode (srch) {
return document.evaluate(
"//td[contains(string(),'"+srch+"')]",
document,
null,
9,
null).singleNodeValue;
}

as i mentioned above it wouldn't catch regexp - to handle it you have
to use standard DOM methods with loop like this:

function findNodeRE (srch) {
var allTd = document.getElementsByTagName('TD');
for(var i=0;i<allTd.length;i++){
if(srch.test(allTd.textContent))
return allTd;
}
}

pay attention that "srch" variable in second function is RegExp object

How to use the contextNode in document.evaluate()?	2	Sep 29, 2006
innerHTML?	2	Dec 9, 2005
remove HTML tag - keep everything in between	2	Nov 28, 2006
innerHTML with AJAX problem	12	May 9, 2008
Javascript, Firefox, InnerHTML and AJAX	6	Jul 27, 2007
removing nodes innerHTML vs. removeChild	2	Apr 2, 2008
finding the XPath of a node	3	Mar 18, 2007
Prototype 1.6--Somebody Stop These People	6	Dec 24, 2009

document.evaluate and @innerHTML

Csaba Gabor

Martin Honnen

Martin Honnen

Csaba Gabor

Martin Honnen

cyberrus

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads