Inger Helene Falch-Jacobsen said:
It would be too much work to do that with the approx. 8000 Norwegian
names!
Not at all. On page
http://home.no.net/ingernet/surnfreq.php for example
you have the list of name frequencies inside a DIV element. It is
And what about those who aren't Norwegians?
To comply with the recommendations, you would need to set the lang
attribute for those <a> elements that contain such names, then.
The language of names is often debatable, though - but not the names are
in English, which is what the markup now says.
The dates are in English format, too.
Then lang="en" is the adequate markup for the part where the dates
appear. On the other hand, on the Web, especially on pages that are more
or less bi- or multilingual, the ISO 8601 notation (e.g., 2004-04-06) is
often superior, since it is unique and understandable irrespectively of
language.
I thought I was smart when I
removed "Norway" from the Norwegian place names to save space...
Maybe I have to use that, even though I tell my visitors that "My
ancestors are mainly from ... Norway" on the index page. And of
course replace Danmark, Sverige and Tyskland with Denmark, Sweden and
Germany. My priority now is to optimize for Google.
Sorry, but that's worse than pointless. Use the language that best serves
the purpose of the pages, instead of trying to tune it to please Google.
Besides, it might well fail - unless you put lots of English on the
pages, creating more distraction among human readers. Anyone who wants to
find genealogy information on the Web and uses a language filter in
Google will miss quite a lot anyway
But you do have the option to search for pages *located* in a
specific country...
No I don't. Google just misleads us into thinking that way. It offers a
language filter and a domain filter, but neither of them is a country
filter. Pick up a .com or .org domain and try to deduce the country from
the domain name, or from the language (as announced or as actually used),
and you get wrong guesses. Does someone think that .to pages are really
located in Tonga, or .tv pages in Tuvalu?
So the problem is partly that the English text is too short?
For the purposes of language markup, the question is irrelevant.
As regards to Google's guessing game, the answer is obviously that to the
extent it uses the actual text content, having a larger proportion of
English text increases the odds of its guessing English. But I would
strongly advice against such games. A little reductio ad absurdum:
To maximize the odds of its guessing English, the entire page should
contain nothing but very simple statements in English, using the most
common words. Such as
"All work and no play makes Jack a dull boy.
All work and no play makes Jack a dull boy.
All work and no play makes Jack a dull boy.
..."
This would not do good to the purpose of the page. And the same applies
to less drastic moves in that direction.