Hi P
So what you’re looking for is a way of parsing the “free text” within the
tags in the wikipedia articles into words ?
Is this some sort of linguistic study with a free corpus from Wikipedia ?
I’m not sure xquery is the optimal tool for this analysis - at least I haven’t see any xquery examples regarding this parsing into words.
But I guess it can be implemented with userdefined functions, or perhaps more effectively by a Tamino server-extension function.