Full text search

Michel_Teunissen · February 10, 2005, 4:11pm

What’s involved in setting up full text search on a certain xml schema? How does this translate to XPath/XQuery

Michel

system · February 11, 2005, 10:18pm

Hi,

do you mean you want to search the whole document content (not content of a single sub-level element)?

You need an index of type “text” on the root element of you schema. Then Tamino builds a text-index for every word in every document you store in this schema.

Then you can do index-supported querying, which would look like this in XQL (say your root element is “chapter”):

_xql=chapter[.~=“Excalibur”]
which means “gimme every chapter that contains (~=) the word Excalibur”.

You can go wild with wildcard ():
_xql=chapter[.~=“calibur"]
_xql=chapter[.~="Excali”]
_xql=chapter[.~="Exbur"]

Index-supported, all of these queries are fast even with thousands of documents. But beware of leading and trailing asterisk in one query, like this:
_xql=chapter[.~=“cali”]

This is called a word fragment search and this is by default not index-supported. If you need index-support for such queries, you should engange “word fragment index” (to be found the database properties section in Tamino Manager). It can be enabled for the whole database only, and it somewhat slows down processing of inserts. But it makes all kinds of word-fragment searches possible.

Best regards, Andreas

Dr_Harald_Schoening · February 13, 2005, 5:26pm

Hi,

in addition to Andreas’ remarks: all the queries
he mentions are possible without any index. They are, of course, not that fast as with an index, but for testing purposes with a small database, you do not need text indexes

Regards

Harald

Michel_Teunissen · February 14, 2005, 2:56pm

Is this also going to work if my document is in a custom workspace? How do handle this in xql?

Michel

Dr_Harald_Schoening · February 15, 2005, 10:04pm

Hi,

are you referring to a collection when writing “workspace”?
Yes, same procedure - you give teh name of the collection in the URL in case of direct http addressing.
Andreas’ queries are xql (X-Query), same is possible in XQuery with tf:containsText

Regards

Harald

Topic		Replies	Views
Search across multiple document types Tamino	2	3188	April 2, 2021
Document content indexing Tamino	2	10455	April 2, 2021
Full-text search capabilities? Tamino	3	3327	April 2, 2021
Does Tamino support Word 2003? Tamino	5	12742	April 2, 2021
Creating a search engine using Tamino Tamino	1	3130	April 2, 2021

Full text search

Related topics