Q:Text search for character entity references (*')

Rudolf · July 19, 2002, 9:12pm

Could someone help us out. We are trying to locate documents where an element contains an apostrophe (').

A request like …?_xql=a/b[c=“'“] fails, i.e. it acts like _xql=a/b[c=””] which is not our intention.

Of course we also tried some variations like the hexadecimal variant or just an apostrophe. They all lead to the same (unwanted) result.

Does anyone know how we can get the desired result?

regards,
Rudolf

Dr_Harald_Schoening · July 22, 2002, 2:32pm

Hi Rudolf,

you cannot use * as a wildcard with the operator =. You must use ~= instead. This is the first reason why the query …?_xql=a/b[c=“'“] fails.
The second reason is more involved: The query …?_xql=a/b[c~=”'”] will not produce the desired result as well. The reason is that ’ is defined as white space in Tamino. You can modify this classification by adding the following line to ino:transliteration (see Tamino documentation “Character Handling and Word Recognition” for details)
<ino:character ino:value=“'” ino:class=“character” />
Then, recreate text indexes and restart teh Tamino server to make changes take effect. After that, the query …?_xql=a/b[c~=“'”] will find all c elements which contain an '

Regards

Harald

Rudolf · July 22, 2002, 3:13pm

Of course Harald, you’re right. Actually we did use the ~= operator and it does not work. You can easily test it yourself on any document type containing ' as text.

Regards,
Rudolf

Dr_Harald_Schoening · July 23, 2002, 4:51pm

Rudolf,

yes, that’s what I wrote: you must tell Tamino to handle ’ differently:

The query …?_xql=a/b[c~=“*'”] will not produce the desired result as well. The reason is that ’ is defined as white space in Tamino. You can modify this classification by adding the following line to ino:transliteration (see Tamino documentation “Character Handling and Word Recognition” for details)
<ino:character ino:value=“'” ino:class=“character” />
Then, recreate text indexes and restart the Tamino server to make changes take effect. After that, the query …?_xql=a/b[c~=“'”] will find all c elements which contain an '

This works - I have tested that.

Regards

Harald

Rudolf · July 23, 2002, 6:17pm

You’re absolutely right. Thank you.

Regards,
Rudolf

Topic		Replies	Views
[XQery] problem with the caracter " ' " Tamino	3	4442	April 2, 2021
Error parsing the XQL query! Tamino	8	3750	April 2, 2021
[XQuery] problem with the character "-" Tamino	7	6843	April 2, 2021
FullText Search Behaviour Tamino	3	3455	April 2, 2021
empty element (sort of) Tamino	5	3582	April 2, 2021

Q:Text search for character entity references (*')

Related topics