{ "ns" : "http://zorba.io/modules/full-text", "description" : " This module provides an XQuery API to full-text functions.\n For general information about this implementation of the\n XQuery and XPath Full Text 1.0 specification\n as well as instructions for building an installing a thesaurus,\n see the Full Text Thesaurus documentation.\n

Notes on languages

\n To refer to particular human languages,\n uses either the\n ISO 639-1\n or\n ISO 639-2\n languages codes.\n Note that only a subset of the\n complete list of language codes\n are supported and not every function supports the same subset.\n

\n Most functions in this module take a language as a parameter\n using the\n xs:language\n XML schema data type.\n

Notes on stemming

\n The stem() functions return the\n stem\n of a word.\n The stem of a word itself, however, is not guaranteed to be a word.\n It is best to consider a stem as an opaque byte sequence.\n All that is guaranteed about a stem is that,\n for a given word,\n the stem of that word will always be the same byte sequence.\n Hence,\n you should never compare the result of one of the stem()\n functions against a non-stemmed string,\n for example:\n
\n  if ( ft:stem( \"apples\" ) eq \"apple\" )             ** WRONG **\n 
\n Instead do:\n
\n  if ( ft:stem( \"apples\" ) eq ft:stem( \"apple\" ) )  ** CORRECT **\n 
\n

Notes on the thesaurus

\n The thesaurus-lookup() functions have \"levels\"\n and \"relationship\" parameters.\n The values for these are implementation-defined.\n The default implementation uses the\n WordNet lexical database,\n version 3.0.\n

\n In WordNet,\n the number of \"levels\" that two phrases are apart\n are how many hierarchical meanings apart they are.\n For example,\n \"canary\" is 5 levels away from \"vertebrate\"\n (carary > finch > oscine > passerine > bird > vertebrate).\n

\n When using the WordNet implementation,\n all of the relationships (and their abbreviations)\n specified by\n ISO 2788\n and\n ANSI/NISO Z39.19-2005\n with the exceptions of \"HN\" (history note)\n and \"X SN\" (see scope note for) are supported.\n These relationships are:\n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n
Rel.MeaningWordNet Rel.
BTbroader termhypernym
BTGbroader term generichypernym
BTIbroader term instanceinstance hypernym
BTPbroader term partitivepart meronym
NTnarrower termhyponym
NTGnarrower term generichyponym
NTInarrower term instanceinstance hyponym
NTPnarrower term partitivepart holonym
RTrelated termalso see
SNscope noten/a
TTtop termhypernym
UFnon-preferred termn/a
USEpreferred termn/a
\n Note that you can specify relationships\n either by their abbreviation\n or their meaning.\n Relationships are case-insensitive.\n In addition to the\n ISO 2788\n and\n ANSI/NISO Z39.19-2005\n relationships,\n All of the relationships offered by WordNet are also supported.\n These relationships are:\n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n
RelationshipMeaning
also see\n A word that is related to another,\n e.g., for \"varnished\" (furniture)\n one should also see \"finished.\"\n
antonym\n A word opposite in meaning to another,\n e.g., \"light\" is an antonym for \"heavy.\"\n
attribute\n A noun for which adjectives express values,\n e.g., \"weight\" is an attribute\n for which the adjectives \"light\" and \"heavy\"\n express values.\n
cause\n A verb that causes another,\n e.g., \"show\" is a cause of \"see.\"\n
derivationally related form\n A word that is derived from a root word,\n e.g., \"metric\" is a derivationally related form of \"meter.\"\n
derived from adjective\n An adverb that is derived from an adjective,\n e.g., \"correctly\" is derived from the adjective \"correct.\"\n
entailment\n A verb that presupposes another,\n e.g., \"snoring\" entails \"sleeping.\"\n
hypernym\n A word with a broad meaning that more specific words fall under,\n e.g., \"meal\" is a hypernym of \"breakfast.\"\n
hyponym\n A word of more specific meaning than a general term applicable to it,\n e.g., \"breakfast\" is a hyponym of \"meal.\"\n
instance hypernym\n A word that denotes a category of some specific instance,\n e.g., \"author\" is an instance hypernym of \"Asimov.\"\n
instance hyponym\n A term that donotes a specific instance of some general category,\n e.g., \"Asimov\" is an instance hyponym of \"author.\"\n
member holonym\n A word that denotes a collection of individuals,\n e.g., \"faculty\" is a member holonym of \"professor.\"\n
member meronym\n A word that denotes a member of a larger group,\n e.g., a \"person\" is a member meronym of a \"crowd.\"\n
part holonym\n A word that denotes a larger whole comprised of some part,\n e.g., \"car\" is a part holonym of \"engine.\"\n
part meronym\n A word that denotes a part of a larger whole,\n e.g., an \"engine\" is part meronym of a \"car.\"\n
participle of verb\n An adjective that is the participle of some verb,\n e.g., \"breaking\" is the participle of the verb \"break.\"\n
pertainym\n An adjective that classifies its noun,\n e.g., \"musical\" is a pertainym in \"musical instrument.\"\n
similar to\n Similar, though not necessarily interchangeable, adjectives.\n For example, \"shiny\" is similar to \"bright\",\n but they have subtle differences.\n
substance holonym\n A word that denotes a larger whole containing some constituent\n substance, e.g., \"bread\" is a substance holonym of \"flour.\"\n
substance meronym\n A word that denotes a constituant substance of some larger whole,\n e.g., \"flour\" is a substance meronym of \"bread.\"\n
verb group\n A verb that is a member of a group of similar verbs,\n e.g., \"live\" is in the verb group\n of \"dwell\", \"live\", \"inhabit\", etc.\n
\n

Notes on tokenization

\n For general information about the implementation of tokenization,\n including what constitutes a token,\n see the Full Text Tokenizer documentation.\n", "sees" : [ ], "authors" : [ "Paul J. Lucas" ], "version" : null, "encoding" : "utf-8", "namespaces" : [ { "uri" : "http://www.w3.org/2005/xqt-errors", "prefix" : "err" }, { "uri" : "http://zorba.io/modules/full-text", "prefix" : "ft" }, { "uri" : "http://zorba.io/options/versioning", "prefix" : "ver" }, { "uri" : "http://zorba.io/errors", "prefix" : "zerr" } ], "functions" : [ { "arity" : 0, "name" : "current-compare-options", "qname" : "ft:current-compare-options", "signature" : "() as object() external", "description" : " Gets the current compare options.\n", "summary" : "

Gets the current compare options.

", "annotation_str" : "", "annotations" : [ ], "updating" : false, "parameters" : [ ], "returns" : { "type" : "object()", "description" : "said compare options." }, "errors" : [ ] }, { "arity" : 0, "name" : "current-lang", "qname" : "ft:current-lang", "signature" : "() as xs:language external", "description" : " Gets the current\n language:\n either the language specified by the\n declare ft-option using\n language\n statement (if any)\n or the one returned by ft:host-lang() (if none).\n", "summary" : "

Gets the current\n language :\n either the language specified by the\n declare ft-option using \n language \n statement (if any)\n or the one returned by ft:host-lang() (if none).

", "annotation_str" : "", "annotations" : [ ], "updating" : false, "parameters" : [ ], "returns" : { "type" : "xs:language", "description" : "said language." }, "errors" : [ ] }, { "arity" : 0, "name" : "host-lang", "qname" : "ft:host-lang", "signature" : "() as xs:language external", "description" : " Gets the host's current\n language.\n The \"host\" is the computer on which the software is running.\n The host's current language is obtained as follows:\n \n", "summary" : "

Gets the host's current\n language .

", "annotation_str" : "", "annotations" : [ ], "updating" : false, "parameters" : [ ], "returns" : { "type" : "xs:language", "description" : "said language." }, "errors" : [ ] }, { "arity" : 1, "name" : "is-stem-lang-supported", "qname" : "ft:is-stem-lang-supported", "signature" : "($lang as xs:language) as xs:boolean external", "description" : " Checks whether the given\n language\n is supported for stemming.\n", "summary" : "

Checks whether the given\n language \n is supported for stemming.

", "annotation_str" : "", "annotations" : [ ], "updating" : false, "parameters" : [ { "name" : "lang", "type" : "xs:language", "occurence" : null, "description" : "
The language to check.
" } ], "returns" : { "type" : "xs:boolean", "description" : "true only if the language is supported." }, "errors" : [ ] }, { "arity" : 1, "name" : "is-stop-word-lang-supported", "qname" : "ft:is-stop-word-lang-supported", "signature" : "($lang as xs:language) as xs:boolean external", "description" : " Checks whether the given\n language\n is supported for stop words.\n", "summary" : "

Checks whether the given\n language \n is supported for stop words.

", "annotation_str" : "", "annotations" : [ ], "updating" : false, "parameters" : [ { "name" : "lang", "type" : "xs:language", "occurence" : null, "description" : "
The language to check.
" } ], "returns" : { "type" : "xs:boolean", "description" : "true only if the language is supported." }, "errors" : [ ] }, { "arity" : 1, "name" : "is-stop-word", "qname" : "ft:is-stop-word", "signature" : "($word as xs:string) as xs:boolean external", "description" : " Checks whether the given word is a stop-word.\n", "summary" : "

Checks whether the given word is a stop-word.

", "annotation_str" : "", "annotations" : [ ], "updating" : false, "parameters" : [ { "name" : "word", "type" : "xs:string", "occurence" : null, "description" : "
The word to check. The word's language is assumed to be the one returned by ft:current-lang().
" } ], "returns" : { "type" : "xs:boolean", "description" : "true only if $word is a stop-word." }, "errors" : [ "err:FTST0009 if ft:current-lang() is not supported." ] }, { "arity" : 2, "name" : "is-stop-word", "qname" : "ft:is-stop-word", "signature" : "($word as xs:string, $lang as xs:language) as xs:boolean external", "description" : " Checks whether the given word is a stop-word.\n", "summary" : "

Checks whether the given word is a stop-word.

", "annotation_str" : "", "annotations" : [ ], "updating" : false, "parameters" : [ { "name" : "word", "type" : "xs:string", "occurence" : null, "description" : "
The word to check.
" }, { "name" : "lang", "type" : "xs:language", "occurence" : null, "description" : "
The language of $word.
" } ], "returns" : { "type" : "xs:boolean", "description" : "true only if $word is a stop-word." }, "errors" : [ "err:FTST0009 if $lang is not supported." ] }, { "arity" : 1, "name" : "is-thesaurus-lang-supported", "qname" : "ft:is-thesaurus-lang-supported", "signature" : "($lang as xs:language) as xs:boolean external", "description" : " Checks whether the given\n language\n is supported for look-up using the default thesaurus.\n", "summary" : "

Checks whether the given\n language \n is supported for look-up using the default thesaurus.

", "annotation_str" : "", "annotations" : [ ], "updating" : false, "parameters" : [ { "name" : "lang", "type" : "xs:language", "occurence" : null, "description" : "
The language to check.
" } ], "returns" : { "type" : "xs:boolean", "description" : "true only if the language is supported." }, "errors" : [ ] }, { "arity" : 2, "name" : "is-thesaurus-lang-supported", "qname" : "ft:is-thesaurus-lang-supported", "signature" : "($uri as xs:string, $lang as xs:language) as xs:boolean external", "description" : " Checks whether the given\n language\n is supported for look-up using the thesaurus specified by the given URI.\n", "summary" : "

Checks whether the given\n language \n is supported for look-up using the thesaurus specified by the given URI.

", "annotation_str" : "", "annotations" : [ ], "updating" : false, "parameters" : [ { "name" : "uri", "type" : "xs:string", "occurence" : null, "description" : "
The URI specifying the thesaurus to use.
" }, { "name" : "lang", "type" : "xs:language", "occurence" : null, "description" : "
The language to check.
" } ], "returns" : { "type" : "xs:boolean", "description" : "true only if the language is supported." }, "errors" : [ "err:FTST0018 if $uri refers to a thesaurus that is not found in the statically known thesauri." ] }, { "arity" : 1, "name" : "is-tokenizer-lang-supported", "qname" : "ft:is-tokenizer-lang-supported", "signature" : "($lang as xs:language) as xs:boolean external", "description" : " Checks whether the given\n language\n is supported for tokenization.\n", "summary" : "

Checks whether the given\n language \n is supported for tokenization.

", "annotation_str" : "", "annotations" : [ ], "updating" : false, "parameters" : [ { "name" : "lang", "type" : "xs:language", "occurence" : null, "description" : "
The language to check.
" } ], "returns" : { "type" : "xs:boolean", "description" : "true only if the language is supported." }, "errors" : [ ] }, { "arity" : 1, "name" : "stem", "qname" : "ft:stem", "signature" : "($word as xs:string) as xs:string external", "description" : " Stems the given word.\n", "summary" : "

Stems the given word.

", "annotation_str" : "", "annotations" : [ ], "updating" : false, "parameters" : [ { "name" : "word", "type" : "xs:string", "occurence" : null, "description" : "
The word to stem. The word's language is assumed to be the one returned by ft:current-lang().
" } ], "returns" : { "type" : "xs:string", "description" : "the stem of $word." }, "errors" : [ "err:FTST0009 if ft:current-lang() is not supported." ] }, { "arity" : 2, "name" : "stem", "qname" : "ft:stem", "signature" : "($word as xs:string, $lang as xs:language) as xs:string external", "description" : " Stems the given word.\n", "summary" : "

Stems the given word.

", "annotation_str" : "", "annotations" : [ ], "updating" : false, "parameters" : [ { "name" : "word", "type" : "xs:string", "occurence" : null, "description" : "
The word to stem.
" }, { "name" : "lang", "type" : "xs:language", "occurence" : null, "description" : "
The language of $word.
" } ], "returns" : { "type" : "xs:string", "description" : "the stem of $word." }, "errors" : [ "err:FTST0009 if $lang is not supported." ] }, { "arity" : 1, "name" : "strip-diacritics", "qname" : "ft:strip-diacritics", "signature" : "($string as xs:string) as xs:string external", "description" : " Strips all diacritical marks from all characters.\n", "summary" : "

Strips all diacritical marks from all characters.

", "annotation_str" : "", "annotations" : [ ], "updating" : false, "parameters" : [ { "name" : "string", "type" : "xs:string", "occurence" : null, "description" : "
The string to strip diacritical marks from.
" } ], "returns" : { "type" : "xs:string", "description" : "$string with diacritical marks stripped." }, "errors" : [ ] }, { "arity" : 1, "name" : "thesaurus-lookup", "qname" : "ft:thesaurus-lookup", "signature" : "($phrase as xs:string) as xs:string* external", "description" : " Looks-up the given phrase in the default thesaurus.\n", "summary" : "

Looks-up the given phrase in the default thesaurus.

", "annotation_str" : "", "annotations" : [ ], "updating" : false, "parameters" : [ { "name" : "phrase", "type" : "xs:string", "occurence" : null, "description" : "
The phrase to look up. The phrase's language is assumed to be the one returned by ft:current-lang().
" } ], "returns" : { "type" : "xs:string*", "description" : "the related phrases if $phrase is found in the thesaurus or the empty sequence if not." }, "errors" : [ "err:FTST0009 if ft:current-lang() is not supported.", "zerr:ZXQP8401 if the thesaurus data file's version is not supported by the currently running version of the software.", "zerr:ZXQP8402 if the thesaurus data file's endianness does not match that of the CPU on which the software is currently running.", "zerr:ZXQP8403 if there was an error reading the thesaurus data." ] }, { "arity" : 2, "name" : "thesaurus-lookup", "qname" : "ft:thesaurus-lookup", "signature" : "($uri as xs:string, $phrase as xs:string) as xs:string* external", "description" : " Looks-up the given phrase in a thesaurus.\n", "summary" : "

Looks-up the given phrase in a thesaurus.

", "annotation_str" : "", "annotations" : [ ], "updating" : false, "parameters" : [ { "name" : "uri", "type" : "xs:string", "occurence" : null, "description" : "
The URI specifying the thesaurus to use.
" }, { "name" : "phrase", "type" : "xs:string", "occurence" : null, "description" : "
The phrase to look up. The phrase's language is assumed to be the one the one returned by ft:current-lang().
" } ], "returns" : { "type" : "xs:string*", "description" : "the related phrases if $phrase is found in the thesaurus or the empty sequence if not." }, "errors" : [ "err:FTST0009 if ft:current-lang() is unsupported.", "err:FTST0018 if $uri refers to a thesaurus that is not found in the statically known thesauri.", "zerr:ZOSE0001 if the thesaurus data file could not be found.", "zerr:ZOSE0002 if the thesaurus data file is not a plain file.", "zerr:ZXQP8401 if the thesaurus data file's version is not supported by the currently running version of the software.", "zerr:ZXQP8402 if the thesaurus data file's endianness does not match that of the CPU on which the software is currently running.", "zerr:ZXQP8403 if there was an error reading the thesaurus data file." ] }, { "arity" : 3, "name" : "thesaurus-lookup", "qname" : "ft:thesaurus-lookup", "signature" : "($uri as xs:string, $phrase as xs:string, $lang as xs:language) as xs:string* external", "description" : " Looks-up the given phrase in the thesaurus specified by the given URI.\n", "summary" : "

Looks-up the given phrase in the thesaurus specified by the given URI.

", "annotation_str" : "", "annotations" : [ ], "updating" : false, "parameters" : [ { "name" : "uri", "type" : "xs:string", "occurence" : null, "description" : "
The URI specifying the thesaurus to use.
" }, { "name" : "phrase", "type" : "xs:string", "occurence" : null, "description" : "
The phrase to look up.
" }, { "name" : "lang", "type" : "xs:language", "occurence" : null, "description" : "
The language of $phrase.
" } ], "returns" : { "type" : "xs:string*", "description" : "the related phrases if $phrase is found in the thesaurus or the empty sequence if not." }, "errors" : [ "err:FTST0009 if $lang is not supported.", "err:FTST0018 if $uri refers to a thesaurus that is not found in the statically known thesauri.", "zerr:ZOSE0001 if the thesaurus data file could not be found.", "zerr:ZOSE0002 if the thesaurus data file is not a plain file.", "zerr:ZXQP8401 if the thesaurus data file's version is not supported by the currently running version of the software.", "zerr:ZXQP8402 if the thesaurus data file's endianness does not match that of the CPU on which the software is currently running.", "zerr:ZXQP8403 if there was an error reading the thesaurus data file." ] }, { "arity" : 4, "name" : "thesaurus-lookup", "qname" : "ft:thesaurus-lookup", "signature" : "($uri as xs:string, $phrase as xs:string, $lang as xs:language, $relationship as xs:string) as xs:string* external", "description" : " Looks-up the given phrase in a thesaurus.\n", "summary" : "

Looks-up the given phrase in a thesaurus.

", "annotation_str" : "", "annotations" : [ ], "updating" : false, "parameters" : [ { "name" : "uri", "type" : "xs:string", "occurence" : null, "description" : "
The URI specifying the thesaurus to use.
" }, { "name" : "phrase", "type" : "xs:string", "occurence" : null, "description" : "
The phrase to look up.
" }, { "name" : "lang", "type" : "xs:language", "occurence" : null, "description" : "
The language of $phrase.
" }, { "name" : "relationship", "type" : "xs:string", "occurence" : null, "description" : "
The relationship the results are to have to $phrase.
" } ], "returns" : { "type" : "xs:string*", "description" : "the related phrases if $phrase is found in the thesaurus or the empty sequence if not." }, "errors" : [ "err:FTST0018 if $uri refers to a thesaurus that is not found in the statically known thesauri.", "err:FTST0009 if $lang is not supported.", "zerr:ZOSE0001 if the thesaurus data file could not be found.", "zerr:ZOSE0002 if the thesaurus data file is not a plain file.", "zerr:ZXQP8401 if the thesaurus data file's version is not supported by the currently running version of the software.", "zerr:ZXQP8402 if the thesaurus data file's endianness does not match that of the CPU on which the software is currently running.", "zerr:ZXQP8403 if there was an error reading the thesaurus data file." ] }, { "arity" : 6, "name" : "thesaurus-lookup", "qname" : "ft:thesaurus-lookup", "signature" : "($uri as xs:string, $phrase as xs:string, $lang as xs:language, $relationship as xs:string, $level-least as xs:integer, $level-most as xs:integer) as xs:string* external", "description" : " Looks-up the given phrase in a thesaurus.\n", "summary" : "

Looks-up the given phrase in a thesaurus.

", "annotation_str" : "", "annotations" : [ ], "updating" : false, "parameters" : [ { "name" : "uri", "type" : "xs:string", "occurence" : null, "description" : "
The URI specifying the thesaurus to use.
" }, { "name" : "phrase", "type" : "xs:string", "occurence" : null, "description" : "
The phrase to look up.
" }, { "name" : "lang", "type" : "xs:language", "occurence" : null, "description" : "
The language of $phrase.
" }, { "name" : "relationship", "type" : "xs:string", "occurence" : null, "description" : "
The relationship the results are to have to $phrase.
" }, { "name" : "level-least", "type" : "xs:integer", "occurence" : null, "description" : "
The minimum number of levels within the thesaurus to be traversed.
" }, { "name" : "level-most", "type" : "xs:integer", "occurence" : null, "description" : "
The maximum number of levels within the thesaurus to be traversed.
" } ], "returns" : { "type" : "xs:string*", "description" : "the related phrases if $phrase is found in the thesaurus or the empty sequence if not." }, "errors" : [ "err:FOCA0003 if either $level-least or $level-most is either negative or too large.", "err:FTST0018 if $uri refers to a thesaurus that is not found in the statically known thesauri.", "err:FTST0009 if $lang is not supported.", "zerr:ZOSE0001 if the thesaurus data file could not be found.", "zerr:ZOSE0002 if the thesaurus data file is not a plain file.", "zerr:ZXQP8401 if the thesaurus data file's version is not supported by the currently running version of the software.", "zerr:ZXQP8402 if the thesaurus data file's endianness does not match that of the CPU on which the software is currently running.", "zerr:ZXQP8403 if there was an error reading the thesaurus data file." ] }, { "arity" : 1, "name" : "tokenize-node", "qname" : "ft:tokenize-node", "signature" : "($node as node()) as object()* external", "description" : " Tokenizes the given node and all of its descendants.\n", "summary" : "

Tokenizes the given node and all of its descendants.

", "annotation_str" : "", "annotations" : [ ], "updating" : false, "parameters" : [ { "name" : "node", "type" : "node()", "occurence" : null, "description" : "
The node to tokenize. The node's default language is assumed to be the one returned by ft:current-lang().
" } ], "returns" : { "type" : "object()*", "description" : "a (possibly empty) sequence of tokens." }, "errors" : [ "err:FTST0009 if ft:current-lang() is not supported." ] }, { "arity" : 2, "name" : "tokenize-node", "qname" : "ft:tokenize-node", "signature" : "($node as node(), $lang as xs:language) as object()* external", "description" : " Tokenizes the given node and all of its decendants.\n", "summary" : "

Tokenizes the given node and all of its decendants.

", "annotation_str" : "", "annotations" : [ ], "updating" : false, "parameters" : [ { "name" : "node", "type" : "node()", "occurence" : null, "description" : "
The node to tokenize.
" }, { "name" : "lang", "type" : "xs:language", "occurence" : null, "description" : "
The default language of $node.
" } ], "returns" : { "type" : "object()*", "description" : "a (possibly empty) sequence of tokens." }, "errors" : [ "err:FTST0009 if $lang is not supported." ] }, { "arity" : 2, "name" : "tokenize-nodes", "qname" : "ft:tokenize-nodes", "signature" : "($includes as node()+, $excludes as node()*) as object()* external", "description" : " Tokenizes the set of nodes comprising $includes (and all of its\n descendants) but excluding $excludes (and all of its\n descendants), if any.\n", "summary" : "

Tokenizes the set of nodes comprising $includes (and all of its\n descendants) but excluding $excludes (and all of its\n descendants), if any.

", "annotation_str" : "", "annotations" : [ ], "updating" : false, "parameters" : [ { "name" : "includes", "type" : "node()", "occurence" : null, "description" : "
The set of nodes (and its descendants) to include. The default language is assumed to be the one returned by ft:current-lang().
" }, { "name" : "excludes", "type" : "node()", "occurence" : null, "description" : "
The set of nodes (and its descendants) to exclude.
" } ], "returns" : { "type" : "object()*", "description" : "a (possibly empty) sequence of tokens." }, "errors" : [ "err:FTST0009 if ft:current-lang() is not supported." ] }, { "arity" : 3, "name" : "tokenize-nodes", "qname" : "ft:tokenize-nodes", "signature" : "($includes as node()+, $excludes as node()*, $lang as xs:language) as object()* external", "description" : " Tokenizes the set of nodes comprising $includes (and all of its\n descendants) but excluding $excludes (and all of its\n descendants), if any.\n", "summary" : "

Tokenizes the set of nodes comprising $includes (and all of its\n descendants) but excluding $excludes (and all of its\n descendants), if any.

", "annotation_str" : "", "annotations" : [ ], "updating" : false, "parameters" : [ { "name" : "includes", "type" : "node()", "occurence" : null, "description" : "
The set of nodes (and its descendants) to include.
" }, { "name" : "excludes", "type" : "node()", "occurence" : null, "description" : "
The set of nodes (and its descendants) to exclude.
" }, { "name" : "lang", "type" : "xs:language", "occurence" : null, "description" : "
The default language for nodes.
" } ], "returns" : { "type" : "object()*", "description" : "a (possibly empty) sequence of tokens." }, "errors" : [ "err:FTST0009 if $lang is not supported." ] }, { "arity" : 1, "name" : "tokenize-string", "qname" : "ft:tokenize-string", "signature" : "($string as xs:string) as xs:string* external", "description" : " Tokenizes the given string.\n", "summary" : "

Tokenizes the given string.

", "annotation_str" : "", "annotations" : [ ], "updating" : false, "parameters" : [ { "name" : "string", "type" : "xs:string", "occurence" : null, "description" : "
The string to tokenize. The string's language is assumed to be the one returned by ft:current-lang().
" } ], "returns" : { "type" : "xs:string*", "description" : "a (possibly empty) sequence of tokens." }, "errors" : [ "err:FTST0009 if ft:current-lang() is not supported." ] }, { "arity" : 2, "name" : "tokenize-string", "qname" : "ft:tokenize-string", "signature" : "($string as xs:string, $lang as xs:language) as xs:string* external", "description" : " Tokenizes the given string.\n", "summary" : "

Tokenizes the given string.

", "annotation_str" : "", "annotations" : [ ], "updating" : false, "parameters" : [ { "name" : "string", "type" : "xs:string", "occurence" : null, "description" : "
The string to tokenize.
" }, { "name" : "lang", "type" : "xs:language", "occurence" : null, "description" : "
The language of $string.
" } ], "returns" : { "type" : "xs:string*", "description" : "a (possibly empty) sequence of tokens." }, "errors" : [ "err:FTST0009 if $lang is not supported." ] }, { "arity" : 0, "name" : "tokenizer-properties", "qname" : "ft:tokenizer-properties", "signature" : "() as object() external", "description" : " Gets properties of the tokenizer for the\n language\n returned by ft:current-lang().\n", "summary" : "

Gets properties of the tokenizer for the\n language \n returned by ft:current-lang() .

", "annotation_str" : "", "annotations" : [ ], "updating" : false, "parameters" : [ ], "returns" : { "type" : "object()", "description" : "said properties." }, "errors" : [ "err:FTST0009 if ft:current-lang() is not supported." ] }, { "arity" : 1, "name" : "tokenizer-properties", "qname" : "ft:tokenizer-properties", "signature" : "($lang as xs:language) as object() external", "description" : " Gets properties of the tokenizer for the given\n language.\n", "summary" : "

Gets properties of the tokenizer for the given\n language .

", "annotation_str" : "", "annotations" : [ ], "updating" : false, "parameters" : [ { "name" : "lang", "type" : "xs:language", "occurence" : null, "description" : "
The language of the tokenizer to get the properties of.
" } ], "returns" : { "type" : "object()", "description" : "said properties." }, "errors" : [ "err:FTST0009 if $lang is not supported. tokenization specifically." ] } ], "variables" : [ { "name" : "ft:LANG-DA", "type" : "xs:language", "description" : " Predeclared constant for the Danish\n xs:language.\n" }, { "name" : "ft:LANG-DE", "type" : "xs:language", "description" : " Predeclared constant for the German\n xs:language.\n" }, { "name" : "ft:LANG-EN", "type" : "xs:language", "description" : " Predeclared constant for the English\n xs:language.\n" }, { "name" : "ft:LANG-ES", "type" : "xs:language", "description" : " Predeclared constant for the Spanish\n xs:language.\n" }, { "name" : "ft:LANG-FI", "type" : "xs:language", "description" : " Predeclared constant for the Finnish\n xs:language.\n" }, { "name" : "ft:LANG-FR", "type" : "xs:language", "description" : " Predeclared constant for the French\n xs:language.\n" }, { "name" : "ft:LANG-HU", "type" : "xs:language", "description" : " Predeclared constant for the Hungarian\n xs:language.\n" }, { "name" : "ft:LANG-IT", "type" : "xs:language", "description" : " Predeclared constant for the Italian\n xs:language.\n" }, { "name" : "ft:LANG-NL", "type" : "xs:language", "description" : " Predeclared constant for the Dutch\n xs:language.\n" }, { "name" : "ft:LANG-NO", "type" : "xs:language", "description" : " Predeclared constant for the Norwegian\n xs:language.\n" }, { "name" : "ft:LANG-PT", "type" : "xs:language", "description" : " Predeclared constant for the Portuguese\n xs:language.\n" }, { "name" : "ft:LANG-RO", "type" : "xs:language", "description" : " Predeclared constant for the Romanian\n xs:language.\n" }, { "name" : "ft:LANG-RU", "type" : "xs:language", "description" : " Predeclared constant for the Russian\n xs:language.\n" }, { "name" : "ft:LANG-SV", "type" : "xs:language", "description" : " Predeclared constant for the Swedish\n xs:language.\n" }, { "name" : "ft:LANG-TR", "type" : "xs:language", "description" : " Predeclared constant for the Turkish\n xs:language.\n" } ] }