|Zeug|Zeuge|Zeugen| |
(three elements) |
|Baum| |
(unique lemma) |
| |
(not in lexicon) |
> [ambiguity(alemma) > 3];
> [alemma contains "Zeuge"];
words which can be lemmatised as Zeuge
equivalent to [alemma = ".*\|Zeuge\|.*"]
(alemma not contains "Zeuge")
!(alemma contains "Zeuge")
> /region[np, a] :: a.np_f contains "quot";
> define macro find('$0=Tag $1=Property')
'<$0_f contains "$1"> []* </$0_f>';
> /find[np, brac];
> /find[advp, temp];
etc.
> [ (pos = "NN") & (agr matches ".*:Pl:.*") ];
nouns which are uniquely identified as plurals
/unify[agr, <label1>, <label2>, ...]
> a:[pos="ART"] b:[pos="ADJA"]? c:[pos="NN"]
:: /unify[agr, a,b,c] matches "Gen:.*";
(simple) NPs uniquely identified as genitive
> a:[pos="ART"] b:[pos="ADJA"]? c:[pos="NN"]
:: /unify[agr, a,b,c] contains "Dat:.:Sg:.*";
NPs which might be dative singular
> ... :: ambiguity(/unify[agr, a,b,c]) >= 1;
to check agreement within NP
> /region[np, a] :: a.np_agr matches "Dat:.:Pl:.*";
> <np_agr matches "Dat:.:Pl:.*"> []* </np_agr>;
/unify[]
expects features sets
in canonical format, with members sorted according to CWB's internal sort order;
this is usually ensured with the -m
option to cwb-s-encode
ambiguity()
, contains
and matches
are guaranteed
to work as long as the |
-separated set notation is used correctly and consistently
/unify[]
macro cannot be used unless the features
within each set are sorted in the canonical format. Only if an attribute is explicitly
declared as a feature set at indexing-time are the members of the sets sorted into the
canonical order.
|1:Zeuge|2:Zeug|3:Zeugen|