Lingua::Interset::Tagset::HE::Conll - Driver for the Hebrew tagset.
version 2.047
use Lingua::Interset::Tagset::HE::Conll; my $driver = Lingua::Interset::Tagset::HE::Conll->new(); my $fs = $driver->decode("NN\tNN\tM|S");
or
use Lingua::Interset qw(decode); my $fs = decode('he::conll', "NN\tNN\tM|S");
Interset driver for the Hebrew tagset in CoNLL format. CoNLL tagsets in Interset are traditionally three values separated by tabs. The values come from the CoNLL columns CPOS, POS and FEAT.
Tagset described in Yoav Goldberg: Automatic Syntactic Processing of Modern Hebrew Automatic Syntactic Processing of Modern Hebrew (2011), p. 32, http://www.cs.bgu.ac.il/~nlpproj/yoav-phd.pdf
TODO: try to use the official (but not as easy to process) resource: BGU Computational Linguistics Group. Hebrew morphological tagging guidelines. Technical report, Ben Gurion University of the Negev, 2008. http://www.cs.bgu.ac.il/~adlerm/tagging-guideline.pdf
Lingua::Interset, Lingua::Interset::Tagset, Lingua::Interset::Tagset::Conll, Lingua::Interset::FeatureStructure
Rudolf Rosa <rosa@ufal.mff.cuni.cz>
Dan Zeman <zeman@ufal.mff.cuni.cz>
This software is copyright (c) 2014 by Univerzita Karlova v Praze (Charles University in Prague).
This is free software; you can redistribute it and/or modify it under the same terms as the Perl 5 programming language system itself.