Methodology

We extract participles where pos_tag.vt{"ptca","ptcp"}.

Stem (binyan) comes from pos_tag.vs. Usage class maps pdp to verbal/adjectival/substantive.

Gender and number are pos_tag.gn and pos_tag.nu when present.

Article/negation/preposition/conjunction are detected from immediately-preceding tokens within the verse.

Full dataset and aggregates are downloadable from the Data page.