       |
|
Mats Rooth:
- English 97 is a Lopar lexicalized statistical language model trained on circa 50 million words of Wall Street Journal data. The grammar is as in Carroll and Rooth 1998, except that the the chunk trigram robustness rules have been replaced with bigram rules, and the lexicon is lemmatized.
- BankBaseline is a lexicalized Lopar model based on Penn Treebank II sections 0-15. Andrew Jonas created the lemma mapping in the lexicon.
- PF Linear Expectation is an implementation in Java of a generalization of the governor algorithm of Schmid and Rooth (2001). Expected values for markup functions of a certain linear form can be computed; governor and depth-of-embedding functions are included.
John Hale:
- The CYK Parser for Minimalist Grammars described in Appendix A of my dissertation implements a design presented in Henk Harkema's 2001 UCLA dissertation. The implementation also calculates the entropy of the suffix language that grammatically completes a given prefix string, under a context-free probability model of MG derivations.
Marisa Boston:
- DepParse Version 2.2. An incremental dependency parser developed for human sentence processing modeling.
- DepPrint Version 1.1. A dependency grammar visualizer that outputs SVG files.
- NegraToConfig. A conversion tool from Negra format to Nivre parser configurations.
Effi Georgala:
- Computational lexicon of Modern Greek annotated with part of speech and lemma
- Newspaper Corpus of Modern Greek
Please contact Effi if you are interested in either one of these resources.
|
 |
|