Jeffrey Heinz

UCLA Linguistics Dept.
3125 Campbell Hall
Los Angeles CA 90095-1543
jheinz@humnet.ucla.edu


Home

Software

There are six sets of software available here. They are all implemented in the OCaML programming language so OCaML needs to be installed in order for them to compile and run. You can learn more about OCaML and download it from the OCaML homepage at http://caml.inria.fr/ Another option for downloading and installing is using GODI which organizes a lot of librarys for OCaML. It is easy to use and I recommend it. Go to http://godi.ocaml-programming.de/.
Name Description Link Requirements Last Updated
Tools A basic set of tools that are useful for writing bigger programs. Tools.tar.gz OCaML 3.07+ November 27, 2006
Feature Feature implements a feature system over a set of elements. Sequences of elements (e.g. words) can be converted to sequences of feature bundles and vice versa. Additional programs allow one to compute the natural classes, find the partitions by natural class, compute a measure of similarity based on natural class (Frisch, Pierrehumbert, and Broe 2004), as well as find the smallest natural class containing two elements (i.e. minimal generalization (Albright and Hayes (2002, 2003)). Feature.tar.gz OCaML 3.07+ and the Tools library (above). October 5, 2006
Brent A program that implements the Dynamic Programming Algorithm for Probabilistic Utterance Segmentation (Brent 1999). Brent.tar.gz OCaML 3.07+ and the Tools library (above). September 2006
FSA FSA is a program which implements finite state acceptors. The implementation includes a number of functions including determinizing, minimizing (i.e. finding a canonical acceptor), multiplying, adding, generating words, merging of states to get new acceptors, etc. You can use the OCaML code directly to create your own programs, or you can run these commands from the command line, pipe them together, etc. FSA.tar.gz OCaML 3.07+ and the Tools library (above). December 6, 2006
Ngram Ngram is a program which implements n-gram grammars. It runs from the command line and finds the n-grams in a list of strings (written in a file) for any value of n. It also lets you compute a finite state representation of the n-gram grammar. Options allow you to cutomize delimiters, etc. ngram.tar.gz OCaML 3.07+, Tools library and FSA library (above). June 2, 2007
Prec Prec is a program which implements precedence grammars. It runs from the command line and finds the precedence relations in a list of strings (written in a file). It also lets you compute a finite state representation of the precedence grammar. Options allow you to cutomize delimiters, etc. prec.tar.gz OCaML 3.07+, Tools library and FSA library (above). June 2, 2007