csli.util.nlp
Class Lemmatiser

java.lang.Object
  extended by csli.util.nlp.Lemmatiser
All Implemented Interfaces:
Serializable
Direct Known Subclasses:
DictionaryLemmatiser

public abstract class Lemmatiser
extends Object
implements Serializable

An abstract lemmatiser.

See Also:
Serialized Form

Constructor Summary
Lemmatiser()
           
 
Method Summary
 List<Lemma> getLemma(List<String> wordList)
          Produce a list of lemmas (pairs of root form & part-of-speech) from a list of word strings.
abstract  Lemma getLemma(String wordString)
          Produce a lemma (pair of root form & part-of-speech) from a word string.
abstract  String getWord(Lemma lemma)
          Generate a morphological word string from a lemma (pair of root form & part-of-speech).
 List<String> getWord(List<Lemma> lemmaList)
          Produce a list of morphological word strings from a list of lemmas (pairs of root form & part-of-speech).
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

Lemmatiser

public Lemmatiser()
Method Detail

getLemma

public abstract Lemma getLemma(String wordString)
Produce a lemma (pair of root form & part-of-speech) from a word string. Should return NULL on error.

Parameters:
wordString - the word string to lemmatise
Returns:
the corresponding lemma

getLemma

public List<Lemma> getLemma(List<String> wordList)
Produce a list of lemmas (pairs of root form & part-of-speech) from a list of word strings. Should return NULL on error.

Parameters:
wordList - the list of word strings to lemmatise
Returns:
the corresponding list of lemmas

getWord

public abstract String getWord(Lemma lemma)
Generate a morphological word string from a lemma (pair of root form & part-of-speech). Should return NULL on error.

Parameters:
lemma - the lemma from which to generate
Returns:
the corresponding word string

getWord

public List<String> getWord(List<Lemma> lemmaList)
Produce a list of morphological word strings from a list of lemmas (pairs of root form & part-of-speech). Should return NULL on error.

Parameters:
lemmaList - the list of lemmas from which to generate
Returns:
the corresponding list of word strings