How to use
EnglishMorphAnalyzer
in
edu.emory.mathcs.nlp.component.morph.english

Best Java code snippets using edu.emory.mathcs.nlp.component.morph.english.EnglishMorphAnalyzer (Showing top 8 results out of 315)

public MorphologicalAnalyzer(Language language)
{
  analyzer = new EnglishMorphAnalyzer();
}

@Override
public String lemmatize(String simplifiedWordForm, String pos)
{
  String lemma = StringUtils.toLowerCase(simplifiedWordForm), t;
  
  if ((t = getAbbreviation(lemma, pos)) != null || (t = getBaseFormFromInflection(lemma, pos)) != null)
    lemma = t;
  
  if      (isCardinal(lemma))	return MetaConst.CARDINAL;
  else if (isOrdinal (lemma))	return MetaConst.ORDINAL;
  
  return lemma;
}

/** Constructs an English morphological analyzer from the dictionary in resource. */
public EnglishMorphAnalyzer()
{
  Element inflection = XMLUtils.getDocumentElement(IOUtils.getInputStreamsFromResource(INFLECTION_SUFFIX));
  Element derivationN2V = XMLUtils.getDocumentElement(IOUtils.getInputStreamsFromResource(DERIVATION_SUFFIX_N2V));
  
  try
  {
    inf_verb      = getInflectionRules(inflection, VERB     , VERB_POS);
    inf_noun      = getInflectionRules(inflection, NOUN     , NOUN_POS);
    inf_adjective = getInflectionRules(inflection, ADJECTIVE, ADJECTIVE_POS);
    inf_adverb    = getInflectionRules(inflection, ADVERB   , ADVERB_POS);
    
    der_n2v = getDerivationalRules(derivationN2V, NOUN);
    
    base_cardinal     = DSUtils.createStringHashSet(IOUtils.getInputStreamsFromResource(CARDINAL_BASE));
    base_ordinal      = DSUtils.createStringHashSet(IOUtils.getInputStreamsFromResource(ORDINAL_BASE));
    rule_abbreviation = getAbbreviationMap(IOUtils.getInputStreamsFromResource(ABBREVIATOIN_RULE));
  }
  catch (IOException e) {e.printStackTrace();}
}

/** Called by {@link #EnglishLemmatizer()}. */
private EnglishInflection getInflectionRules(Element eInflection, String type, String basePOS) throws IOException
{
  Element     eAffixes        = XMLUtils.getFirstElementByTagName(eInflection, type);
  InputStream baseStream      = IOUtils.getInputStreamsFromResource(ROOT + type + EXT_BASE);
  InputStream exceptionStream = IOUtils.getInputStreamsFromResource(ROOT + type + EXT_EXCEPTION);
  
  return getInflection(baseStream, exceptionStream, eAffixes, basePOS);
}

/** Constructs an English morphological analyzer from the dictionary in resource. */
public EnglishMorphAnalyzer()
{
  Element inflection = XMLUtils.getDocumentElement(IOUtils.getInputStreamsFromResource(INFLECTION_SUFFIX));
  Element derivationN2V = XMLUtils.getDocumentElement(IOUtils.getInputStreamsFromResource(DERIVATION_SUFFIX_N2V));
  
  try
  {
    inf_verb      = getInflectionRules(inflection, VERB     , VERB_POS);
    inf_noun      = getInflectionRules(inflection, NOUN     , NOUN_POS);
    inf_adjective = getInflectionRules(inflection, ADJECTIVE, ADJECTIVE_POS);
    inf_adverb    = getInflectionRules(inflection, ADVERB   , ADVERB_POS);
    
    der_n2v = getDerivationalRules(derivationN2V, NOUN);
    
    base_cardinal     = DSUtils.createStringHashSet(IOUtils.getInputStreamsFromResource(CARDINAL_BASE));
    base_ordinal      = DSUtils.createStringHashSet(IOUtils.getInputStreamsFromResource(ORDINAL_BASE));
    rule_abbreviation = getAbbreviationMap(IOUtils.getInputStreamsFromResource(ABBREVIATOIN_RULE));
  }
  catch (IOException e) {e.printStackTrace();}
}

/** Called by {@link #EnglishLemmatizer()}. */
private EnglishInflection getInflectionRules(Element eInflection, String type, String basePOS) throws IOException
{
  Element     eAffixes        = XMLUtils.getFirstElementByTagName(eInflection, type);
  InputStream baseStream      = IOUtils.getInputStreamsFromResource(ROOT + type + EXT_BASE);
  InputStream exceptionStream = IOUtils.getInputStreamsFromResource(ROOT + type + EXT_EXCEPTION);
  
  return getInflection(baseStream, exceptionStream, eAffixes, basePOS);
}

@Override
public String lemmatize(String simplifiedWordForm, String pos)
{
  String lemma = StringUtils.toLowerCase(simplifiedWordForm), t;
  
  if ((t = getAbbreviation(lemma, pos)) != null || (t = getBaseFormFromInflection(lemma, pos)) != null)
    lemma = t;
  
  if      (isCardinal(lemma))	return MetaConst.CARDINAL;
  else if (isOrdinal (lemma))	return MetaConst.ORDINAL;
  
  return lemma;
}

public MorphologicalAnalyzer(Language language)
{
  analyzer = new EnglishMorphAnalyzer();
}

Most used methods

<init>
Constructs an English morphological analyzer from the dictionary in resource.
getAbbreviation
Called by #analyze(DEPNode).
getAbbreviationMap
getBaseFormFromInflection
getDerivationalRules
Called by #EnglishMPAnalyzer(ZipFile).
getInflection
getInflectionRules
Called by #EnglishLemmatizer().
isCardinal
isOrdinal

Popular in Java

Reading from database using SQL prepared statement
startActivity (Activity)
orElseThrow (Optional)
Return the contained value, if present, otherwise throw an exception to be created by the provided s
onCreateOptionsMenu (Activity)
Charset (java.nio.charset)
A charset is a named mapping between Unicode characters and byte sequences. Every Charset can decode
Queue (java.util)
A collection designed for holding elements prior to processing. Besides basic java.util.Collection o
ServletException (javax.servlet)
Defines a general exception a servlet can throw when it encounters difficulty.
GridLayout (java.awt)
The GridLayout class is a layout manager that lays out a container's components in a rectangular gri
Loader (org.hibernate.loader)
Abstract superclass of object loading (and querying) strategies. This class implements useful common
Location (org.springframework.beans.factory.parsing)
Class that models an arbitrary location in a Resource.Typically used to track the location of proble
Top plugins for Android Studio

How to useEnglishMorphAnalyzer in edu.emory.mathcs.nlp.component.morph.english

Best Java code snippets using edu.emory.mathcs.nlp.component.morph.english.EnglishMorphAnalyzer (Showing top 8 results out of 315)

How to use
EnglishMorphAnalyzer
in
edu.emory.mathcs.nlp.component.morph.english