How to use
getProbabilities
method
in
com.optimaize.langdetect.NgramFrequencyData

Best Java code snippets using com.optimaize.langdetect.NgramFrequencyData.getProbabilities (Showing top 2 results out of 315)

/**
 * update language probabilities with N-gram string(N=1,2,3)
 * @param count 1-n: how often the gram occurred.
 */
private boolean updateLangProb(@NotNull double[] prob, @NotNull String ngram, int count, double alpha) {
  double[] langProbMap = ngramFrequencyData.getProbabilities(ngram);
  if (langProbMap==null) {
    return false;
  }
  if (logger.isTraceEnabled()) logger.trace(ngram + "(" + Util.unicodeEncode(ngram) + "):" + Util.wordProbToString(langProbMap, ngramFrequencyData.getLanguageList()));
  double weight = alpha / BASE_FREQ;
  if (ngram.length() >1) {
    if (prefixFactor !=1.0 && ngram.charAt(0)==' ') {
      weight *= prefixFactor;
    } else if (suffixFactor!=1.0 && ngram.charAt(ngram.length()-1)==' ') {
      weight *= suffixFactor;
    }
  }
  for (int i=0; i<prob.length; ++i) {
    for (int amount=0; amount<count; amount++) {
      prob[i] *= (weight + langProbMap[i]);
    }
  }
  return true;
}

/**
 * update language probabilities with N-gram string(N=1,2,3)
 * @param count 1-n: how often the gram occurred.
 */
private boolean updateLangProb(@NotNull double[] prob, @NotNull String ngram, int count, double alpha) {
  double[] langProbMap = ngramFrequencyData.getProbabilities(ngram);
  if (langProbMap==null) {
    return false;
  }
  if (logger.isTraceEnabled()) logger.trace(ngram + "(" + Util.unicodeEncode(ngram) + "):" + Util.wordProbToString(langProbMap, ngramFrequencyData.getLanguageList()));
  double weight = alpha / BASE_FREQ;
  if (ngram.length() >1) {
    if (prefixFactor !=1.0 && ngram.charAt(0)==' ') {
      weight *= prefixFactor;
    } else if (suffixFactor!=1.0 && ngram.charAt(ngram.length()-1)==' ') {
      weight *= suffixFactor;
    }
  }
  for (int i=0; i<prob.length; ++i) {
    for (int amount=0; amount<count; amount++) {
      prob[i] *= (weight + langProbMap[i]);
    }
  }
  return true;
}

Javadoc

Don't modify this data structure! (Can't make array immutable...)

Popular methods of NgramFrequencyData

Popular in Java

Creating JSON documents from java classes using gson
scheduleAtFixedRate (Timer)
setScale (BigDecimal)
requestLocationUpdates (LocationManager)
Runnable (java.lang)
Represents a command that can be executed. Often used to run code in a different Thread.
MessageDigest (java.security)
Uses a one-way hash function to turn an arbitrary number of bytes into a fixed-length byte sequence.
MessageFormat (java.text)
Produces concatenated messages in language-neutral way. New code should probably use java.util.Forma
TimeUnit (java.util.concurrent)
A TimeUnit represents time durations at a given unit of granularity and provides utility methods to
BasicDataSource (org.apache.commons.dbcp)
Basic implementation of javax.sql.DataSource that is configured via JavaBeans properties. This is no
Reflections (org.reflections)
Reflections one-stop-shop objectReflections scans your classpath, indexes the metadata, allows you t
Github Copilot alternatives

How to use getProbabilitiesmethodin com.optimaize.langdetect.NgramFrequencyData

Best Java code snippets using com.optimaize.langdetect.NgramFrequencyData.getProbabilities (Showing top 2 results out of 315)

How to use
getProbabilities
method
in
com.optimaize.langdetect.NgramFrequencyData