How to use
pTopicGivenTerm
method
in
org.apache.mahout.clustering.lda.cvb.TopicModel

Best Java code snippets using org.apache.mahout.clustering.lda.cvb.TopicModel.pTopicGivenTerm (Showing top 3 results out of 315)

public void trainDocTopicModel(Vector original, Vector topics, Matrix docTopicModel) {
 // first calculate p(topic|term,document) for all terms in original, and all topics,
 // using p(term|topic) and p(topic|doc)
 pTopicGivenTerm(original, topics, docTopicModel);
 normalizeByTopic(docTopicModel);
 // now multiply, term-by-term, by the document, to get the weighted distribution of
 // term-topic pairs from this document.
 for (Element e : original.nonZeroes()) {
  for (int x = 0; x < numTopics; x++) {
   Vector docTopicModelRow = docTopicModel.viewRow(x);
   docTopicModelRow.setQuick(e.index(), docTopicModelRow.getQuick(e.index()) * e.get());
  }
 }
 // now recalculate \(p(topic|doc)\) by summing contributions from all of pTopicGivenTerm
 topics.assign(0.0);
 for (int x = 0; x < numTopics; x++) {
  topics.set(x, docTopicModel.viewRow(x).norm(1));
 }
 // now renormalize so that \(sum_x(p(x|doc))\) = 1
 topics.assign(Functions.mult(1 / topics.norm(1)));
}

public void trainDocTopicModel(Vector original, Vector topics, Matrix docTopicModel) {
 // first calculate p(topic|term,document) for all terms in original, and all topics,
 // using p(term|topic) and p(topic|doc)
 pTopicGivenTerm(original, topics, docTopicModel);
 normalizeByTopic(docTopicModel);
 // now multiply, term-by-term, by the document, to get the weighted distribution of
 // term-topic pairs from this document.
 for (Element e : original.nonZeroes()) {
  for (int x = 0; x < numTopics; x++) {
   Vector docTopicModelRow = docTopicModel.viewRow(x);
   docTopicModelRow.setQuick(e.index(), docTopicModelRow.getQuick(e.index()) * e.get());
  }
 }
 // now recalculate \(p(topic|doc)\) by summing contributions from all of pTopicGivenTerm
 topics.assign(0.0);
 for (int x = 0; x < numTopics; x++) {
  topics.set(x, docTopicModel.viewRow(x).norm(1));
 }
 // now renormalize so that \(sum_x(p(x|doc))\) = 1
 topics.assign(Functions.mult(1 / topics.norm(1)));
}

public void trainDocTopicModel(Vector original, Vector topics, Matrix docTopicModel) {
 // first calculate p(topic|term,document) for all terms in original, and all topics,
 // using p(term|topic) and p(topic|doc)
 pTopicGivenTerm(original, topics, docTopicModel);
 normalizeByTopic(docTopicModel);
 // now multiply, term-by-term, by the document, to get the weighted distribution of
 // term-topic pairs from this document.
 for (Element e : original.nonZeroes()) {
  for (int x = 0; x < numTopics; x++) {
   Vector docTopicModelRow = docTopicModel.viewRow(x);
   docTopicModelRow.setQuick(e.index(), docTopicModelRow.getQuick(e.index()) * e.get());
  }
 }
 // now recalculate \(p(topic|doc)\) by summing contributions from all of pTopicGivenTerm
 topics.assign(0.0);
 for (int x = 0; x < numTopics; x++) {
  topics.set(x, docTopicModel.viewRow(x).norm(1));
 }
 // now renormalize so that \(sum_x(p(x|doc))\) = 1
 topics.assign(Functions.mult(1 / topics.norm(1)));
}

Javadoc

Computes \(p(topic x | term a, document i)\) distributions given input document i. \(pTGT[x][a]\) is the (un-normalized) \(p(x|a,i)\), or if docTopics is null, \(p(a|x)\) (also un-normalized).

Popular in Java

Running tasks concurrently on multiple threads
setContentView (Activity)
getSupportFragmentManager (FragmentActivity)
onCreateOptionsMenu (Activity)
FileInputStream (java.io)
An input stream that reads bytes from a file. File file = ...finally if (in != null) in.clos
PrintStream (java.io)
Fake signature of an existing Java class.
BitSet (java.util)
The BitSet class implements abit array [http://en.wikipedia.org/wiki/Bit_array]. Each element is eit
Rectangle (java.awt)
A Rectangle specifies an area in a coordinate space that is enclosed by the Rectangle object's top-
Reflections (org.reflections)
Reflections one-stop-shop objectReflections scans your classpath, indexes the metadata, allows you t
Location (org.springframework.beans.factory.parsing)
Class that models an arbitrary location in a Resource.Typically used to track the location of proble
Github Copilot alternatives

How to use pTopicGivenTermmethodin org.apache.mahout.clustering.lda.cvb.TopicModel

Best Java code snippets using org.apache.mahout.clustering.lda.cvb.TopicModel.pTopicGivenTerm (Showing top 3 results out of 315)

How to use
pTopicGivenTerm
method
in
org.apache.mahout.clustering.lda.cvb.TopicModel