How to use
ShallowIdentityStateMapping
in
burlap.mdp.auxiliary.common

Best Java code snippets using burlap.mdp.auxiliary.common.ShallowIdentityStateMapping (Showing top 4 results out of 315)

/**
 * Initializes
 * @param domain the learning domain
 * @param gamma the discount factor
 * @param vfa the value function approximation to use
 */
public ApproximateQLearning(SADomain domain, double gamma, ParametricFunction.ParametricStateActionFunction vfa) {
  this(domain, gamma, vfa, new ShallowIdentityStateMapping());
}

/**
 * Initializes with a default 0.1 epsilon greedy policy/strategy
 * @param d the domain in which the agent will act
 * @param discount the discount factor
 * @param learningRate the learning rate
 * @param qInitizalizer the Q-value initialization method
 * @param hashFactory the state hashing factory
 */
public SGNaiveQLAgent(SGDomain d, double discount, double learningRate, QFunction qInitizalizer, HashableStateFactory hashFactory) {
  this.init(d);
  this.discount = discount;
  this.learningRate = new ConstantLR(learningRate);
  this.hashFactory = hashFactory;
  this.qInit = qInitizalizer;
  
  this.qMap = new HashMap<HashableState, List<QValue>>();
  stateRepresentations = new HashMap<HashableState, State>();
  this.policy = new EpsilonGreedy(this, 0.1);
  
  this.storedMapAbstraction = new ShallowIdentityStateMapping();
}

/**
 * Initializes with a default 0.1 epsilon greedy policy/strategy
 * @param d the domain in which the agent will act
 * @param discount the discount factor
 * @param learningRate the learning rate
 * @param defaultQ the default to which all Q-values will be initialized
 * @param hashFactory the state hashing factory
 */
public SGNaiveQLAgent(SGDomain d, double discount, double learningRate, double defaultQ, HashableStateFactory hashFactory) {
  this.init(d);
  this.discount = discount;
  this.learningRate = new ConstantLR(learningRate);
  this.hashFactory = hashFactory;
  this.qInit = new ConstantValueFunction(defaultQ);
  
  this.qMap = new HashMap<HashableState, List<QValue>>();
  stateRepresentations = new HashMap<HashableState, State>();
  this.policy = new EpsilonGreedy(this, 0.1);
  
  this.storedMapAbstraction = new ShallowIdentityStateMapping();
}

/**
 * Initializes with a default Q-value of 0 and a 0.1 epsilon greedy policy/strategy
 * @param d the domain in which the agent will act
 * @param discount the discount factor
 * @param learningRate the learning rate
 * @param hashFactory the state hashing factory
 */
public SGNaiveQLAgent(SGDomain d, double discount, double learningRate, HashableStateFactory hashFactory) {
  this.init(d);
  this.discount = discount;
  this.learningRate = new ConstantLR(learningRate);
  this.hashFactory = hashFactory;
  this.qInit = new ConstantValueFunction(0.);
  
  this.qMap = new HashMap<HashableState, List<QValue>>();
  stateRepresentations = new HashMap<HashableState, State>();
  this.policy = new EpsilonGreedy(this, 0.1);
  
  this.storedMapAbstraction = new ShallowIdentityStateMapping();
}

Javadoc

A StateAbstraction class the input state without copying it.

Most used methods

<init>

Popular in Java

Finding current android device location
getSupportFragmentManager (FragmentActivity)
getSharedPreferences (Context)
orElseThrow (Optional)
Return the contained value, if present, otherwise throw an exception to be created by the provided s
InputStream (java.io)
A readable source of bytes.Most clients will use input streams that read data from the file system (
MessageFormat (java.text)
Produces concatenated messages in language-neutral way. New code should probably use java.util.Forma
TreeMap (java.util)
Walk the nodes of the tree left-to-right or right-to-left. Note that in descending iterations, next
AtomicInteger (java.util.concurrent.atomic)
An int value that may be updated atomically. See the java.util.concurrent.atomic package specificati
HttpServletRequest (javax.servlet.http)
Extends the javax.servlet.ServletRequest interface to provide request information for HTTP servlets.
Scheduler (org.quartz)
This is the main interface of a Quartz Scheduler. A Scheduler maintains a registry of org.quartz.Job
From CI to AI: The AI layer in your organization

How to useShallowIdentityStateMapping in burlap.mdp.auxiliary.common

Best Java code snippets using burlap.mdp.auxiliary.common.ShallowIdentityStateMapping (Showing top 4 results out of 315)

How to use
ShallowIdentityStateMapping
in
burlap.mdp.auxiliary.common