org.deeplearning4j.nn.conf.NeuralNetConfiguration$Builder.updater java code examples

.activation(Activation.LEAKYRELU)
.weightInit(WeightInit.XAVIER)
.updater(new Nesterovs(0.1))// To configure: .updater(Nesterovs.builder().momentum(0.9).build())
.l2(1e-4)
.list()

.l2(0.001)
.weightInit(WeightInit.XAVIER)
.updater(new RmsProp(0.1))
.list()
.layer(0, new LSTM.Builder().nIn(CHAR_TO_INT.size()).nOut(lstmLayerSize).activation(Activation.TANH).build())

.updater(new Nesterovs(0.02))// To configure: .updater(Nesterovs.builder().momentum(0.9).build())
.l2(1e-4)
.list()

/** Returns the network configuration, 2 hidden DenseLayers of size 50.
 */
private static MultiLayerConfiguration getDeepDenseLayerNetworkConfiguration() {
  final int numHiddenNodes = 100;
  return new NeuralNetConfiguration.Builder()
      .seed(seed)
      .optimizationAlgo(OptimizationAlgorithm.STOCHASTIC_GRADIENT_DESCENT)
      .weightInit(WeightInit.XAVIER)
      .updater(new Nesterovs(learningRate, 0.9))
      .list()
      .layer(0, new DenseLayer.Builder().nIn(numInputs).nOut(numHiddenNodes)
          .activation(Activation.RELU).build())
      .layer(1, new DenseLayer.Builder().nIn(numHiddenNodes).nOut(numHiddenNodes)
          .activation(Activation.RELU).build())
      .layer(2, new OutputLayer.Builder(LossFunctions.LossFunction.MSE)
          .activation(Activation.IDENTITY)
          .nIn(numHiddenNodes).nOut(numOutputs).build())
      .pretrain(false).backprop(true).build();
}

public MultiLayerConfiguration conf() {
  MultiLayerConfiguration conf = new NeuralNetConfiguration.Builder()
          .optimizationAlgo(OptimizationAlgorithm.STOCHASTIC_GRADIENT_DESCENT).iterations(1)
          .learningRate(0.01).seed(12345).regularization(true).l2(0.001).weightInit(WeightInit.XAVIER)
          .updater(new RmsProp()).list()
          .layer(0, new GravesLSTM.Builder().nIn(inputShape[1]).nOut(256).activation(Activation.TANH)
                  .build())
          .layer(1, new GravesLSTM.Builder().nOut(256).activation(Activation.TANH).build())
          .layer(2, new RnnOutputLayer.Builder(LossFunctions.LossFunction.MCXENT)
                  .activation(Activation.SOFTMAX) //MCXENT + softmax for classification
                  .nOut(totalUniqueCharacters).build())
          .backpropType(BackpropType.TruncatedBPTT).tBPTTForwardLength(50).tBPTTBackwardLength(50)
          .pretrain(false).backprop(true).build();
  return conf;
}

/**
 * Gradient updater. For example, Updater.SGD for standard stochastic gradient descent,
 * Updater.NESTEROV for Nesterov momentum, Updater.RSMPROP for RMSProp, etc.<br>
 * Note: default hyperparameters are used with this method. Use {@link #updater(IUpdater)} to configure
 * the updater-specific hyperparameters.
 *
 * @see Updater
 */
public Builder updater(Updater updater) {
  this.updater = updater;
  return updater(updater.getIUpdaterWithDefaultConfig());
}

.l2(0.0005)
.weightInit(WeightInit.XAVIER)
.updater(new Nesterovs.Builder().learningRate(.01).build())
.biasUpdater(new Nesterovs.Builder().learningRate(0.02).build())
.list()

.seed(12345)
.l2(0.001) //l2 regularization on all layers
.updater(new AdaGrad.Builder().learningRate(0.04).build())
.list()
.layer(0, new ConvolutionLayer.Builder(10, 10)

.l2(0.0005)
.weightInit(WeightInit.XAVIER)
.updater(new Nesterovs.Builder().learningRate(.01).build())
.biasUpdater(new Nesterovs.Builder().learningRate(0.02).build())
.list()

.l2(0.0005)
.weightInit(WeightInit.XAVIER)
.updater(new Nesterovs.Builder().learningRate(.01).build())
.biasUpdater(new Nesterovs.Builder().learningRate(0.02).build())
.list()

.l2(0.001)
.weightInit(WeightInit.XAVIER)
.updater(new RmsProp.Builder().learningRate(0.1).build())
.list()
.layer(0, new LSTM.Builder().nIn(iter.inputColumns()).nOut(lstmLayerSize)

.updater(new Adam.Builder().learningRate(2e-2).build())
.l2(1e-5)
.weightInit(WeightInit.XAVIER)

.convolutionMode(ConvolutionMode.Same)
.l2(1e-4)
.updater(new AMSGrad(lrSchedule))
.weightInit(WeightInit.RELU)
.graphBuilder()

.weightInit(WeightInit.RELU)
.activation(Activation.LEAKYRELU)
.updater(Updater.ADADELTA)
.convolutionMode(ConvolutionMode.Same)
.regularization(true).dropOut(0.2)

public static MultiLayerNetwork lenetModel() {
  /**
   * Revisde Lenet Model approach developed by ramgo2 achieves slightly above random
   * Reference: https://gist.github.com/ramgo2/833f12e92359a2da9e5c2fb6333351c5
   **/
  MultiLayerConfiguration conf = new NeuralNetConfiguration.Builder()
      .seed(seed)
      .l2(0.005) // tried 0.0001, 0.0005
      .activation(Activation.RELU)
      .weightInit(WeightInit.XAVIER)
      .optimizationAlgo(OptimizationAlgorithm.STOCHASTIC_GRADIENT_DESCENT)
      .updater(new Nesterovs(0.0001,0.9))
      .list()
      .layer(0, new ConvolutionLayer.Builder(new int[]{5, 5}, new int[]{1, 1}, new int[]{0, 0}).name("cnn1")
          .nIn(channels).nOut(50).biasInit(0).build())
      .layer(1, new SubsamplingLayer.Builder(new int[]{2,2}, new int[]{2,2}).name("maxpool1").build())
      .layer(2, new ConvolutionLayer.Builder(new int[]{5,5}, new int[]{5, 5}, new int[]{1, 1}).name("cnn2")
          .nOut(100).biasInit(0).build())
      .layer(3, new SubsamplingLayer.Builder(new int[]{2,2}, new int[]{2,2}).name("maxpool2").build())
      .layer(4, new DenseLayer.Builder().nOut(500).build())
      .layer(5, new OutputLayer.Builder(LossFunctions.LossFunction.NEGATIVELOGLIKELIHOOD)
          .nOut(4)
          .activation(Activation.SOFTMAX)
          .build())
      .backprop(true).pretrain(false)
      .setInputType(InputType.convolutional(height, width, channels))
      .build();
  return new MultiLayerNetwork(conf);
}

.seed(6)
.iterations(1)
.updater(Updater.ADAM)
.learningRate(learningRate)
.weightInit(WeightInit.XAVIER)

MultiLayerConfiguration conf = new NeuralNetConfiguration.Builder()
    .iterations(1)
    .updater(Updater.NESTEROVS)
    .learningRate(learningRate)
    .weightInit(WeightInit.XAVIER_UNIFORM)

  public static ComputationGraphConfiguration getConf() {
    ComputationGraphConfiguration.GraphBuilder builder = new NeuralNetConfiguration.Builder()
        .seed(12345)
        .updater(new Adam(0.01))
        .weightInit(WeightInit.RELU)
        .graphBuilder()
        .addInputs("in");

    String[] poolNames = new String[ngramFilters.length];
    int i = 0;
    for (int ngram : ngramFilters) {
      String filterName = String.format("ngram%d", ngram);
      poolNames[i] = String.format("pool%d", ngram);
      builder = builder.addLayer(filterName, new Convolution1DLayer.Builder()
          .nOut(numFilters)
          .kernelSize(ngram)
          .activation(Activation.RELU)
          .build(), "in")
          .addLayer(poolNames[i], new GlobalPoolingLayer.Builder(PoolingType.MAX).build(), filterName);
      i++;
    }
    return builder.addVertex("concat", new MergeVertex(), poolNames)
        .addLayer("predict", new DenseLayer.Builder().nOut(numClasses).dropOut(dropoutRetain)
            .activation(Activation.SOFTMAX).build(), "concat")
        .addLayer("loss", new LossLayer.Builder(LossFunctions.LossFunction.MCXENT).build(), "predict")
        .setOutputs("loss")
        .setInputTypes(InputType.recurrent(W2V_VECTOR_SIZE, 1000))
        .build();
  }
}

public static MultiLayerConfiguration lenetModelConf() {
  MultiLayerConfiguration conf = new NeuralNetConfiguration.Builder()
      .seed(seed)
      .l2(0.005)
      .activation(Activation.RELU)
      .weightInit(WeightInit.XAVIER)
      .optimizationAlgo(OptimizationAlgorithm.STOCHASTIC_GRADIENT_DESCENT)
      .updater(new Nesterovs(0.0001, 0.9))
      .list()
      .layer(0, new ConvolutionLayer.Builder(new int[]{5, 5}, new int[]{1, 1}, new int[]{0, 0}).name("cnn1")
          .nIn(channels).nOut(50).biasInit(0).build())
      .layer(1, new SubsamplingLayer.Builder(new int[]{2,2}, new int[]{2,2}).name("maxpool1").build())
      .layer(2, new ConvolutionLayer.Builder(new int[]{5,5}, new int[]{5, 5}, new int[]{1, 1}).name("cnn2")
          .nOut(100).biasInit(0).build())
      .layer(3, new SubsamplingLayer.Builder(new int[]{2,2}, new int[]{2,2}).name("maxpool2").build())
      .layer(4, new DenseLayer.Builder().nOut(500).build())
      .layer(5, new OutputLayer.Builder(LossFunctions.LossFunction.NEGATIVELOGLIKELIHOOD)
          .nOut(4)
          .activation(Activation.SOFTMAX)
          .build())
      .backprop(true).pretrain(false)
      .setInputType(InputType.convolutional(height, width, channels))
      .build();
  return conf;
}
public static void saveModel(FileSystem fs, Model model ) throws Exception{

  private static MultiLayerConfiguration getConfiguration(){
    int lstmLayerSize = 200;					//Number of units in each LSTM layer
    int tbpttLength = 50;                       //Length for truncated backpropagation through time. i.e., do parameter updates ever 50 characters

    Map<Character, Integer> CHAR_TO_INT = SparkLSTMCharacterExample.getCharToInt();
    int nIn = CHAR_TO_INT.size();
    int nOut = CHAR_TO_INT.size();

    //Set up network configuration:
    MultiLayerConfiguration conf = new NeuralNetConfiguration.Builder()
      .updater(new Nesterovs(0.1))
      .seed(12345)
      .l2(0.001)
      .weightInit(WeightInit.XAVIER)
      .list()
      .layer(0, new LSTM.Builder().nIn(nIn).nOut(lstmLayerSize).activation(Activation.TANH).build())
      .layer(1, new LSTM.Builder().nIn(lstmLayerSize).nOut(lstmLayerSize).activation(Activation.TANH).build())
      .layer(2, new RnnOutputLayer.Builder(LossFunctions.LossFunction.MCXENT).activation(Activation.SOFTMAX)        //MCXENT + softmax for classification
        .nIn(lstmLayerSize).nOut(nOut).build())
      .backpropType(BackpropType.TruncatedBPTT).tBPTTForwardLength(tbpttLength).tBPTTBackwardLength(tbpttLength)
      .pretrain(false).backprop(true)
      .build();

    return conf;
  }
}

Javadoc

Gradient updater. For example, Updater.SGD for standard stochastic gradient descent, Updater.NESTEROV for Nesterov momentum, Updater.RSMPROP for RMSProp, etc.
Note: default hyperparameters are used with this method. Use #updater(IUpdater) to configure the updater-specific hyperparameters.

Popular methods of NeuralNetConfiguration$Builder

<init>
l2
L2 regularization coefficient for the weights. Use with .regularization(true)
list
Create a ListBuilder (for creating a MultiLayerConfiguration) with the specified layers Usage: .l
weightInit
Weight initialization scheme.
seed
Random number generator seed. Used for reproducability between runs
optimizationAlgo
Optimization algorithm to use. Most common: OptimizationAlgorithm.STOCHASTIC_GRADIENT_DESCENT
activation
Activation function / neuron non-linearity
iterations
Number of optimization iterations.
learningRate
Learning rate. Defaults to 1e-1
gradientNormalization
Gradient normalization strategy. Used to specify gradient renormalization, gradient clipping etc.
graphBuilder
Create a GraphBuilder (for creating a ComputationGraphConfiguration).
regularization
Whether to use regularization (l1, l2, dropout, etc

Popular in Java

Making http post requests using okhttp
onCreateOptionsMenu (Activity)
getSharedPreferences (Context)
startActivity (Activity)
FileReader (java.io)
A specialized Reader that reads from a file in the file system. All read requests made by calling me
Runnable (java.lang)
Represents a command that can be executed. Often used to run code in a different Thread.
NoSuchElementException (java.util)
Thrown when trying to retrieve an element past the end of an Enumeration or Iterator.
TreeSet (java.util)
TreeSet is an implementation of SortedSet. All optional operations (adding and removing) are support
Cipher (javax.crypto)
This class provides access to implementations of cryptographic ciphers for encryption and decryption
Font (java.awt)
The Font class represents fonts, which are used to render text in a visible way. A font provides the
Top PhpStorm plugins

How to use updatermethodin org.deeplearning4j.nn.conf.NeuralNetConfiguration$Builder

Best Java code snippets using org.deeplearning4j.nn.conf.NeuralNetConfiguration$Builder.updater (Showing top 20 results out of 315)

How to use
updater
method
in
org.deeplearning4j.nn.conf.NeuralNetConfiguration$Builder