Extractions

Extraction

class openrec.legacy.modules.extractions.Extraction(train=True, l2_reg=None, scope=None, reuse=False)

A direct inheritance of the Module.

Look Up

class openrec.legacy.modules.extractions.LookUp(embed, ids=None, scope=None, reuse=False)

The LookUp module maps (embeds) input ids into fixed representations. The representations are not be updated during training. The module outputs a tensor with shape shape(ids) + [embedding dimensionality].

Parameters:
  • embed (numpy array) – Fixed embedding matrix.
  • ids (Tensorflow tensor, optional) – List of ids to retrieve embeddings. If None, the whole embedding matrix is returned.
  • scope (str, optional) – Scope for module variables.
  • reuse (bool, optional) – Whether or not to reuse module variables.
_build_shared_graph()

Build shared computational graphs across training and serving (may be overridden).

Identity Mapping

class openrec.legacy.modules.extractions.IdentityMapping(value, scope=None, reuse=False)

The IdentityMapping module executes an identity function.

Parameters:
  • value (Tensorflow tensor) – Input tensor
  • scope (str, optional) – Scope for module variables.
  • reuse (bool, optional) – Whether or not to reuse module variables.
_build_shared_graph()

Build shared computational graphs across training and serving (may be overridden).

Latent Factor

class openrec.legacy.modules.extractions.LatentFactor(shape, init='normal', ids=None, l2_reg=None, scope=None, reuse=False)

The LatentFactor module maps (embeds) input ids into latent representations. The module outputs a tensor with shape shape(ids) + [embedding dimensionality].

Parameters:
  • shape (list) – Shape of the embedding matrix, i.e. [number of unique ids, embedding dimensionality].
  • init (str, optional) – Embedding initialization. ‘zero’ or ‘normal’ (default).
  • ids (Tensorflow tensor, optionl) – List of ids to retrieve embeddings. If None, the whole embedding matrix is returned.
  • l2_reg (float, optional) – Weight for L2 regularization, i.e., weight decay.
  • scope (str, optional) – Scope for module variables.
  • reuse (bool, optional) – Whether or not to reuse module variables.
_build_shared_graph()

Build shared computational graphs across training and serving (may be overridden).

censor_l2_norm_op(censor_id_list=None, max_norm=1)

Limit the norm of embeddings.

Parameters:
  • censor_id_list (list or Tensorflow tensor) – list of embeddings to censor (indexed by ids).
  • max_norm (float, optional) – Maximum norm.
Returns:

An operator for post-training execution.

Return type:

Tensorflow operator

Multi Layer FC

class openrec.legacy.modules.extractions.MultiLayerFC(in_tensor, dims, relu_in=False, relu_mid=True, relu_out=False, dropout_in=None, dropout_mid=None, dropout_out=None, bias_in=True, bias_mid=True, bias_out=True, batch_norm=False, train=True, l2_reg=None, scope=None, reuse=False)

The MultiLayerFC module implements multi-layer perceptrons with ReLU as non-linear activation functions. Each layer is often referred as a fully-connected layer.

Parameters:
  • in_tensor (Tensorflow tensor) – An input tensor with shape [*, feature dimensionality]
  • dims (list) – Specify the feature size of each layer’s outputs. For example, setting dims=[512, 258, 128] to create three fully-connected layers with output shape [*, 512], [*, 256], and [*, 128], respectively.
  • relu_in (bool, optional) – Whether or not to add ReLU to the input tensor.
  • relu_mid (bool, optional) – Whether or not to add ReLU to the outputs of intermediate layers.
  • relu_out (bool, optional) – Whether or not to add ReLU to the final output tensor.
  • dropout_in (float, optional) – Dropout rate for the input tensor. If None, no dropout is used for the input tensor.
  • dropout_mid (float, optional) – Dropout rate for the outputs of intermediate layers. If None, no dropout is used for the intermediate outputs.
  • dropout_out (float, optional) – Dropout rate for the outputs of the final layer. If None, no dropout is used for the final outputs.
  • bias_in (bool, optional) – Whether or not to add bias to the input tensor.
  • bias_mid (bool, optional) – Whether or not to add bias to the outputs of intermediate layers.
  • bias_out (bool, optional) – Whether or not to add bias to the final output tensor.
  • batch_norm (bool, optional) – Whether or not to add batch normalization [1] to each layer’s outputs.
  • train (bool, optionl) – An indicator for training or servining phase.
  • l2_reg (float, optional) – Weight for L2 regularization, i.e., weight decay.
  • scope (str, optional) – Scope for module variables.
  • reuse (bool, optional) – Whether or not to reuse module variables.

References

[1]Ioffe, S. and Szegedy, C., 2015, June. Batch normalization: Accelerating deep network training by reducing internal covariate shift. In International Conference on Machine Learning (pp. 448-456).
_build_shared_graph()

Build shared computational graphs across training and serving (may be overridden).

SDAE

class openrec.legacy.modules.extractions.SDAE(in_tensor, dims, dropout=None, l2_reconst=1.0, train=True, l2_reg=None, scope=None, reuse=False)

The SDAE module implements Stacked Denoising Autoencoders [bn]. It outputs SDAE’s bottleneck representations (i.e., the encoder outputs).

Parameters:
  • in_tensor (Tensorflow tensor) – An input tensor with shape [*, feature dimensionality]
  • dims (list) – Specify the feature size of each encoding layer’s outputs. For example, setting dims=[512, 258, 128] to create an three-layer encoder with output shape [*, 512], [*, 256], and [*, 128], and a two-layer decoder with output shape [*, 256] and [*, 512].
  • dropout (float, optional) – Dropout rate for the input tensor. If None, no dropout is used for the input tensor.
  • l2_reconst (float, optional) – Weight for reconstruction loss.
  • train (bool, optionl) – An indicator for training or servining phase.
  • l2_reg (float, optional) – Weight for L2 regularization, i.e., weight decay.
  • scope (str, optional) – Scope for module variables.
  • reuse (bool, optional) – Whether or not to reuse module variables.

References

[bn]Vincent, P., Larochelle, H., Lajoie, I., Bengio, Y. and Manzagol, P.A., 2010. Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion. Journal of Machine Learning Research, 11(Dec), pp.3371-3408.
_build_shared_graph()

Build shared computational graphs across training and serving (may be overridden).

TemporalLatentFactor

class openrec.legacy.modules.extractions.TemporalLatentFactor(shape, mlp_dims, ids, init='normal', mlp_pretrain=True, l2_reg=None, train=True, scope=None, reuse=False)
_build_shared_graph()

Build shared computational graphs across training and serving (may be overridden).

_build_training_graph()

Build training-specific computational graphs (may be overridden).

forward_update_embeddings(sess)

Retrieve update node.

pretrain_mlp_as_identity(sess)