tf.contrib.training.bucket_by_sequence_length(input_length, tensors, batch_size, bucket_boundaries, num_threads=1, capacity=32, shapes=None, dynamic_pad=False, allow_smaller_final_batch=False, keep_input=None, shared_name=None, name=None)
Lazy bucketing of inputs according to their length.
This method calls tf.contrib.training.bucket
under the hood, after first subdividing the bucket boundaries into separate buckets and identifying which bucket the given input_length
belongs to. See the documentation for which_bucket
for details of the other arguments.
Args:
-
input_length
:int32
scalarTensor
, the sequence length of tensors. -
tensors
: The list or dictionary of tensors, representing a single element, to bucket. Nested lists are not supported. -
batch_size
: The new batch size pulled from the queue (python int or int32 scalar). -
bucket_boundaries
: int list, increasing non-negative numbers. The edges of the buckets to use when bucketing tensors. Two extra buckets are created, one forinput_length < bucket_boundaries[0]
and one forinput_length >= bucket_boundaries[-1]
. -
num_threads
: An integer. The number of threads enqueuingtensors
. -
capacity
: An integer. The maximum number of minibatches in the top queue, and also the maximum number of elements within each bucket. -
shapes
: (Optional) The shapes for each example. Defaults to the inferred shapes fortensors
. -
dynamic_pad
: Boolean. Allow variable dimensions in input shapes. The given dimensions are padded upon dequeue so that tensors within a batch have the same shapes. -
allow_smaller_final_batch
: (Optional) Boolean. IfTrue
, allow the final batches to be smaller if there are insufficient items left in the queues. -
keep_input
: (Optional). Abool
scalar Tensor. If provided, this tensor controls whether the input is added to the queue or not. If it evaluatesTrue
, thentensors
are added to the bucket; otherwise they are dropped. This tensor essentially acts as a filtering mechanism. The default behavior is to assumekeep_input=True
. -
shared_name
: (Optional). If set, the queues will be shared under the given name across multiple sessions. -
name
: (Optional) A name for the operations.
Returns:
A tuple (sequence_length, outputs)
where sequence_length
is a 1-D Tensor
of size batch_size
and outputs
is a list or dictionary of batched, bucketed, outputs corresponding to elements of tensors
.
Raises:
-
TypeError
: ifbucket_boundaries
is not a list of python integers. -
ValueError
: ifbucket_boundaries
is empty or contains non-increasing values.
Please login to continue.