Image Encoding using nvImageCodec

The image batch encoder is responsible for saving image tensors to the disk as JPG images. The actual encoding is done in batches using the nvImageCodec library. The image encoder is generic enough to be across the sample applications. The code associated with this class can be found in the samples/common/python/nvcodec_utils.py file.

The image batch encoder is a relatively simple class. Here is how its __init__ method is defined.

Once the initialization is complete, we encode the images in the __call__ method. Since the Batch object is passed, we have information of the data, its batch index and the original file name used to read the data.

def __call__(self, batch):
    self.cvcuda_perf.push_range("encoder.nvimagecodec")

    assert isinstance(batch.data, torch.Tensor)

    image_tensors_nchw = batch.data
    # Create an empty list to store filenames
    filenames = []
    chwtensor_list = []
    # Iterate through each image to prepare the filenames
    for img_idx in range(image_tensors_nchw.shape[0]):
        img_name = os.path.splitext(os.path.basename(batch.fileinfo[img_idx]))[0]
        results_path = os.path.join(self.output_path, f"out_{img_name}.jpg")
        self.logger.info(f"Preparing to save the image to: {results_path}")
        # Add the filename to the list
        filenames.append(results_path)
        # Add the image tensor CAI to a CAI list from an NCHW tensor
        # (this was a stacked tensor if N images)
        chwtensor_list.append(image_tensors_nchw[img_idx].cuda())

    # Pass the image tensors and filenames to the encoder.
    self.encoder.write(filenames, chwtensor_list)
    self.cvcuda_perf.pop_range()