v0.9.0-beta
Release Highlights
CV-CUDA v0.9.0 includes the following changes:
New Features:
Improved Resize performance (up to 4x for u8 inputs, up to 3x for RGB8)
Improved performance of cubic interpolation, eg in Rotate, WarpAffine and WarpPerspective (up to 2x faster)
Added optional scaling to ResizeCropConvertReformat fused operator
Improved structure of Python documentation and optimized its generation (>5min to <30s) by removing the Exhale index
Added 64bit stride support to various operators
limited to 32bit strides to avoid performance regressions: AdaptiveThreshold, AdvCvtColor, AverageBlur, BilateralFilter, BrightnessContrast, ColorTwist, BoxBlur, CenterCrop, ConvertTo, CopyMakeBorder, CustomCrop, GaussianNoise, Gaussian, Flip, HistogramEq, JointBilateralFilter, Laplacian, Morphology, Normalize, RandomResizedCrop, Reformat, Remap, Resize, Rotate, SIFT, WarpAffine, WarpPerspective
Bug Fixes:
Added exception handling on CApi in Python: now forward C/C++exceptions to Python
Fixed coordinate rounding bug in Resize operator with nearest neighbor interpolation
Compatibility and Known Limitations
Documentation built on Ubuntu 20.04 needs an up-to-date version of sphinx (pip install –upgrade sphinx) as well as explicitly parsing the system’s default python version ./ci/build_docs path/to/build -DPYTHON_VERSIONS=”<py_ver>”.
Python bindings installed via Debian packages and Python tests fail with Numpy 2.0. We recommend using an older version of Numpy (e.g. 1.26) until we have implemented a fix.
The Resize and RandomResizedCrop operators incorrectly interpolate pixel values near the boundary of an image or tensor when using linear and cubic interpolation. This will be fixed in an upcoming release.
See main README on CV-CUDA GitHub.
License
CV-CUDA is licensed under the Apache 2.0 license.
Resources
Acknowledgements
CV-CUDA is developed jointly by NVIDIA and the ByteDance Machine Learning team.