RAPIDS cuDF Accelerates pandas up to 30x on an 8GB Text Dataset

Опубликовано: 08 Август 2024
на канале: NVIDIA Developer
97,258
98

Watch RAPIDS cuDF accelerate pandas code processing tabular text data by up to 30x. This demo highlights the impact of GPU-acceleration of data with large string fields, a new feature of RAPIDS cuDF, that often slows down CPU-only pandas data processing.

Try the notebook: https://nvda.ws/3LXTerT

RAPIDS cuDF can now process up to 2.1B rows of tabular text data. To use the latest version of RAPIDS cuDF, use the command pip install --extra-index-url=https://pypi.nvidia.com cudf-cu12==24.8.* before running your code.

Join the NVIDIA Developer Program: https://nvda.ws/3OhiXfl

Read and subscribe to the NVIDIA Technical Blog: https://nvda.ws/3XHae9F

#datascience #dataanalytics #pandas #machinelearning #RAPIDS #largestrings #textdata #consumerinternet

RAPIDS cuDF, pandas, real-time analytics, dataframe, NVIDIA, Data Analytics, Data Science, CPU/GPU Interoperability, GPU-Accelerated, GPU, tabular text data