NVIDIA Collaborates with Open-source Community to Bring GPU Acceleration to Apache Spark 3.0

15 May 2020
Ray Sharma
- 1
- 2
- 3
- 4
- 5
(0 votes)
1.1 min read
font size decrease font size increase font size
Comment

Image Credit: NVIDIA

NVIDIA said it is collaborating with the open-source community to bring end-to-end GPU acceleration to Apache Spark 3.0, an analytics engine for big data processing.

With the anticipated late spring release of Spark 3.0, data scientists and machine learning engineers will for the first time be able to apply revolutionary GPU acceleration to the ETL (extract, transform and load) data processing workloads widely conducted using SQL database operations, said the Company.

In another first, AI model training will be able to be processed on the same Spark cluster, instead of running the workloads as separate processes on separate infrastructure. This enables high-performance data analytics across the entire data science pipeline, accelerating tens to thousands of terabytes of data from data lake to model training, without changes to existing code used for Spark applications running on premises and in the cloud.

Building on its strategic AI partnership with NVIDIA, Adobe is one of the first companies working with a preview release of Spark 3.0 running on Databricks. NVIDIA claims that Adobe has achieved a 7x performance improvement and 90 percent cost savings in an initial test, using GPU-accelerated data analytics for product development in Adobe Experience Cloud and supporting features that power digital businesses.

NEW REPORT:
Next-Gen DPI for ZTNA: Advanced Traffic Detection for Real-Time Identity and Context Awareness

Ray Sharma

Ray is a news editor at The Fast Mode, bringing with him more than 10 years of experience in the wireless industry.

For tips and feedback, email Ray at ray.sharma(at)thefastmode.com, or reach him on LinkedIn @raysharma10, Facebook @1RaySharma

‹

China Mobile, Huawei Launch Enterprise Private Line Service Based on NG OTN ›

Openreach Selects ADTRAN’s Open Software-Defined Fibre Access Platform for Nationwide Deployment

NVIDIA Collaborates with Open-source Community to Bring GPU Acceleration to Apache Spark 3.0

RELATED CONTENT