News

The Spark streaming analytics engine is one of the most popular open source tools for weaving big data into modern applications architectures with over 800 contributors from 200 organizations. It ...
Fast, flexible, and developer-friendly, Apache Spark is the leading platform for large-scale SQL, batch processing, stream processing, and machine learning.
Google is adding another product in its range of big data services on the Google Cloud Platform today. The new Google Cloud Dataproc service sits between managing the Spark data processing engine ...
Writing Spark applications Spark, written in Scala, provides a unified abstraction layer for data processing, making it a great environment for developing data applications.
The Hadoop processing engine Spark has risen to become one of the hottest big data technologies in a short amount of time. And while Spark has been a Top-Level Project at the Apache Software ...
Varun Sharma, an enterprise solutions architect, proposes a tiered big data reference architecture for capitalizing on the power and potential of data while ensuring security and governance.
These real-world struggles inspired me to write the book, Hands-On Big Data Engineering: From Architecture to Deployment, to guide companies on building scalable, AI-ready data systems.