List of Google Cloud Dataproc Customers
Mountain View, 94043, CA,
United States
Since 2010, our global team of researchers has been studying Google Cloud Dataproc customers around the world, aggregating massive amounts of data points that form the basis of our forecast assumptions and perhaps the rise and fall of certain vendors and their products on a quarterly basis.
Each quarter our research team identifies companies that have purchased Google Cloud Dataproc for Analytics and BI from public (Press Releases, Customer References, Testimonials, Case Studies and Success Stories) and proprietary sources, including the customer size, industry, location, implementation status, partner involvement, LOB Key Stakeholders and related IT decision-makers contact details.
Companies using Google Cloud Dataproc for Analytics and BI include: Suzano, a Brazil based Manufacturing organisation with 37000 employees and revenues of $8.59 billion, Outbrain, a United States based Professional Services organisation with 1016 employees and revenues of $1.02 billion, Manash Lifestyle, a India based Distribution organisation with 258 employees and revenues of $4.0 million and many others.
Contact us if you need a completed and verified list of companies using Google Cloud Dataproc, including the breakdown by industry (21 Verticals), Geography (Region, Country, State, City), Company Size (Revenue, Employees, Asset) and related IT Decision Makers, Key Stakeholders, business and technology executives responsible for the software purchases.
The Google Cloud Dataproc customer wins are being incorporated in our Enterprise Applications Buyer Insight and Technographics Customer Database which has over 100 data fields that detail company usage of software systems and their digital transformation initiatives. Apps Run The World wants to become your No. 1 technographic data source!
Apply Filters For Customers
| Logo | Customer | Industry | Empl. | Revenue | Country | Vendor | Application | Category | When | SI | Insight |
|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
Manash Lifestyle | Distribution | 258 | $4M | India | Google Cloud Dataproc | Analytics and BI | 2017 | Mediaagility |
In 2017 Manash Lifestyle operating the Purplle e-commerce site implemented Google Cloud Dataproc as a core component of its Analytics and BI environment, with Mediaagility engaged to support the deployment in India. The initial deployment targeted data ingestion, ELT pre-processing, and model training workflows that feed Purplle’s recommendation and personalization pipeline for e-commerce.
Google Cloud Dataproc was configured to handle scalable data processing and iterative model training, supporting batch and near real time pre-processing of behavioral and catalog data, and to operationalize recommendation model training as part of the analytics workflow. Functional capabilities implemented included data ingestion orchestration, ELT staging and transformation, feature preparation for machine learning, and automated model training pipelines to produce ranking and recommendation artifacts.
The Dataproc deployment was integrated into Purplle’s personalization pipeline and e-commerce operational flows, enabling runtime model scoring that serves personalized content to the storefront. Operational scope was India focused, affecting analytics, data science, personalization engineering, and e-commerce merchandising teams, and supporting high volume request loads for personalization.
Governance and process changes emphasized a shift from ETL to ELT, with Mediaagility assisting the rollout and operational handover to Purplle’s analytics team. The implementation of Google Cloud Dataproc reduced data processing time by approximately 60 percent and supported up to 8–24 million personalized requests per day, outcomes that directly influenced conversion and revenue performance.
|
|
|
|
Outbrain | Professional Services | 1016 | $1.0B | United States | Google Cloud Dataproc | Analytics and BI | 2017 | DoiT International |
In 2017 Outbrain planned a migration of its research Hadoop/Spark cluster to Google Cloud Dataproc. The migration to Google Cloud Dataproc, executed starting January 2018, placed the deployment squarely in the Analytics and BI category to support data science and research analytics workloads.
The implementation centered on running Spark research jobs on managed Dataproc clusters with autoscaling enabled and use of preemptible nodes to optimize cost and capacity. Google Cloud Dataproc hosted ephemeral clusters for interactive and batch research workloads, enabling researchers to spin up Spark runtimes and scale compute independently from storage.
Operational scope focused on Outbrain research and data science teams, with a noted acceleration of research project turnaround in Israel. The migration reduced infrastructure costs by approximately 40 percent, an outcome recorded during the post migration period.
Implementation and rollout were driven by DoiT International, which led configuration, provisioning, and operational handoff beginning January 2018 after late 2017 planning. Governance emphasized cloud cost controls and autoscaling policies for Dataproc clusters, aligning platform configuration with research analytics workflows and operational ownership by the data science organization.
|
|
|
|
Suzano | Manufacturing | 37000 | $8.6B | Brazil | Google Cloud Dataproc | Analytics and BI | 2021 | n/a |
Suzano implemented Google Cloud Dataproc in 2021 as part of a Google Cloud-based data lake to ingest and transform plant and industrial data for analytics and data science across the organisation. The Brazil deployment began in January 2021 and rapidly ingested initial databases within a month, establishing core extraction and batch processing pipelines for operational and sensor-derived data.
The implementation leveraged Google Cloud Dataproc to run managed Spark and Hadoop workloads for ETL and transformation, populating a centralized Google Cloud Storage data lake and enabling downstream analytics and data science workflows. Functional capabilities implemented included ingestion pipelines, transformation jobs, scheduled batch processing, and self-service access for reporting and analytic teams, aligning the deployment with the Analytics and BI application category.
Operational scope centered on plant and industrial data across Suzano Brazil operations, serving industrial process analytics, reporting, and data science functions. Governance emphasis targeted democratized data access and standardized pipelines to accelerate reporting, and the deployment positioned Suzano Google Cloud Dataproc Analytics and BI relationship to support organization-wide analytic consumption and improved decision-making in industrial process analytics.
|
Buyer Intent: Companies Evaluating Google Cloud Dataproc
Discover Software Buyers actively Evaluating Enterprise Applications
| Logo | Company | Industry | Employees | Revenue | Country | Evaluated | ||
|---|---|---|---|---|---|---|---|---|
| No data found | ||||||||