List of Amazon EMR Customers
Seattle, 98109-5210, WA,
United States
Since 2010, our global team of researchers has been studying Amazon EMR customers around the world, aggregating massive amounts of data points that form the basis of our forecast assumptions and perhaps the rise and fall of certain vendors and their products on a quarterly basis.
Each quarter our research team identifies companies that have purchased Amazon EMR for Application Hosting and Computing Services from public (Press Releases, Customer References, Testimonials, Case Studies and Success Stories) and proprietary sources, including the customer size, industry, location, implementation status, partner involvement, LOB Key Stakeholders and related IT decision-makers contact details.
Companies using Amazon EMR for Application Hosting and Computing Services include: Elevance Health, formerly Anthem, Inc, a United States based Insurance organisation with 104900 employees and revenues of $171.34 billion, KKR, a United States based Banking and Financial Services organisation with 4834 employees and revenues of $21.88 billion, Expedia, a United States based Professional Services organisation with 16500 employees and revenues of $13.69 billion, Airbnb, a United States based Professional Services organisation with 7300 employees and revenues of $11.10 billion, Coinbase, a United States based Banking and Financial Services organisation with 3772 employees and revenues of $6.56 billion and many others.
Contact us if you need a completed and verified list of companies using Amazon EMR, including the breakdown by industry (21 Verticals), Geography (Region, Country, State, City), Company Size (Revenue, Employees, Asset) and related IT Decision Makers, Key Stakeholders, business and technology executives responsible for the software purchases.
The Amazon EMR customer wins are being incorporated in our Enterprise Applications Buyer Insight and Technographics Customer Database which has over 100 data fields that detail company usage of software systems and their digital transformation initiatives. Apps Run The World wants to become your No. 1 technographic data source!
Apply Filters For Customers
| Logo | Customer | Industry | Empl. | Revenue | Country | Vendor | Application | Category | When | SI | Insight |
|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
Airbnb | Professional Services | 7300 | $11.1B | United States | Amazon Web Services (AWS) | Amazon EMR | Application Hosting and Computing Services | 2008 | n/a |
In 2008, Airbnb implemented Amazon EMR. Amazon EMR is used as a managed cluster-based data processing platform within Airbnb's AWS environment, "".
Airbnb configured Amazon EMR to support large scale batch ETL and analytics workloads that underpin customer-facing functions. The deployment is aligned to common Amazon EMR capabilities such as distributed processing for log ingestion, scalable data transformations, and analytics orchestration to feed downstream services. Amazon EMR is explicitly cited by Airbnb executives as part of the infrastructure supporting payments, search, reservations, messaging and content moderation workloads.
Operationally the Amazon EMR deployment is integrated into Airbnb's broader AWS footprint, and it coexists with other AWS services the company has consumed since 2008. The company disclosed a multiyear hosting commitment with its primary cloud provider, originally $1.2 billion over five years and adjusted to an eight year term to average $150 million per year rather than $240 million per year, a change disclosed in its IPO prospectus. Airbnb attributes a $63.5 million year over year decrease in hosting costs in the nine months ended Sept. 30 to improved contract management and utilization of third party cloud services.
Governance around the Amazon EMR use has emphasized contract and capacity management, with reporting in public filings tying cloud spend terms to operational continuity. The filing links the hosting agreement adjustment to cost management during travel demand disruptions and notes the company has leveraged vendor negotiations and utilization controls to reduce hosting expense and cost of revenue.
|
|
|
Baptist Health | Healthcare | 27000 | $4.0B | United States | Amazon Web Services (AWS) | Amazon EMR | Application Hosting and Computing Services | 2020 | n/a |
In 2020, Baptist Health implemented Amazon EMR, Apps Category , to standardize Spark-based ETL and analytics workloads across its data engineering organization. The Amazon EMR deployment was positioned as the primary cluster platform for PySpark, SparkSQL and Spark Streaming jobs that replaced long-running Hive queries and supported real-time Kafka ingestion patterns.
Implementation centered on Amazon EMR cluster automation and Spark application modernization, with teams converting Hive and HiveQL workloads into PySpark and Spark Scala jobs running on Amazon EMR. The implementation included creating external Hive tables, encoding and decoding JSON using PySpark, authoring Spark UDFs, and using Spark for event joins and aggregations prior to persisting to HDFS or object storage. Apache Airflow was installed for job orchestration and scheduling, with Apache Oozie used for legacy Hadoop job control in some flows.
Integrations spanned on-premises and cloud tooling that directly interacted with Amazon EMR, including Kafka for streaming ingestion, Amazon S3 and Amazon Glacier for data storage, DynamoDB updated via Python and the boto3 library, and CI/CD pipelines using GitHub and Jenkins. The environment also interfaced with Snowflake for fact and dimension table loads using Airflow operators, and Tableau for reporting over Snowflake, while comparative pipelines leveraged Google Cloud components such as BigQuery and Dataflow during proof of concept work.
Operational governance emphasized agile delivery, formal requirements capture, use case and design documentation, and guidance from senior data engineers to development teams on PySpark ETL patterns. Deployment architecture considerations included cluster lifecycle automation with DevOps to create and destroy EMR clusters, continuous monitoring, and performance tuning of Spark jobs using benchmarking and profiling techniques. The Amazon EMR implementation supported centralized ETL orchestration, developer workflow standardization, and cross-cloud benchmarking activities at Baptist Health.
|
|
|
Blackline Safety | Professional Services | 477 | $55M | Canada | Amazon Web Services (AWS) | Amazon EMR | Application Hosting and Computing Services | 2020 | Onica by Rackspace Technology |
In 2020, Blackline Safety implemented Amazon EMR on Amazon Web Services, engaging Onica by Rackspace Technology to re-architect its high-volume streaming and processing infrastructure, aligned with the Data apps category. The project was driven by the need to ingest and enrich increased telemetry from G7 wearable devices and to deliver real-time analytics for industrial contact tracing, mapping of gas leaks, and equipment usage reporting.
The implementation centers on Amazon EMR as the primary processing layer, with Amazon Kinesis used for high-throughput ingestion, Delta Lake for scalable cohesive streaming and storage, and Amazon Redshift for analytical reporting. Configuration work included constructing a data lake staging and enrichment pipeline, implementing message routing and assembly logic to produce high-value records, and adjusting ingest cadence to support a shift from one message every five minutes to one every ten seconds per device.
Operational coverage extended to Blackline Vision data science workflows and to customer-facing reporting use cases across manufacturing, oil and gas, and public utilities. The solution accepted streams from G7 wearable devices via cellular and satellite connectivity and produced outputs for contact tracing visualizations, alert type dashboards, calibration and compliance views, and geospatial reports.
Onica assisted with AWS load balancer configuration and online troubleshooting with AWS partners, and helped establish strict user access controls to preserve client data boundaries during rollout. The collaboration enabled rapid deployment of an industrial contact tracing reporting capability and increased data-rate reporting from connected wearables, supporting Blackline Safety's continued delivery of cloud-hosted software and analytics.
|
|
|
Coinbase | Banking and Financial Services | 3772 | $6.6B | United States | Amazon Web Services (AWS) | Amazon EMR | Application Hosting and Computing Services | 2015 | n/a |
In 2015, Coinbase implemented Amazon EMR as part of its Application Hosting and Computing Services architecture on Amazon Web Services, embedding managed Hadoop processing into its production data platform. The deployment supported the Coinbase Exchange and wallet services that run inside the company production environment on AWS, and Coinbase launched the U.S. Coinbase Exchange on AWS in February 2015 while later expanding to serve European users.
Amazon EMR was used to process large, growing datasets for downstream analytics and warehousing, specifically to transform transactional stores into structured Amazon Redshift tables. The platform architecture includes a custom transactional data engine alongside Amazon Relational Database Service instances running PostgreSQL, and Amazon Elastic Compute Cloud instances that power exchange compute. Amazon EMR processed batch and large scale map reduce style workloads to feed the managed petabyte scale Redshift data warehouse.
The EMR implementation is tightly integrated with a real time streaming insight pipeline that uses Amazon Kinesis for operations analytics and event streaming. Audit and observability flows use AWS CloudTrail logs delivered to Amazon S3, then processed by AWS Lambda and routed into Kinesis containers based on Docker images, creating indexed audit logs. Network and deployment infrastructure is provisioned through AWS CloudFormation templates, and Amazon Virtual Private Cloud endpoints are used to optimize throughput to Amazon S3. Amazon WorkSpaces is used to provision cloud desktops for on demand contractor access to network slices.
Governance and access control were established with AWS Identity and Access Management to create an auditable self service security foundation, enabling engineers to operate within controlled boundaries. CloudFormation templates allowed version control of network topology and exact duplication for on demand development and staging environments, while CloudTrail and the indexed audit log path provided transparent, searchable auditability. The streaming pipeline also mirrors data into a separate disaster recovery environment for resilience.
Operational outcomes called out in the implementation include distribution of applications natively across multiple AWS Availability Zones for reliable global delivery, real time alerts when security groups or network access controls are modified, and a high volume data throughput where approximately 1 TB of data, about 1 billion events, flows daily through the analytics and audit pipeline. Amazon EMR is described as the component that crunches Coinbase databases into actionable Redshift data used to inform company performance and operational steering.
|
|
|
Elevance Health, formerly Anthem, Inc | Insurance | 104900 | $171.3B | United States | Amazon Web Services (AWS) | Amazon EMR | Application Hosting and Computing Services | 2020 | n/a |
In 2020, Elevance Health implemented Amazon EMR as a core component of an Application Hosting and Computing Services deployment to centralize large scale analytics and machine learning workloads. The implementation was part of a cloud-first architecture designed to consolidate disparate clinical, claims, and EHR data into a HIPAA compliant centralized data lake to enable faster query, processing, and model training.
The deployment uses Amazon EMR alongside Amazon S3 for storage and Amazon SageMaker for model training and inference, with Amazon EMR providing distributed data processing and analytics pipelines to standardize structured and unstructured clinical and claims data. Functional capabilities implemented include large batch and near real time ingestion, ETL and data standardization pipelines, ML model training and scoring workflows, and automated claims processing orchestration using cloud compute clusters and data pipelines. Amazon EMR is explicitly used to run scalable analytics jobs that feed predictive models and downstream services.
Integrations are centered on bidirectional data exchange with electronic medical records and claims repositories, nationwide admission discharge and transfer feeds, and the companys Health OS platform that shares medical information between payer and provider teams. Operational coverage spans payer analytics, claims operations, and provider-facing care teams, and supports collaboration across thousands of physician and hospital partners in roughly 90 percent of the United States. The architecture emphasizes secure data access controls, centralized storage on Amazon S3, and pipeline orchestration so clinical insights can be surfaced at the point of care.
Governance and process changes highlighted by Elevance Health include establishing a single source of truth, investing in data quality controls, and rolling out data literacy training across clinical and claims teams. The organization adopted a DIAL framework for continuous model learning and operationalization, enabling ML driven predictive patient insights, improved data access for care teams, and a reduction in avoidable admissions as described by the company. Those outcomes were enabled by Amazon EMR powered compute and the broader Application Hosting and Computing Services stack on AWS.
|
|
|
|
Professional Services | 16500 | $13.7B | United States | Amazon Web Services (AWS) | Amazon EMR | Application Hosting and Computing Services | 2010 | n/a |
|
|
|
|
Professional Services | 16800 | $5.4B | United States | Amazon Web Services (AWS) | Amazon EMR | Application Hosting and Computing Services | 2016 | n/a |
|
|
|
|
Banking and Financial Services | 4834 | $21.9B | United States | Amazon Web Services (AWS) | Amazon EMR | Application Hosting and Computing Services | 2020 | n/a |
|
|
|
|
Media | 173 | $10M | Japan | Amazon Web Services (AWS) | Amazon EMR | Application Hosting and Computing Services | 2019 | n/a |
|
|
|
|
Retail | 20550 | $170M | Philippines | Amazon Web Services (AWS) | Amazon EMR | Application Hosting and Computing Services | 2022 | n/a |
|
Buyer Intent: Companies Evaluating Amazon EMR
- Teamlok France, a France based Distribution organization with 10 Employees
- Bund Verlag, a Germany based Media company with 50 Employees
Discover Software Buyers actively Evaluating Enterprise Applications
| Logo | Company | Industry | Employees | Revenue | Country | Evaluated |
|---|---|---|---|---|---|---|
| Teamlok France | Distribution | 10 | $1M | France | 2025-10-23 | |
| Bund Verlag | Media | 50 | $5M | Germany | 2024-09-03 |