Apache Spark
Open jobs
3184
Companies looking for Spark
2819
Apache Spark revolutionizes large-scale data processing with its lightning-fast, in-memory distributed computing framework. Designed for complex analytics, machine learning, and streaming workloads, Spark dramatically outperforms traditional MapReduce approaches. Its unified analytics engine supports multiple programming languages (Scala, Python, Java, R) and integrates seamlessly with diverse data sources. Spark's resilient distributed datasets (RDDs) and DataFrame abstractions enable sophisticated data transformations with minimal infrastructure complexity. While powerful, it demands significant computational resources and expertise to optimize. Machine learning libraries (MLlib) and streaming capabilities make it a go-to solution for enterprises processing petabyte-scale datasets across distributed environments.
Used together with Spark
Jobs (this month)
3184
Companies with Jobs
2819
Jobs in using Apache Spark for but please no
|
data engineer
Lead Software Engineer, Back End - L5 (Bangkok based - Relocation provided) @ agoda
AU | 2025-12-27
The role at Agoda is a high-stakes opportunity for a data engineer with deep experience in building mission-critical systems. The job requires expertise in scalable back-end development, with a...
read more »
|
API, CI/CD, Scala, Kafka, Spark, Agile/Scrum, Go, C, Java, Computer Science | ||
|---|---|---|---|
|
data engineer
Software Engineer II @ the-trade-desk
AU | 2025-12-26
Trade Desk's Measurement Upper Funnel role promises end-to-end ownership across a multi-cloud, microservices-heavy stack while chasing petabyte-scale insights from 600 billion queries a day. It's...
read more »
|
Funnel, Cloud Computing, JavaScript, React, API, Docker, Kubernetes, Big Data, Spark, Flink, Terraform, GitLab, AWS, Azure, Databricks, SQL Server, Vertica, ClickHouse, AI/ML, C, Java, SQL, Agile/Scrum, RDBMS, Data Analytics, Python, NumPy, Pandas | ||
|
promoted
O'Reilly: Building AI Agents with Model Context Protocol (MCP)Design and implement composable agent architectures using MCP. Understand the MCP architecture and how it enables AI applications to access external context. Build MCP servers that expose tools, resources, and prompts to LLMs. |
|
||
|
data engineer
Senior AWS Data Engineer @ asx
AU | 2025-12-26
This role is a compelling opportunity for a senior data engineer with deep AWS expertise, but it’s a steep climb in a highly competitive field. The position requires a hands-on engineer with...
read more »
|
Agile/Scrum, AWS, API, Data Engineering, Data Management, Analytics, Data Modelling, DWH, Data Streaming, Confluence, Kafka, Airflow, AWS Glue, Iceberg, Athena, Redshift, DataViz, Tableau, Power BI, QuickSight, Python, Spark, Terraform, Terragrunt | ||
|
data engineer
Data Engineer - Azure & Databricks @ n2sglobal
AU | 2025-12-23
This position calls for a data engineer skilled in Azure Databricks to build scalable data pipelines with the added twist that integrating diverse data sources like REST APIs and SFTP is a...
read more »
|
Databricks, ETL/ELT, ADF, Spark, API, Cloud Storage, Management, Azure, Data Lake, SQL, Delta, Data Quality, Data Governance, Computer Science, Data Engineering, Python, Synapse, DWH, Big Data, Airflow, Kafka, AI/ML | ||
|
data engineer
Data Engineer (SSE / Staff Engineer) - Python & Spark & AWS @ n2sglobal
AU | 2025-12-23
A Senior Software Engineer / Staff Data Engineer role focuses on designing and maintaining large-scale data pipelines with expertise in Python, Spark, and AWS. The standout feature is the...
read more »
|
Python, Spark, AWS, Analytics, Data Governance, Data Engineering, ETL/ELT, Data Quality, Big Data, Hadoop, Hive, Kafka, Data Streaming, Cloud Computing, S3, AWS Glue, Redshift, Athena, API, DevOps, CI/CD, Jenkins, Git, Docker, Kubernetes, Management, Agile/Scrum, Scala, Data Modelling, Cyber Security, Data Lake, DWH, AI/ML |