Apache Spark

big data, distributed computing, analytics

Open jobs

3184

Companies looking for Spark

2819

Apache Spark revolutionizes large-scale data processing with its lightning-fast, in-memory distributed computing framework. Designed for complex analytics, machine learning, and streaming workloads, Spark dramatically outperforms traditional MapReduce approaches. Its unified analytics engine supports multiple programming languages (Scala, Python, Java, R) and integrates seamlessly with diverse data sources. Spark's resilient distributed datasets (RDDs) and DataFrame abstractions enable sophisticated data transformations with minimal infrastructure complexity. While powerful, it demands significant computational resources and expertise to optimize. Machine learning libraries (MLlib) and streaming capabilities make it a go-to solution for enterprises processing petabyte-scale datasets across distributed environments.

Used together with Spark

Jobs (this month)

3184

Companies with Jobs

2819

Jobs in

All cities

Amsterdam

San Francisco

London

Paris

New York

Berlin

Copenhagen

Singapore

Tokyo

Sydney

Madrid

Rio De Janeiro

using Apache Spark for

All job types

Analytics Engineers

Data Engineers

Analysts

Data Scientists

Machine Learning Engineers

Others

but please no

Agile/Scrum

JIRA

Activity Schema

Adobe Analytics

Agile/Scrum

Artificial Intelligence/Machine Learning

Airbyte

Apache Airflow

Alation

Alteryx

Amplitude

Analytics

Analytics Engineering

Ansible

Apache Flink

Application Programming Interface (API)

AppDynamics

Redpanda

Webhooks

Argo CD

Apache Arrow

Astronomer

Amazon Athena

Apache Avro

Amazon Web Services (AWS)

Amazon Aurora

AWS CloudFormation

Amazon CloudWatch

Amazon EC2

Amazon EMR

AWS Glue

Amazon Kinesis

AWS Lambda

Amazon RDS

Microsoft Azure

Azure Data Factory

Azure DevOps

Bash

Apache Beam

Business Intelligence (BI)

Big Data

BigEye

Google BigQuery

Blendo

Blockchain

C

C#/.NET

Apache Cassandra

Cyber Security

Certified Cloud Security Professional (CCSP)

Certified Information Systems Security Professional (CISSP)

FinOps

Customer Data Platform (CDP)

Chef

Chroma

Continuous Integration/Continuous Delivery (CI/CD)

Circle CI

ClickHouse

Cloud Computing

Cloud Storage

Cloudflare

Azure Cosmos DB

Collibra

IBM Cognos

Computer Science

Confluence

C++

Customer Relationship Management (CRM)

Comma-Separated Values (CSV)

Cypress

Dagster

Dashboard

Data Analytics

Data Contracts

Data Engineering

Data Governance

Data Lake

Data Lakehouse

Data Management

Data Modelling

Data Quality

Data Science

Data Vault

Data Visualization

Databricks

Datacoral

Datadog

Google Cloud Dataflow

Datafold

Google Dataform

Dataiku

DataOps

Google Cloud Dataproc

Data Analysis Expressions (DAX)

dbt (data build tool)

Delta Lake

DevOps

Docker

Dremio

DuckDB

Data Warehouse

DynamoDB

Dynatrace

Elasticsearch/ELK Stack

Enterprise Resource Planning (ERP)

ETL/ELT

Ethereum

Binance

Non-Fungible Tokens (NFT)

Geographic Information System (GIS)

Decentralized Finance (DeFi)

Smart Contracts

Chainalysis

Web3

Microsoft Excel

Feather

Fivetran

Funnel

Google Cloud Platform (GCP)

GDPR/CCPA

Generative AI

Git

GitHub

GitLab

Go

Google Analytics

Google Cloud Composer

Google Cloud Data Fusion

Google Cloud Functions

Google Cloud Run

Google Sheets

Grafana

Google Tag Manager (GTM)

Apache Hadoop

Apache HBase

Hierarchical Data Format

Hadoop Distributed File System (HDFS)

Heap Analytics

Hevo Data

Apache Hive

HyperText Markup Language (HTML)

Hubspot

Hyper-V

IBM

Apache Iceberg

Informatica

Data Collection

Internet of Things (IoT)

Java

JavaScript

Jenkins

Jira

JavaScript Object Notation (JSON)

JSON Schema

Apache Kafka

Keras

Kibana

(Kimball) Dimensional Modeling

Kissmetrics

Key Performance Indicator (KPI)

Kubernetes

Linux

Large Language Models (LLM)

Logstash

Looker

Looker Studio

Luigi

Management

MariaDB

Marketing

Marketing Mix Modeling (MMM)

Master Data Management

Masthead Data

Matillion

MATLAB

Matomo

Matplotlib

Modern Data Stack

Meltano

Mendix

Metabase

Microsoft

Microsoft Fabric

Mixpanel

MLOps

Mode Analytics

MongoDB

Monte Carlo

MySQL

IBM Netezza

Neo4j

New Relic

Natural Language Processing (NLP)

NoSQL

NumPy

Opsgenie

Oracle

Optimized Row Columnar (ORC)

PagerDuty

Pandas

Apache Parquet

Pendo

Doctor of Philosophy (PhD)

Pinecone

Piwik PRO

Plausible Analytics

Playwright

Plotly

Polars

PostgreSQL

Microsoft Power BI

Microsoft PowerPoint

PowerShell

Prefect

Presto

Process Mining

Prometheus

Protocol Buffers

Pub/Sub

Pulumi

Puppet

Puppeteer

Pydantic

PySpark

Python

PyTorch

Qlik

Amazon QuickSight

R (Language)

Retrieval Augmented Generation (RAG)

Relational Database Management System (RDBMS)

React

Redash

Redis

Amazon Redshift

Recurrent Neural Networks

Rust

Amazon S3

Software as a Service (SaaS)

Amazon SageMaker

SAP

SAS

Scala

Scikit-learn

SciPy

Seaborn

Twilio Segment

Selenium

Singer

Sisense

Snowflake

Snowplow

Apache Spark

Splunk

SPSS

Structured Query Language (SQL)

SQLMesh

SQLFluff

SQLFmt

Microsoft SQL Server

SQL Server Analysis Services (SSAS)

SQL Server Integration Services (SSIS)

SQL Server Reporting Services

Stitch

Data Streaming

Supermetrics

Apache Superset

Azure Synapse Analytics

TIBCO Spotfire

Tableau

Talend

TensorFlow

Teradata

Bicep

Infrastructure as Code (IaC)

Azure Resource Manager (ARM)

Terraform

Terragrunt

Third Normal Form (3NF)

TOML

Trifacta

Apache Trino

Apache Druid

TypeScript

Unix

Visual Basic for Applications (VBA)

Vector DB

Vertica

VirtualBox

Virtual Machine

VMware

Microsoft Word

Extensible Markup Language (XML)

Xplenty

Yet Another Markup Language (YAML)

Motherduck

PostHog

Kestra

Omni Analytics

Thoughtspot

Lightdash

Hudi

Open Table Format (OTF)

Lance

data engineer Lead Software Engineer, Back End - L5 (Bangkok based - Relocation provided) @ agoda AU \| 2025-12-27 The role at Agoda is a high-stakes opportunity for a data engineer with deep experience in building mission-critical systems. The job requires expertise in scalable back-end development, with a... read more »	API, CI/CD, Scala, Kafka, Spark, Agile/Scrum, Go, C, Java, Computer Science	Lead Software Engineer, Back End - L5 (Bangkok based - Relocation provided) agoda (AU) FULL TIME \| JOB LISTED The role at Agoda is a high-stakes opportunity for a data engineer with deep experience in building mission-critical systems. The job requires expertise in scalable back-end development, with a focus on Kafka, Spark, and distributed systems. The team emphasizes innovation, collaboration, and technical leadership, but the emphasis on large-scale systems and the lack of clear salary details make the role somewhat opaque. While the opportunity is substantial, the challenges include managing complex, high-traffic systems and maintaining a culture of continuous improvement. The position stands out for its emphasis on engineering fundamentals and real-world impact, but the lack of specific salary info and the heavy focus on large-scale systems may limit its appeal to those looking for a more traditional role. The company’s commitment to diversity and inclusion is commendable, but the technical demands and lack of clarity around compensation may not align with the expectations of a technical audience seeking clarity and specificity. Generated content Technology used API CI/CD Scala Kafka Spark Agile/Scrum Go C Java Computer Science Listed At 2025-12-27 2025-09-12 (original listing) Similar Jobs Loading... Permalink View original posting
data engineer Software Engineer II @ the-trade-desk AU \| 2025-12-26 Trade Desk's Measurement Upper Funnel role promises end-to-end ownership across a multi-cloud, microservices-heavy stack while chasing petabyte-scale insights from 600 billion queries a day. It's... read more »	Funnel, Cloud Computing, JavaScript, React, API, Docker, Kubernetes, Big Data, Spark, Flink, Terraform, GitLab, AWS, Azure, Databricks, SQL Server, Vertica, ClickHouse, AI/ML, C, Java, SQL, Agile/Scrum, RDBMS, Data Analytics, Python, NumPy, Pandas	Software Engineer II the-trade-desk (AU) FULL TIME \| JOB LISTED Trade Desk's Measurement Upper Funnel role promises end-to-end ownership across a multi-cloud, microservices-heavy stack while chasing petabyte-scale insights from 600 billion queries a day. It's technically appealing: .NET Core and React front-to-back, gRPC and REST APIs, Docker and Kubernetes, plus Spark and Flink powering data processing, with Databricks and SQL-based stores like Vertica and ClickHouse. The stated culture prizes learning, mentorship, and autonomous delivery, but the reality is a sprawling toolkit and a global, time-shifted crew that must stay in sync via asynchronous chat and 30-minute standups. The job tests both breadth and depth: you’ll need solid C# or Java, robust SQL, cloud chops across AWS, Azure or Aliyun, and a grasp of data architectures from row-based RDBMS to NoSQL. It’s strong on tech realism, but beware the breadth can swallow focus without disciplined scope. Generated content Technology used Funnel Cloud Computing JavaScript React API Docker Kubernetes Big Data Spark Flink Terraform GitLab AWS Azure Databricks SQL Server Vertica ClickHouse AI/ML C Java SQL Agile/Scrum RDBMS Data Analytics Python NumPy Pandas Listed At 2025-12-26 2025-11-13 (original listing) Similar Jobs Loading... Permalink View original posting
promoted O'Reilly: Building AI Agents with Model Context Protocol (MCP) Design and implement composable agent architectures using MCP. Understand the MCP architecture and how it enables AI applications to access external context. Build MCP servers that expose tools, resources, and prompts to LLMs.	Get Started
data engineer Senior AWS Data Engineer @ asx AU \| 2025-12-26 This role is a compelling opportunity for a senior data engineer with deep AWS expertise, but it’s a steep climb in a highly competitive field. The position requires a hands-on engineer with... read more »	Agile/Scrum, AWS, API, Data Engineering, Data Management, Analytics, Data Modelling, DWH, Data Streaming, Confluence, Kafka, Airflow, AWS Glue, Iceberg, Athena, Redshift, DataViz, Tableau, Power BI, QuickSight, Python, Spark, Terraform, Terragrunt	Senior AWS Data Engineer asx (AU) FULL TIME \| JOB LISTED This role is a compelling opportunity for a senior data engineer with deep AWS expertise, but it’s a steep climb in a highly competitive field. The position requires a hands-on engineer with experience in data pipelines, cloud infrastructure, and data product design, but it lacks clarity on compensation and the company’s culture. The role emphasizes technical excellence and innovation, but the description is vague on the company’s values and how they align with the candidate’s background. While the team is diverse and the role is technically rigorous, the lack of explicit salary details and limited mention of the company’s unique strengths makes it hard to assess fully. The position is ideal for someone passionate about data engineering and cloud architecture, but it’s not clear how it fits within the broader market landscape or the company’s long-term goals. Generated content Technology used Agile/Scrum AWS API Data Engineering Data Management Analytics Data Modelling DWH Data Streaming Confluence Kafka Airflow AWS Glue Iceberg Athena Redshift DataViz Tableau Power BI QuickSight Python Spark Terraform Terragrunt Listed At 2025-12-26 2025-07-28 (original listing) Similar Jobs Loading... Permalink View original posting
data engineer Data Engineer - Azure & Databricks @ n2sglobal AU \| 2025-12-23 This position calls for a data engineer skilled in Azure Databricks to build scalable data pipelines with the added twist that integrating diverse data sources like REST APIs and SFTP is a... read more »	Databricks, ETL/ELT, ADF, Spark, API, Cloud Storage, Management, Azure, Data Lake, SQL, Delta, Data Quality, Data Governance, Computer Science, Data Engineering, Python, Synapse, DWH, Big Data, Airflow, Kafka, AI/ML
data engineer Data Engineer (SSE / Staff Engineer) - Python & Spark & AWS @ n2sglobal AU \| 2025-12-23 A Senior Software Engineer / Staff Data Engineer role focuses on designing and maintaining large-scale data pipelines with expertise in Python, Spark, and AWS. The standout feature is the... read more »	Python, Spark, AWS, Analytics, Data Governance, Data Engineering, ETL/ELT, Data Quality, Big Data, Hadoop, Hive, Kafka, Data Streaming, Cloud Computing, S3, AWS Glue, Redshift, Athena, API, DevOps, CI/CD, Jenkins, Git, Docker, Kubernetes, Management, Agile/Scrum, Scala, Data Modelling, Cyber Security, Data Lake, DWH, AI/ML	Data Engineer (SSE / Staff Engineer) - Python & Spark & AWS n2sglobal (AU) CONTRACT \| JOB LISTED A Senior Software Engineer / Staff Data Engineer role focuses on designing and maintaining large-scale data pipelines with expertise in Python, Spark, and AWS. The standout feature is the integration of cloud and big data technologies, while a notable risk is keeping up with evolving security and compliance demands; no salary details are provided. The job emphasizes scalable data solutions, automation, and security, which are essential but not necessarily unique. It’s a position that demands cross-functional collaboration in an agile context, with a reasonable chance of encountering unforeseen technical hurdles. Generated content Technology used Python Spark AWS Analytics Data Governance Data Engineering ETL/ELT Data Quality Big Data Hadoop Hive Kafka Data Streaming Cloud Computing S3 AWS Glue Redshift Athena API DevOps CI/CD Jenkins Git Docker Kubernetes Management Agile/Scrum Scala Data Modelling Cyber Security Data Lake DWH AI/ML Listed At 2025-12-23 2025-12-11 (original listing) Similar Jobs Loading... Permalink View original posting

data engineer Lead Software Engineer, Back End - L5 (Bangkok based - Relocation provided) @ agoda AU \| 2025-12-27 The role at Agoda is a high-stakes opportunity for a data engineer with deep experience in building mission-critical systems. The job requires expertise in scalable back-end development, with a... read more »	API, CI/CD, Scala, Kafka, Spark, Agile/Scrum, Go, C, Java, Computer Science	Lead Software Engineer, Back End - L5 (Bangkok based - Relocation provided) agoda (AU) FULL TIME \| JOB LISTED The role at Agoda is a high-stakes opportunity for a data engineer with deep experience in building mission-critical systems. The job requires expertise in scalable back-end development, with a focus on Kafka, Spark, and distributed systems. The team emphasizes innovation, collaboration, and technical leadership, but the emphasis on large-scale systems and the lack of clear salary details make the role somewhat opaque. While the opportunity is substantial, the challenges include managing complex, high-traffic systems and maintaining a culture of continuous improvement. The position stands out for its emphasis on engineering fundamentals and real-world impact, but the lack of specific salary info and the heavy focus on large-scale systems may limit its appeal to those looking for a more traditional role. The company’s commitment to diversity and inclusion is commendable, but the technical demands and lack of clarity around compensation may not align with the expectations of a technical audience seeking clarity and specificity. Generated content Technology used API CI/CD Scala Kafka Spark Agile/Scrum Go C Java Computer Science Listed At 2025-12-27 2025-09-12 (original listing) Similar Jobs Loading... Permalink View original posting
data engineer Software Engineer II @ the-trade-desk AU \| 2025-12-26 Trade Desk's Measurement Upper Funnel role promises end-to-end ownership across a multi-cloud, microservices-heavy stack while chasing petabyte-scale insights from 600 billion queries a day. It's... read more »	Funnel, Cloud Computing, JavaScript, React, API, Docker, Kubernetes, Big Data, Spark, Flink, Terraform, GitLab, AWS, Azure, Databricks, SQL Server, Vertica, ClickHouse, AI/ML, C, Java, SQL, Agile/Scrum, RDBMS, Data Analytics, Python, NumPy, Pandas	Software Engineer II the-trade-desk (AU) FULL TIME \| JOB LISTED Trade Desk's Measurement Upper Funnel role promises end-to-end ownership across a multi-cloud, microservices-heavy stack while chasing petabyte-scale insights from 600 billion queries a day. It's technically appealing: .NET Core and React front-to-back, gRPC and REST APIs, Docker and Kubernetes, plus Spark and Flink powering data processing, with Databricks and SQL-based stores like Vertica and ClickHouse. The stated culture prizes learning, mentorship, and autonomous delivery, but the reality is a sprawling toolkit and a global, time-shifted crew that must stay in sync via asynchronous chat and 30-minute standups. The job tests both breadth and depth: you’ll need solid C# or Java, robust SQL, cloud chops across AWS, Azure or Aliyun, and a grasp of data architectures from row-based RDBMS to NoSQL. It’s strong on tech realism, but beware the breadth can swallow focus without disciplined scope. Generated content Technology used Funnel Cloud Computing JavaScript React API Docker Kubernetes Big Data Spark Flink Terraform GitLab AWS Azure Databricks SQL Server Vertica ClickHouse AI/ML C Java SQL Agile/Scrum RDBMS Data Analytics Python NumPy Pandas Listed At 2025-12-26 2025-11-13 (original listing) Similar Jobs Loading... Permalink View original posting
promoted O'Reilly: Building AI Agents with Model Context Protocol (MCP) Design and implement composable agent architectures using MCP. Understand the MCP architecture and how it enables AI applications to access external context. Build MCP servers that expose tools, resources, and prompts to LLMs.	Get Started
data engineer Senior AWS Data Engineer @ asx AU \| 2025-12-26 This role is a compelling opportunity for a senior data engineer with deep AWS expertise, but it’s a steep climb in a highly competitive field. The position requires a hands-on engineer with... read more »	Agile/Scrum, AWS, API, Data Engineering, Data Management, Analytics, Data Modelling, DWH, Data Streaming, Confluence, Kafka, Airflow, AWS Glue, Iceberg, Athena, Redshift, DataViz, Tableau, Power BI, QuickSight, Python, Spark, Terraform, Terragrunt	Senior AWS Data Engineer asx (AU) FULL TIME \| JOB LISTED This role is a compelling opportunity for a senior data engineer with deep AWS expertise, but it’s a steep climb in a highly competitive field. The position requires a hands-on engineer with experience in data pipelines, cloud infrastructure, and data product design, but it lacks clarity on compensation and the company’s culture. The role emphasizes technical excellence and innovation, but the description is vague on the company’s values and how they align with the candidate’s background. While the team is diverse and the role is technically rigorous, the lack of explicit salary details and limited mention of the company’s unique strengths makes it hard to assess fully. The position is ideal for someone passionate about data engineering and cloud architecture, but it’s not clear how it fits within the broader market landscape or the company’s long-term goals. Generated content Technology used Agile/Scrum AWS API Data Engineering Data Management Analytics Data Modelling DWH Data Streaming Confluence Kafka Airflow AWS Glue Iceberg Athena Redshift DataViz Tableau Power BI QuickSight Python Spark Terraform Terragrunt Listed At 2025-12-26 2025-07-28 (original listing) Similar Jobs Loading... Permalink View original posting
data engineer Data Engineer - Azure & Databricks @ n2sglobal AU \| 2025-12-23 This position calls for a data engineer skilled in Azure Databricks to build scalable data pipelines with the added twist that integrating diverse data sources like REST APIs and SFTP is a... read more »	Databricks, ETL/ELT, ADF, Spark, API, Cloud Storage, Management, Azure, Data Lake, SQL, Delta, Data Quality, Data Governance, Computer Science, Data Engineering, Python, Synapse, DWH, Big Data, Airflow, Kafka, AI/ML
data engineer Data Engineer (SSE / Staff Engineer) - Python & Spark & AWS @ n2sglobal AU \| 2025-12-23 A Senior Software Engineer / Staff Data Engineer role focuses on designing and maintaining large-scale data pipelines with expertise in Python, Spark, and AWS. The standout feature is the... read more »	Python, Spark, AWS, Analytics, Data Governance, Data Engineering, ETL/ELT, Data Quality, Big Data, Hadoop, Hive, Kafka, Data Streaming, Cloud Computing, S3, AWS Glue, Redshift, Athena, API, DevOps, CI/CD, Jenkins, Git, Docker, Kubernetes, Management, Agile/Scrum, Scala, Data Modelling, Cyber Security, Data Lake, DWH, AI/ML	Data Engineer (SSE / Staff Engineer) - Python & Spark & AWS n2sglobal (AU) CONTRACT \| JOB LISTED A Senior Software Engineer / Staff Data Engineer role focuses on designing and maintaining large-scale data pipelines with expertise in Python, Spark, and AWS. The standout feature is the integration of cloud and big data technologies, while a notable risk is keeping up with evolving security and compliance demands; no salary details are provided. The job emphasizes scalable data solutions, automation, and security, which are essential but not necessarily unique. It’s a position that demands cross-functional collaboration in an agile context, with a reasonable chance of encountering unforeseen technical hurdles. Generated content Technology used Python Spark AWS Analytics Data Governance Data Engineering ETL/ELT Data Quality Big Data Hadoop Hive Kafka Data Streaming Cloud Computing S3 AWS Glue Redshift Athena API DevOps CI/CD Jenkins Git Docker Kubernetes Management Agile/Scrum Scala Data Modelling Cyber Security Data Lake DWH AI/ML Listed At 2025-12-23 2025-12-11 (original listing) Similar Jobs Loading... Permalink View original posting

Open jobs

Companies looking for Spark

Used together with Spark

Jobs (this month)

Companies with Jobs

Lead Software Engineer, Back End - L5 (Bangkok based - Relocation provided) @ agoda

Software Engineer II @ the-trade-desk

O'Reilly: Building AI Agents with Model Context Protocol (MCP)

Senior AWS Data Engineer @ asx

Data Engineer - Azure & Databricks @ n2sglobal

Data Engineer (SSE / Staff Engineer) - Python & Spark & AWS @ n2sglobal