Open jobs

145

Companies looking for Parquet

141

Back in the day, when data lakes were threatening to drown us all in unstructured chaos, came Apache Parquet. The promise? Columnar storage to rescue our queries from the abysmal depths of full table scans. It’s essentially a clever way of organizing data on disk so you only read what you need, not everything. Quite clever, really, though one does wonder if everyone truly understood the implications for schema evolution.

The reality? It's become a de facto standard for anything touching Spark, Hive, or Presto. Forget CSV; Parquet’s where the cool kids hang out. It's not a silver bullet, mind you – small files can be a nightmare, and it’s not ideal for every workload. But compared to row-oriented formats or even older columnar solutions, it offers a compelling balance of compression, performance, and ecosystem support. Today, if you're not using it, you're probably doing it wrong, or at least making your data engineers weep softly into their lattes.

Used together with Parquet

Additional Resources

Compare to other file formats
Jobs (this month)

145

Companies with Jobs

141

Jobs in using Apache Parquet for but please no

chief

Assistant Director - Analytics & Modeling @ moodys-corporation

US | 2025-12-28 | USD 139900 - 202750 / year
Moody’s is hiring into the Model Certification Products team to prototype and certify complex catastrophe, climate, and cyber models, with an emphasis on terabyte-scale streams and batch data. The... read more »
AI/ML, R, Python, C, C++, Rust, Arrow, Parquet, SQL, Amazon RDS, LLM, Data Science, Computer Science, API, Analytics
data engineer

Java Software Engineer @ imc-trading

AU | 2025-12-27
IMC seeks a data engineer experienced in Java 11+ to build scalable, low-latency, high-throughput data processing systems using Kafka, Avro, and Parquet, with a unique focus on transforming... read more »
Java, DataViz, Data Engineering, Analytics, Kafka, Avro, Parquet, Data Streaming, Docker, Kubernetes, Linux
promoted

O'Reilly Power BI Learning

Master Power BI fast: build basic-to-advanced visuals, apply smart design, use dynamic slicers/filters & new cards, choose the right chart, spotlight key insights, brand and theme reports, and seamlessly publish/share in the Service.

data engineer

Junior Data Engineer @ burson

GB | 2025-12-26
A Junior Data Engineer role at Burson involves supporting data pipelines and AI models in a hybrid London setting. The position emphasizes Python scripting, Azure cloud infrastructure, and... read more »
AI/ML, DevOps, Python, Azure, Computer Science, BI, Agile/Scrum, Git, API, Power BI, Azure DevOps, Java, R, DAX, NoSQL, SQL, MongoDB, Parquet
data engineer

Director, Principal Data Architect @ tiffany-and-co

US | 2025-12-26
This role is a highly demanding and complex position requiring deep expertise in GCP data architecture, DevOps, and cloud-native engineering. The candidate must lead platform enablement, mentor... read more »
Management, GCP, Analytics, Cloud Computing, Big Data, AI/ML, Data Streaming, Data Lake, DWH, Data Engineering, AWS, Azure, SaaS, Data Management, DevOps, CI/CD, Marketing, Agile/Scrum, BigQuery, Cloud Composer, Fivetran, Power BI, Dataiku, SQL, ETL/ELT, Jira, Azure DevOps, GitHub, Computer Science, Cyber Security, Data Quality, API, Parquet, Avro, JSON, XML, CSV, dbt, Python, Data Science
data engineer

Director, Principal Data Architect @ tiffany-and-co

FR | 2025-12-26
The role is an ambitious opportunity for a seasoned data engineer with deep expertise in GCP, but it lacks specificity in defining responsibilities and expectations. The candidate must lead a... read more »
Management, GCP, Analytics, Cloud Computing, Big Data, AI/ML, Data Streaming, Data Lake, DWH, Data Engineering, AWS, Azure, SaaS, Data Management, DevOps, CI/CD, Marketing, Agile/Scrum, BigQuery, Cloud Composer, Fivetran, Power BI, Dataiku, SQL, ETL/ELT, Jira, Azure DevOps, GitHub, Computer Science, Cyber Security, Data Quality, API, Parquet, Avro, XML, CSV, dbt, Python, Data Science
analyst

Senior Data Analyst @ drw

GB | 2025-12-25
The role at DRW is a high-impact, fast-paced data analyst position requiring expertise in financial data processing, cloud workflows, and data quality engineering. The job emphasizes hands-on data... read more »
Management, Data Quality, SQL, Cloud Storage, Airflow, PySpark, Pandas, Git, Parquet, Linux, Bash
promoted

O'Reilly Power BI Learning

Master Power BI fast: build basic-to-advanced visuals, apply smart design, use dynamic slicers/filters & new cards, choose the right chart, spotlight key insights, brand and theme reports, and seamlessly publish/share in the Service.

data engineer

Data Engineer -601/602 @ ptrglobal

US | 2025-12-24 | USD 65 - 70 / hour
A data engineer role focused on AWS, Python, and Spark within a bank's consumer division; requires designing data pipelines and models, with a differentiator being expertise in cloud data lake... read more »
AWS, Python, Spark, Data Lake, Agile/Scrum, Data Collection, Analytics, SQL, NoSQL, Data Quality, Data Governance, Data Engineering, PySpark, GenAI, API, Cloud Computing, Data Lakehouse, Databricks, Hadoop, PostgreSQL, Oracle, Cassandra, DynamoDB, MongoDB, Snowflake, Redshift, Airflow, Unix, Avro, Protobuf, Parquet, Iceberg, Data Streaming, Data Modelling, Data Vault, dimensional modeling, CI/CD
data engineer

Data Engineer, Active Grid Response @ gridwareinc

US | 2025-12-23
Gridware seeks a data engineer to develop ETL/ELT pipelines for its Active Grid Response platform, emphasizing high-precision sensor data integration and real-time processing, with a notable focus... read more »
Management, Data Lakehouse, Analytics, ETL/ELT, Data Lake, Python, SQL, Databricks, Data Quality, Data Science, Cloud Computing, Big Data, Spark, Airflow, Dagster, Prefect, Data Streaming, Kafka, Kinesis, Data Modelling, IoT, Protobuf, Avro, Parquet, Grafana
NL | 2025-12-23
Snowflake pitches Amsterdam as a hub for post-sales brilliance: a Solutions Architect role that blends enterprise data platform design with hands-on coaching, governance, and multi-vendor... read more »
Snowflake, AI/ML, Cloud Computing, Data Management, Management, Teradata, Spark, Databricks, Hadoop, Oracle, SQL Server, Data Vault, Fabric, Data Governance, Data Lake, DWH, dimensional modeling, 3NF, AWS, Azure, SQL, dbt, Talend, Informatica, Python, PySpark, Parquet, Avro, Iceberg, Delta, Analytics, Tableau, Power BI, Thoughtspot, SAS, NLP, Marketing, Data Analytics
data engineer

Senior Data Engineer @ autodesk

US | 2025-12-23 | USD 130600 - 211200 / year
Seeking a Principal Data Engineer to lead data infrastructure design supporting ML, personalization, and search systems. A key differentiator is the opportunity to influence strategic initiatives... read more »
AI/ML, Data Science, RAG, Data Engineering, Agile/Scrum, Analytics, Kafka, Flink, SQL, NoSQL, Vector DB, Python, Java, Big Data, Spark, Parquet, Iceberg, Delta, ETL/ELT, Cloud Computing, AWS, Azure, GCP, DWH, Snowflake, Redshift, Data Modelling, Computer Science, PhD, Pinecone, ELK, Data Streaming, MLOps
data engineer

Senior Data Engineer - (Genetics) Maternity Cover - 12 months FTC @ our-future-health-uk

GB | 2025-12-22
This role is for a Senior Data Engineer specializing in genetic data processing, with responsibilities involving building and maintaining robust pipelines for data storage and release. The... read more »
Data Engineering, CI/CD, Agile/Scrum, Cloud Computing, Python, Unix, Azure, Parquet, Delta, Docker, Kubernetes, Spark, Databricks, Git, GitHub