Apache Parquet
Open jobs
145
Companies looking for Parquet
141
Back in the day, when data lakes were threatening to drown us all in unstructured chaos, came Apache Parquet. The promise? Columnar storage to rescue our queries from the abysmal depths of full table scans. It’s essentially a clever way of organizing data on disk so you only read what you need, not everything. Quite clever, really, though one does wonder if everyone truly understood the implications for schema evolution.
The reality? It's become a de facto standard for anything touching Spark, Hive, or Presto. Forget CSV; Parquet’s where the cool kids hang out. It's not a silver bullet, mind you – small files can be a nightmare, and it’s not ideal for every workload. But compared to row-oriented formats or even older columnar solutions, it offers a compelling balance of compression, performance, and ecosystem support. Today, if you're not using it, you're probably doing it wrong, or at least making your data engineers weep softly into their lattes.
Used together with Parquet
Additional Resources
Compare to other file formatsJobs (this month)
145
Companies with Jobs
141
Jobs in using Apache Parquet for but please no
|
chief
Assistant Director - Analytics & Modeling @ moodys-corporation
US | 2025-12-28
| USD
139900 - 202750
/ year
Moody’s is hiring into the Model Certification Products team to prototype and certify complex catastrophe, climate, and cyber models, with an emphasis on terabyte-scale streams and batch data. The...
read more »
|
AI/ML, R, Python, C, C++, Rust, Arrow, Parquet, SQL, Amazon RDS, LLM, Data Science, Computer Science, API, Analytics | ||
|---|---|---|---|
|
data engineer
Java Software Engineer @ imc-trading
AU | 2025-12-27
IMC seeks a data engineer experienced in Java 11+ to build scalable, low-latency, high-throughput data processing systems using Kafka, Avro, and Parquet, with a unique focus on transforming...
read more »
|
Java, DataViz, Data Engineering, Analytics, Kafka, Avro, Parquet, Data Streaming, Docker, Kubernetes, Linux | ||
|
promoted
O'Reilly Power BI LearningMaster Power BI fast: build basic-to-advanced visuals, apply smart design, use dynamic slicers/filters & new cards, choose the right chart, spotlight key insights, brand and theme reports, and seamlessly publish/share in the Service. |
|
||
|
data engineer
Junior Data Engineer @ burson
GB | 2025-12-26
A Junior Data Engineer role at Burson involves supporting data pipelines and AI models in a hybrid London setting. The position emphasizes Python scripting, Azure cloud infrastructure, and...
read more »
|
AI/ML, DevOps, Python, Azure, Computer Science, BI, Agile/Scrum, Git, API, Power BI, Azure DevOps, Java, R, DAX, NoSQL, SQL, MongoDB, Parquet | ||
|
data engineer
Director, Principal Data Architect @ tiffany-and-co
US | 2025-12-26
This role is a highly demanding and complex position requiring deep expertise in GCP data architecture, DevOps, and cloud-native engineering. The candidate must lead platform enablement, mentor...
read more »
|
Management, GCP, Analytics, Cloud Computing, Big Data, AI/ML, Data Streaming, Data Lake, DWH, Data Engineering, AWS, Azure, SaaS, Data Management, DevOps, CI/CD, Marketing, Agile/Scrum, BigQuery, Cloud Composer, Fivetran, Power BI, Dataiku, SQL, ETL/ELT, Jira, Azure DevOps, GitHub, Computer Science, Cyber Security, Data Quality, API, Parquet, Avro, JSON, XML, CSV, dbt, Python, Data Science | ||
|
data engineer
Director, Principal Data Architect @ tiffany-and-co
FR | 2025-12-26
The role is an ambitious opportunity for a seasoned data engineer with deep expertise in GCP, but it lacks specificity in defining responsibilities and expectations. The candidate must lead a...
read more »
|
Management, GCP, Analytics, Cloud Computing, Big Data, AI/ML, Data Streaming, Data Lake, DWH, Data Engineering, AWS, Azure, SaaS, Data Management, DevOps, CI/CD, Marketing, Agile/Scrum, BigQuery, Cloud Composer, Fivetran, Power BI, Dataiku, SQL, ETL/ELT, Jira, Azure DevOps, GitHub, Computer Science, Cyber Security, Data Quality, API, Parquet, Avro, XML, CSV, dbt, Python, Data Science | ||
| Management, Data Quality, SQL, Cloud Storage, Airflow, PySpark, Pandas, Git, Parquet, Linux, Bash | |||
|
promoted
O'Reilly Power BI LearningMaster Power BI fast: build basic-to-advanced visuals, apply smart design, use dynamic slicers/filters & new cards, choose the right chart, spotlight key insights, brand and theme reports, and seamlessly publish/share in the Service. |
|
||
|
data engineer
Data Engineer -601/602 @ ptrglobal
US | 2025-12-24
| USD
65 - 70
/ hour
A data engineer role focused on AWS, Python, and Spark within a bank's consumer division; requires designing data pipelines and models, with a differentiator being expertise in cloud data lake...
read more »
|
AWS, Python, Spark, Data Lake, Agile/Scrum, Data Collection, Analytics, SQL, NoSQL, Data Quality, Data Governance, Data Engineering, PySpark, GenAI, API, Cloud Computing, Data Lakehouse, Databricks, Hadoop, PostgreSQL, Oracle, Cassandra, DynamoDB, MongoDB, Snowflake, Redshift, Airflow, Unix, Avro, Protobuf, Parquet, Iceberg, Data Streaming, Data Modelling, Data Vault, dimensional modeling, CI/CD | ||
|
data engineer
Data Engineer, Active Grid Response @ gridwareinc
US | 2025-12-23
Gridware seeks a data engineer to develop ETL/ELT pipelines for its Active Grid Response platform, emphasizing high-precision sensor data integration and real-time processing, with a notable focus...
read more »
|
Management, Data Lakehouse, Analytics, ETL/ELT, Data Lake, Python, SQL, Databricks, Data Quality, Data Science, Cloud Computing, Big Data, Spark, Airflow, Dagster, Prefect, Data Streaming, Kafka, Kinesis, Data Modelling, IoT, Protobuf, Avro, Parquet, Grafana | ||
|
analytics engineer
Solutions Architect @ snowflake-computing
NL | 2025-12-23
Snowflake pitches Amsterdam as a hub for post-sales brilliance: a Solutions Architect role that blends enterprise data platform design with hands-on coaching, governance, and multi-vendor...
read more »
|
Snowflake, AI/ML, Cloud Computing, Data Management, Management, Teradata, Spark, Databricks, Hadoop, Oracle, SQL Server, Data Vault, Fabric, Data Governance, Data Lake, DWH, dimensional modeling, 3NF, AWS, Azure, SQL, dbt, Talend, Informatica, Python, PySpark, Parquet, Avro, Iceberg, Delta, Analytics, Tableau, Power BI, Thoughtspot, SAS, NLP, Marketing, Data Analytics | ||
|
data engineer
Senior Data Engineer @ autodesk
US | 2025-12-23
| USD
130600 - 211200
/ year
Seeking a Principal Data Engineer to lead data infrastructure design supporting ML, personalization, and search systems. A key differentiator is the opportunity to influence strategic initiatives...
read more »
|
AI/ML, Data Science, RAG, Data Engineering, Agile/Scrum, Analytics, Kafka, Flink, SQL, NoSQL, Vector DB, Python, Java, Big Data, Spark, Parquet, Iceberg, Delta, ETL/ELT, Cloud Computing, AWS, Azure, GCP, DWH, Snowflake, Redshift, Data Modelling, Computer Science, PhD, Pinecone, ELK, Data Streaming, MLOps | ||
|
data engineer
Senior Data Engineer - (Genetics) Maternity Cover - 12 months FTC @ our-future-health-uk
GB | 2025-12-22
This role is for a Senior Data Engineer specializing in genetic data processing, with responsibilities involving building and maintaining robust pipelines for data storage and release. The...
read more »
|
Data Engineering, CI/CD, Agile/Scrum, Cloud Computing, Python, Unix, Azure, Parquet, Delta, Docker, Kubernetes, Spark, Databricks, Git, GitHub |