All roles

Data Engineer (Python & PySpark)

Remote · USA Full-time New today

Key Responsibilities

Pipeline Development: Design, develop, and maintain end-to-end ETL/ELT pipelines using Python and PySpark. Big Data Processing: Build large-scale data processing frameworks to handle structured and unstructured data, ensuring high performance and reliability. Cloud Infrastructure: Architect and manage data solutions within the GCP ecosystem, focusing on cost-efficiency and security. Data Modeling: Design and implement robust data warehouse models (Star/Snowflake schemas) and data lake architectures. Optimization: Identify, design, and implement internal process improvements, such as automating manual processes and optimizing data delivery for greater scalability. Collaboration: Work closely with stakeholders to understand data requirements and translate them into technical specifications. Technical Qualifications Core Programming: Strong proficiency in Python, including experience with libraries like Pandas, NumPy, and logging frameworks. Big Data: 3+ years of hands-on experience with Apache Spark (PySpark) for distributed data processing. GCP Ecosystem: Practical experience with Google Cloud services, specifically: BigQuery (Optimization, Partitioning, Clustering). Cloud DataProc or Dataflow. Cloud Storage (GCS) and Cloud Functions. Cloud Composer (Apache Airflow) for orchestration. Data Warehousing: Solid understanding of relational databases and SQL (PostgreSQL, MySQL) as well as NoSQL environments. DevOps & Tools: Experience with Git, Docker, and CI/CD pipelines. Familiarity with Terraform or other IaC tools is a significant plus. Apply To This Job

Related roles

Formateur Freelance - CAP Plomberie

Remote · USA Full-time

Senior account executive

Remote · USA Full-time

General Manager – B2C SaaS (Fully Remote)

Remote · USA Full-time

Strategic Account Manager (m/w/d) | 4.000€ Fixum + Provision | 110K OTE | 100% Remote

Remote · USA Full-time

Formateur Freelance - CAP Maintenance des Véhicules option A - Véhicules légers

Remote · USA Full-time

Consultores BMC Helix (ITSM / Digital Workplace) - 100% Remoto

Remote · USA Full-time

Brand Strategy Manager

Remote · USA Full-time

Junior Visual Designer (m/f/x)

Remote · USA Full-time

Java Full Stack Developer (Outbound/Selfservice)

Remote · USA Full-time

WE ARE HIRING - Insegnante Per Ripetizioni di Greco

Remote · USA Full-time

Remote Civil Litigation Paralegal for Boutique Business Law Firm

Remote · USA Full-time

Experienced Data Entry Specialist – Remote Opportunity at arenaflex

Remote · USA Full-time

Specialty Business Manager, Derm - St. Louis W, MO

Remote · USA Full-time

[Hiring] Virtual Health Medical Assistant & Patient Advocate @Absolute Elder Care

Remote · USA Full-time

Experienced Remote Data Entry Specialist – Flexible Work Arrangements at arenaflex

Remote · USA Full-time

Internal Fullstack Engineer (Associate) - Remote, AI Capability Center (AICC)

Remote · USA Full-time

Customer Success Manager

Remote · USA Full-time

Respiratory Therapist - Hiring Immediately

Remote · USA Full-time

Associate Vice President - HEOR - Oncology

Remote · USA Full-time

Associate - Telecom

Remote · USA Full-time