Jobs

Python Developer with Pyspark


Job details
  • N Consulting Ltd
  • 4 days ago

Job Title:Python Developer with PySpark

Location:Northompton

Job Type:Contract

About the Role:
We are seeking a skilled Python Developer with expertise in PySpark to join our dynamic team. The ideal candidate will have strong experience in building and optimizing large-scale data processing pipelines and a deep understanding of distributed data systems. You will play a key role in designing and implementing data solutions that drive critical business decisions.

Key Responsibilities:

  • Develop, optimize, and maintain large-scale data pipelines using PySpark and Python.
  • Collaborate with data engineers, analysts, and stakeholders to gather requirements and implement data solutions.
  • Perform ETL (Extract, Transform, Load) processes on large datasets and ensure efficient data workflows.
  • Analyze and debug data processing issues to ensure accuracy and reliability of pipelines.
  • Work with distributed computing frameworks to handle large datasets efficiently.
  • Develop reusable components, libraries, and frameworks for data processing.
  • Optimize PySpark jobs for performance and scalability.
  • Integrate data pipelines with cloud platforms like AWS, Azure, or Google Cloud (if applicable).
  • Monitor and troubleshoot production data pipelines to minimize downtime and data issues.

Key Skills and Qualifications:

Technical Skills:

  • Strong programming skills inPythonwith hands-on experience inPySpark.
  • Experience with distributed data processing frameworks (e.g., Spark).
  • Proficiency in SQL for querying and transforming data.
  • Understanding of data partitioning, serialization formats (Parquet, ORC, Avro), and data compression techniques.
  • Familiarity with Big Data technologies such as Hadoop, Hive, and Kafka (optional but preferred).

Cloud Platforms (Preferred):

  • Hands-on experience with AWS services like S3, EMR, Glue, or Redshift.
  • Knowledge of Azure Data Lake, Databricks, or Google BigQuery is a plus.

Additional Tools and Frameworks:

  • Familiarity with CI/CD pipelines and version control tools (Git, Jenkins).
  • Experience with orchestration tools like Apache Airflow or Luigi.
  • Understanding of containerization and orchestration tools like Docker and Kubernetes (preferred).

Experience:

  • Bachelor’s or Master’s degree in Computer Science, Data Engineering, or a related field.
  • 5+ years of experience in Python programming.
  • 4+ years of hands-on experience with PySpark.
  • Experience with Big Data ecosystems and tools.

Sign up for our newsletter

The latest news, articles, and resources, sent to your inbox weekly.

Similar Jobs

Python Developer with Pyspark

Job Title:Python Developer with PySparkLocation:NorthomptonJob Type:ContractAbout the Role:We are seeking a skilled Python Developer with expertise in PySpark to join our dynamic team. The ideal candidate will have strong experience in building and optimizing large-scale data processing pipelines and a deep understanding of distributed data systems. You will play a...

N Consulting Ltd

DBT Developer

About Us:We are Two Circles. We are a Sports & Entertainment Marketing business. We grow audiences and revenues. We do that by knowing fans best. We work with clients to help them understand & influence what their fans are doing – the way fans spend their money, the events that...

Two Circles London

Tech4 | Senior Data Engineer

Senior Data Engineer - Python / Data Pipelines / Data Platform / AWS - is required by fast growing, highly successful and and tech focused organisation.About the jobYou will play a crucial role in designing, building, and maintaining their data platform, with a strong emphasis on streaming data, cloud infrastructure,...

Tech4 Edinburgh

MBN Solutions | Data Engineer

Data EngineerUp to £70,000 + 10% Pension + BenefitsEdinburgh (Hybrid - 2 days p/w)About the OpportunityMBN Solutions are delighted to partner with a Tier 1 UK Retail Bank to support the build of a brand new Data team based in the UK.We are looking for a Data Engineer who is...

MBN Solutions Edinburgh

Premier Group Recruitment | Azure Data Engineer

My Client a leading powerhouse within London are looking to add anAzure Data Engineerto their team.The Azure Data Engineer will support our software developers, PowerBI developers, data analysts and data scientists on data initiatives and will ensure optimal data delivery architecture is consistent throughout ongoing projects.Job Title:Lead Data EngineerLocation: London...

Premier Group Recruitment London

Senior Data Analyst

Job Purpose and Background in summaryCDP’s Data Analytics Team are looking for a skilled and enthusiastic data analyst who is passionate to use data to drive companies, investors, and governments to build a thriving economy which works for people and planet.About CDPCDP is a not-for-profit charity that runs the global...

CDP London