Role :: Python Spark AWS // Columbus, OH

Columbus, OH Columbus OH 43287

Date : Jul-22-24

Columbus, OH

Jul-22-24

Work Authorization

US Citizen
GC
H1B
TN EAD, H4 EAD, L2 EAD

Preferred Employment

Corp-Corp
W2-Permanent
W2-Contract
1099-Contract
Contract to Hire

Job Details

Experience

Senior

Rate/Salary ($)

Market

Duration

1 year

Sp. Area

Python, Open Source

Sp. Skills

Python

Consulting / Contract

Third Party OK

Required Skills :

Python, Scala, Big Data, DynamoDB, MySQL, Oracle, SQL

Preferred Skills :

Domain :

Work Authorization

US Citizen
GC
TN EAD, H4 EAD, L2 EAD
H1B

Preferred Employment

Corp-Corp
W2-Permanent
W2-Contract
1099-Contract
Contract to Hire

Job Details

Experience

Senior

Rate/Salary ($)

Market

Duration

1 year

Sp. Area

Python, Open Source

Sp. Skills

Python

Consulting / Contract

Third Party OK

Required Skills :

Python, Scala, Big Data, DynamoDB, MySQL, Oracle, SQL

Preferred Skills :

Domain :

Pride Veterans Staffing
Hoboken, NJ
Post Resume to
View Contact Details &
Apply for Job

Job Description :

JOB: Python Spark AWS

Location: Columbus, OH

Duration: long-term (C2C)

Visa: H1b/TN/h4/L2/E3

JOB Description

Develop and maintain data platforms using Python, Spark, and PySpark.
Handle migration to PySpark on AWS.
Design and implement data pipelines.
Work with AWS and Big Data.
Produce unit tests for Spark transformations and helper methods.
Create Scala/Spark jobs for data transformation and aggregation.
Write Scaladoc-style documentation for code.
Optimize Spark queries for performance.
Integrate with SQL databases (e.g., Microsoft, Oracle, Postgres, MySQL).
Understand distributed systems concepts (CAP theorem, partitioning, replication, consistency, and consensus).

Skills:
Proficiency in Python, Scala (with a focus on functional programming), and Spark.
Familiarity with Spark APIs, including RDD, DataFrame, MLlib, GraphX, and Streaming.
Experience working with HDFS, S3, Cassandra, and/or DynamoDB.
Deep understanding of distributed systems.
Experience with building or maintaining cloud-native applications.
Familiarity with serverless approaches using AWS Lambda is a plus