Under Development Features: Talent Bank/Pool, VMS Intigration, Analytics, Social Integration, Reports, API Integration, Resource and Timesheets Management, Company Admin

Under Development Features: Talent Bank/Pool, VMS, Analytics, Social/API Integration, Company Admin

Data Engineer with Pyspark (1022 views)

NYC, NY
January 21, 2020

*** Direct Client Requirement *****

Title:  Data Engineer with Pyspark

Location: NYC, NY

Duration: Long Term

Rate: DOE

Interview Type: Phone and F2F

Work Status:  Successful applicants must be legally authorized to work in the U.S.

Job Type: C2C,C2H ,W2

Experience: 6  YEARS

Need local profiles

 

Required/Preferred Skills:

  • Experience with Data/ETL pipelines
  • Python experience is critical
  • Some Unix/Shell scripting experience preferred (we exclusively use the AWS CLI to submit jobs to our EMR clusters for InfoSec reasons)
  • Experience with EMR/PySpark preferred
  • Airflow experience would be a plus but not mandatory
  • Any other AWS Experience such as with EC2, Athena, Redshift Spectrum would be a plus but not mandatory
  • Basic Understanding of HDFS/distributed computing
  • Basic SQL skills
  • Understanding of columnar file formats such as parquet
  • Experience with automating API calls would be a plus but not mandatory

Thanks

Siva

siva@sohanit.com

PH:402-241-9606

Apply here or Please send to resumes@sohanit.com

Position Keywords: Data/ETL pipelines,EMR/PySpark,

Pay Rate: DOE

Job Duration: 12 Months

% Travel Required: None

Job Posted by: Consulting Services

Job ID: TM 91

Work Authorization: Successful applicants must be legally authorized to work in the U.S

Don't have time now?
Get a reminder in your inbox