Preferred

Selecting and integrating Big Data tools and frameworks required to provide requested capabilities
Implementing Data ingestion and ETL processes on Hadoop
Monitoring performance and advising any necessary infrastructure changes
Defining data retention policies
Design and build data processing pipelines for structured and unstructured data using tools and frameworks in the Hadoop ecosystem
Develop applications that are scalable to handle millions of events/records
Design and launch scalable, reliable and efficient processes to move, transform and report on large amounts of data
Participate in meetings with business (account/product management, data scientists) to obtain new requirements
Follow our Agile software development process with daily scrums and monthly Sprints
Ability to work collaboratively on a cross-functional team with a wide range of experience levels

QUALIFICATIONS

Bachelor's degree and 8+ years relevant experience or Master’s degree and 6+ years of relevant experience
4+ years in industry implementing big data solutions on Hadoop
Proficient understanding of distributed computing principles
Proficiency with Hadoop v2, MapReduce, HDFS
Experience with building stream-processing systems, using solutions such as Storm or Kafka and Spark-Streaming
Good knowledge of Big Data querying tools, such as Pig, Hive, Phoenix
Experience with Spark
Experience with integration of data from multiple data sources
Experience with 1 or 2 NoSQL/Graph databases, such as HBase, Cassandra, MongoDB, Neo4j
Proficiency in a programming languages like SCALA, Java,Python
Experience with Linux OS, shell scripting
Experience with relational databases (SQL)
Experience in working with real-time data feeds
Experience in working with unstructured data
Experience in implementing Scoop Jobs to Import/Export data from Hadoop
Knowledge of various ETL techniques and frameworks, such as Pig, Hive, or Flume
Experience with various messaging systems, such as Kafka or RabbitMQ
Experience with Big Data ML toolkits, such as Mahout, SparkML, or H2O
Good understanding of Lambda Architecture, along with its advantages and drawbacks
Experience with Hortonworks Hadoop Data Platform (HDP)
Experience with all or some of the following supporting Hadoop administration and security frameworks: HCatalog, Drill, NiFi, Oozie, Falcon, Ranger, Ambari, Zeplin.

•Bachelor's degree or foreign equivalent required from an accredited institution. Will also consider three years of progressive experience in the specialty in lieu of every year of education
•At least 2 years of experience with Information Technology

** U.S. Citizens and those who are authorized to work independently in the United States are encouraged to apply. We are unable to sponsor at this time.
This is a Full-Time / Permanent job opportunity.
Only for US Citizen and Green Card Holder
** All your information will be kept confidential according to EEO guidelines.

Sr. Hadoop Data Engineer

Job Description

See Your Match Score

More jobs at Sonoma Consulting Inc.

More jobs at Sonoma Consulting Inc.