An exceptional data engineer to build and maintain big data infrastructure to support the development of a strategic data asset and the execution of data science projects.

MassMutual Company: MassMutual
Location: Boston, MA
Web: www.massmutual.com

MassMutual's advanced analytics group is seeking an exceptional data engineer to build and maintain big data infrastructure. This infrastructure supports the development of a strategic data asset and the execution of data science projects, which include complex applications of machine learning, visualization, and data mining.

This is an opportunity to join a small but growing team of talented individuals, with backgrounds in applied math, computer science and physics, focused on using novel big data technology and data science to answer fundamental questions that directly impact the direction of the company and industry at large.

Essential Responsibilities
  • Design, build and measure data ingestion and integration pipelines for large volumes of temporal data from different sources. Examples include database extracts, application server logs, scanned images, voice recordings, Twitter streams, websites, and health sensor data.
  • Design, build, and measure complex ETL jobs to process disparate, dirty data sources and form a high integrity, high quality, clean data asset.
  • Support and scale big data infrastructure (Cloudera CDH 5, MongoDB, and HP Vertica)
  • Monitor and track data quality and data flow dynamics
  • Design and build web based APIs to facilitate easy access to data

Desired Skills and Experience
  • 5+ years industry experience working with data
  • Undergraduate degree in Computer Science or Engineering (M.S. or Ph.D. preferred)
  • 3+ years of overall experience developing and administering large data systems
  • 3+ years of coding and scripting (Python/Java/Scala/Go/Bash/Git) and design experience. Solid CS fundamentals in algorithms and data structures.
  • 3+ years of data modeling and administration of NoSQL and SQL databases.
  • 3+ years of experience with at least one these: Hadoop, MapReduce, HDFS, HBase, Hive, Flume, Sqoop, Spark, Vertica, SQL, data warehouses. (Certifications in one or more of the above tools preferred)
  • Familiarity with web programming and web app frameworks like Flask, Django or Play!

Apply online via indeed.

