Data Engineer

Company: Altcoin Advisors

Location: New York, NY, USA

Role overview

A quantitative trading firm dedicated to achieving excellent returns in the cryptosphere. We take a scientific approach to investing using statistical arbitrage models, machine learning and advanced optimization techniques is looking for a Data Engineer


A Data Engineer is a person who creates and manages a company’s Big Data infrastructure and tools, and is someone that knows how to get results from vast amounts of data quickly.

The actual definition of this role varies, and often mixes with the Data Scientist role. This role will focus on the engineering side, but will interact with the systems that are statistics and machine learning based.


We are looking for a Data Engineer that will work on the collecting, storing, processing, and analyzing of huge sets of data (financial, blockchain and social). The primary focus will be on choosing optimal solutions to use for these purposes, then maintaining, implementing, and monitoring them. You will also be responsible for integrating them with the architecture used across the company.


Selecting and integrating any Big Data tools and frameworks required to provide requested capabilities

Implementing ETL process

Monitoring performance and advising any necessary infrastructure changes

Create tools to analyze social data and correlate them to the market (NLP,Entity Extraction, Clustering, Sentiment Analysis)

Skills and Qualifications

2 - 5 Years of experience of data engineering

Proficient understanding of distributed computing principles

Management of data cluster and database management

Experience with building stream-processing systems

Experience with integration of data from multiple data sources

Knowledge of various ETL techniques and frameworks

Good understanding of Lambda Architecture, along with its advantages and drawbacks

Experience working on AWS or another cloud provider

Proficient in Python

Preferred Qualification

Knowledge of Big Data querying tools, such as Pig, Hive, and Impala

Experience with NoSQL databases, such as HBase, Cassandra, MongoDB

Experience with Big Data ML toolkits, such as Mahout, SparkML, or H2O

Experience working with unstructured and text data

Blockchain knowledge


New York is the most populous city in the US with more than 8 million inhabitants. A lot of immigrants from over 180 countries live in the City. Travelers are usually attracted to its cosmopolitanism and energy. New York consists of five boroughs: Manhattan, Brooklyn, Queens, The Bronx, Staten Island. New York is also well-known for its cultural heritage Empire State Building, Central Park and Statue of Liberty. The City is a center for fashion, media, culture, research and a booming tech scene.

The tech scene in New York has changed a lot in recent years. A lot of the initial excitement has worn off and settled into a stable trend of successful businesses. In both 2014 and 2015, a New York City tech company went public with a valuation of over $1 billion. It is now #2 startup ecosystem in the world, with startups raising $11.5 billion in venture capital in 2017. NYC is also the HQ of choice for Spotify, Consensys and WeWork.