In this role you will build very large, scalable platforms using cutting edge data technologies. This is not a “maintain existing platform” or “make minor tweaks to current code base” kind of role. We are effectively building from the ground up and plan to leverage the most recent Big Data technologies. If you enjoy building new things without being constrained by technical debt, this is the job for you!
You will help define company data assets (data model), spark, sparkSQL and hiveSQL jobs to populate data models
You will help define and design data integrations, data quality frameworks and design and evaluate open source/vendor tools for data lineage
You will work closely with Dropbox business units and engineering teams to develop strategy for long term Data Platform architecture
BS degree in Computer Science or related technical field involving coding (e.g., physics or mathematics), or equivalent technical experience
4+ years of Python or Java development experience
4+ years of SQL experience (No-SQL experience is a plus)
4+ years of experience with schema design and dimensional data modeling
Proven ability in regards to managing and communicating data warehouse plans to internal clients.
Experience designing, building and maintaining data processing systems
Experience working with either a Map Reduce or a MPP system on any size/scale