Data Engineer

Dec. 22, 2022

Remote, EU

Tessian Tessian protects every business’s mission by securing the human layer 💫 We're building intelligent security that works for human beings as they are, not how security policies would like them to be. Using machine learning technology, Tessian automatically predicts and eliminates advanced threats on email caused by human error - like data exfiltration, accidental data loss, business email compromise and phishing attacks - with minimal disruption to employees' workflow. As a result, employees are empowered to do their best work, without security getting in their way. You can read more about Human Layer Security here. Tessian has raised a $74m Series C led by renowned security investors, March Capital (Crowdstrike, Knowbe4), with follow-on from Sequoia, Accel, Balderton, Latitude, Okta, Sozo, Citi & Schroder Adveq, to further our mission to secure the human layer. Founded in 2013, Tessian is backed by world-class venture capital investors, has hubs in London, San Francisco, Boston, and Austin, and London, and is one of the Top-3 2021 Best Places to Work for Women. As a high-growth scale-up, our email datasets are growing at an exponential rate. This is a great problem to have as it allows us to train best-in-class machine learning models to prevent previously unpreventable data breaches. We are facing interesting challenges with scaling our data processing pipelines and MLOps. You will work as part of  a multidisciplinary team of Data Scientists and back end engineers to build out infrastructure and data pipelines empowering them to perform iterative machine learning research across terabytes of data. We view this role as a hugely impactful, high-leverage role. By providing our Data Science teams with better tools and processes to reduce the friction involved in creating data for ML research, we can deliver more value to our clients through the data breaches we prevent.Some interesting projects we’re working on:Creating raw email data collection and transformation pipelines to support our ML researchBuilding feature generation pipelines which can scale to terabytes of email data to enable Data Science teams to perform iterative machine learning researchDesigning our next generation data-lake to handle massive future scale using modern formats like Apache IcebergCreating a framework allowing us to standardise how we deploy all our ML models to production Responsibilities:Develop scalable and efficient data pipelines which can collect and transform our raw email datasets to enable our Data Scientists to rapidly iterate on machine learning R&DBuilding feature processing pipelines which can be used to create and test new features as part of machine learning experimentsEnsuring experiments are reproducible by ensuring data lineage across machine learning pipelines to track data and model versions from data ingestion through to deployed modelsAutomating model training and evaluation workflows to enable us to rapidly test improvements to Tessian’s machine learning threat detection modelsStreamlining model deployment and monitoring of machine learning models in productionIncorporate labelling processes into our data collection workflow Qualifications:You are a highly-skilled developer who understands software engineering best practices (git, CI/CD, testing, reviewing, etc) and infrastructure as code principles.You have experience working with distributed data processing systems such as SparkYou have designed and deployed data pipelines and ETL systems for data-at-scaleYou have a deep knowledge of the AWS ecosystem and have managed AWS production environmentsYou have experience with workflow systems such as Apache AirflowYou understand what is involved in deploying machine learning solutions, and have ideally been involved in machine learning projects from automated training through to deploymentIdeally you have worked closely as part of or with a data science team, to provide robust and scalable data to power the research they are conductingHas an ability to break down complex problems into concrete, manageable components and think through optimal solutionsEnjoys “getting their hands dirty” by digging into complex operationsTakes a high degree of ownership over their workIs a clear communicator with professional presenceHas strong listening skills; open to input from other team members and departments Why we think you'll love it here 😍....  It’s important to us that all Tessians are part of the journey we’re on, so we offer equity options with every role and benchmark to provide above market rate salaries - there’s plenty more too….  Be at your best, both inside and outside of work  • 25 days of paid holiday (plus 8 bank holidays, and an additional day for every year you've worked at Tessian!) • Private health insurance provided through Vitality Health and mental health support through our Employee Assistance Program • Up to 60 days of working abroad, limited to 30 days per trip a year• Spill - employee mental health support through Slack• Classpass - subsided access to gym time and classes all across London • Choice First: Do your best work, in the way that works best for you• Flexible working hours and working from home (if you're not already remote!) • Enhanced pension contributions, matched up to 5% • We’re family friendly, with policies built to support you in all stages of life • High-quality tech kit provided for you to work on, plus Tessian ANC headphones • If you're relocating to join the team, we'll provide a contribution to help with your costs • Fertility support via Carrot covering adoption, surrogacy support, fertility treatments, support during pregnancy and more Beyond work  • Elite membership of the Tessian House System... • Every other Wednesday we get together to share team updates and drinks • Monthly team socials & a big, whole company extravaganza every quarter • Never-ending ping-pong tournaments ,,,and here are another 200 reasons! Equality & diversity ⚖️ Our mission to empower and protect people is a reflection of two of our values: Human First and We Do the Right Thing. For us, Diversity, Equity and Inclusion is also a reflection of these core values.  As a human first company, we are committed to creating a diverse, equitable and inclusive environment where all our Tessians have the opportunity to thrive. We strive for a better Tessian, and a better world. We're working inside and outside Tessian to improve diversity and equity in our industry, and foster an environment where everyone feels a sense of belonging. Our strategy touches each part of a Tessian’s life cycle, from applicant to employee, ensuring that we keep DEI at the core of every point in our candidate and employee experience. Read more about our DEI commitments here. Obligatory small print Please note that we do not accept applications or résumés from recruiters. Any unsolicited CVs, profiles, or names, submitted in any format, by any channel, to any of our team, will be deemed to fall outside any terms and/or conditions with either the person submitting the information or their company of employment/representation. By submitting your application to Tessian, you consent to Tessian retaining your information and contacting you about future job opportunities, that may be of interest, for up to 2 years in accordance with our Privacy Policy Please note, that any job offers will be subject to the candidate passing background screening checks. We're a #LI-Remote company offering Choice First working practices where possible.

Built with ❤️ for the ML Community by Dom © 2023 RemoteML