Company Overview
Casetext has led innovation in legal AI since 2013, applying cutting-edge AI to the law to create solutions that enable attorneys to provide higher-quality representation to more clients, enhance efficiency and accuracy, and gain a competitive advantage. Our leadership and contributions in legal AI have been recognized worldwide, including receipt of the World Economic Forum’s Technology Pioneer award for the development of AI-powered brief analysis tool CARA AI. Today, over 10,000 law firms—from solos and small practices to more than 40 Am Law 200 firms—rely on Casetext to elevate the quality of their law practice. For more information visit www.casetext.com.
About the Role
Casetext is revolutionizing the field of legal research by combining state-of-the-art machine learning and natural language processing technologies. Our platform helps millions of users access high-quality, affordable legal information and insights. We aim to disrupt the traditional legal research landscape and improve access to justice. We're looking for a Senior Machine Learning Researcher who is passionate about leveraging cutting-edge technologies in natural language understanding, neural information retrieval, graphical deep learning, and Large Language Models (LLMs) to better our justice system. This role is entirely remote, but you must be based in the U.S. and authorized to work in the U.S.
As the Sr. ML Researcher You Will:
- Research & Development: Understand challenges in the legal tech sector and design machine learning-based solutions to tackle them. Stay updated with the latest in ML/NLP research, and identify promising avenues for improvement in Casetext’s offerings, including interaction with Large Language Models.
- Experimentation: Design and execute experiments to quantitatively assess improvements over existing baselines in search algorithms and generative AI technologies for legal applications.
- Modeling & Deployment: Work closely with our engineering team to implement, train, and deploy machine learning models in production, including those based on Large Language Models. Evaluate and vet solutions offered by external vendors to improve our AI offerings.
- Team Collaboration: Guide, mentor, and develop less experienced colleagues. Work cross-functionally with team members, including those in Product Management, Engineering, and Marketing.
- Publication & Outreach: Publish novel research and findings in top ML/NLP venues, and maintain a presence in the scientific community.
- Data Management: Oversee data collection and annotation pipelines and work with messy structured and unstructured datasets.
About You:
- Proficient in SQL, SciKitLearn, PyTorch, Huggingface, and other standard ML, NLP, and deep learning libraries
- Strong proficiency in scripting languages (Python & Bash)
- Track record of solving real-world problems and previous publications in ML/NLP conferences or journals
- Familiarity with Large Language Models and techniques for interacting with them (e.g., advanced prompting)
- Experience deploying distributed machine learning systems on AWS, Google Cloud Platform, or similar platforms
- Comfortable working with messy structured and unstructured datasets and experimenting in settings with limited labeled data
- Able to work iteratively and quickly within our fast-paced product development cycles and maintain a results-oriented mindset
- Familiarity with software engineering practices and writing clean code
- Prefer someone with publications in top ML conferences & journals, but not required
- Prefer someone proficient in a compiled language, such as Scala, Java, Rust or C++
- Looking for someone with excellent communication skills who is able to articulate technical concepts clearly to both technical and non-technical stakeholders
- Team player, open to giving and receiving constructive feedback, and experience in mentoring junior team members
- Self-starter/Driven: capable of independently identifying, pursuing, and delivering on research directions
- Someone with a sincere interest and passion for NLP, generative AI, and knowledge retrieval, especially for the legal domain
- A Ph.D. in Computer Sciences, Data Science, Machine Learning, Math, Physics, or a related quantitative field. Alternatively, an MS with a strong technical background and research experience (i.e., publications).
Salary Range: $170-$190k
Casetext Benefits
- Competitive compensation
- Exciting and meaningful work with an ambitious and passionate team
- You’ll be a leader at a fast-growing start-up, take on a lot of responsibility, and play a substantial role in the future of the company
- Medical, dental, and vision insurance is covered for you, and we cover 50% for spouses and dependents
- Health FSA & Dependent Care FSA
- Short-Term & Long-Term Disability
- Professional Development Budget
- Annual Wellness Budget
- One-time Technology Budget
- Flexible, remote-first work culture
- Generous parental leave
- Unlimited PTO
- We’re a close-knit team of smart, driven people who really enjoy working together
On August 17, 2023 Thomson Reuters acquired Casetext, Inc. Over the coming months, Casetext and Thomson Reuters will work together to integrate the business and employees into Thomson Reuters. More will be communicated in the coming months about the transition.
Casetext is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. All aspects of employment, including the decision to hire, promote, discipline, or discharge, will be based on merit, competence, performance, and business needs. We do not discriminate on the basis of race, color, religion, marital status, age, national origin, ancestry, physical or mental disability, medical condition, pregnancy, genetic information, gender, sexual orientation, gender identity or expression, veteran status, or any other status protected under federal, state, or local law.