Overview

Our mission is to make payments safer and easier for everyone. We started with a consumer product (privacy.com) that helps people spend more safely using virtual payment cards. Then we launched a simple, modern API to make our payment and card issuing infrastructure available to other startups, fintechs, and brands. Today, that infrastructure is known as Lithic, and it powers billions of dollars in payments for some of the most innovative companies in the world.

Lithic is a remote-first company and has a distributed team with an office in New York. That means that if you want to work remotely, you can! If you want to drop by the office or work fully in-person, you can do that, too! We’ve raised $100M+ from top-tier investors including Index, Bessemer, Stripes, and Tusk Venture Partners, with a recent Series C that will help us scale.

Lithic is hiring a Site Reliability Engineer who will be simultaneously operating an API growing by double digits month-over-month and standing up the infrastructure to carry us forward through orders of magnitude growth in scale. You will be building the foundational systems in a new SRE practice as well as authoring internal tooling to support our growing business.

The day-to-day work is a blend of operational & project work, with occasional incident management. We run our business from a blend of on-prem & AWS using Terraform & Salt. Our services are implemented in C++ & Python and we are experimenting with Rust. We’re in the process of containerizing & we’re moving towards a service mesh with Nomad, Consul & Traefik. If you find the challenge of building-out infrastructure while the load it supports grows by leaps & bounds, we’d like to meet you! We encourage you to apply even if you don’t meet every requirement listed below!

Qualifications: 

  • Demonstrated years of professional software engineering experience in DevOps or SRE roles
  • Bachelors in Computer Science, Computer Systems, or equivalent experience
  • Deep understanding of Linux systems and networking
  • Experience with relational database cluster management (preferably PostgreSQL)
  • Familiarity with Python, Bash, or other scripting languages
  • Ability to write clear, maintainable, thoughtfully commented code
  • Experience with Jenkins, AWS, and Docker
  • Experience in Ansible (or other configuration management tools), Kubernetes, and other CICD toolsets is a plus
  • Knowledge of strongly typed languages such as Java and C++ is a plus
  • Understanding of distributed computing concepts including replication and quorum is a plus
  • Experience with REST and other aspects of API development is a plus

Benefits:

  • Health, vision, and dental insurance
  • Unlimited PTO
  • 401(k) match
  • Fully covered membership to One Medical (dependent on location)
  • 1-year membership to Talkspace
  • Classpass credit