Software Systems Architect - DNA Data Storage Operations

Catalog is a developer of a data conversion and storage platform used to offer the next generation of digital data archives.

CATALOG is a well-funded, rapidly expanding, Boston-basedp on a mission to enable new paradigms for data preservation and computation. We are looking for a seasoned systems architect to work within an interdisciplinary team of biologists, engineers, and computer scientists.  You will be responsible for optimizing data storage and computing to enable a complex high-throughput workflow: encoding data files into DNA-based code, automated execution of DNA synthesis, DNA sequencing, and decoding DNA sequence data.  As part of our rapidly expanding group, you will help to optimize processes and pipeline data management for the generation of molecular datasets at a scale enabling the world’s first DNA storage and computing platform.
 
In this role you will:
-       Design the CATALOG software stack that integrates core pipelines up into external-facing APIs under SWT and DCT input
-       Optimize and support infrastructure enabling Hybrid Cloud HPC capabilities
-      Prototype and communicate specific solutions for optimizing workflows
-       Provide guidance to existing and potential users on how to use the HPC cluster and cloud computing for specific projects or workflows: scripting, programming, debugging services, workflow management, software installation, and data management
-       Design and maintain systems to ensure security, reliability, and resiliency
-       Oversee a Jr Software Engineer for DevOps, in charge of maintaining core pipelines and keeping them up, updated, and supporting users, probably under TOT
-       Oversee strategies for managing data, scaling capabilities, implementing telemetry, and continuously improving pipeline processes

We are looking for someone driven by:
-       Creating an infrastructure, team, and processes that will efficiently scale to meet future throughput demands
-       An environment demanding fast-paced collaborative problem solving
-      The opportunity to help bring a ground-breaking technology to market 

You will also need to be able to:
-       Understand user requirements and translate those into reliable and scalable system designs
-       Train colleagues as necessary to drive technology adoption and best practices
-       Handle a variety of tasks, effectively solve problems with numerous and complex variables, and be able to shift priorities rapidly

Ideally, we are looking for someone with:
-       A BS or higher degree in computer science or software engineering
-       Proven track record designing secure, reliable, scalable, and maintainable systems through the full product lifecycle
-       Expertise in cloud architecture, networking, system administration, and security best practices
-       Experience integrating, optimizing, troubleshooting and debugging complex systems
-       Experience in designing & building high performance full-stack applications dealing with terabytes of data with APIs backed by scalable databases
-       Ability to design and write technical documents, diagrams, and runbooks
-       Experience with a public cloud environment (eg. GCP, Azure, AWS)
-       Proficiency with a wide range of programming languages, API and UI frameworks, including Python, C/C++, GRPC, etc.
-       Experience directing junior software engineers
-       Ability to develop a technical roadmap that evolves from simple MVP to full-featured systems and get buy-in from stakeholders 
-       Ability to communicate effectively with a diverse team including scientists and engineers

Preferred candidates will have experience with many of the following:
-       Application deployments and improvements utilizing cloud native services
-       Continuous Integration and Continuous Delivery (CI/CD) processes and toolsets such as Puppet, Chef, Ansible, Jenkins or other automation tools
-       Iterative template creation experience with Terraform or similar
-       Version control (Git, Github)
-       Automated software build environments
-       Containerization (Docker, Singularity/Apptainer)
-       Cloud orchestration and parallelization using Kubernetes or equivalent
-       HPC distributed file systems, SLURM, and parallel programming
-       Familiarity with ML/AI
 
A career at CATALOG will challenge you to think critically and work creatively in order to conquer new problems in an unexplored and groundbreaking field. To do this, we believe in building an inclusive team with different points of view. CATALOG supports and celebrates diversity in our workforce. We provide competitive compensation packages and benefits including medical, dental, and vision insurance, 401(k) with company matching, and unlimited PTO. 
 
Visa sponsorship is not available for this position.  

Apply now
Please let 
CATALOG
 know you found this job on Climate Jobs List. This will help us grow!
Apply now
Apply now
Please let 
CATALOG
 know you found this job on Climate Jobs List. This will help us grow!
Apply now
About this role
Job posted
Job type
Experience level
We contribute 5% of every payment to remove CO2 from the atmosphere with Stripe Climate