Sr. Software Engineer (AI Systems & Infrastructure)
On-site · Columbia, Maryland, United States
Job Summary
Lead the design and development of scalable LLM-powered applications and services. Architect infrastructure solutions that support rapid iteration and deployment of AI features. Collaborate directly with product teams to translate user needs into technical solutions. Build and maintain the platforms that enable your team to ship AI features quickly and reliably. Develop and manage automation tools to improve system reliability and development efficiency. Implement and maintain monitoring, alerting, and logging systems. Conduct capacity planning and performance tuning for AI workloads. Lead and participate in incident response and post-mortem analyses. Mentor junior team members and contribute to the overall growth of the engineering team. Continuously identify and implement improvements to our systems and development processes.
Required Qualifications
- 12+ years of software engineering with a focus on scalable systems
- SWE with AI experience (LLM, RAG, MCP)
- Active and current TS.SCI w FSP
- Strong full-stack development experience
- Programming skills in Python, Go, or Java
- Extensive experience with cloud platforms (AWS, GCP, Azure)
- Proficiency with Docker and Kubernetes
- Experience with infrastructure-as-code tools (Terraform, Ansible, Puppet)
- Expertise in monitoring/observability tools (Prometheus, Grafana, ELK)
- Familiarity with CI/CD pipelines
Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.
Hiring someone like this?
Get your role in front of qualified candidates on Sorce.