‹ Back

Senior AI Infrastructure Engineer

JOB SUMMARY

GermanyPosted on 1/24/2026

Skills & Technologies

Languages:Python
ML/AI:MLflow
Cloud/DevOps:AWSDockerKubernetes
Tools:CI/CD
Apply

Job details

Senior Software Engineer – AI InfrastructureAbout ArangoAt Arango, we believe the first generation of enterprise AI missed something essential: context. LLM models are powerful, but they didn’t understand the context needed to deliver accurate answers. Arango provides a trusted data foundation for the next wave of Enterprise AI with graph-based Contextual AI - transforming enterprise data into a System of Context that truly represents the business, so LLMs can deliver better outcomes with unlimited scale and cost efficiency.

The Arango AI Data Platform gives developers a single, integrated environment to build and scale AI-powered applications without the complexity of stitching together multiple databases and tools. At its core is a massively scalable multi-model database that unifies graph, vector, document, and key-value data with full-text, geospatial, and vector search — creating the System of Context, the bridge between enterprise data and LLMs.

We’re a global team based in California and Cologne, united by curiosity, collaboration, and a passion for helping developers, data engineers, and technology leaders innovate faster and smarter with AI. Trusted by NVIDIA, HPE, the London Stock Exchange, the U. S. Air Force, NIH, and Articul8, Arango powers enterprise AI with context, confidence, and scale.

We are a proud member of the NVIDIA Inception Program and the AWS ISV Accelerate Program.

If you’re excited about shaping the future of Contextual AI, come build with us.

Location** Only candidates in Europe will be considered. **About the RoleWe are looking for a senior, hands-on software engineer to help maintain, stabilize, and debug our AI infrastructure.

This role has high ownership and requires deep technical problem-solving skills in production environments.

The most important qualification is not a specific language, but a strong understanding of how complex software systems actually behave in production. Key Responsibilities:Maintain, develop and stabilize AI infrastructure servicesArchitect and implement foundational services and shared libraries that scale our entire AI infrastructure eco

System. Debug complex, non-obvious production issues across application, process, network, and memory layers

Systematically analyze, isolate, and resolve failures in unclear scenariosImprove observability, profiling, and tracing across servicesWork with distributed microservices and internal platformsOperate and improve systems running on Docker, Kubernetes, and HelmSupport CI/CD pipelines (CircleCI), testing, and security-relevant componentsWork with MLflow, Triton, and distributed Python modulesCore

Requirements:Senior-level experience (minimum 5+ years)Passion for working with cutting-edge technologies in a fast-moving AI environment, including LLM-based workflows and pipelinesComfortable working independently in a fast-paced, evolving environmentStrong debugging skills in complex, distributed systems with the ability to identify root causes when failures are ambiguousSolid understanding of how software behaves in production environmentsExperience designing and operating microservices, distributed systems, and databasesProven experience building and scaling high-availability servicesCore Technology StackPython (primary language) - 5+ yearsDocker, Kubernetes, Helm - 5+ yearsCI/CD pipelines and testing frameworks (e. g. , CircleCI)Observability tools: metrics, logs, tracing, and profilingExposure to AI/ML infrastructure, including tools such as MLflow and TritonNice to Have:Customer-facing experience, including proofs of concept (PoCs) or technical demosExperience with AI/ML infrastructure and orchestration, i. e. , MLflow and TritonCross-language debugging experienceFamiliarity with RustExperience working with databases, NoSQL, multi-model, or graph databasesKnowledge of Retrieval-Augmented Generation (RAG) and GraphRAG conceptsUnderstanding of graph algorithms and graph-based data modelingWhy Join ArangoDBOur headquarters is in San Francisco (US) and we have an office in Cologne (Germany), but most of our diverse team works remotely worldwide. So, do you prefer your desk at home or do you want to join us at one of our

locations.

Your choice.

The global minds of Arango team comes from 5 different continents and more than 20 countries. Diverse backgrounds enable us to see new solutions.

We invite people from every culture, national origin, religion, sexual orientation, gender identity or expression, and of every age to apply to our positions. All employment decisions are based on business needs, job

requirements, and individual

qualifications. Arango is committed to a workplace free of discrimination and harassment based on any of these characteristics.

We love this diversity and encourage everyone curious and visionary to join the multi-model movement.

Discover the company

Explore other offers from this company or learn more about ArangoDB.

The company

A
ArangoDB
Germany