[h2][b]About Futureproof Labs[/b][/h2]Futureproof Labs is an AI-native product studio building human-first AI products designed for real-world speed and impact.

You’ll be joining a team focused on transforming raw AI capability into polished, commercially deployable products. [h2][b]Role Summary[/b][/h2]We are seeking a Mid to Senior AI Engineer specializing in Small Language Models (SLMs) to design, finetune, and optimize lightweight models for fast, efficient inference in production environments.

This is a focused ML engineering role, Your primary responsibility will be improving model performance, efficiency, deployment, and reliability for Futureproof Labs’ AI products.

We are looking to add the absolute best engineering talent to bolster our team as we build AI products for the US and Canadian markets.

You'll get to work with experienced and established founders in North America as you build cutting-edge solutions that can have a global impact. [h2][b]Key Responsibilities[/b][/h2][ml][ul][li indent=0 align=left]Design, train, finetune, and evaluate small/efficient language models for targeted tasks. [/li][li indent=0 align=left]Optimize models for speed, memory efficiency, and cost-effective inference on local or cloud infrastructure. [/li][li indent=0 align=left]Implement techniques like quantization, distillation, pruning, and specialized training pipelines. [/li][li indent=0 align=left]Build robust evaluation pipelines: benchmarks, accuracy tests, latency profiling, and regression checks. [/li][li indent=0 align=left]Work closely with product teams to align model behavior with real-world use cases. [/li][li indent=0 align=left]Integrate SLMs with existing backend systems and APIs. [/li][li indent=0 align=left]Ensure versioning, monitoring, safety checks, and performance tracking of deployed models. [/li][li indent=0 align=left]Keep up with emerging SLM research and propose improvements. [/li][/ul][/ml][h2][b]Required Skills Experience[/b][/h2][ml][ul][li indent=0 align=left]At least 3–5+ years of hands-on ML/AI engineering experience. [/li][li indent=0 align=left]Strong experience training or fine tuning small or efficient[/li][li indent=0 align=left]Solid understanding of optimization techniques: quantization, pruning, distillation, low-rank finetuning (LoRA/QLoRA). [/li][li indent=0 align=left]Strong Python skills and familiarity with PyTorch or JAX. [/li][li indent=0 align=left]Experience deploying models in production with attention to speed, cost, and reliability. [/li][li indent=0 align=left]Ability to build clean, maintainable ML pipelines (training, inference, monitoring). [/li][/ul][/ml][h2][b]Nice to Have[/b][/h2][ml][ul][li indent=0 align=left]Experience with retrieval-augmented pipelines (RAG) optimized for small models. [/li][li indent=0 align=left]Familiarity with on-device ML or edge deployment. [/li][li indent=0 align=left]Knowledge of evaluation frameworks, safety testing, and alignment techniques. [/li][li indent=0 align=left]Understanding of GPU/accelerator performance tuning. [/li][/ul][/ml].