webAI is pioneering the future of artificial intelligence by establishing the first distributed AI infrastructure dedicated to personalized AI. We recognize the evolving demands of a data-driven society for scalability and flexibility, and we firmly believe that the future of AI lies in distributed processing at the edge, bringing computation closer to the source of data generation. Our mission is to build a future where a company's valuable data and intellectual property remain entirely private, enabling the deployment of large-scale AI models directly on standard consumer hardware without compromising the information embedded within those models. We are developing an end-to-end platform that is secure, scalable, and fully under the control of our users, empowering enterprises with AI that understands their unique business. We are a team driven by truth, ownership, tenacity, and humility, and we seek individuals who resonate with these core values and are passionate about shaping the next generation of AI.

About the Role:

We are seeking an AI Systems Engineer to support our Public Sector initiatives focused on building and optimizing production ready AI systems for secure and distributed environments. This role sits at the intersection of machine learning, systems engineering, and deployment optimization, bridging research and real world implementation.

You will be responsible for transforming prototype models into scalable, efficient, and reliable production systems that operate seamlessly across a spectrum of hardware from government cloud infrastructure to edge devices in restricted or disconnected environments.

Responsibilities:

Productionize AI models from research prototypes into scalable, deployable systems used in real world applications.
Develop, fine tune, and optimize models using PyTorch, TensorFlow, or Hugging Face Transformers, adapting both open and closed source models.
Implement model optimization techniques such as quantization, pruning, distillation, and hardware specific acceleration.
Engineer systems for dynamic model adaptation using low rank adaptation (LoRA), parameter efficient fine tuning (PEFT), and on device inference strategies.
Build and maintain Retrieval Augmented Generation (RAG) pipelines, including vector database integration for contextual retrieval.
Work with multi modal AI systems across computer vision, audio, and natural language domains.
Employ synthetic data generation and digital twinning techniques (GANs, diffusion models, or simulation based) to create robust datasets for edge cases.
Develop GPU accelerated and low level system code in C, C++, or Rust for performance critical operations.
Optimize model execution for distributed and resource constrained environments, ensuring reliability under variable connectivity conditions.
Collaborate cross functionally with Infrastructure, MLOps, and Security teams to deliver secure, compliant, and high performance AI solutions for government partners.

Qualifications:

Active US Security clearance or eligibility and willingness to obtain a US Security clearance
5+ years of experience in applied AI, ML engineering, or production AI systems.
Deep proficiency in PyTorch, TensorFlow, or Hugging Face Transformers.
Proven experience deploying AI models across cloud, edge, and mobile hardware environments.
Expertise in model compression and optimization (quantization, pruning, distillation).
Strong understanding of GPU computing, CUDA, and performance profiling.
Experience building RAG pipelines and integrating vector databases (e.g., FAISS, Milvus, Pinecone).
Familiarity with multi modal models and synthetic data generation methods.
Low level programming experience in C, C++, or Rust with understanding of computer memory and concurrency.
Strong algorithmic and problem solving skills, especially in distributed or constrained compute environments.

Preferred Skills

Experience with edge AI, federated learning, or offline inference systems.
Familiarity with distributed training frameworks such as DeepSpeed or Ray.
Understanding of AI governance and compliance frameworks relevant to public sector deployments.
Experience integrating models into large scale distributed systems or microservice architectures.
Excellent communication and technical documentation skills for collaboration across multi disciplinary teams.

We at webAI are committed to living out the core values we have put in place as the foundation on which we operate as a team. We seek individuals who exemplify the following:

Truth - Emphasizing transparency and honesty in every interaction and decision.
Ownership - Taking full responsibility for one’s actions and decisions, demonstrating commitment to the success of our clients.
Tenacity - Persisting in the face of challenges and setbacks, continually striving for excellence and improvement.
Humility - Maintaining a respectful and learning-oriented mindset, acknowledging the strengths and contributions of others.

Benefits:

Competitive salary and performance-based incentives.
Comprehensive health, dental, and vision benefits package.
401k Match (US-based only)
$200/mos Health and Wellness Stipend
$400/year Continuing Education Credit
$500/year Function Health subscription (US-based only)
Free parking, for in-office employees
Unlimited Approved PTO
Parental Leave for Eligible Employees
Supplemental Life Insurance

webAI is an Equal Opportunity Employer and does not discriminate against any employee or applicant on the basis of age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. We adhere to these principles in all aspects of employment, including recruitment, hiring, training, compensation, promotion, benefits, social and recreational programs, and discipline. In addition, it is the policy of webAI to provide reasonable accommodation to qualified employees who have protected disabilities to the extent required by applicable laws, regulations and ordinances where a particular employee works.

This job is no longer accepting applications

See open jobs at webAI.See open jobs similar to "Senior Machine Learning Engineer" G2 Venture Partners.

See more open positions at webAI

Powered by Getro.com

Privacy policy Cookie policy