Back to jobs
Job Description
- Architect and manage Large Language Model (LLM) deployments across on-premises (NVIDIA/AMD) and cloud (cloud computing platform, Google Cloud platform (GCP) environments. Audit multi-agent orchestration, agent construction, and vector databases to map data flows and enforce privilege boundaries.
- Use Docker and Kubernetes to orchestrate scalable inference and training environments, optimizing Graphics Processing Unit (GPU) utilization and resource isolation.
- Protect model weights, secure data ingestion, and harden inference endpoints across the Machine Learning operations (MLOps) lifecycle. Investigate and mitigate AI-specific threats (e.g., prompt injection, jailbreaking, data poisoning). Map testing findings to MITRE ATLAS, OWASP for LLMs, and STRIDE models.
- Bridge local high-compute clusters and cloud AI services while maintaining a consistent security posture.
