Back to jobs
Job Description
- Drive 0-to-1 technical engagements, designing agent reasoning and guardrails while personally refining retrieval-augmented generation (RAG) pipelines, prompt chains, and application programming interface (API) integrations.
- Design evaluation pipelines using "gold datasets" and automated "judge" LLM frameworks to benchmark latency, accuracy, and brand fidelity.
- Bridge non-deterministic LLM outputs with deterministic systems like Salesforce and ServiceNow to ensure safe, reliable, and mission-critical operations.
- Deploy and stress-test pre-GA features in real-world environments, providing field intelligence to influence Google cloud product and development roadmaps.
- Author architectural patterns and prompt development guides while implementing ingestion pipelines for conversational data using Gemini and Vertex AI.
