Lead Software & Ai Architect

Description:

We are looking for a candidate who will be part of the Architecture team within the EY Client Technology's Engineering Architecture Practice. The candidate will be responsible for defining and delivering architectures and designs for business-critical global products and platforms, with focus on scalable distributed systems, cloud-native solutions, modern enterprise architectures, and agentic AI systems. You'll architect solutions hosted in a variety of infrastructure/environments considering reusability, scalability, reliability, observability, and modern design patterns in alignment with EY's mission and strategy. As an architect, you'll embody the principles of the architecture elevator—connecting strategic business vision with hands-on technical implementation while bridging the gap between executive leadership and engineering teams.

You Will

Build relationships across the business to understand their requirements and strategy, helping them build product/platform roadmaps including agentic AI solutions. Demonstrate architecture leadership to translate business requirements into architecture blueprints and innovative designs incorporating modern distributed systems patterns, microservices architectures, agent orchestration patterns, multi-agent collaboration frameworks, and cloud-native solutions.
Own the overall technology, architecture and design landscape of products/platforms from solutioning to delivery to adoption phases, with deep understanding of distributed systems, containerization, orchestration platforms, agent frameworks (LangGraph, Microsoft Agent Framework), agent communication protocols (MCP, A2A), reasoning techniques, and enterprise security patterns.
Design and implement enterprise-scale distributed and agentic AI solutions using modern protocols and frameworks, with expertise in web APIs (REST, GraphQL), Model Context Protocol (MCP), Agent2Agent Protocol (A2A), containerization with Kubernetes, event-driven architectures, and robust access control mechanisms using OAuth 2.0/OpenID Connect, along with AI guardrails using frameworks like NeMo Guardrails, and Guardrails AI.
Potentially lead engineering teams in a portfolio or large product, set the overall technology and architecture direction and own its end-to-end delivery in a lead architect's capacity. Act as the critical glue between Engineering, Product Management, AI Safety, and Operations teams, ensuring seamless collaboration and alignment across all stakeholders.
Solve complex problems where scalable solutions may not currently exist, necessitating the ideation and incubation of new distributed system approaches, multi-agent approaches, advanced reasoning patterns, modern architectural patterns, and cutting-edge technologies. Conduct technical feasibility assessments, buy/build analysis for platforms, vendor product evaluations, proof of concepts (with hands-on development) and present various solution options to the business with appropriate recommendations.
Work closely with multi-disciplinary teams including engineers, AI engineers, prompt engineers, AI safety specialists, data scientists, platform teams, and infosec to deliver modern architectures using optimized delivery approaches, ensuring compliance with enterprise standards, AI governance standards, security policies, and governance guidelines (OWASP GenAI Top 10).
Provide leadership in product delivery to build and consume reusable functional and technical assets. Identify, define and implement reusable components, API libraries, containerized services, agent templates, reasoning patterns, tool integration libraries, evaluation frameworks, and architectural patterns that can be leveraged across the enterprise.
Architect and design distributed systems and multi-agent systems that can seamlessly interact with multiple enterprise systems, APIs, and data sources while maintaining security through proper authentication implementation and AI safety through guardrail implementation, scalability through containerization and Kubernetes orchestration, and reliability through event-driven architecture patterns and observability via Open Telemetry evolving AI standards
Provide support to product and project managers in building plans, forecasts, estimations and costs for new products/platforms, including considerations for infrastructure resources, LLM API costs, token usage optimization, containerization costs, and ongoing platform operational requirements.
Provide coaching, mentoring and support to development teams during the delivery phase across the ecosystem, frameworks and platforms. Ensure that developed artifacts comply with envisioned architecture and design, expected quality, security standards, AI safety standards, and recommended development practices through code reviews, prompt reviews, agent workflow validation, and architecture validation processes.
Can work across multiple projects with varied stakeholders, operating as the architecture elevator that connects boardroom strategy with server room implementation. An architect sets architectural direction for scalable systems, builds consensus around technology strategy, mediates conflicts regarding implementation approaches, and provides technical leadership and advisory services while ensuring alignment between Engineering, Product Management, AI Safety, and Operations teams.
Demonstrate excellent interpersonal communication and organizational skills required to operate as a leading member of global, distributed engineering teams. Excel at translating complex technical concepts (including agentic AI concepts) to business stakeholders and business requirements to engineering teams.
Cultivate lasting relationships across business, IT, technology vendors, and industry experts to maintain insight into the broader technology landscape and agentic AI landscape as well as emerging trends. Understand new trends and emerging technologies in multi-agent systems, reasoning techniques, AI safety, agent protocols, and how they apply to EY's business context.

To Qualify for the Role, You Must Have

Architecture & Leadership

Led and governed global engineering teams and architects, driving the technology and architecture landscape of large and complex portfolios of products/solutions/projects, achieving measurable success in implementation and enterprise adoption. Demonstrated experience as an architecture elevator, connecting strategic business vision with hands-on technical execution.
Expert-level proficiency in at least one of the following programming languages: C#, Java, JavaScript, TypeScript, or Python, with demonstrated ability to architect and implement enterprise-scale solutions including AI-powered systems using your chosen language stack.
Deep hands-on experience with modern protocols and frameworks including web APIs (REST, GraphQL), containerization technologies (Docker, Kubernetes), event-driven architecture patterns, enterprise security implementations using OAuth 2.0/OpenID Connect, and emerging AI protocols such as Model Context Protocol (MCP) and Agent-to-Agent (A2A) communication frameworks. Proven ability to design and implement enterprise-scale distributed solutions.
Architecture leadership skills with advanced understanding and practical experience in modern architecture patterns, with proven track record in complex, large-scale implementations serving user bases in the hundreds of thousands or more across the entire product lifecycle. Experience leading engineering teams and acting as the critical connector between Engineering, Product Management, and Operations.
Hands-on deep experience in architecture including application integration, cloud infrastructure architecture, data pipelines, security architectures, and successful delivery of both functional and non-functional aspects of platforms using cloud-native services, full-stack applications, DevSecOps, security practices, and automated deployment pipelines.
Comprehensive knowledge of modern enterprise technologies including web APIs, microservices architectures, containerization and orchestration platforms (Docker, Kubernetes), event-driven systems, message queuing, and enterprise security patterns including OAuth 2.0, OpenID Connect, and identity management solutions.
Deep understanding of distributed system architectures, multi-service systems, API gateway patterns, and the ability to design scalable systems that can handle complex enterprise workloads with proper load balancing, caching strategies, and fault tolerance across enterprise environments.
Expert knowledge of data management including relational and NoSQL databases, data streaming platforms, real-time data processing, and data governance. Understanding of platform lifecycle management, A/B testing strategies, and performance monitoring.
Strong background in security best practices, enterprise compliance requirements, and regulatory standards in enterprise environments.

Agentic AI & Advanced AI Systems

2+ years of hands-on experience architecting and implementing enterprise-scale agentic AI systems, with proven track record in designing and deploying multi-agent systems and AI orchestration platforms serving significant user bases.
Expert-level proficiency in major agent frameworks including LangChain/LangGraph (stateful graph-based agent workflows), Microsoft Ecosystem (AutoGen for multi-agent systems, Semantic Kernel, migration to Microsoft Agent Framework). Experience with at least 2+ different agent orchestration frameworks in production environments.
Expertise in agent communication protocols and standards: Model Context Protocol (MCP), Agent2Agent Protocol (A2A), AG-UI event-based protocols, OpenAPI, JSON-RPC, Server-Sent Events (SSE), and WebSockets for agent integration.
Knowledge of advanced reasoning techniques with production implementation experience: ReAct (Reasoning + Acting), Chain-of-Thought (CoT), and Tree of Thoughts (ToT). Expert knowledge of planning patterns including Multi-Plan Selection, External Module-Aided Planning, Memory-Augmented Planning, and Task Decomposition.
Hands-on experience with AI safety frameworks and guardrail implementation: NeMo Guardrails (NVIDIA), Guardrails AI, Azure AI Content Safety or equivalent enterprise platforms. Proven experience implementing multi-layer guardrail strategies including input guardrails (prompt injection detection, jailbreak prevention, PII masking), output guardrails (hallucination detection, toxicity screening, fact-checking), runtime guardrails (token-level monitoring, confidence scoring, circuit breakers), and interaction-level guardrails (tool use restrictions, decision caps, human-in-the-loop gates).
Experience with LLM evaluation frameworks: DeepEval, RAGAS, LangSmith. Knowledge of evaluation methodologies including LLM-as-a-Judge patterns, Human-in-the-Loop (HITL) annotation and feedback systems, and A/B testing for agent/prompt variants. Deep understanding of key metrics including performance, efficiency, quality, and safety metrics.
Production observability implementation: Span-level tracking and distributed tracing for multi-agent systems, cost attribution and token counting, prompt version control, real-time quality monitoring, and user feedback loops.
Expert-level skills in RAG (Retrieval-Augmented Generation) architectures: Agentic RAG, GraphRAG, Multi-Modal RAG. Production experience with two or more vector databases (MongoDB Atlas Vector Search, Redis, Pinecone, Chroma). Deep understanding of memory architectures for agents including working memory, persistent memory, semantic memory, and episodic memory.
Proven experience designing and implementing multi-agent systems: Agent coordination mechanisms, inter-agent messaging, shared knowledge bases, collaborative memory, task delegation and work distribution algorithms. Mastery of orchestration patterns including sequential, concurrent, and hierarchical orchestration, dynamic handoff and context-based routing, human-in-the-loop integration, and event-driven agent orchestration.
Cost optimization expertise: Token usage optimization, model selection strategies, and ROI analysis for agentic AI systems.
Demonstrated passion for AI technologies and their potential to transform enterprise solutions, with eagerness to stay current with AI advancements and integrate them thoughtfully into architectural decisions.
Strong background in modern software architecture patterns including event-driven architectures, data streaming, real-time processing, and microservices-based platforms.

Ideally, You'll Also Have

Bachelor's or Master's Degree in Engineering, Computer Science, Data Science, AI/ML, Mathematics, or related technical field.
15+ years overall IT industry experience, and 5+ years in architect roles, including experience with enterprise platform implementations, distributed system deployments, multi-agent system deployments, and leading cross-functional engineering teams.
Achieved one or more relevant certifications such as Microsoft Azure Solutions Architect, Microsoft Azure AI Engineer Associate, AWS Solutions Architect, Google Cloud Professional Architect, Google Cloud Professional AI Engineer, TOGAF, Kubernetes certifications (CKA, CKAD), DeepLearning.AI specializations, or emerging distributed systems architecture certifications.
Experience with modern development frameworks and tools such as container orchestration platforms, API management tools, service mesh technologies, or similar distributed systems development ecosystems.
Thought leadership in technology and agentic AI - presenting ideas, products, research papers, and concept papers to varied audiences at AI conferences, building technology points of view and presenting them in industry forums/events, contributing to open-source projects, and published research on AI.
Experience in designing and implementing business use-cases using low-code/no-code platforms, data reporting, data management and governance technologies, mobile applications, and deep cloud security concepts.
Experience with specialized agentic AI capabilities: Multi-modal agents (text, vision, audio, video processing), code generation agents (automated programming, code review, bug detection), and domain-specific agents (finance, healthcare, legal, customer service).
Knowledge of enterprise security including OAuth 2.0/OpenID Connect implementation, API security, container security, and secure deployment practices.

Organization	EY
Industry	IT / Telecom / Software Jobs
Occupational Category	AI Architect
Job Location	Texas,USA
Shift Type	Morning
Job Type	Full Time
Gender	No Preference
Career Level	Experienced Professional
Experience	15 Years
Posted at	2025-12-25 3:59 pm
Expires on	2026-03-13