Bulge Bracket Investment Banks

Posted 4 days ago

No clicks

**Role: Applied AI ML Director, Palo Alto, CA, USA** Build, optimize, and scale production-grade Large Language Model (LLM) systems serving hundreds of thousands of professionals daily. Lead a cross-functional team to deliver innovative, real business-impact AI solutions. Key responsibilities include architecting robust, reusable APIs, agentic workflows, and processing millions of documents while automating complex financial tasks. Requirements: - PhD (or equivalent experience) in Computer Science, Mathematics, Statistics, or related field - Extensive ML engineering experience, with a proven track record shipping production AI systems - Deep expertise in NLP, computer vision, and/or multimodal LLM algorithms, with a strong foundation in statistics, optimization, and ML theory - Practical experience implementing distributed, multi-threaded, and scalable applications using frameworks like Ray, Horovod, DeepSpeed - Exceptional communication skills with the ability to build trust and influence stakeholders at all levels Preferred Qualifications: - Advanced proficiency in designing and deploying production ML pipelines using DAG frameworks - Expertise in architecting and implementing high-throughput, low-latency microservices with gRPC, REST, and GraphQL - Hands-on experience with parameter-efficient fine-tuning, model quantization, and quantization-aware training for LLMs at scale - Deep knowledge of distributed training strategies, memory optimization, and inference acceleration for large-scale multimodal models - Experience with advanced agentic workflow orchestration and integration with enterprise event-driven architectures

Compensation: Not specified USD
City: Palo Alto
Country: United States

Full Job Description

Location: Palo Alto, CA, United States

At the heart of JP Morgans AI transformation is the Chief Analytics Office (CAO) the team responsible for driving the firmwide adoption of artificial intelligence and advanced analytics. The CAO leads the strategy, governance, and delivery of AI/ML products across the company, ensuring that innovation is balanced with responsibility, security, and ethical best practices. By overseeing the build, adoption, and maintenance of AI solutions, the CAO empowers every line of business to leverage AI/ML at scale and unlock new value for clients and stakeholders.

As a Generative AI Executive Director within our CAO organization, youll be at the forefront of building and optimizing production-grade LLM systems (LLM Suite) that serve hundreds of thousands of professionals every day. Youll architect and scale robust, reusable APIs and agentic workflows that process millions of documents and automate complex financial tasks. Collaboration is key: youll work closely with teams across ML Engineering, Product Management, and Cloud Engineering to deliver solutions that are measured, budgeted, and built for real business impact.

Your technical decisions will directly shape how JP Morgan leverages AI at enterprise scale. Youll ensure LLM Suite is designed for reliability, scalability, and performance enabling other teams to build on top of our infrastructure and accelerate innovation firmwide. This is not a research sandbox; its production infrastructure with executive visibility and measurable ROI.

If youre ready to lead engineering teams, ship production AI, and make decisions that influence the future of financial technology, we invite you to join us and help define the next era of enterprise AI at JP Morgan.

Job Responsibilities

Architect and deliver production LLM based systems (text, image, speech, video) powering mission-critical LLM Suite products.

Own end-to-end delivery, performance, and continuous improvement of individual LLM Suite products.

Bridge advanced AI research with robust engineering to build innovative, production-ready solutions.

Drive results with an entrepreneurial mindset in a fast-paced, high-impact environment.

Required qualifications, capabilities, and skills

PhD or equivalent experience in Computer Science, Mathematics, Statistics, or a related quantitative discipline.

Extensive hands-on experience as an individual contributor in ML engineering, with a proven track record of shipping production AI systems.

Deep expertise in NLP, Computer Vision, and/or Multimodal LLM algorithms, with a strong foundation in statistics, optimization, and ML theory.

Practical experience implementing distributed, multi-threaded, and scalable applications using frameworks such as Ray, Horovod, DeepSpeed, etc.

Exceptional communication skills, able to convey complex technical concepts and build trust with stakeholders at all levels.

Preferred qualifications, capabilities, and skills

Advanced proficiency in designing and deploying production ML pipelines using DAG frameworks, including custom operator development and pipeline optimization.

Expertise in architecting and implementing high-throughput, low-latency microservices with gRPC, REST, and GraphQL, including protocol buffer schema design, streaming endpoints, and load balancing.

Hands-on experience with parameter-efficient fine-tuning (LoRA, QLoRA, IA3), model quantization (INT8, FP16, GPTQ), and quantization-aware training for LLMs at scale.

Deep knowledge of distributed training strategies (data/model/pipe parallelism), memory optimization, and inference acceleration for large-scale multimodal models.

Experience with advanced agentic workflow orchestration, including multi-agent coordination, stateful task management, and integration with enterprise event-driven architectures.

FEDERAL DEPOSIT INSURANCE ACT: This position is subject to Section 19 of the Federal Deposit Insurance Act. As such, an employment offer for this position is contingent on JPMorganChases review of criminal conviction history, including pretrial diversions or program entries.

#LI-RB3

Lead delivery of production generative AI products and APIs that scale firmwide and drive measurable business outcomes.

Full Job Description

Location: Palo Alto, CA, United States

Job Responsibilities

Architect and deliver production LLM based systems (text, image, speech, video) powering mission-critical LLM Suite products.

Own end-to-end delivery, performance, and continuous improvement of individual LLM Suite products.

Bridge advanced AI research with robust engineering to build innovative, production-ready solutions.

Drive results with an entrepreneurial mindset in a fast-paced, high-impact environment.

Required qualifications, capabilities, and skills

PhD or equivalent experience in Computer Science, Mathematics, Statistics, or a related quantitative discipline.

Extensive hands-on experience as an individual contributor in ML engineering, with a proven track record of shipping production AI systems.

Deep expertise in NLP, Computer Vision, and/or Multimodal LLM algorithms, with a strong foundation in statistics, optimization, and ML theory.

Practical experience implementing distributed, multi-threaded, and scalable applications using frameworks such as Ray, Horovod, DeepSpeed, etc.

Exceptional communication skills, able to convey complex technical concepts and build trust with stakeholders at all levels.

Preferred qualifications, capabilities, and skills

Advanced proficiency in designing and deploying production ML pipelines using DAG frameworks, including custom operator development and pipeline optimization.

Expertise in architecting and implementing high-throughput, low-latency microservices with gRPC, REST, and GraphQL, including protocol buffer schema design, streaming endpoints, and load balancing.

Hands-on experience with parameter-efficient fine-tuning (LoRA, QLoRA, IA3), model quantization (INT8, FP16, GPTQ), and quantization-aware training for LLMs at scale.

Deep knowledge of distributed training strategies (data/model/pipe parallelism), memory optimization, and inference acceleration for large-scale multimodal models.

Experience with advanced agentic workflow orchestration, including multi-agent coordination, stateful task management, and integration with enterprise event-driven architectures.

#LI-RB3

Lead delivery of production generative AI products and APIs that scale firmwide and drive measurable business outcomes.

Applied AI ML Director

Full Job Description

SIMILAR OPPORTUNITIES

Applied AI ML Lead

Director Applied Artificial Intelligence Engineering

Assistant Director - Applied AI Engineer

Executive Director AI Engineering

Director of AI Products

Applied AI ML Director

Full Job Description

SIMILAR OPPORTUNITIES

Applied AI ML Lead

Director Applied Artificial Intelligence Engineering

Assistant Director - Applied AI Engineer

Executive Director AI Engineering

Director of AI Products