LOG IN
SIGN UP
Canary Wharfian - Online Investment Banking & Finance Community.
Sign In
or continue with e-mail and password
Forgot password?
Don't have an account?
Create an account
or continue with e-mail and password
By signing up, you agree to our Terms & Conditions and Privacy Policy.

Lead Site Reliability Engineer | Production Infrastructure

ExperiencedNo visa sponsorship
Jump logo

at Jump

Proprietary Trading

Posted 10 days ago

1 click

**Lead Site Reliability Engineer (Production Infrastructure)** at Jump Trading. Manage/mentor engineers, balance leadership with hands-on work. Design/build high-performance monitoring, alerts, real-time analysis tools, and automation. Oversee incident/change management, reduce operational toil, collaborate globally with tech/business teams. Proven leadership, Python/Go skills, strategic thinking required. Up to $200,000 annual base salary.

Compensation
$175,000 – $200,000 USD

Currency: $ (USD)

City
Not specified
Country
Not specified

Full Job Description

Jump Trading Group is committed to world class research. We empower exceptional talents in Mathematics, Physics, and Computer Science to seek scientific boundaries, push through them, and apply cutting edge research to global financial markets. Our culture is unique. Constant innovation requires fearlessness, creativity, intellectual honesty, and a relentless competitive streak. We believe in winning together and unlocking unique individual talent by incenting collaboration and mutual respect. At Jump, research outcomes drive more than superior risk adjusted returns. We design, develop, and deploy technologies that change our world, fund start-ups across industries, and partner with leading global research organizations and universities to solve problems.

CORE (Central Ops and Reliability Engineering) is the Production Infrastructure team responsible for operating and improving Jumps production trading environment. The team combines deep operational ownership with software and reliability engineering practices to support production systems, drive incident and change management, improve observability and deployment workflows, and reduce operational toil across a fast-moving global trading platform.

What Youll Do:

As Lead Site Reliability Engineer in CORE, you will both manage and mentor engineers across teams and contribute directly to key projects, balancing leadership responsibilities with hands-on work.

  • Design & Build: Architect and implement high-performance monitoring and alerting systems, real-time packet/flow analysis tooling, and automation frameworks for managing Jumps global production footprint.
  • Lead Operational Maturity: Oversee and improve incident management, change management, and post-incident review processes to increase resilience and reduce downtime.
  • Drive Efficiency: Identify and eliminate sources of operational toil through automation and tooling.
  • Collaborate Globally: Partner with engineering, networking, and trading teams in multiple regions to align technical priorities with business objectives.
  • Debug Deeply: Investigate low-level performance issues across complex software stacks, optimizing for ultra-low latency and high throughput.
  • Shape the Roadmap: Influence the strategic direction of production tooling, infrastructure scaling, and vendor partnerships.

Skills Youll Need:

  • Proven leadership experience having managed people across distributed teams.
  • Demonstrated history of solving reliability challenges in large-scale production environments.
  • Previous experience demonstrating strategic thinking skills and maturity in tackling complex problems, dealing with people, technology and processes.
  • Strong programming skills in Python, Go, or equivalent.

Benefits

   - Discretionary bonus eligibility
   - Medical, dental, and vision insurance
   - HSA, FSA, and Dependent Care options
   - Employer Paid Group Term Life and AD&D Insurance
   - Voluntary Life & AD&D insurance
   - Paid vacation plus paid holidays
   - Retirement plan with employer match
   - Paid parental leave
   - Wellness Programs

Annual Base Salary Range
$175,000$200,000 USD

Lead Site Reliability Engineer | Production Infrastructure

Compensation

$175,000 – $200,000 USD

City: Not specified

Country: Not specified

Jump logo
Proprietary Trading

10 days ago

1 click

at Jump

ExperiencedNo visa sponsorship

**Lead Site Reliability Engineer (Production Infrastructure)** at Jump Trading. Manage/mentor engineers, balance leadership with hands-on work. Design/build high-performance monitoring, alerts, real-time analysis tools, and automation. Oversee incident/change management, reduce operational toil, collaborate globally with tech/business teams. Proven leadership, Python/Go skills, strategic thinking required. Up to $200,000 annual base salary.

Full Job Description

Jump Trading Group is committed to world class research. We empower exceptional talents in Mathematics, Physics, and Computer Science to seek scientific boundaries, push through them, and apply cutting edge research to global financial markets. Our culture is unique. Constant innovation requires fearlessness, creativity, intellectual honesty, and a relentless competitive streak. We believe in winning together and unlocking unique individual talent by incenting collaboration and mutual respect. At Jump, research outcomes drive more than superior risk adjusted returns. We design, develop, and deploy technologies that change our world, fund start-ups across industries, and partner with leading global research organizations and universities to solve problems.

CORE (Central Ops and Reliability Engineering) is the Production Infrastructure team responsible for operating and improving Jumps production trading environment. The team combines deep operational ownership with software and reliability engineering practices to support production systems, drive incident and change management, improve observability and deployment workflows, and reduce operational toil across a fast-moving global trading platform.

What Youll Do:

As Lead Site Reliability Engineer in CORE, you will both manage and mentor engineers across teams and contribute directly to key projects, balancing leadership responsibilities with hands-on work.

  • Design & Build: Architect and implement high-performance monitoring and alerting systems, real-time packet/flow analysis tooling, and automation frameworks for managing Jumps global production footprint.
  • Lead Operational Maturity: Oversee and improve incident management, change management, and post-incident review processes to increase resilience and reduce downtime.
  • Drive Efficiency: Identify and eliminate sources of operational toil through automation and tooling.
  • Collaborate Globally: Partner with engineering, networking, and trading teams in multiple regions to align technical priorities with business objectives.
  • Debug Deeply: Investigate low-level performance issues across complex software stacks, optimizing for ultra-low latency and high throughput.
  • Shape the Roadmap: Influence the strategic direction of production tooling, infrastructure scaling, and vendor partnerships.

Skills Youll Need:

  • Proven leadership experience having managed people across distributed teams.
  • Demonstrated history of solving reliability challenges in large-scale production environments.
  • Previous experience demonstrating strategic thinking skills and maturity in tackling complex problems, dealing with people, technology and processes.
  • Strong programming skills in Python, Go, or equivalent.

Benefits

   - Discretionary bonus eligibility
   - Medical, dental, and vision insurance
   - HSA, FSA, and Dependent Care options
   - Employer Paid Group Term Life and AD&D Insurance
   - Voluntary Life & AD&D insurance
   - Paid vacation plus paid holidays
   - Retirement plan with employer match
   - Paid parental leave
   - Wellness Programs

Annual Base Salary Range
$175,000$200,000 USD