Software Engineer, ML Infrastructure - Training Platform (New York) Job at Scale AI, New York, NY

VWZ6MUx2SDZ3Mk9MbS82ZW9LQ3dNa2lsQ3c9PQ==
  • Scale AI
  • New York, NY

Job Description

Software Engineer, ML Infrastructure - Training Platform

Scale is looking for an AI/ML Infrastructure Engineer to join our Machine Learning Infrastructure team to build out our Training Platform. You will partner closely with Machine Learning researchers to understand their requirements and apply your own domain expertise and our compute resources to accelerate experimentation throughput.

The ideal candidate is someone who has strong fundamentals in machine learning, backend system design, and has prior ML Infrastructure experience. You should also be comfortable with infrastructure and large scale system design, as well as diagnosing both model performance and system failures.

You will:

  • Build highly available, observable, performant, and cost-effective APIs for model training.
  • Participate in our teams on call process to ensure the availability of our services.
  • Own projects end-to-end, from requirements, scoping, design, to implementation, in a highly collaborative and cross-functional environment.
  • Exercise good taste in building systems and tools and know when to make build vs. buy tradeoffs, with an eye for cost efficiency.

Ideally you'd have:

  • 4+ years of experience building machine learning training pipelines or inference services in a production setting.
  • Experience with distributed training techniques such as DeepSpeed, FSDP, etc.
  • Experience building, deploying, and monitoring complex microservice architectures.
  • Experience with Python, Docker, Kubernetes, and Infrastructure as code (e.g. terraform).

Nice to haves:

  • Experience with LLM inference latency optimization techniques, e.g. kernel fusion, quantization, dynamic batching, etc.
  • Experience working with a cloud technology stack (eg. AWS or GCP).

Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position, determined by work location and additional factors, including job-related skills, experience, interview performance, and relevant education or training.

About Us:

At Scale, we believe that the transition from traditional software to AI is one of the most important shifts of our time. Our mission is to make that happen faster across every industry, and our team is transforming how organizations build and deploy AI. Our products power the world's most advanced LLMs, generative models, and computer vision models. We are trusted by generative AI companies such as OpenAI, Meta, and Microsoft, government agencies like the U.S. Army and U.S. Air Force, and enterprises including GM and Accenture. We are expanding our team to accelerate the development of AI applications.

We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or Veteran status.

We are committed to working with and providing reasonable accommodations to applicants with physical and mental disabilities. If you need assistance and/or a reasonable accommodation in the application or recruiting process due to a disability, please contact us at accommodations@scale.com.

Apply for this job

* indicates a required field

First Name *

Last Name *

Email *

Phone *

Resume/CV *

LinkedIn Profile *

Website (optional)

#J-18808-Ljbffr

Job Tags

Full time, Shift work,

Similar Jobs

Phelps Health

Student Nurse Intern - ICU | P-shift IPT Job at Phelps Health

 ...Summary ~ Under the general supervision of a licensed nurse, the Student Nurse Intern performs various duties in the treatment and care...  ...nursing program but does not meet licensing requirements. Work Experience ~ Entry level role. No experience required. Certification... 

Bern Digital

Senior Performance Advertising Manager (Google Ads and Meta Ads) Job at Bern Digital

Senior Performance Advertising Manager (Google Ads and Meta Ads) Get AI-powered advice on this job and more exclusive features.Pay found in...  ...strategies across Paid Search (Google Ads, Bing Ads), Paid Social (Facebook, Instagram, LinkedIn, TikTok), and programmatic platforms.... 

Scott Humphrey Corporation

Commercial Assistant Project Manager Job at Scott Humphrey Corporation

 ...local General Contractor with a heavy focus on multifamily and commercial construction projects. They are actively seeking multiple...  ...support their growth in the local market The Assistant Project Manager will assist the project team with success of assigned project(... 

Network Adjusters, Inc.

Claims Adjuster/Examiner Job at Network Adjusters, Inc.

 ...Network Adjusters is seeking an experienced Claims Adjuster for a file review position (Disposition/Claims Analyst). We have continued...  ...available at this time. ***This position is for experienced insurance claims adjusters and requires at least 3 years of insurance claims... 

Beth Israel Lahey Health

Pathology Navigation Job at Beth Israel Lahey Health

 ...When you join the growing BILH team, you're not just taking a job, you're making a difference in people's lives.**The role of the Pathology Navigator performs operational support functions and manages workflow, to include but not limited to, the responsibilities and...