Machine Learning Engineer Intern

Nexa AI

Cupertino, CA

Internship

4 days ago

Internship

PyTorchCUDAOpenCLMobile OptimizationOn-device AI

Job Description

Introduction

Nexa AI is an on-device AI research and deployment company. We specialize in:

Tiny, multimodal models (e.g., Octopus v2, OmniVLM, OmniAudio)
Local on-device inference framework (e.g., nexa-sdk)
Model optimization techniques (e.g., NexaQuant)

Our work has been recognized by industry leaders like Google, Hugging Face, AMD, and more. We partner with enterprises and SMBs to bring local intelligence to every device.

Responsibilities

Build on-device ML infrastructure at scale
Assist in developing and optimizing Large Language Models (LLMs) for on-device deployment
Support on-device AI research efforts
Contribute to the development of our SDKs across multiple platforms including Windows, MacOS, Android, iOS, and Linux

Candidate Requirements

You May Be a Good Fit If You:

Hold a minimum BS or MS in Computer Science
Are familiar with PyTorch
Have an excellent understanding of computer science fundamentals, including data structures, algorithms, and coding
Possess knowledge of operating system internals, compilers, and low-power/mobile optimization
Have experience with low-level programming in C and frameworks like CUDA, OpenCL
Are proficient in multithreading and performance optimization

Logistics

Part-Time: Remote, 20+ hours/week
Full-Time: Based in Cupertino, California

How To Apply

Send your resume to career@nexa4ai.com

Job Highlights

Qualifications

Minimum BS/MS in Computer Science
Familiar with PyTorch
Strong understanding of data structures, algorithms, and coding
Knowledge of operating system internals, compilers, and mobile optimization techniques
Experience with C programming and frameworks like CUDA, OpenCL
Skills in multithreading and performance tuning

Responsibilities

Build scalable on-device ML infrastructure
Develop and optimize LLMs for deployment on devices
Contribute to AI research and SDK development across platforms such as Windows, MacOS, Android, iOS, and Linux

Benefits

Part-time remote work, 20+ hours per week

Machine Learning Engineer Intern

Job Description

Introduction

Responsibilities

Candidate Requirements

Logistics

How To Apply

Job Highlights

Qualifications

Responsibilities

Benefits

Apply for this job