[NetsPresso] AI Software Engineer

Job group

R&D

Experience Level

Experienced 2 years or more

Job Types

Full-time

Locations

NotaNota Inc. (16F, Parnas Tower), 521, Teheran-ro, Gangnam-gu, Seoul, Republic of Korea, 파르나스타워 16층 Nota

👋 About the Team

The XPU Enabler team ensures AI models run smoothly across a wide range of devices. Whenever a new AI model or hardware device emerges, we address compatibility issues and build automated performance profiling services to create seamless connections between models and devices. Our team brings together specialists from various domains—ML Engineers, Embedded Engineers, Backend/Frontend/Mobile Engineers, and AI Software Engineers—working closely together to solve complex technical challenges by leveraging their individual areas of expertise.

📌 What You’ll Do at This Position

You’ll collaborate with top-tier NPU vendors and clients looking to optimize AI models for deployment on various NPUs. Your focus will be on LLMs and similar models, building a strong foundation in core technologies such as static graph transformation, optimization, quantization, and device-level profiling. Through this, you’ll gain a deep understanding of NPU-specific optimization methods and acquire hands-on experience in model deployment—developing your capabilities as a highly skilled AI engineer.

✅ Key Responsibilities

Profile NPU-optimized LLM models on edge devices in terms of accuracy, memory usage, and latency
Develop and maintain pipelines to deploy and run these models on edge hardware

✅ Requirements

2+ years of experience in relevant fields
Experience accelerating AI models on NPUs
Experience developing C++ applications for AI inference on edge devices
Understanding of LLM input/output structures, caching mechanisms, and decoding algorithms
No restrictions on international travel

✅ Pluses

Hands-on experience with LLM quantization and implementation
Extensive experience modifying AI models to export static graphs (e.g., via JIT, ONNX)
Experience using Executorch for model quantization, compilation, and inference

✅ Hiring Process

Document Screening → Screening Interview → Assignment → 1st Interview → 2nd Interview

🤓 A Message from the Team

The XPU Enabler Team takes full ownership of the entire pipeline—from AI model development to real-world device deployment. We don’t limit ourselves to specific frameworks or devices. Instead, we thrive on flexibility and a strong problem-solving mindset as we work across diverse models and NPU environments.

Our team values technical depth, cross-functional respect, and shared growth. If you enjoy exploring new technologies and tackling unfamiliar challenges with curiosity and creativity, you'll feel right at home with us.

Please Check Before Applying! 👀

This job posting is open continuously, and it may close early upon completion of the hiring process.
Please ensure that sensitive personal information, such as salary details, ID number etc., is not included in your resume.
Providing false information in the submitted materials may result in the cancellation of the application.
Please be aware that references will be checked before finalizing the hiring decision.
Compensation will be discussed separately upon successful completion of the final interview.
There will be a probationary period after joining, and there will be no discrimination in the treatment during this period.
Veterans and individuals with disabilities will receive preferential treatment in accordance with relevant regulations.

🔎 Helpful materials

[NetsPresso] AI Software Engineer

👋 About the Team

📌 What You’ll Do at This Position

✅ Key Responsibilities

Profile NPU-optimized LLM models on edge devices in terms of accuracy, memory usage, and latency
Develop and maintain pipelines to deploy and run these models on edge hardware

✅ Requirements

2+ years of experience in relevant fields
Experience accelerating AI models on NPUs
Experience developing C++ applications for AI inference on edge devices
Understanding of LLM input/output structures, caching mechanisms, and decoding algorithms
No restrictions on international travel

✅ Pluses

Hands-on experience with LLM quantization and implementation
Extensive experience modifying AI models to export static graphs (e.g., via JIT, ONNX)
Experience using Executorch for model quantization, compilation, and inference

✅ Hiring Process

Document Screening → Screening Interview → Assignment → 1st Interview → 2nd Interview

🤓 A Message from the Team

The XPU Enabler Team takes full ownership of the entire pipeline—from AI model development to real-world device deployment. We don’t limit ourselves to specific frameworks or devices. Instead, we thrive on flexibility and a strong problem-solving mindset as we work across diverse models and NPU environments.

Our team values technical depth, cross-functional respect, and shared growth. If you enjoy exploring new technologies and tackling unfamiliar challenges with curiosity and creativity, you'll feel right at home with us.

Please Check Before Applying! 👀

This job posting is open continuously, and it may close early upon completion of the hiring process.
Please ensure that sensitive personal information, such as salary details, ID number etc., is not included in your resume.
Providing false information in the submitted materials may result in the cancellation of the application.
Please be aware that references will be checked before finalizing the hiring decision.
Compensation will be discussed separately upon successful completion of the final interview.
There will be a probationary period after joining, and there will be no discrimination in the treatment during this period.
Veterans and individuals with disabilities will receive preferential treatment in accordance with relevant regulations.

🔎 Helpful materials

👋 About ​the ​Team

✅ Key Responsibilities

✅ Requirements

✅ Pluses

✅ Hiring Process

🤓 A Message from the Team

Please Check Before Applying! 👀

👋 About ​the ​Team

✅ Key Responsibilities

✅ Requirements

✅ Pluses

✅ Hiring Process

🤓 A Message from the Team

Please Check Before Applying! 👀

👋 About the Team

👋 About the Team