[NetsPresso] AI Platform Engineer
Job group
R&D
Experience Level
Experienced 7 years or more
Job Types
Full-time
Locations
NotaNota Inc. (16F, Parnas Tower), 521, Teheran-ro, Gangnam-gu, Seoul, Republic of Korea, 16F, Parnas Tower, 파르나스타워 16층 Nota

👋 About ​the ​Team

The ​NP Software ​Development Team ensures that ​Nota’s ​optimization modules ​are developed, deployed, ​and delivered ​with ​high stability ​and ​polish ​across various product ​formats—including ​SDKs, Cloud-SaaS platforms, ​and ​On-Premise ​desktop applications. We ​support the ​latest ​deep learning ​models to ​run ​with optimal performance ​on a ​wide range of semiconductor hardware, and we build scalable software architectures that allow these technical advancements to be rapidly integrated into our products.


By joining our team, you will help drive the development of robust, scalable AI products that respond swiftly to the fast-evolving AI landscape. We are looking for a senior software engineer to join us on this exciting journey.



📌 What You’ll Do at This Position

You will be responsible for implementing and productizing core model-compression technologies so that state-of-the-art models—including LLMs, LVMs, and CV models—can be optimized and delivered across devices with diverse computational characteristics.


You will help simplify complex optimization workflows into Python packages, while also extending key features into standalone services. These optimization technologies are delivered across SDK, Cloud-SaaS, and On-Premise environments, and we are advancing our architecture to ensure the scalability and reliability required for GenAI models in real-world applications.


Through this role, you will gain end-to-end experience in enabling AI models to perform efficient inference in practical computing environments.




✅ Key Responsibilities

  • Review and introduce technical stacks during early project stages
  • Design, develop, and deploy agile PoCs based on business requirements and customer feedback for model-optimization solutions
  • Design, manage, and operate systems that monitor performance and accuracy metrics for AI models and optimization modules
  • Continuously integrate, improve, and operate optimization technologies within our products
  • Design and implement stable, scalable service architectures



✅ Requirements

  • 7+ years of backend development experience, or equivalent proficiency
  • Proficiency in at least one programming language such as Python, Java, or C++
  • Strong collaboration skills using Git for version control
  • Experience building CI/CD pipelines and deploying services with Docker
  • Experience with system-resource monitoring, bottleneck analysis, and code-level performance tuning
  • Foundational knowledge of OS, networking, and databases
  • 2+ years developing and operating AI/ML-based services in production environments
  • Strong problem-solving, critical thinking, and strategic reasoning skills
  • Ability to collaborate with AI researchers, engineers, and PMs to define requirements and design software accordingly



✅ Pluses

  • Experience operating infrastructure using Kubernetes or Docker
  • Experience building MLOps pipelines
  • Technical writing and internal knowledge-sharing experience
  • Strong ownership mindset and a proactive, positive approach to problem-solving
  • Experience with deep learning frameworks such as TensorFlow or PyTorch
  • Experience with GPU programming environments such as CUDA
  • Experience contributing to open-source projects



✅ Hiring Process

  • Document Screening → Assignment → 1st Interview → 2nd Interview

(Additional assignments may be included during the process.)




🤓 A Message from the Team

The NP Software Development Team is committed to building a flexible and collaborative development culture that enables us to deliver diverse products quickly and reliably. We go beyond simple implementation—leading discussions on productizing optimization technologies, ensuring operational stability, and pursuing technical excellence and scalability. From early technical evaluations to system design, automation, metric monitoring, and customer-feedback integration, you will experience the full product lifecycle with us.

The NP project is closely tied to large-scale B2B and B2G initiatives targeting not only the domestic market but the global stage. This provides an ideal environment for gaining hands-on experience in deploying state-of-the-art AI models into real services. As we optimize GenAI models for various device environments, we tackle architectural and scalability challenges together. If you want to build products that balance cutting-edge technology with practical service stability in a rapidly changing AI landscape, we encourage you to apply.



Please Check Before Applying! 👀

  • This job posting is open continuously, and it may close early upon completion of the hiring process.
  • Resumes that include sensitive personal information, such as salary details, may be excluded from the review process.
  • Providing false information in the submitted materials may result in the cancellation of the application.
  • Please be aware that references will be checked before finalizing the hiring decision.
  • Compensation will be discussed separately upon successful completion of the final interview.
  • There will be a probationary period after joining, and there will be no discrimination in the treatment during this period.
  • To support the employment of persons with disabilities, you may optionally submit a copy of your disability registration certificate under “Additional Documents,” if administrative verification is required. Submission is optional and does not affect the evaluation process.
  • Veterans and individuals with disabilities will receive preferential treatment in accordance with relevant regulations.



🔎 Helpful materials

Share
[NetsPresso] AI Platform Engineer

👋 About ​the ​Team

The ​NP Software ​Development Team ensures that ​Nota’s ​optimization modules ​are developed, deployed, ​and delivered ​with ​high stability ​and ​polish ​across various product ​formats—including ​SDKs, Cloud-SaaS platforms, ​and ​On-Premise ​desktop applications. We ​support the ​latest ​deep learning ​models to ​run ​with optimal performance ​on a ​wide range of semiconductor hardware, and we build scalable software architectures that allow these technical advancements to be rapidly integrated into our products.


By joining our team, you will help drive the development of robust, scalable AI products that respond swiftly to the fast-evolving AI landscape. We are looking for a senior software engineer to join us on this exciting journey.



📌 What You’ll Do at This Position

You will be responsible for implementing and productizing core model-compression technologies so that state-of-the-art models—including LLMs, LVMs, and CV models—can be optimized and delivered across devices with diverse computational characteristics.


You will help simplify complex optimization workflows into Python packages, while also extending key features into standalone services. These optimization technologies are delivered across SDK, Cloud-SaaS, and On-Premise environments, and we are advancing our architecture to ensure the scalability and reliability required for GenAI models in real-world applications.


Through this role, you will gain end-to-end experience in enabling AI models to perform efficient inference in practical computing environments.




✅ Key Responsibilities

  • Review and introduce technical stacks during early project stages
  • Design, develop, and deploy agile PoCs based on business requirements and customer feedback for model-optimization solutions
  • Design, manage, and operate systems that monitor performance and accuracy metrics for AI models and optimization modules
  • Continuously integrate, improve, and operate optimization technologies within our products
  • Design and implement stable, scalable service architectures



✅ Requirements

  • 7+ years of backend development experience, or equivalent proficiency
  • Proficiency in at least one programming language such as Python, Java, or C++
  • Strong collaboration skills using Git for version control
  • Experience building CI/CD pipelines and deploying services with Docker
  • Experience with system-resource monitoring, bottleneck analysis, and code-level performance tuning
  • Foundational knowledge of OS, networking, and databases
  • 2+ years developing and operating AI/ML-based services in production environments
  • Strong problem-solving, critical thinking, and strategic reasoning skills
  • Ability to collaborate with AI researchers, engineers, and PMs to define requirements and design software accordingly



✅ Pluses

  • Experience operating infrastructure using Kubernetes or Docker
  • Experience building MLOps pipelines
  • Technical writing and internal knowledge-sharing experience
  • Strong ownership mindset and a proactive, positive approach to problem-solving
  • Experience with deep learning frameworks such as TensorFlow or PyTorch
  • Experience with GPU programming environments such as CUDA
  • Experience contributing to open-source projects



✅ Hiring Process

  • Document Screening → Assignment → 1st Interview → 2nd Interview

(Additional assignments may be included during the process.)




🤓 A Message from the Team

The NP Software Development Team is committed to building a flexible and collaborative development culture that enables us to deliver diverse products quickly and reliably. We go beyond simple implementation—leading discussions on productizing optimization technologies, ensuring operational stability, and pursuing technical excellence and scalability. From early technical evaluations to system design, automation, metric monitoring, and customer-feedback integration, you will experience the full product lifecycle with us.

The NP project is closely tied to large-scale B2B and B2G initiatives targeting not only the domestic market but the global stage. This provides an ideal environment for gaining hands-on experience in deploying state-of-the-art AI models into real services. As we optimize GenAI models for various device environments, we tackle architectural and scalability challenges together. If you want to build products that balance cutting-edge technology with practical service stability in a rapidly changing AI landscape, we encourage you to apply.



Please Check Before Applying! 👀

  • This job posting is open continuously, and it may close early upon completion of the hiring process.
  • Resumes that include sensitive personal information, such as salary details, may be excluded from the review process.
  • Providing false information in the submitted materials may result in the cancellation of the application.
  • Please be aware that references will be checked before finalizing the hiring decision.
  • Compensation will be discussed separately upon successful completion of the final interview.
  • There will be a probationary period after joining, and there will be no discrimination in the treatment during this period.
  • To support the employment of persons with disabilities, you may optionally submit a copy of your disability registration certificate under “Additional Documents,” if administrative verification is required. Submission is optional and does not affect the evaluation process.
  • Veterans and individuals with disabilities will receive preferential treatment in accordance with relevant regulations.



🔎 Helpful materials