The NetsPresso Platform Team at Nota AI designs and implements core platforms and software that transform AI model lightweighting and optimization research into real-world products.
Our organization consists of Model Representation, Quantization, Graph Optimization, Model Engineering, and SW Engineering units. Specifically, the Model Representation Part unifies models from various deep learning frameworks into our proprietary NPIR (NetsPresso Intermediate Representation). This foundational step enables the application of key optimization technologies such as quantization, graph optimization, and compression.
By integrating diverse framework models into a single NPIR, we achieve technical versatility, scale support for the latest AI models and hardware, and maximize inference efficiency by optimizing the logical structure of deep learning models for target runtime environments.
In this position, you will tackle real-world challenges across various industrial domains using AI technology. You will participate in the entire lifecycle of evolving a model into a field-ready AI solution. Through technical deep-dives based on the latest research papers and performance analysis, you will gain a balanced experience between AI research and practical development. You will also collaborate with experienced AI application developers to build production-grade AI systems.
IR Implementation & Maintenance
Expanding Support for Latest AI Models & Diverse Hardware
(Additional assignments may be included during the process.)
You will be part of productizing the unified IR that serves as the foundation of NetsPresso. You will contribute to designing and developing our unique NPIR while analyzing the characteristics of various frameworks and hardware. In this role, you will see how on-device AI lightweighting and optimization techniques are applied to actual products and solve technical hurdles to ensure latest models run efficiently. As an AI Engineer, you will find great satisfaction in implementing theoretical research into a real-world software stack.
The NetsPresso Platform Team at Nota AI designs and implements core platforms and software that transform AI model lightweighting and optimization research into real-world products.
Our organization consists of Model Representation, Quantization, Graph Optimization, Model Engineering, and SW Engineering units. Specifically, the Model Representation Part unifies models from various deep learning frameworks into our proprietary NPIR (NetsPresso Intermediate Representation). This foundational step enables the application of key optimization technologies such as quantization, graph optimization, and compression.
By integrating diverse framework models into a single NPIR, we achieve technical versatility, scale support for the latest AI models and hardware, and maximize inference efficiency by optimizing the logical structure of deep learning models for target runtime environments.
In this position, you will tackle real-world challenges across various industrial domains using AI technology. You will participate in the entire lifecycle of evolving a model into a field-ready AI solution. Through technical deep-dives based on the latest research papers and performance analysis, you will gain a balanced experience between AI research and practical development. You will also collaborate with experienced AI application developers to build production-grade AI systems.
IR Implementation & Maintenance
Expanding Support for Latest AI Models & Diverse Hardware
(Additional assignments may be included during the process.)
You will be part of productizing the unified IR that serves as the foundation of NetsPresso. You will contribute to designing and developing our unique NPIR while analyzing the characteristics of various frameworks and hardware. In this role, you will see how on-device AI lightweighting and optimization techniques are applied to actual products and solve technical hurdles to ensure latest models run efficiently. As an AI Engineer, you will find great satisfaction in implementing theoretical research into a real-world software stack.