The NetsPresso Platform Team designs and implements core platforms and software that transform Nota AI’s model lightweighting and optimization research into real-world products.
Our organization consists of Model Representation, Quantization, Graph Optimization, Model Engineering, and SW Engineering units. Among them, the Quantization Part researches NetsPresso’s core optimization technology—quantization—and integrates proprietary techniques into our products to accelerate deep learning inference across diverse hardware (HW) environments.
We research algorithms to minimize performance degradation caused by quantization, support optimization tailored to various HW and backend constraints, and convert models into forms that enable hardware acceleration.
As a key contributor, you will research and productize quantization technologies that sit at the heart of NetsPresso. You will design Nota’s unique quantization methods by studying state-of-the-art (SOTA) algorithms and optimizing them for specific model architectures and HW characteristics. You will gain hands-on experience with cutting-edge models and optimization techniques for On-device AI.
Document Screening → Assignment → 1st Interview → 2nd Interview
(Additional assignments may be included during the process.)
We value high interest in new technologies and the execution power to turn ideas into reality. This position goes beyond pure research; you will develop proprietary quantization technologies directly linked to NetsPresso services. As each module is organically connected, we prioritize active communication and a proactive attitude. If you enjoy diving deep into complex technical problems and growing through collaboration, you will thrive in this team.
The NetsPresso Platform Team designs and implements core platforms and software that transform Nota AI’s model lightweighting and optimization research into real-world products.
Our organization consists of Model Representation, Quantization, Graph Optimization, Model Engineering, and SW Engineering units. Among them, the Quantization Part researches NetsPresso’s core optimization technology—quantization—and integrates proprietary techniques into our products to accelerate deep learning inference across diverse hardware (HW) environments.
We research algorithms to minimize performance degradation caused by quantization, support optimization tailored to various HW and backend constraints, and convert models into forms that enable hardware acceleration.
As a key contributor, you will research and productize quantization technologies that sit at the heart of NetsPresso. You will design Nota’s unique quantization methods by studying state-of-the-art (SOTA) algorithms and optimizing them for specific model architectures and HW characteristics. You will gain hands-on experience with cutting-edge models and optimization techniques for On-device AI.
Document Screening → Assignment → 1st Interview → 2nd Interview
(Additional assignments may be included during the process.)
We value high interest in new technologies and the execution power to turn ideas into reality. This position goes beyond pure research; you will develop proprietary quantization technologies directly linked to NetsPresso services. As each module is organically connected, we prioritize active communication and a proactive attitude. If you enjoy diving deep into complex technical problems and growing through collaboration, you will thrive in this team.