Quantisation
A model optimisation technique that reduces the numerical precision of neural network parameters to decrease memory usage and improve inference speed.
Glossary Hub
Explore all AI and translation terms beginning with Q.
A model optimisation technique that reduces the numerical precision of neural network parameters to decrease memory usage and improve inference speed.
Systematic checks ensuring accuracy, consistency, and compliance with project requirements.