model-optimization

Here are 90 public repositories matching this topic...

onnx / ir-py

Efficient in-memory representation for ONNX, in Python

machine-learning computation-graph intermediate-representation compilation onnx graph-transformation model-optimization large-language-models

Updated Jun 1, 2026
Python

umitkacar / awesome-tinyml

Star

TinyML & Edge AI: On-device inference, model quantization, embedded ML, ultra-low-power AI for microcontrollers and IoT devices.

Updated Nov 10, 2025
Python

lusinlu / gradient-variance-loss

Star

Code of the ICASSP 2022 paper "Gradient Variance Loss for Structure Enhanced Super-Resolution"

training machine-learning deep-learning super-resolution loss-functions upscaling loss model-optimization

Updated Feb 27, 2022
Python

haoran-ni / ralph-loop-optimizer

Star

Ralph Loop Optimizer: an AI-driven framework that turns any evaluatable codebase into a self-improving optimization loop for strategies, models, prompts, and workflows

experiment-tracking model-optimization coding-agents agentic-ai self-improving-ai strategy-optimization ralph-loop ralph-loop-optimizer

Updated Apr 30, 2026
Python

MaitreChen / openvino-lenet-sample

Star

本仓库包含了完整的深度学习应用开发流程，以经典的手写字符识别为例，基于LeNet网络构建。推理部分使用torch、onnxruntime以及openvino框架💖

deep-learning deployment pytorch lenet mnist-handwriting-recognition openvino onnxruntime model-optimization

Updated Apr 20, 2026
Python

bnabis93 / vision-language-examples

Star

Vision-lanugage model example code.

tutorial example pytorch transformer embedding-models model-acceleration vision-language model-optimization vision-language-model

Updated Sep 6, 2023
Python

TCLResearchEurope / ptdeco

Star

ptdeco is a library for model optimization by matrix decomposition built on top of PyTorch

deep-learning pytorch model-compression model-optimization model-optimisation

Updated May 7, 2025
Python

lattice-ai / Compressed-DNNs-Forget

Star

Minimal Reproducibility Study of (https://arxiv.org/abs/1911.05248). Experiments with Compression of Deep Neural Networks

deep-neural-networks sparsity deep-learning neural-network tensorflow pruning deeplearning celeba celeba-dataset tensorflow-lite tflite sparsity-optimization model-optimization neural-network-pruning tracker-misc

Updated Jun 4, 2021
Python

da2so / DA2Lite

Star

DA2Lite is an automated model compression toolkit for PyTorch.

python deep-learning pytorch image-classification pruning quantization knowledge-distillation model-compression on-device model-optimization filter-decomposition

Updated Mar 15, 2022
Python

umitkacar / awesome-mobile-ai

Star

Mobile AI: iOS CoreML, Android TFLite, on-device inference, ONNX, TensorRT, and ML deployment for smartphones.

quantization mlkit tensorrt mnn edge-computing coreml ncnn onnx tensorflow-lite openvino mobile-ai mobile-inference pytorch-mobile model-optimization neural-engine android-ml on-device-inference ios-ml smartphone-ai

Updated Nov 10, 2025
Python

tphakala / birdnet-onnx-converter

Sponsor

Star

Convert and optimize BirdNET models for ONNX Runtime inference on GPUs, CPUs, and embedded devices

raspberry-pi machine-learning ai artificial-intelligence onnx birdnet model-optimization bird-identification bioacustics birdnet-go

Updated May 16, 2026
Python

umitkacar / onnx-tensorrt-optimization

Star

40x faster AI inference: ONNX to TensorRT optimization with FP16/INT8 quantization, multi-GPU support, and deployment

Updated Nov 14, 2025
Python

Midhilesh29 / PostTrainingQuantization

Star

compares different pretrained object classification with per-layer and per-channel quantization using pytorch

quantization quantization-algorithms pytorch-quantize model-optimization

Updated Jun 12, 2020
Python

Shoko-official / Pytorch-TurboQuant

Star

Pytorch-TurboQuant: High-performance weight-only quantization for PyTorch. Optimized for fast inference and reduced memory footprint.

machine-learning deep-learning pytorch quantization inference-acceleration model-optimization

Updated Apr 4, 2026
Python

shyamsridhar123 / Quantization

Star

Model quantization techniques for efficient LLM inference. Experiments with INT8, INT4, and mixed-precision quantization.

python machine-learning inference quantization model-optimization llm

Updated May 27, 2025
Python

Venkat-023 / Kcet-Rank-Prediction-College-Recommandation

Star

This project presents a robust machine learning solution to predict KCET (Karnataka Common Entrance Test) ranks based on marks and provide personalized college recommendations. The system aids students in estimating their competitive rank prior to official results and assists in selecting suitable colleges based on predicted ranks and branch.

machine-learning-algorithms hyperparameter-tuning model-deployment model-optimization recommandation-system

Updated Dec 8, 2025
Python

Kronbii / thermal-super-resolution

Star

First thermal super-resolution system to achieve 34.2 dB PSNR at 229+ FPS using novel IMDN architecture with specialized thermal adaptations. Features breakthrough RGB→thermal transfer learning, thermal-aware multi-component loss, and real-time inference (2x: 270.6 FPS, 3x: 256.1 FPS, 4x: 250.9 FPS). Production-ready PyTorch + CUDA implementation

real-time research computer-vision deep-learning cuda pytorch thermal-imaging super-resolution real-time-processing image-enhancement production-machine-learning imdn model-optimization

Updated Mar 29, 2026
Python

Franco7Scala / FLOPpy

Star

A hardware-agnostic profiler for tracking the FLOPs and BOPs of Machine and Deep Learning algorithms.

machine-learning deep-learning scikit-learn pytorch profiling performance-monitoring flops huggingface wandb model-optimization hardware-agnostic bops computational-cost

Updated May 19, 2026
Python

sumeyye-agac / har-to-tflite

Star

Tools and experiments for converting Human Activity Recognition (HAR) models to TensorFlow Lite for efficient on-device inference on mobile and wearable devices.

python deep-learning human-activity-recognition tensorflow-lite tf-lite embedded-ai edge-ai on-device-ml mobile-ai model-optimization model-conversion

Updated Mar 5, 2026
Python

hilmansw / Spam-Detection-App

Star

This project is built to detect spam messages using a Long Short-Term Memory (LSTM) model combined with Word2Vec as the word embedding technique. The model has been optimized using Grid Search, achieving a best accuracy of 95.65%.

natural-language-processing deep-learning tensorflow classification spam-detection hyperparameter-tuning streamlit model-optimization

Updated Nov 25, 2025
Python

Improve this page

Add a description, image, and links to the model-optimization topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the model-optimization topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

model-optimization

Here are 90 public repositories matching this topic...

onnx / ir-py

umitkacar / awesome-tinyml

lusinlu / gradient-variance-loss

haoran-ni / ralph-loop-optimizer

MaitreChen / openvino-lenet-sample

bnabis93 / vision-language-examples

TCLResearchEurope / ptdeco

lattice-ai / Compressed-DNNs-Forget

da2so / DA2Lite

umitkacar / awesome-mobile-ai

tphakala / birdnet-onnx-converter

umitkacar / onnx-tensorrt-optimization

Midhilesh29 / PostTrainingQuantization

Shoko-official / Pytorch-TurboQuant

shyamsridhar123 / Quantization

Venkat-023 / Kcet-Rank-Prediction-College-Recommandation

Kronbii / thermal-super-resolution

Franco7Scala / FLOPpy

sumeyye-agac / har-to-tflite

hilmansw / Spam-Detection-App

Improve this page

Add this topic to your repo