machine-learning-security

The Anti-Virus for AI Artifacts & RAG Firewall. A static analysis tool scanning Models and Notebooks for RCE, Datasets and RAG docs for Data Poisoning, PII, and Prompt Injections. Secure your AI Supply Chain.

Updated May 10, 2026
Python

Lab700xOrg / aisbom

Star

AI SBOM: AI Software Bill of Materials - The Supply Chain for Artificial Intelligence

pytorch malware-detection machine-learning-security mlops sbom cyclonedx supply-chain-security cyclonedx-sbom

Updated May 22, 2026
Python

jay-johnson / train-ai-with-django-swagger-jwt

Star

Train AI (Keras + Tensorflow) to defend apps with Django REST Framework + Celery + Swagger + JWT - deploys to Kubernetes and OpenShift Container Platform

machine-learning jwt deep-neural-networks ai openshift tensorflow rest-api django-rest-framework swagger drf keras celery network-analysis network-security celery-tasks machine-learning-security ai-security anti-nex

Updated Nov 2, 2018
Python

citizenjosh / ai-security-training-lab

Star

Hands-on lessons for attacking and defending AI systems, starting with the OWASP Top 10 for LLM Applications.

docker owasp ethical-hacking adversarial-attacks machine-learning-security ai-security cybersecurity-education prompt-injection llm-security

Updated Jun 22, 2025
Python

mmalekzadeh / honest-but-curious-nets

Star

Honest-but-Curious Nets: Sensitive Attributes of Private Inputs Can Be Secretly Coded into the Classifiers' Outputs (ACM CCS'21)

machine-learning privacy deep-neural-networks entropy deep-learning information-theory pytorch data-privacy mutual-information adversarial-machine-learning celeba-dataset machine-learning-security utkface

Updated Jan 11, 2023
Python

fallen-angel-systems / fas-judgement-oss

Star

Open-source prompt injection attack console. Test AI security by firing categorized attacks at any endpoint.

game python cli owasp cybersecurity penetration-testing gamification ctf capture-the-flag security-training red-team machine-learning-security ai-security prompt-injection llm-security adversarial-ai

Updated Mar 19, 2026
Python

karloks2005 / JailbreakLab

Star

Test and evaluate Large Language Models against prompt injections, jailbreaks, and adversarial attacks with a web-based interactive lab.

react docker kubernetes jailbreak model-alignment machine-learning-security ai-security fastapi huggingface prompt-injection llm-security llm-safety security-research-tool ai-evaluation-framework adversarial-ai prompt-defense llm-red-teaming

Updated Mar 27, 2026
Python

jay-johnson / antinex-datasets

Star

Datasets for training deep neural networks to defend software applications

Updated Jun 4, 2018
Python

emmanuelgjr / GenAI-Security-Literature-Review

Star

Comprehensive, auto-updating literature review of GenAI & LLM security research, standards, tools, and resources. 100+ curated entries with interactive webapp.

owasp cybersecurity jailbreaking ai-safety literature-review red-teaming machine-learning-security ai-security adversarial-ml mitre-atlas prompt-injection llm-security genai-security agentic-ai nist-ai-rmf

Updated May 19, 2026
Python

perfecxion-ai / banana-backdoor-demo

Star

Educational research demonstrating weight manipulation attacks in SafeTensors models. Proves format validation alone is insufficient for AI model security.

research ai defensive-security machine-learning-security ai-security llm backdoor-detection safetensors ml-security tinyllama model-security

Updated Feb 17, 2026
Python

DeliciousBuding / DiffAudit-Research

Star

Reproducible research scaffolding for privacy-risk auditing of diffusion models.

python research reproducibility machine-learning-security diffusion-models membership-inference privacy-auditing

Updated May 23, 2026
Python

sichgate / sichgate-methodology

Star

Open methodology for systematic adversarial evaluation of small language models in regulated industry deployments

Updated Apr 3, 2026
Python

sandeepmothukuri / PromptSentinel

Star

Enterprise-grade prompt injection detection and AI firewall for LLM applications

cybersecurity soc security-tools jailbreak-detection red-teaming machine-learning-security ai-security prompt-injection llm-security llm-firewall genai-security ai-firewall prompt-defense

Updated May 20, 2026
Python

Framartin / adversarial-logistic

Star

Adversarial perturbation intensity strategy achieving chosen intra-technique transferability level for logistic regression

machine-learning logistic-regression adversarial-example machine-learning-security

Updated Jan 6, 2018
Python

AmiraGuesmi-mls / Stochastic-Input-Transformation

Star

A stochastic input pre-processing technique based on a process of down-sampling/up-sampling using convolution and transposed convolution layers. Defending convolutional neural network against adversarial attacks.

adversarial-machine-learning adversarial-attacks machine-learning-security defense-methods

Updated Aug 4, 2021
Python

aliuyar1234 / selective-revocation-replay

Star

Research artifact, paper, and frozen evaluation outputs for selective revocation and replay after persistent indirect prompt injection in memory-augmented LLM agents.

provenance reproducibility machine-learning-security prompt-injection llm-security llm-agents agent-memory research-artifact indirect-prompt-injection memory-augmented-llms

Updated Apr 18, 2026
Python

Improve this page

Add a description, image, and links to the machine-learning-security topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the machine-learning-security topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

machine-learning-security

Here are 33 public repositories matching this topic...

1Konny / FGSM

alexdevassy / Machine_Learning_CTF_Challenges

jackaduma / SecBERT

whyisyoung / CADE

arsbr / Veritensor

Lab700xOrg / aisbom

jay-johnson / train-ai-with-django-swagger-jwt

citizenjosh / ai-security-training-lab

mmalekzadeh / honest-but-curious-nets

fallen-angel-systems / fas-judgement-oss

karloks2005 / JailbreakLab

jay-johnson / antinex-datasets

emmanuelgjr / GenAI-Security-Literature-Review

perfecxion-ai / banana-backdoor-demo

DeliciousBuding / DiffAudit-Research

sichgate / sichgate-methodology

sandeepmothukuri / PromptSentinel

Framartin / adversarial-logistic

AmiraGuesmi-mls / Stochastic-Input-Transformation

aliuyar1234 / selective-revocation-replay

Improve this page

Add this topic to your repo