🔐 Prompt Injection Attack Simulator

A simple local LLM security project that simulates prompt injection attacks and evaluates how well a defense system can block them.

Built using local models via Ollama — no external APIs required.

🚀 What This Project Does

This system:

Simulates prompt injection attacks (e.g., "ignore previous instructions")
Sends them to a local LLM
Applies a defense filter
Measures how many attacks are blocked

👉 In short: Attack → Defense → Result → Evaluation

🧱 Tech Stack

Python
FastAPI (backend)
Streamlit (dashboard)
Ollama (local LLM runtime)

⚙️ Setup (One-Time)

1. Install Ollama

Download and install Ollama.

Then run:

ollama pull llama3
ollama serve

Keep this running.

2. Clone the Repo

git clone https://github.com/Sahojit/Prompt-Injection-Attack-Simulator.git
cd Prompt-Injection-Attack-Simulator

3. Create Virtual Environment

python -m venv .venv
source .venv/bin/activate   # Mac/Linux

# Windows:
.venv\Scripts\activate

4. Install Dependencies

pip install -r requirements_simple.txt

▶️ Run the Project

Start Backend

uvicorn src.api.main:app --reload

Start Dashboard

streamlit run dashboard/app.py

🌐 Access the App

API Docs → http://localhost:8000/docs
Dashboard → http://localhost:8501

👉 Open the dashboard in browser to see:

attacks
blocked vs successful
evaluation results

👥 How Teammates Can Use It

Each teammate should:

Install Ollama
Pull model:
```
ollama pull llama3
```
Clone repo
Install requirements
Run backend + dashboard

⚠️ Important:

Ollama must be running locally
Ports 8000 and 8501 should be free

📊 Example Flow

System sends attack prompt
Defense checks it
If safe → goes to LLM
If malicious → blocked
Results are logged and shown on dashboard

🧠 Project Idea

This project demonstrates:

How prompt injection attacks work
How simple defenses can reduce risk
How to measure LLM security

⚠️ Limitations

Uses simple rule-based defense
Not fully secure (for learning/demo purposes)
Can be extended with ML-based detection

📌 Future Improvements

Add ML-based classifier
Improve detection rules
Add more attack types

🧑‍💻 Author

Sahojit Karmakar

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
__pycache__		__pycache__
data		data
.DS_Store		.DS_Store
ANALYSIS.md		ANALYSIS.md
README.md		README.md
app.py		app.py
attacks.py		attacks.py
defense.py		defense.py
evaluator.py		evaluator.py
llm.py		llm.py
ml_classifier.py		ml_classifier.py
output_filter.py		output_filter.py
requirements.txt		requirements.txt
simulator.py		simulator.py
updater.py		updater.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🔐 Prompt Injection Attack Simulator

🚀 What This Project Does

🧱 Tech Stack

⚙️ Setup (One-Time)

1. Install Ollama

2. Clone the Repo

3. Create Virtual Environment

4. Install Dependencies

▶️ Run the Project

Start Backend

Start Dashboard

🌐 Access the App

👥 How Teammates Can Use It

📊 Example Flow

🧠 Project Idea

⚠️ Limitations

📌 Future Improvements

🧑‍💻 Author

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🔐 Prompt Injection Attack Simulator

🚀 What This Project Does

🧱 Tech Stack

⚙️ Setup (One-Time)

1. Install Ollama

2. Clone the Repo

3. Create Virtual Environment

4. Install Dependencies

▶️ Run the Project

Start Backend

Start Dashboard

🌐 Access the App

👥 How Teammates Can Use It

📊 Example Flow

🧠 Project Idea

⚠️ Limitations

📌 Future Improvements

🧑‍💻 Author

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages