Agent4Edu: Generating Learner Response Data by LLM-based Agents for Intelligent Education Systems (AAAI 2025)
-
Updated
May 10, 2025 - Python
Agent4Edu: Generating Learner Response Data by LLM-based Agents for Intelligent Education Systems (AAAI 2025)
[ECCV 2024] Brain-ID: Learning Contrast-agnostic Anatomical Representations for Brain Imaging
Foundational tools for BCG X's data science packages.
Model-driven synthetic test data for CI/CD and analytics - deterministic, privacy-preserving, and domain-aware. Includes Python APIs, XML pipelines, and MCP/IDE integration to orchestrate realistic datasets for finance, healthcare, and other regulated environments.
Cryptocurrency reddit sentiment analysis application.
A program that simulates answers given by a crowd to multiple choice questions with either a single or multiple answers correct, and writes it to a CSV
Software to simulate compendium-wide gene expression data using a VAE.
Schema-aware synthetic data for databases, APIs, and pipelines. Realistic, relational, privacy-safe.
Code for reproducing my thesis results.
A sample database with a random data model and automatic reporting. (PL doc)
Enterprise Tax Platform Simulator demonstrating ERP transaction generation, middleware ETL pipelines, Vertex-style tax determination, tax reporting dashboards, system monitoring, and an AI Copilot for enterprise tax architecture explanation.
An application for randomly generating telecommunication payment data.
Rocket Flight Simulation project
Симулятор рыночного мониторинга с автоматическим сбором данных и SQL-аналитикой (MAX, MIN, AVG). 📈🗄
This project simulates user behavior on a SaaS learning platform and analyzes product growth metrics using Python. The analysis focuses on understanding how users move through the product funnel, identifying drop-off points, and evaluating experiments that aim to improve conversion to paid users. The project also includes an A/B testing simulation
Создание синтетического датасета на основе cимуляции свойств физики SEM
High-performance, multi-stream data ingestion simulator Built for testing real-time pipelines, PB-scale throughput, and stream processing systems like Kafka, Flink, FastAPI, and Iceberg.
This repository contains projects and exercises I completed during my "Big Data Architecture" course. It reflects the concepts I’ve learned about data processing using Apache Spark and PySpark.
Add a description, image, and links to the data-simulation topic page so that developers can more easily learn about it.
To associate your repository with the data-simulation topic, visit your repo's landing page and select "manage topics."