Synthetic Data SDK ✨
-
Updated
May 8, 2026 - Python
Synthetic Data SDK ✨
[IMC 2020 (Best Paper Finalist)] Using GANs for Sharing Networked Time Series Data: Challenges, Initial Promise, and Open Questions
Project page of SynthText3D
Create files with fake data. In many formats. With no efforts.
(SIGCOMM '22) Practical GAN-based Synthetic IP Header Trace Generation using NetShare
Examples scripts that showcase how to use Private AI Text to de-identify, redact, hash, tokenize, mask and synthesize PII in text.
Synthetic Data Engine 💎
🥪🏭 A simple CLI for generating synthetic Jaffle Shop data.
SDNist: Benchmark data and evaluation tools for data synthesizers.
The Synthetic Data API. Generate privacy-safe synthetic data with 5 lines of code.
Multidimensional cluster generation in Python
Multidimensional cluster generation in Julia
DARPA Lift Challenge Simulator
Generate realistic Synthetic enterprise data for Spark, Pandas, testing, demos, benchmarking, learning, and research.
Multidimensional cluster generation in R
Multidimensional cluster generation in MATLAB/Octave
tenderness is a fast library for synthetic, deterministic document rendering from text and images
Verisim generates whole, coherent Pydantic domain objects instead of unrelated random fields. A generated person can have a name, username, email, phone, address, job, company, bio, website, and social profiles that all make sense together.
Calibrate a depth camera with another sensor giving odometry
🏥 Comprehensive synthetic data generator for Indian surgical emergency patients - Perfect for MBBS education, surgery training, and NEET-PG preparation
Add a description, image, and links to the synthetic-data-generator topic page so that developers can more easily learn about it.
To associate your repository with the synthetic-data-generator topic, visit your repo's landing page and select "manage topics."