A free and open-source library to generate and validate datasets with full trasparency.
Transform your ideas into workflows using custom blocks and validate data with ease.
Add custom blocks in minutes with auto-discovery. Drop your file in user_blocks/ and it's automatically available-no configuration needed.
Visual pipeline builder eliminates boilerplate code. Connect blocks and they automatically share data through accumulated state.
Intuitive drag-and-drop interface, no training required. Build complex data generation workflows without writing orchestration code.
Complete execution traces for debugging. See exactly how each result was generated with full visibility into every pipeline step.
Starting from a seed file, build or customize a pipeline to generate the desired data for your use case.
Start with text content that your pipeline will process.
Design your workflow using drag-and-drop blocks. Each block adds data to the accumulated state.
Review your results with keyboard shortcuts and configure the view to easily see the needed data.
Export your data in JSONL format, filtered by status (accepted, rejected, pending).
You can start now, locally or using Docker with just a few commands:
That's it! No complex configuration required. Free and open source.
Found a bug or have an idea? We welcome contributions from the community!