FINETUNE_PREP.AI
v1.0.0
[SYS] OpenAI fine-tuning dataset preparation system: Multi-format converter + JSONL processor
CORE_FEATURES:
Multi Format Processing
> Tone of voice datasets
> Instruction-following datasets
> Function-calling datasets
Tool Formatting
> Function call JSON structure
> Tool response mapping
> Multi-turn conversations
JSONL Generation
> OpenAI spec-compliant format
> Proper line formatting
> Validation & error checking
Variable Substitution
> System prompt templating
> Response variable mapping
> Dynamic content insertion
EXAMPLE_OUTPUT:
Workflow Input:
"Fine-tune Type": "Reg_Func (Function Calling)",
"Format Type": "Multi-turn conversations",
"Input Dataset": "customer_service_examples.xlsx",
"Tool Definitions": "Automatically extracted"
Finetune Dataprep - JSONL Example
Document Metadata
- Source: Customer Service Dataset
- Format: JSONL with Tool Calls
- Tags: #CustomerService #ToolCalling #FunctionCalling #OpenAI #FineTuning
Finetune Dataprep Output Examples
Input Formats
1. TOV (Tone of Voice) - For customizing model's tone and style
2. Reg (Regular Instruction) - For general instruction-following capabilities
3. Reg_Func (Function Calling) - For training models to use tools/functions
Example Function Calling Spreadsheet Input
Below is an example of how you would format your data in a spreadsheet for function calling fine-tuning:
JSONL Output Format
After processing your spreadsheet data, the workflow generates a properly formatted JSONL file that looks like this:
Tool Definitions
Workflow Process
The Finetune Dataprep workflow processes your data through these key steps:
- INPUT PROCESSING: The workflow accepts XLS or CSV files with customer service conversations organized in columns.
- FORMAT SELECTION: Based on the job type (TOV, Reg, or Reg_Func), the workflow applies appropriate data transformation rules.
- TOOL DEFINITIONS: For function calling datasets, tool definitions are automatically extracted and formatted according to OpenAI's schema.
- CONVERSATION FORMATTING: Conversations are properly structured with user, assistant, and tool messages in the correct sequence.
- JSONL GENERATION: The workflow outputs a properly formatted JSONL file with each JSON object on its own line, ready for upload to OpenAI's fine-tuning API.
- Proper message sequence and role attribution
- Correct JSON structure for function/tool calls
- Tool definition schema validation
- Multi-turn conversation support with contextual awareness
This is an example of a fine-tuning JSONL file created with our template
$ system_requirements
MODELS: none required
STORAGE: google sheets
SERVICES: none required
OUTPUT: downloadable .jsonl file
PRICING: google sheets - free
EST. PER RUN COST: free
PROCESS_FLOW:
AUTOMATION_BENEFITS:
- > Save hours of manual data formatting
- > Error-free JSONL conversion
- > Support for complex tool/function calling
- > Process multi-turn conversations
- > OpenAI compliant data structure
* Compatible with all n8n installations v1.0.0+
*Superflowz is a subsidiary of CARDUME ESBELTO UNIP. LDA. Your purchase will be from, and your receipt will list, CARDUME ESBELTO