Qwen Image 2512 Trainer

general trainer

fal-ai/qwen-image-2512-trainer

Train LoRAs for Qwen Image 2512, the current Qwen base.

Trains on Qwen Image 2512, the improved base with better text rendering and more realistic people. This is the Qwen trainer to default to: its output runs directly on the qwen-image-2512/lora endpoint.

Open in fal playground ↗Official API docs ↗

What goes in the zip

Flat zip of images with name.txt captions. Uncaptioned images use default_caption; missing both fails the run.

Good starting point

steps: 1000learning_rate: 0.0005

Parameters

Schema facts come straight from the fal API; the notes are ours.

Required

image_data_urlstringrequired

URL to a zip archive of your training images, optionally with matching .txt caption files.

In the atelier: The album you hand the painter. It is the single biggest factor in what the LoRA becomes.

Tip: 15 to 30 sharp, varied images beat 200 sloppy ones. Vary angle, lighting and background; keep the subject consistent.

Watch out: Duplicate or near-duplicate images push the LoRA toward memorizing instead of learning.

Raw schema description

URL to the input data zip archive for text-to-image training. The zip should contain images with their corresponding text captions: image.EXT and image.txt For example: photo.jpg and photo.txt The text file contains the caption/prompt describing the target image. If no text file is provided for an image, the default_caption will be used. If no default_caption is provided and a text file is missing, the training will fail.

Optional

learning_ratenumberdefault: 5e-4

How big each learning update is. Controls how aggressively the model changes per step.

In the atelier: The painter's eagerness. A high rate is frantic practice: fast but sloppy, and it can wreck habits he already had. A low rate is careful practice: slow, but precise.

Tip: Stay near the trainer's default unless you have a reason. If results look fried or oversaturated, lower it. If the subject barely shows after many steps, raise it slightly or add steps.

Watch out: Learning rate and steps trade off against each other. Doubling both at once is how datasets get burned.

Raw schema description

Learning rate for LoRA parameters.

stepsintegerdefault: 1000100 – 30000

How many training iterations the model runs on your dataset. More steps means the LoRA sees your images more times.

In the atelier: Practice repetitions. Too few and the painter never picks up the skill. Too many and he stops learning and starts memorizing your exact photos.

Tip: Around 1000 is a solid default for a 15 to 30 image subject dataset. Small datasets need fewer steps, not more.

Watch out: If outputs start reproducing your training photos almost exactly (same pose, same background), you overtrained. Go back down.

Raw schema description

Number of steps to train for

default_captionstring

Caption used for any image that has no .txt caption file in the zip.

In the atelier: The note the painter assumes when a photo in the album has no note attached.

Tip: For edit trainers this often carries the whole instruction, like 'turn this sketch into a finished painting'.

Raw schema description

Default caption to use when caption files are missing. If None, missing captions will cause an error.

Call it

import { fal } from "@fal-ai/client";

const result = await fal.subscribe("fal-ai/qwen-image-2512-trainer", {
  input: {
    "image_data_url": "https://your-cdn.com/dataset.zip",
    "steps": 1000,
    "learning_rate": 0.0005
  },
  logs: true,
});
console.log(result.data);

Run the result with

Qwen Image 2512

fal-ai/qwen-image-2512/lora