Kohya-ss-sd-scripts

mirror of https://github.com/kohya-ss/sd-scripts.git synced 2026-04-06 13:47:06 +00:00

Author	SHA1	Message	Date
rockerBOO	c149cf283b	Add parser args for other trainers.	2025-08-03 00:58:25 -04:00
rockerBOO	9fde0d7972	Handle tuple return from generate_dataset_group_by_blueprint	2025-01-08 18:38:20 -05:00
Kohya S	cc11989755	fix: refactor huber-loss calculation in multiple training scripts	2024-12-01 21:20:28 +09:00
recris	420a180d93	Implement pseudo Huber loss for Flux and SD3	2024-11-27 18:37:09 +00:00
Kohya S	5fba6f514a	Merge branch 'dev' into sd3	2024-10-25 19:03:27 +09:00
catboxanon	e1b63c2249	Only add warning for deprecated scaling vpred loss function	2024-10-21 08:12:53 -04:00
catboxanon	8fc30f8205	Fix training for V-pred and ztSNR 1) Updates debiased estimation loss function for V-pred. 2) Prevents now-deprecated scaling of loss if ztSNR is enabled.	2024-10-21 07:34:33 -04:00
Kohya S	2500f5a798	fix latents caching not working closes #1696	2024-10-15 07:16:34 +09:00
kohya-ss	c80c304779	Refactor caching in train scripts	2024-10-12 20:18:41 +09:00
Kohya S	f2bc820133	support weighted captions for SD/SDXL	2024-10-11 08:48:55 +09:00
Kohya S	886f75345c	support weighted captions for sdxl LoRA and fine tuning	2024-10-10 08:27:15 +09:00
Plat	a823fd9fb8	Improve wandb logging (#1576 ) * fix: wrong training steps were recorded to wandb, and no log was sent when logging_dir was not specified * fix: checking of whether wandb is enabled * feat: log images to wandb with their positive prompt as captions * feat: logging sample images' caption for sd3 and flux * fix: import wandb before use	2024-09-11 22:21:16 +09:00
Kohya S	ea9242653c	Merge branch 'dev' into sd3	2024-08-24 21:24:44 +09:00
Kohya S.	4ca29edbff	Merge pull request #1505 from liesened/patch-2 Add v-pred support for SDXL train	2024-08-24 21:16:53 +09:00
liesen	1e8108fec9	Handle args.v_parameterization properly for MinSNR and changed prediction target	2024-08-24 01:38:17 +03:00
Kohya S	41dee60383	Refactor caching mechanism for latents and text encoder outputs, etc.	2024-07-27 13:50:05 +09:00
Kohya S	56bb81c9e6	add grad_hook after restore state closes #1344	2024-06-12 21:39:35 +09:00
Kohya S	da6fea3d97	simplify and update alpha mask to work with various cases	2024-05-19 21:26:18 +09:00
u-haru	db6752901f	画像のアルファチャンネルをlossのマスクとして使用するオプションを追加 (#1223 ) * Add alpha_mask parameter and apply masked loss * Fix type hint in trim_and_resize_if_required function * Refactor code to use keyword arguments in train_util.py * Fix alpha mask flipping logic * Fix alpha mask initialization * Fix alpha_mask transformation * Cache alpha_mask * Update alpha_masks to be on CPU * Set flipped_alpha_masks to Null if option disabled * Check if alpha_mask is None * Set alpha_mask to None if option disabled * Add description of alpha_mask option to docs	2024-05-19 19:07:25 +09:00
Kohya S	38e4c602b1	Merge pull request #1277 from Cauldrath/negative_learning Allow negative learning rate	2024-05-19 18:55:50 +09:00
Kohya S	c68baae480	add `--log_config` option to enable/disable output training config	2024-05-19 17:21:04 +09:00
Kohya S	47187f7079	Merge pull request #1285 from ccharest93/main Hyperparameter tracking	2024-05-19 16:31:33 +09:00
Kohya S	607e041f3d	chore: Refactor optimizer group	2024-05-12 14:16:41 +09:00
Kohya S	b56d5f7801	add experimental option to fuse params to optimizer groups	2024-05-06 21:35:39 +09:00
Maatra	2c9db5d9f2	passing filtered hyperparameters to accelerate	2024-04-20 14:11:43 +01:00
Cauldrath	fc374375de	Allow negative learning rate This can be used to train away from a group of images you don't want As this moves the model away from a point instead of towards it, the change in the model is unbounded So, don't set it too low. -4e-7 seemed to work well.	2024-04-18 23:29:01 -04:00
2kpr	4f203ce40d	Fused backward pass	2024-04-14 09:56:58 -05:00
kabachuha	90b18795fc	Add option to use Scheduled Huber Loss in all training pipelines to improve resilience to data corruption (#1228 ) * add huber loss and huber_c compute to train_util * add reduction modes * add huber_c retrieval from timestep getter * move get timesteps and huber to own function * add conditional loss to all training scripts * add cond loss to train network * add (scheduled) huber_loss to args * fixup twice timesteps getting * PHL-schedule should depend on noise scheduler's num timesteps * 2 multiplier to huber loss cause of 1/2 a^2 conv. The Taylor expansion of sqrt near zero gives 1/2 a^2, which differs from a^2 of the standard MSE loss. This change scales them better against one another add option for smooth l1 (huber / delta) * unify huber scheduling * add snr huber scheduler --------- Co-authored-by: Kohya S <52813779+kohya-ss@users.noreply.github.com>	2024-04-07 13:54:21 +09:00
ykume	cd587ce62c	verify command line args if wandb is enabled	2024-04-05 08:23:03 +09:00
Kohya S	ab1e389347	Merge branch 'dev' into masked-loss	2024-03-26 19:39:30 +09:00
BootsofLagrangian	d9456020d7	Fix most of ZeRO stage uses optimizer partitioning - we have to prepare optimizer and ds_model at the same time. - pull/1139#issuecomment-1986790007 Signed-off-by: BootsofLagrangian <hard2251@yonsei.ac.kr>	2024-03-20 20:52:59 +09:00
Kohya S	fbb98f144e	Merge branch 'dev' into deep-speed	2024-03-20 18:15:26 +09:00
Kohya S	9b6b39f204	Merge branch 'dev' into masked-loss	2024-03-20 18:14:36 +09:00
Kohya S	3419c3de0d	common masked loss func, apply to all training script	2024-03-17 19:30:20 +09:00
gesen2egee	095b8035e6	save state on train end	2024-03-10 23:33:38 +08:00
Kohya S	a9b64ffba8	support masked loss in sdxl_train ref #589	2024-02-27 21:43:55 +09:00
Kohya S	e3ccf8fbf7	make deepspeed_utils	2024-02-27 21:30:46 +09:00
Kohya S	eefb3cc1e7	Merge branch 'deep-speed' into deepspeed	2024-02-27 18:57:42 +09:00
Kohya S	f4132018c5	fix to work with cpu_count() == 1 closes #1134	2024-02-24 19:25:31 +09:00
BootsofLagrangian	4d5186d1cf	refactored codes, some function moved into train_utils.py	2024-02-22 16:20:53 +09:00
Kohya S	baa0e97ced	Merge branch 'dev' into dev_device_support	2024-02-17 11:54:07 +09:00
Kohya S	93bed60762	fix to work `--console_log_xxx` options	2024-02-12 14:49:29 +09:00
Kohya S	358ca205a3	Merge branch 'dev' into dev_device_support	2024-02-12 13:01:54 +09:00
Kohya S	e24d9606a2	add clean_memory_on_device and use it from training	2024-02-12 11:10:52 +09:00
BootsofLagrangian	03f0816f86	the reason not working grad accum steps found. it was becasue of my accelerate settings	2024-02-09 17:47:49 +09:00
Kohya S	055f02e1e1	add logging args for training scripts	2024-02-08 21:16:42 +09:00
BootsofLagrangian	62556619bd	fix full_fp16 compatible and train_step	2024-02-07 16:42:05 +09:00
BootsofLagrangian	3970bf4080	maybe fix branch to run offloading	2024-02-05 22:40:43 +09:00
BootsofLagrangian	2824312d5e	fix vae type error during training sdxl	2024-02-05 20:13:28 +09:00
Yuta Hayashibe	5f6bf29e52	Replace print with logger if they are logs (#905 ) * Add get_my_logger() * Use logger instead of print * Fix log level * Removed line-breaks for readability * Use setup_logging() * Add rich to requirements.txt * Make simple * Use logger instead of print --------- Co-authored-by: Kohya S <52813779+kohya-ss@users.noreply.github.com>	2024-02-04 18:14:34 +09:00

1 2

96 Commits