Kohya-ss-sd-scripts

mirror of https://github.com/kohya-ss/sd-scripts.git synced 2026-04-08 22:35:09 +00:00

Author	SHA1	Message	Date
Kohya S.	5f793fb0f4	Log d*lr for ProdigyPlusScheduleFree (#2289 )	2026-03-29 18:47:09 +09:00
woctordho	343c929e39	Log d*lr for ProdigyPlusScheduleFree	2026-03-21 11:09:56 +08:00
woctordho	1cd95b2d8b	Add `skip_image_resolution` to deduplicate multi-resolution dataset (#2273 ) * Add min_orig_resolution and max_orig_resolution * Rename min_orig_resolution to skip_image_resolution; remove max_orig_resolution * Change skip_image_resolution to tuple * Move filtering to __init__ * Minor fix	2026-03-19 08:43:39 +09:00
Kohya S.	34e7138b6a	Add/modify some implementation for anima (#2261 ) * fix: update extend-exclude list in _typos.toml to include configs * fix: exclude anima tests from pytest * feat: add entry for 'temperal' in extend-words section of _typos.toml for Qwen-Image VAE * fix: update default value for --discrete_flow_shift in anima training guide * feat: add Qwen-Image VAE * feat: simplify encode_tokens * feat: use unified attention module, add wrapper for state dict compatibility * feat: loading with dynamic fp8 optimization and LoRA support * feat: add anima minimal inference script (WIP) * format: format * feat: simplify target module selection by regular expression patterns * feat: kept caption dropout rate in cache and handle in training script * feat: update train_llm_adapter and verbose default values to string type * fix: use strategy instead of using tokenizers directly * feat: add dtype property and all-zero mask handling in cross-attention in LLMAdapterTransformerBlock * feat: support 5d tensor in get_noisy_model_input_and_timesteps * feat: update loss calculation to support 5d tensor * fix: update argument names in anima_train_utils to align with other archtectures * feat: simplify Anima training script and update empty caption handling * feat: support LoRA format without `net.` prefix * fix: update to work fp8_scaled option * feat: add regex-based learning rates and dimensions handling in create_network * fix: improve regex matching for module selection and learning rates in LoRANetwork * fix: update logging message for regex match in LoRANetwork * fix: keep latents 4D except DiT call * feat: enhance block swap functionality for inference and training in Anima model * feat: refactor Anima training script * feat: optimize VAE processing by adjusting tensor dimensions and data types * fix: wait all block trasfer before siwtching offloader mode * feat: update Anima training guide with new argument specifications and regex-based module selection. Thank you Claude! * feat: support LORA for Qwen3 * feat: update Anima SAI model spec metadata handling * fix: remove unused code * feat: split CFG processing in do_sample function to reduce memory usage * feat: add VAE chunking and caching options to reduce memory usage * feat: optimize RMSNorm forward method and remove unused torch_attention_op * Update library/strategy_anima.py Use torch.all instead of all. Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Update library/safetensors_utils.py Fix duplicated new_key for concat_hook. Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Update anima_minimal_inference.py Remove unused code. Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Update anima_train.py Remove unused import. Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Update library/anima_train_utils.py Remove unused import. Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * fix: review with Copilot * feat: add script to convert LoRA format to ComfyUI compatible format (WIP, not tested yet) * feat: add process_escape function to handle escape sequences in prompts * feat: enhance LoRA weight handling in model loading and add text encoder loading function * feat: improve ComfyUI conversion script with prefix constants and module name adjustments * feat: update caption dropout documentation to clarify cache regeneration requirement * feat: add clarification on learning rate adjustments * feat: add note on PyTorch version requirement to prevent NaN loss --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2026-02-13 08:15:06 +09:00
Kohya S	9a61d61b22	feat: avoid unet type casting when fp8_scaled	2025-09-12 22:18:29 +09:00
Kohya S	209c02dbb6	feat: HunyuanImage LoRA training	2025-09-12 21:40:42 +09:00
Kohya S	7f983c558d	feat: block swap for inference and initial impl for HunyuanImage LoRA (not working)	2025-09-11 22:15:22 +09:00
Kohya S	6edbe00547	feat: update libraries, remove warnings	2025-08-16 20:07:03 +09:00
rockerBOO	c149cf283b	Add parser args for other trainers.	2025-08-03 00:58:25 -04:00
rockerBOO	bf0f86e797	Add sai_model_spec to train_network.py	2025-08-02 21:35:45 -04:00
Kohya S	c28e7a47c3	feat: add regex-based rank and learning rate configuration for FLUX.1 LoRA	2025-07-26 19:35:42 +09:00
rockerBOO	0145efc2f2	Merge branch 'sd3' into lumina	2025-06-09 18:13:06 -04:00
rockerBOO	b4a89c3cdf	Fix None	2025-05-01 02:03:22 -04:00
rockerBOO	f62c68df3c	Make grad_norm and combined_grad_norm None is not recording	2025-05-01 01:37:57 -04:00
Kohya S	5a18a03ffc	Merge branch 'dev' into sd3	2025-04-07 21:55:17 +09:00
青龍聖者@bdsqlsz	9f1892cc8e	Merge branch 'sd3' into lumina	2025-04-06 16:13:43 +08:00
Kohya S	d0b5c0e5cf	chore: formatting, add TODO comment	2025-03-30 21:15:37 +09:00
Kohya S.	59d98e45a9	Merge pull request #1974 from rockerBOO/lora-ggpo Add LoRA-GGPO for Flux	2025-03-30 21:07:31 +09:00
Kohya S.	93a4efabb5	Merge branch 'sd3' into resize-interpolation	2025-03-30 19:30:56 +09:00
DKnight54	381303d64f	Update train_network.py	2025-03-29 02:26:18 +08:00
rockerBOO	0181b7a042	Remove progress bar avg norms	2025-03-27 03:28:33 -04:00
rockerBOO	3647d065b5	Cache weight norms estimate on initialization. Move to update norms every step	2025-03-18 14:25:09 -04:00
rockerBOO	ea53290f62	Add LoRA-GGPO for Flux	2025-03-06 00:00:38 -05:00
rockerBOO	70403f6977	fix cache text encoder outputs if not using disk. small cleanup/alignment	2025-02-26 23:33:50 -05:00
rockerBOO	7b83d50dc0	Merge branch 'sd3' into lumina	2025-02-26 22:13:56 -05:00
Kohya S	1fcac98280	Merge branch 'sd3' into val-loss-improvement	2025-02-26 21:09:10 +09:00
Kohya S.	6e90c0f86c	Merge pull request #1909 from rockerBOO/progress_bar Move progress bar to account for sampling image first	2025-02-24 18:57:44 +09:00
rockerBOO	48e7da2d4a	Add sample batch size for Lumina	2025-02-23 20:19:24 -05:00
rockerBOO	025cca699b	Fix samples, LoRA training. Add system prompt, use_flash_attn	2025-02-23 01:29:18 -05:00
Kohya S	efb2a128cd	fix wandb val logging	2025-02-21 22:07:35 +09:00
rockerBOO	ca1c129ffd	Fix metadata	2025-02-19 14:20:40 -05:00
rockerBOO	7729c4c8f9	Add metadata	2025-02-19 14:20:40 -05:00
Kohya S	4a36996134	modify log step calculation	2025-02-18 22:05:08 +09:00
Kohya S	dc7d5fb459	Merge branch 'sd3' into val-loss-improvement	2025-02-18 21:34:30 +09:00
rockerBOO	1aa2f00e85	Fix validation epoch loss to check epoch average	2025-02-17 12:07:23 -05:00
rockerBOO	44782dd790	Fix validation epoch divergence	2025-02-17 12:07:22 -05:00
rockerBOO	4671e23778	Fix validation epoch loss to check epoch average	2025-02-16 01:42:44 -05:00
Kohya S	63337d9fe4	Merge branch 'sd3' into val-loss-improvement	2025-02-15 21:41:07 +09:00
rockerBOO	ab88b431b0	Fix validation epoch divergence	2025-02-14 11:14:38 -05:00
Kohya S	76b761943b	fix: simplify validation step condition in NetworkTrainer	2025-02-11 21:53:57 +09:00
Kohya S	177203818a	fix: unpause training progress bar after vaidation	2025-02-11 21:42:46 +09:00
Kohya S	344845b429	fix: validation with block swap	2025-02-09 21:25:40 +09:00
Kohya S	0911683717	set python random state	2025-02-09 20:53:49 +09:00
Kohya S	c5b803ce94	rng state management: Implement functions to get and set RNG states for consistent validation	2025-02-04 21:59:09 +09:00
rockerBOO	de830b8941	Move progress bar to account for sampling image first	2025-01-29 00:02:45 -05:00
Kohya S	45ec02b2a8	use same noise for every validation	2025-01-27 22:10:38 +09:00
Kohya S	0778dd9b1d	fix Text Encoder only LoRA training	2025-01-27 22:03:42 +09:00
Kohya S	0750859133	validation: Implement timestep-based validation processing	2025-01-27 21:56:59 +09:00
Kohya S	29f31d005f	add network.train()/eval() for validation	2025-01-27 21:35:43 +09:00
Kohya S	b6a3093216	call optimizer eval/train fn before/after validation	2025-01-27 21:22:11 +09:00

1 2 3 4 5 ...

423 Commits