BootsofLagrangian
dfe08f395f
support deepspeed
2024-02-04 03:12:42 +09:00
Kohya S
cd19df49cd
Merge pull request #1085 from kohya-ss/dev
...
Dev
v0.8.3
2024-01-27 18:32:06 +09:00
Kohya S
736365bdd5
update README.md
2024-01-27 18:31:01 +09:00
Kohya S
6ceedb9448
Merge branch 'main' into dev
2024-01-27 18:23:52 +09:00
Kohya S
930a3912a7
Merge pull request #1084 from fireicewolf/devel
...
Fix network multiplier cause crashed while use multi-GPUs
2024-01-27 18:22:00 +09:00
Kohya S
cf790d87c4
Merge pull request #1079 from feffy380/fix/fp8savestate
...
Update safetensors to fix a crash with `--fp8_base --save_state`
2024-01-26 22:34:35 +09:00
DukeG
4e67fb8444
test
2024-01-26 20:22:49 +08:00
DukeG
50f631c768
test
2024-01-26 20:02:48 +08:00
DukeG
85bc371ebc
test
2024-01-26 18:58:47 +08:00
feffy380
322ee52c77
Update requirements.txt
...
Update safetensors to fix a crash when using `--fp8_base --save_state`
2024-01-25 19:15:53 +01:00
Kohya S
c576f80639
Fix ControlNetLLLite training issue #1069
2024-01-25 18:43:07 +09:00
Kohya S
d5ab97b69b
Merge pull request #1067 from kohya-ss/dev
...
Dev
v0.8.2
2024-01-23 21:04:16 +09:00
Kohya S
7cb44e4502
update readme
2024-01-23 21:02:40 +09:00
Kohya S
7a20df5ad5
Merge pull request #1064 from KohakuBlueleaf/fix-grad-sync
...
Avoid grad sync on each step even when doing accumulation
2024-01-23 20:33:55 +09:00
Kohya S
bea4362e21
Merge pull request #1060 from akx/refactor-xpu-init
...
Deduplicate ipex initialization code
2024-01-23 20:25:37 +09:00
Kohya S
6805cafa9b
fix TI training crashes in multigpu #1019
2024-01-23 20:17:19 +09:00
Kohaku-Blueleaf
711b40ccda
Avoid always sync
2024-01-23 11:49:03 +08:00
Kohya S
696dd7f668
Fix dtype issue in PyTorch 2.0 for generating samples in training sdxl network
2024-01-22 12:43:37 +09:00
Kohya S
e0a3c69223
update readme
2024-01-20 18:47:10 +09:00
Kohya S
c59249a664
Add options to reduce memory usage in extract_lora_from_models.py closes #1059
2024-01-20 18:45:54 +09:00
Kohya S
fef172966f
Add network_multiplier for dataset and train LoRA
2024-01-20 16:24:43 +09:00
Kohya S
5a1ebc4c7c
format by black
2024-01-20 13:10:45 +09:00
Kohya S
2a0f45aea9
update readme
2024-01-20 11:08:20 +09:00
Kohya S
1f77bb6e73
fix to work sample generation in fp8 ref #1057
2024-01-20 10:57:42 +09:00
Kohya S
a7ef6422b6
fix to work with torch 2.0
2024-01-20 10:00:30 +09:00
Kohaku-Blueleaf
9cfa68c92f
[Experimental Feature] FP8 weight dtype for base model when running train_network (or sdxl_train_network) ( #1057 )
...
* Add fp8 support
* remove some debug prints
* Better implementation for te
* Fix some misunderstanding
* as same as unet, add explicit convert
* better impl for convert TE to fp8
* fp8 for not only unet
* Better cache TE and TE lr
* match arg name
* Fix with list
* Add timeout settings
* Fix arg style
* Add custom seperator
* Fix typo
* Fix typo again
* Fix dtype error
* Fix gradient problem
* Fix req grad
* fix merge
* Fix merge
* Resolve merge
* arrangement and document
* Resolve merge error
* Add assert for mixed precision
2024-01-20 09:46:53 +09:00
Aarni Koskela
6f3f701d3d
Deduplicate ipex initialization code
2024-01-19 18:07:36 +02:00
Kohya S
d2a99a19d4
Merge pull request #1056 from kohya-ss/dev
...
fix vram usage in LoRA training
v0.8.1
2024-01-17 21:41:36 +09:00
Kohya S
0395a35543
Merge branch 'main' into dev
2024-01-17 21:39:13 +09:00
Kohya S
987d4a969d
update readme
2024-01-17 21:38:49 +09:00
Kohya S
976d092c68
fix text encodes are on gpu even when not trained
2024-01-17 21:31:50 +09:00
Kohya S
e6b15c7e4a
Merge pull request #1053 from akx/sdpa
...
Fix typo `--spda` (it's `--sdpa`)
2024-01-16 21:50:45 +09:00
Aarni Koskela
ef50436464
Fix typo --spda (it's --sdpa)
2024-01-16 14:32:48 +02:00
Kohya S
26d35794e3
Merge pull request #1052 from kohya-ss/dev
...
merge dev
v0.8.0
2024-01-15 21:39:02 +09:00
Kohya S
dcf0eeb5b6
update readme
2024-01-15 21:35:26 +09:00
Kohya S
32b759a328
Add wandb_run_name parameter to init_kwargs #1032
2024-01-14 22:02:03 +09:00
Kohya S
09ef3ffa8b
Merge branch 'main' into dev
2024-01-14 21:49:25 +09:00
Kohya S
aab265e431
Fix an issue with saving as diffusers sd1/2 model close #1033
2024-01-04 21:43:50 +09:00
Kohya S
716bad188b
Update dependencies ref #1024
2024-01-04 19:53:25 +09:00
Kohya S
4f93bf10f0
Merge pull request #1032 from hopl1t/wandb_session_name_support
...
Added cli argument for wandb session name
v0.7.1
2024-01-04 11:10:31 +09:00
Kohya S
07bf2a21ac
Merge pull request #1024 from p1atdev/main
...
Add support for `torch.compile`
2024-01-04 10:49:52 +09:00
Kohya S
8ac2d2a92f
Merge pull request #1030 from Disty0/dev
...
Update IPEX Libs
2024-01-04 10:46:07 +09:00
Kohya S
76aee71257
Merge branch 'main' into dev
2024-01-04 10:42:16 +09:00
Kohya S
1db5d790ed
Merge pull request #1029 from kohya-ss/dependabot/github_actions/crate-ci/typos-1.16.26
...
Bump crate-ci/typos from 1.16.15 to 1.16.26
2024-01-04 10:41:07 +09:00
Kohya S
663b481029
fix TI training with full_fp16/bf16 ref #1019
2024-01-03 23:22:00 +09:00
Kohya S
1ab6493268
Merge branch 'main' into dev
2024-01-03 21:36:31 +09:00
Nir Weingarten
ab716302e4
Added cli argument for wandb session name
2024-01-03 11:52:38 +02:00
Disty0
b9d2181192
Cleanup
2024-01-02 11:51:29 +03:00
Disty0
49148eb36e
Disable Diffusers slicing if device is not XPU
2024-01-02 11:50:08 +03:00
Disty0
479bac447e
Fix typo
2024-01-01 12:51:23 +03:00