SeedVR2 v2.5ã¯ãByteDanceãéçºããAIããŒã¹ã®åç»åŸ©å ã¢ãã«ãComfyUIã«çµ±åããææ°ããŒãžã§ã³ã§ããããã®ã¢ããããŒãã¯ãé床ãã¡ã¢ãªå¹çãå質ã䜿ããããã®å€§å¹ ãªåäžãå®çŸããç¹ã«äœã¹ããã¯GPUã§ã®éçšãå¯èœã«ãããåŸæ¥ã®ããŒãžã§ã³ã§ã¯ãã¡ã¢ãªäžè¶³ãåŠçã®äžå®å®ãã課é¡ã ã£ãããv2.5ã§ã¯ã¢ãžã¥ã©ãŒèšèšãæ¡çšãã7åãã©ã¡ãŒã¿ã¢ãã«ã8GB VRAMã®GPUã§åäœãããããšãéæãããããã«ãããæ¶è²»è åãããŒããŠã§ã¢ã§ãããã§ãã·ã§ãã«ãªåç»ã¢ããã¹ã±ãŒãªã³ã°ãå¯èœã«ãªããVFXãã³ã³ãã³ãå¶äœã®çŸå Žã§å®çšæ§ãé«ãŸã£ãã
åç»ã¢ããã¹ã±ãŒãªã³ã°ãšã¯ãäœè§£ååºŠã®æ åãé«è§£å床ã«å€æãã詳现ã远å ããããã»ã¹ã ãSeedVR2ã¯ãæ¡æ£ã¢ãã«ãåºç€ãšããäžåºŠã®ã¹ãããã§åŸ©å ãè¡ãç¹ãç¹åŸŽã§ãåŸæ¥ã®å€ã¹ãããææ³ã«æ¯ã¹ãŠå¹ççã§ãããv2.5ã®ãªãªãŒã¹ã¯ãã³ãã¥ããã£ããã®ãã£ãŒãããã¯ãåæ ãããããã¯ã¹ã¯ããã³ã°ãtorch.compileã®çµ±åã«ãããåŠçæéãççž®ããªããå質ãç¶æãããããšãã°ãé·ãåç»ã®åŠçã§ã¯ãã¡ã¢ãªäœ¿çšéãå®å®ãããVRAMã®ç©ã¿éããé²ãã¹ããªãŒãã³ã°ã¢ãŒããã¯ãã£ãå°å ¥ããããããã«ããããŠãŒã¶ãŒã¯ããŒããŠã§ã¢ã®å¶çŽãè¶ ããæè»ãªã¯ãŒã¯ãããŒãæ§ç¯ã§ããã
ããã«ãRGBAãµããŒãã®åŒ·åã«ãããéæãã£ãã«ã®åŠçãèªç¶ã«ãªãããšããžã¬ã€ãä»ãã¢ããã¹ã±ãŒãªã³ã°ã§ã¯ãªãŒã³ãªçµæãåŸããããè§£ååºŠã®æè»æ§ãåäžãã2ã§å²ãåããä»»æã®è§£å床ã«å¯Ÿå¿å¯èœã ããã®ããŒãžã§ã³ã¯ãComfyUIã®V3ç§»è¡ã«ã察å¿ããç¡ç¶æ ããŒãèšèšãæ¡çšããŠãããå šäœãšããŠãSeedVR2 v2.5ã¯AIããŒã«ã®æ°äž»åã象城ããå°éå®¶ããè¶£å³ãŠãŒã¶ãŒãŸã§å¹ åºãå±€ã«ã¢ã¯ã»ã¹ããããããã以äžã§ã¯ããã®è©³çްãé ã«è§£èª¬ããã
æŠèŠ
SeedVR2ã¯ãByteDanceã®ç ç©¶ããŒã ãææ¡ããæ¡æ£ãã©ã³ã¹ãã©ãŒããŒããŒã¹ã®åç»åŸ©å ã¢ãã«ã§ãããåŸæ¥ã®åç»çæãã€ãã©ã€ã³ã掻çšããä»»æã®è§£å床ãšé·ãã®åç»ãæ±ãããšãç®çãšããŠãããv2.5ã§ã¯ãComfyUIçµ±åãå šé¢çã«åèšèšãããã¢ãžã¥ã©ãŒã¢ãŒããã¯ãã£ãå°å ¥ãããããã«ãããDiTïŒDiffusion TransformerïŒã¢ãã«ãšVAEïŒVariational AutoencoderïŒã®ããŒãã£ã³ã°ãåé¢ãããŠãŒã¶ãŒãåå¥ã«èšå®ã調æŽã§ããããã«ãªã£ãã
ãã®ããŒãžã§ã³ã®æ žå¿ã¯ãäœVRAMç°å¢ã§ã®éçšæé©åã ãããšãã°ã7åãã©ã¡ãŒã¿ã®ã¢ãã«ã8GB GPUã§å®è¡å¯èœã«ããGGUFéååã«ããã¡ã¢ãªæ¶è²»ãå€§å¹ ã«åæžãããåŠçãããŒã¯4ã€ã®ãã§ãŒãºâãšã³ã³ãŒããã¢ããã¹ã±ãŒã«ããã³ãŒãããã¹ãããã»ã¹âã«åããããåãã§ãŒãºã§ãªãœãŒã¹ãå¹ççã«è§£æŸãããããã«ãããé·æéåç»ã®åŠçã§ãVRAM䜿çšéãå®å®ããã¡ã¢ãªãªãŒã¯ãé²ãã
ãŸããå質é¢ã§ã¯LABã«ã©ãŒãããã³ã°ãããã©ã«ããšããHSVããŠã§ãŒãã¬ããé©å¿ãªã©ã®è¿œå ææ³ããµããŒããæéçæŽåæ§ãä¿ã€ããã®ãã³çªãã¬ã³ãã£ã³ã°ãå®è£ ããããããéã®é·ç§»ãæ»ããã«ãªã£ããCLIïŒã³ãã³ãã©ã€ã³ã€ã³ã¿ãŒãã§ãŒã¹ïŒã匷åããããããåŠçããã«ãGPU察å¿ã远å ãããŠããããããã®å€æŽã¯ãã³ãã¥ããã£ã®è²¢ç®ã«ããå®çŸããSeedVR2ãããå®çšçãªããŒã«ã«é²åãããã
ã€ã³ã¹ããŒã«æ¹æ³
SeedVR2 v2.5ã®ã€ã³ã¹ããŒã«ã¯ãComfyUIãããŒãžã£ãŒãéããŠå®¹æã«è¡ããããŸããComfyUIã®ã«ã¹ã¿ã ããŒããããŒãžã£ãŒã§ãSeedVR2ããæ€çŽ¢ããAInVFXããŒãžã§ã³ãéžæããããã®ããŒãžã§ã³ã¯æå€ã®ã¹ã¿ãŒæ°ãèªããææ°ã®2.5以äžã確èªããããã€ã³ã¹ããŒã«åŸãComfyUIãåèµ·åããã·ã§ã«ã§ãšã©ãŒããã§ãã¯ãããFlash AttentionãTritonãæ€åºãããªããŠãåäœå¯èœã ãããããã¯æšè«é床ãåäžãããã
æåã€ã³ã¹ããŒã«ãå¿ èŠãªå Žåãå ¬åŒGitHubãªããžããªã«ã¢ã¯ã»ã¹ããããã¥ã¡ã³ãã«åŸããäŸåé¢ä¿ãã€ã³ã¹ããŒã«ããç°å¢ã®ç«¶åã解決ãããã¢ãã«ãã¡ã€ã«ã¯Hugging FaceããèªåããŠã³ããŒããããããæåã§å ¥æããå Žåã¯AInVFXãnumzãcmekaã®ãªããžããªãå©çšãã¢ãã«ãComfyUIã®models/SeedVR2ãã©ã«ãã«é 眮ããã
ã«ã¹ã¿ã ãã£ã¬ã¯ããªã䜿çšããå Žåãextra_model_paths.yamlãç·šéããã¢ãã«ãã¹ãæå®ãComfyUIåèµ·ååŸãæ°èŠã¢ãã«ãèªèããããããŠã³ããŒãäžææ©èœã远å ãããåéæã«ãã¡ã€ã«æŽåæ§ããã§ãã¯ãããããã«ãããå®å®ããã»ããã¢ãããå¯èœã«ãªãããŠãŒã¶ãŒã¯ããã«ãã³ãã¬ãŒãã¯ãŒã¯ãããŒãããŒãããŠãã¹ãã§ããã
ã¢ãã«ããªãšãŒã·ã§ã³
SeedVR2ã®ã¢ãã«ã¯ã3åãã©ã¡ãŒã¿ïŒ3BïŒãš7åãã©ã¡ãŒã¿ïŒ7BïŒã®2ããªãšãŒã·ã§ã³ããããããããã«éååãªãã·ã§ã³ãçšæãããŠããã3Bã¢ãã«ã¯è»œéã§ãäœVRAMç°å¢ã«é©ããè¿ éãªåŠçãåªå ããå Žåã«æå¹ãäžæ¹ã7Bã¢ãã«ã¯è©³çްãªåŸ©å ãå¯èœã§ãã·ã£ãŒãããªã¢ã³ããéžæãããšãšããžã®æç床ãåäžããã
éååãªãã·ã§ã³ãšããŠãFP8ãFP16ãGGUFïŒ4ãããQ4_K_Mã8ãããQ8_0ïŒããµããŒããGGUFã¯ã¡ã¢ãªå¹çãé«ãã7Bã¢ãã«ã8GB VRAMã§åäœãããéµãšãªãããããã¯æ°ã¯3Bã36ã7Bã66ã§ããããã¯ã¹ã¯ããã³ã°ã«ããã¡ã¢ãªãæé©åã
以äžã¯äž»ãªã¢ãã«ã®æ¯èŒè¡šïŒ
| ã¢ãã« | ãã©ã¡ãŒã¿æ° | éååãªãã·ã§ã³ | VRAMèŠä»¶ïŒæå°ïŒ | ç¹åŸŽ | åŒ±ç¹ |
|---|---|---|---|---|---|
| 3B | 3å | FP8, FP16, GGUF | 5GBä»¥äž | é«éåŠçãäœã¡ã¢ãªæ¶è²» | 詳现埩å ã7Bã«å£ã |
| 7B | 7å | FP8, FP16, GGUF | 8GBä»¥äž (GGUFæ) | é«å質ãã·ã£ãŒãããªã¢ã³ããã | å€§èŠæš¡ã¢ãã«ã§ã¡ã¢ãªè² è·ãé«ã |
| 7B Sharp | 7å | FP16 | 8GBä»¥äž | ãšããžåŒ·èª¿ | æšæº7Bããã·ã£ãŒãã ãããã€ãºãå¢ãå¯èœæ§ |
ãã®è¡šãããçšéã«å¿ããéžæãå¯èœã ãããšãã°ãæ¶è²»è åãGPUã§ã¯GGUFéååã®3Bãæšå¥šããé«åè³ªãæ±ããå Žåã«7BãæŽ»çšãããå ¬åŒHugging Faceãªããžããªã§ç¢ºèªãããã¹ããã¯ã«åºã¥ããä»»æè§£å床ïŒ2ã§å²ãåããïŒã«å¯Ÿå¿ããã
ã¢ãã«éžææã¯ãããŒããŠã§ã¢ãèæ ®ãVAEã¯åäžã ããã¿ã€ã«ãªã³ã°ãæå¹åå¯èœã§ãtorch.compileãšçµã¿åãããããšã§å¹çãé«ããã
-
ç¹åŸŽ:
- 3B: 36ãããã¯ã軜éã§ã¢ãã€ã«ããã€ã¹åãã
- 7B: 66ãããã¯ã詳现远å ã«åªããã
- GGUF: 4/8ãããéååã§VRAMã3åã®1ã«åæžã
-
è¯ãç¹:- 倿§ãªéååã§æè»æ§ãé«ãã
- ã·ã£ãŒãããªã¢ã³ãã§èŠèŠå¹æåäžã
-
æªãç¹:- 7Bã§é«è§£å床æãVRAMäžè¶³ãçºçããããã
- éååã«ãã埮现ãªå質å£åã®å¯èœæ§ã
ããã©ãŒãã³ã¹æé©å
v2.5ã®æå€§ã®é²åã¯ãããã©ãŒãã³ã¹ã®æé©åã«ãããtorch.compileã®ãµããŒãã«ãããDiTã®é床ã20-40%ãVAEã15-25%åäžãããããã¯ãã°ã©ãå šäœãã³ã³ãã€ã«ããCUDAã«ãŒãã«ãæé©åããããã ããã ããååã³ã³ãã€ã«ã«2-5åããããããé·åç»ããããåŠçåããçãåç»ã§ã¯ãªãã«æšå¥šã
ãããã¯ã¹ã¯ããã³ã°ã¯ãã¢ãã«ãå°å¡ã«åããCPUã«ãªãããŒãã7Bã¢ãã«ã§36ãããã¯ãã¹ã¯ãããããšãããŒã¯VRAMã2GB以å ã«æãããããFlash Attentionãã€ã³ã¹ããŒã«ããã°ãæšè«ã10%é«éåãã¿ã€ã«ãªã³ã°ã匷åããããšã³ã³ãŒã/ãã³ãŒããç¬ç«å¶åŸ¡ã
ã¢ãã«ãã£ãã·ã³ã°ã«ãããè€æ°ã¢ããã¹ã±ãŒã©ãŒéã§ã¢ãã«ãå ±æãèšå®å€æŽæãèªåæŽæ°ãããããŒãæéããŒãã«è¿ã¥ãããCLIã§ã¯ããã«ãGPU察å¿ã§ã¯ãŒã¯ããŒãã忣ããæéçãªãŒããŒã©ããã§ãã¬ã³ãã
ãããã®æé©åã«ãããæ¶è²»è ããŒããŠã§ã¢ã§4Kã¢ããã¹ã±ãŒãªã³ã°ãå¯èœã«ãªã£ããããšãã°ãããããµã€ãºã5ã«èšå®ãããšãã»ãšãã©ã®GPUã§åäœããåŠçæéãççž®ããã
-
ç¹åŸŽ:
- torch.compile: ã¢ãŒã調æŽå¯èœïŒããã©ã«ãããmax-autotuneïŒã
- ãããã¯ã¹ã¯ããã³ã°: ã¢ãããã£ãã¡ã¢ãªã¯ãªã¢ïŒ5%éŸå€ïŒã
-
è¯ãç¹:- é·åç»ã§å€§å¹ æéççž®ã
- ãã«ãGPUã§ã¹ã±ãŒã©ãã«ã
-
æªãç¹:- ã³ã³ãã€ã«ãªãŒããŒãããã§çã¿ã¹ã¯ã«äžåãã
- Flash Attentionæªã€ã³ã¹ããŒã«æãé床äœäžã
ã¡ã¢ãªç®¡ç
ã¡ã¢ãªç®¡çã®æ¹åãv2.5ã®éµã ãåŸæ¥ã®VRAMç©ã¿äžããé²ãã¹ããªãŒãã³ã°ã¢ãŒããã¯ãã£ãæ¡çšããåããããCPU RAMã«ãªãããŒãããªãããŒãããã€ã¹ãç¬ç«èšå®å¯èœã§ãDiTãVAEããã³ãœã«ãCPU/GPU/ãªãããéžæã
4ãã§ãŒãºãã€ãã©ã€ã³ã«ãããåæ®µéã§ãªãœãŒã¹ãè§£æŸãããŒã¯VRAM远跡æ©èœã§ããã§ãŒãºããšã®äœ¿çšéãç£èŠãGGUFéååã§VRAMãåæžãã7Bã¢ãã«ã8GBã§éçšã
é·åç»ã§ã¯ããªãããŒããæå¹ã«ãããšVRAMã11GBååŸã«åºå®ãCPU RAMã¯æçµåºååå¿ èŠã ããéè€ãé¿ããã¹ããªãŒãã³ã°ã§å¹çåã
-
ç¹åŸŽ:
- ãªãããŒã: DiT/VAE/ãã³ãœã«ç¬ç«ã
- ããŒã¯è¿œè·¡: ãã°ã§è©³çŽ°è¡šç€ºã
-
è¯ãç¹:- é·æéåç»ã®å®å®åŠçã
- äœVRAM GPU察å¿ã
-
æªãç¹:- CPU RAMäžè¶³ã§å¶éã
- ãªãããŒãæã®é床äœäžã
å質åäž
å質é¢ã§ã¯ãLABè²ä¿®æ£ãããã©ã«ããšããç¥èŠçè²è»¢éã§ç²ŸåºŠåäžãHSV飜åãããã³ã°ããŠã§ãŒãã¬ããé©å¿ã远å ãæ±ºå®è«ççæã§ãã·ãŒãããŒã¹ã®åçŸæ§ç¢ºä¿ã
æéçäžè²«æ§ã¯ããã³çªãã¬ã³ãã£ã³ã°ã§ãããé·ç§»ãæ»ããã«ãRGBAãµããŒãã§ã¢ã«ãã¡ãã£ãã«ããšããžã¬ã€ãä»ãã¢ããã¹ã±ãŒãªã³ã°ãè§£å床ããã£ã³ã°ã§æå€±ãªãã
-
ç¹åŸŽ:
- è²ä¿®æ£: LAB, HSV, ãŠã§ãŒãã¬ããã
- ãã¬ã³ãã£ã³ã°: ãã³çªã
-
è¯ãç¹:- èªç¶ãªè²åçŸã
- éæãã£ãã«å¯Ÿå¿ã
-
æªãç¹:- ã¢ã«ãã¡åŠçã¯éçºäžã
- é«è§£å床ã§åŸ®çްãã€ãºã
CLIæ©èœ
CLIã¯ãããåŠçã«ç¹åããã©ã«ãå šäœãåŠçããã¢ãã«ãã£ãã·ã³ã°ã§å¹çåãåºå圢åŒãèªåæ€ç¥ïŒMP4/PNGïŒããã«ãGPUã§ã¯ãŒã¯ããŒã忣ãæéçãªãŒããŒã©ãããã¬ã³ãã
ãã©ã¡ãŒã¿ã¯ComfyUIãšçµ±äžãäŸ: è§£å床1080ãæå€§1920ãããããµã€ãº21ããããã¯ã¹ã¯ãã16ã
-
ç¹åŸŽ:
- ãããåŠç: ç»å/åç»æ··åã
- ãã£ãã·ã³ã°: DiT/VAEã
-
è¯ãç¹:- çç£ãã€ãã©ã€ã³åãã
- ãã«ã衚瀺å å®ã
-
æªãç¹:- ComfyUIæªãŠãŒã¶ãŒã«ã¯åŠç¿æ²ç·ã
- ãšã©ãŒåŠçã®ãããªã匷åã
䜿çšäŸ
䜿çšäŸãšããŠãåäžç»åã¢ããã¹ã±ãŒãªã³ã°ããå§ãããComfyUIã§ãã³ãã¬ãŒããããŒãããDiTã7B FP16ãVAEãã¿ã€ã«ãªã³ã°æå¹ã«èšå®ãã·ãŒããåºå®ããè§£å床ã調æŽã
åç»ã®å Žåãããããµã€ãºã4n+1ïŒäŸ:5ïŒã«ãããªãããŒããCPUã«ãCLIã§ãã©ã«ãåŠç: python inference_cli.py media_folder/ --output processed/ çã
ããã«ãããPixabayç»åã®ãããªåçšçŽ æãé«å質ã«ã¢ããã¹ã±ãŒã«å¯èœããããã°ã¢ãŒãã§ã¡ã¢ãªãç£èŠããæé©åã
SeedVR2 v2.5ã¯ãAIåç»åŸ©å ã®æªæ¥ãåãéããæ¶è²»è ããŒããŠã§ã¢ã§ã®é«å質åŠçãå¯èœã«ãªããVFXæ¥çã®å¹çåãä¿é²ããããã©ã³ã¹ã®åããéžæãšããŠã3Bã¢ãã«ããå§ããå¿ èŠã«å¿ããŠ7Bãžç§»è¡ãæšå¥šãæ¥çå šäœã§ã¯ãããããããŒã«ãã¯ãªãšã€ãã£ãã®éå£ãäžãããã驿°çãªã³ã³ãã³ããçãã ããã
