AIãç»åãèªåã§çã¿åºãæè¡ã¯ãä»ãç§ãã¡ã®çæŽ»ã«æ¬ ãããªãååšã§ããMidjourneyãStable DiffusionãDALLã»Eã®ãããªããŒã«ã䜿ãã°ã誰ã§ãããã¹ãã®èª¬æã ãã§çŸããã€ã©ã¹ããåçã®ãããªç»åãç¬æã«äœããŸããç¹ã«æ¥æ¬ã§ã¯ãã¢ãã¡é¢šã®ãã£ã©ã¯ã¿ãŒã颚æ¯åçãååãã¶ã€ã³ã®ãããã¿ã€ãäœæãªã©ã«æ¬ ãããªãååšãšãªã£ãŠããŸããããããåŸæ¥ã®æ¹æ³ã«ã¯å€§ããªå£ããããŸãããçæã«æéããããããããé«è§£å床ã«ãªããšå質ãèœã¡ãã现ããæç€ºïŒããã³ããïŒã«ããŸãåŸããªãââãããªäžæºãã¯ãªãšã€ã¿ãŒãäžè¬ãŠãŒã¶ãŒã®éã§ããèãããŸãã
ãããªäžãTikTokã®èŠªäŒç€Ÿã§ããäžåœByteDanceãã2026幎2æã«ç»æçãªæ°ã¢ãã«ãBitDanceããçºè¡šããŸãããããã¯ããªãŒããªã°ã¬ãã·ãïŒautoregressiveãARïŒããšåŒã°ããç»åçæã®æ çµã¿ãæ ¹æ¬ããèŠçŽãããã®ã§ããè«æã«ãããšãåŸæ¥ã®ARã¢ãã«ããã¯ããã«å°ãªãèšç®è³æºã§ãæé«ã¯ã©ã¹ã®å質ãå®çŸããããçæé床ãåŸæ¥ã®30å以äžã«ãªãã±ãŒã¹ãããã1024Ã1024ãã¯ã»ã«ã®é«è§£å床ç»åãæ°ç§ã§äœããŸãããªãŒãã³ãœãŒã¹ã§ã³ãŒããšã¢ãã«ãå ¬éããããããäžçäžã®ç ç©¶è ãéçºè ãããã«è©ŠããŸãã
BitDanceã®æå€§ã®ç¹åŸŽã¯ãããã€ããªããŒã¯ã³ããšããç¬èªã®ç»åè¡šçŸæ¹æ³ã§ããç»åã现ããããŒã¹ïŒããŒã¯ã³ïŒã«åããããã0ãš1ã®ãããåã§è¡šãããšã§ã1ã€ã®ããŒã¹ã倩æåŠçãªæ°ã®ããªãšãŒã·ã§ã³ïŒæå€§2ã®256ä¹éãïŒãæãŠãããã«ããŸãããããã«ãããå°ãªãããŒã¹æ°ã§è¶ é«ç²Ÿçްãªç»åãåçŸå¯èœãåŸæ¥ã®æ¹æ³ã§ã¯ãéžæè¢ãå€ãããŠäºæž¬ãé£ãããåé¡ããè³¢ããæ¡æ£ïŒdiffusionïŒããšããææ³ã§è§£æ±ºããŠããŸãã
æ¬èšäºã§ã¯ãå°éçšèªãã§ããã ãé¿ããäžè¬ã®æ¹ã§ãããªãã»ã©ïŒããšæ¥œãããããBitDanceã®ä»çµã¿ã解説ããŸããåŸæ¥æè¡ã®èª²é¡ããå§ãŸããBitDanceã®é©æ°çã¢ã€ãã¢ãå®éã®æ§èœæ¯èŒãããã¹ãããç»åãäœãå®çšäŸããããŠæ¥æ¬ç€ŸäŒãžã®åœ±é¿ãŸã§ã詳ãããäŒãããŸããAIç»åçæã®æ°æä»£ããäžç·ã«èŠããŠã¿ãŸãããã
AIç»åçæã®æŽå²ãšãããŸã§ã®èª²é¡
AIã§ç»åãäœãæè¡ã®æŽå²ã¯ã2010幎代åŸåããæ¥éã«å éããŸãããæåã¯GANïŒæµå¯Ÿççæãããã¯ãŒã¯ïŒãšåŒã°ããæ¹æ³ãäž»æµã§ããªã¢ã«ãªé¡åçãªã©ãçã¿åºããŸããããã®åŸã2020幎代ã«å ¥ããæ¡æ£ã¢ãã«ããç»å Žããã€ãºãå°ããã€åãé€ããŠããããªç»åãäœãææ³ã§ãStable Diffusionã®ãããªäººæ°ããŒã«ãçãŸããŸããããããã¯é«å質ã§ãããçæã«æéããããã®ã匱ç¹ã§ããã
äžæ¹ãããªãŒããªã°ã¬ãã·ãïŒARïŒãã¢ãã«ã¯ãChatGPTãæç« ããæ¬¡ã®åèªãäºæž¬ããªãããäœãã®ãšåãããã«ãç»åããæ¬¡ã®ããŒã¹ãé çªã«äºæž¬ããªãããçæããŸããå©ç¹ã¯èšèªã¢ãã«ãšçžæ§ãè¯ããããã¹ãçè§£ã匷ãããšãã§ã倧ããªåŒ±ç¹ã2ã€ãããŸããã
1ã€ç®ã¯ãããŒã¯ã³ã®è¡šçŸåäžè¶³ããç»åãå°ããªãããã¯ã«åããããããããã³ãŒãããã¯ããšããèŸæžããéžã¶æ¹æ³ãäžè¬çã§ãããèŸæžã®æ°ãéãããŠãããšçްããè²ã質æã倱ãããããã®ã§ãã2ã€ç®ã¯ãçæã®é ããã1ã€ãã€é çªã«äºæž¬ããã®ã§ãè§£å床ãé«ããªããšäœçŸã»äœåã¹ããããããããåŸ ãŠãªãã»ã©é ããªããŸãã
ByteDanceã®ç ç©¶ããŒã ã¯ããããã®èª²é¡ã«æ£é¢ããæã¿ãŸãããåŸæ¥ã®ARã¢ãã«ãèŠæŠããŠãããç»åã®ããŒã¹ãã©ã衚çŸãããããšãã©ãå¹ççã«äºæž¬ããããã®2ç¹ãããŸã£ããæ°ããã¢ãããŒãã§è§£æ±ºããã®ã§ãããããBitDanceã®èªçã§ãã
BitDanceã®æ žå¿ïŒãã€ããªããŒã¯ã³ã§ç»åããããããã§è³¢ã衚çŸ
BitDanceã®æå€§ã®çºæã¯ãç»åã®ããŒã¹ãããã€ããªããŒã¯ã³ïŒ0ãš1ã®ãããåïŒãã§è¡šçŸããããšã§ããåŸæ¥ã¯ããã®ããŒã¹ã¯èŸæžã®123çªç®ããšããããã«1ã€ã®çªå·ãéžãã§ããŸããããBitDanceã¯1ã€ã®ããŒã¹ã256ãããïŒ0ã1ã256å䞊ãã åïŒã§è¡šããŸãã
ããããããã®ã¯ã1ã€ã®ããŒã¯ã³ãæãŠãããªãšãŒã·ã§ã³ã®æ°ã2ã®256ä¹ãšããéæ¹ããªã巚倧ãã«ãªãç¹ã§ããå°çäžã®ååã®æ°ããã¯ããã«å€ããã©ããªçްããç»åã®ç¹åŸŽã衚çŸã§ããŸãããŸãã§ãç¡éã«è¿ãéžæè¢ããæé©ãª1ã€ãéžã¹ãããããªã€ã¡ãŒãžã§ãã
ããããç»åãå§çž®ããå¹çãæçŸ€ãåŸæ¥ã®é£ç¶çãªè¡šçŸïŒVAEãšåŒã°ããæ¹æ³ïŒãšæ¯ã¹ãŠããåçŸç²ŸåºŠïŒPSNRãSSIMãšããææšïŒãåçããã以äžã§ããã¡ã€ã«ãµã€ãºã¯å°ããæããããŸããè«æã®å®éšã§ã¯ãç»åã16åã®1ã32åã®1ã«å§çž®ããŠãã现ãããã£ããŒã«ïŒé«ªã®æ¯ã®è³ªæã圱ã®ã°ã©ããŒã·ã§ã³ïŒããã£ããæ®ãçµæãåºãŠããŸãã
äžè¬ã®æ¹ãã€ã¡ãŒãžããããäŸãæããŸããããåŸæ¥ã®æ¹æ³ã¯ãã¬ãŽãããã¯ãéãããè²ãšåœ¢ããéžã¶ããããªãã®ãäžæ¹BitDanceã¯ãã¬ãŽã®è²ãšåœ¢ãç¡éã«è¿ãããããèªåã§æé©ãªçµã¿åãããææ¡ããŠããããæãã§ããããã«ãããå°ãªããããã¯æ°ã§æ¬ç©ãã£ããã®ç»åãäœããèšæ¶å®¹éãç¯çŽã§ããŸããã¯ãªãšã€ã¿ãŒã«ãšã£ãŠã¯ããã£ãšçްããæç€ºãåºããã®ã«ãåŠçã軜ãããšãã倢ã®ãããªæè¡ãªã®ã§ãã
巚倧ãªéžæè¢ãè³¢ãéžã¶ïŒãã€ããªæ¡æ£ãããã®ä»çµã¿
ããããéžæè¢ãå€ãããã®ã¯åé¡ãçã¿ãŸãã2ã®256ä¹éãã®äžãããæ£ãããããåããåœãŠãã®ã¯ãæ®éã®äºæž¬æ¹æ³ã§ã¯ã»ãŒäžå¯èœã§ããããã§BitDanceã¯ããã€ããªæ¡æ£ãããããšããæ°ããäºæž¬é è³ãéçºããŸããã
æ¡æ£ã¢ãã«ã¯ãç»åã«ãã€ãºãå ããŠåŸã ã«åãé€ãææ³ã§æåã§ããBitDanceã¯ããããããåã«å¿çšãé£ç¶çãªç©ºéïŒ0ãš1ã®éã®ã°ã¬ãŒãŸãŒã³ïŒã§äºæž¬ãè¡ããæåŸã«ã0ã1ã«ç¢ºå®ããããã¹ããããå ããŸãããŸãã§ããŒãããããã·ã«ãšããããåŸã ã«é®®æãªçµµãæµ®ãã³äžããããããããªããã»ã¹ã§ãã
ããã«ãããåŸæ¥ã®ãåé¡é è³ãïŒsoftmaxãšããæ¹æ³ïŒãèŠæã ã£ã巚倧空éã§ãã粟床é«ãäºæž¬å¯èœã«ãªããŸãããè«æã®å®éšã§ã¯ããã®ãããã䜿ãããšã§ãµã³ããªã³ã°ã®ç²ŸåºŠãå€§å¹ ã«åäžããçæãããç»åã®èªç¶ããæ Œæ®µã«è¯ããªã£ãŠããŸããæè¡è ã§ãªããŠãããAIã人éã®ããã«å°ããã€èããªããæé©è§£ãèŠã€ãããã€ã¡ãŒãžãæ¹§ãã¯ãã§ãã
äžæ°ã«ãŸãšããŠçæïŒ 次ãããæ¡æ£ã§åçã¹ããŒãã¢ãã
ããã«BitDanceã¯ã次ãããæ¡æ£ããšããä»çµã¿ã§ãçæã®é ããæ ¹æ¬è§£æ±ºããŸãããåŸæ¥ã®ARã¯ã1ããŒã¹ãã€é çªã«äºæž¬ãããŸãããç»åã®å Žåãé£ãåãããŒã¹ã¯åŒ·ãé¢é£ããŠããŸãïŒäŸïŒç©ºã®éã¯åšå²ã®é²ãšé£åïŒã
ããã§BitDanceã¯ãç»åãããããïŒå°ããªãããã¯çŸ€ïŒãåäœã§äºæž¬ã1åã®ã¹ãããã§4åã16åã®ããŒã¹ãåæã«çæããŸãããããæ¡æ£ãããã®ãããã§ããããã®ããŒã¹éã®é¢ä¿ããã¡ããšèæ ®ãããŸãã
çµæãçæã¹ãããæ°ãæ¿æžã260MïŒ2å6åäžïŒãã©ã¡ãŒã¿ã®å°åã¢ãã«ã§ããåŸæ¥ã®14åãã©ã¡ãŒã¿ã¢ãã«ãäžåãå質ã§ã8.7åã®é床ãåºããŸããé«è§£å床1024Ã1024ç»åã§ã¯ãåŸæ¥ã¢ãã«ãæ°çŸç§ããããšããã10ç§å°ã§å®äºããŸãã§ã1æã®çµµãå°ããã€æãããããäžæ°ã«ãšãªã¢ãå¡ãã€ã¶ãããããªå¹çåã§ãã
ããã«ãããã¹ãããäžè¬PCã§ãå®çšçãªé床ã§é«å質ç»åãäœããæªæ¥ãè¿ã¥ããŸããã
å®åã¯æ¬ç©ïŒ ãã³ãããŒã¯æ¯èŒãšé©ãã®çµæ
BitDanceã®æ¬åœã®åãã¯ã客芳çãªæ°åã§èšŒæãããŠããŸããImageNetãšããæšæºãã³ãããŒã¯ïŒ256Ã256ãã¯ã»ã«ç»åçæïŒã§ã1BïŒ10åïŒãã©ã¡ãŒã¿ã®ã¢ãã«ãFIDã¹ã³ã¢1.24ãéæãããã¯ARã¢ãã«å²äžæé«å€ã§ãæ¡æ£ã¢ãã«ã«ã广µããŸãã
以äžã¯ãäž»ãªã¢ãã«ãšã®æ¯èŒè¡šã§ãïŒè«æããŒã¿ã«åºã¥ãç°¡ç¥çïŒã
| ã¢ãã«å | ãã©ã¡ãŒã¿æ° | FIDã¹ã³ã¢ïŒäœãã»ã©åªç§ïŒ | çæã¹ãããæ° | ã¹ã«ãŒãããïŒç»å/ç§ïŒ | åè |
|---|---|---|---|---|---|
| BitDance-B-4x | 2.6å | 1.69 | 64 | 24.18 | å°åã§é«é |
| BitDance-H-1x | 10å | 1.24 | 256 | - | ARå²äžæé«å質 |
| RandAR-XXL | 14å | 2.15 | 88 | 10.39 | åŸæ¥SOTA䞊è¡AR |
| VAR-d24 | 10å | 2.09 | 10 | 47.22 | 倿®µéVAR |
| PAR-XXL | 14å | 2.35 | 147 | 5.17 | 䞊è¡AR |
ã芧ã®éããBitDanceã¯å°ãªããã©ã¡ãŒã¿ã§åªäœã260Mã¢ãã«ã14åã¢ãã«ãäžåãã®ã¯ããã€ããªãšäžŠè¡äºæž¬ã®çžä¹å¹æã§ãã
ããã¹ãããç»åçæïŒT2IïŒã§ã匷åã14BïŒ140åïŒãã©ã¡ãŒã¿ã¢ãã«ã¯ãDPG-Benchã§88.28ç¹ïŒARã¢ãã«äžæé«ã¯ã©ã¹ïŒãGenEvalã§0.86ç¹ãèšé²ãè€éãªæç€ºïŒãæ¡ã®æšã®äžã§ã®ã¿ãŒã匟ã女ã®åãïŒã«ãå¿ å®ã§ãæåã®ã¬ã³ããªã³ã°ïŒçæ¿ã®æ¥æ¬èªïŒãèªç¶ãè«æã®ãµã³ãã«ã«ã¯ããã©ããã颚ã€ã©ã¹ãããªã¢ã«ãªäººç©åçãèžè¡çãªé¢šæ¯ã䞊ã³ãæ¥æ¬äººå¥œã¿ã®ã¯ãªãªãã£ã§ãã1024Ã1024çæã§åŸæ¥ã®30å以äžã®é床ã¯ãåçšå©çšã«é©åœãèµ·ããã¬ãã«ã§ãã
ããã¹ãããé«è§£å床ç»åãžïŒå®çšäŸã𿥿¬ã§ã®å¯èœæ§
BitDanceã¯ããã¹ãçè§£ãæçŸ€ãQwenãšããå€§èŠæš¡èšèªã¢ãã«ãåºç€ã«ããŠãããããã倿®ãã®æ±äº¬ã¿ã¯ãŒãšæ¡ã®ã³ã©ãã¬ãŒã·ã§ã³ããªã¢ã«åç颚ããšãã£ãæ¥æ¬èªããã³ããã«ãæ£ç¢ºã«å¿ããŸããçæãããç»åã¯ããã³ããéµå®æ§ãé«ãã空éçãªé çœ®ïŒæåãšå¥¥ã®é¢ä¿ïŒãèžè¡ã¹ã¿ã€ã«ã®åçŸãåªããŠããŸãã
æ¥æ¬ã§ã®æŽ»çšã·ãŒã³ã¯ç¡é倧ã§ããã¢ãã¡å¶äœäŒç€Ÿã¯ãã£ã©ã¯ã¿ãŒãã¶ã€ã³ã®ã©ããé«éçæãåºå代çåºã¯ååããžã¥ã¢ã«ãå³æè©Šäœãå人ã¯ãªãšã€ã¿ãŒã¯è¶£å³ã®ã€ã©ã¹ããããçŽã«ã¢ããããŒããå»çåéã§ã¯CTç»åã®è£å®ã建ç¯ã§ã¯3Dãã¬ãã¥ãŒã«ãå¿çšå¯èœã§ãã
ç°å¢é¢ã§ãåªäœãå°ãªãèšç®è³æºã§é«å質ãåºããã®ã§ãããŒã¿ã»ã³ã¿ãŒã®é»åæ¶è²»ãæããè±ççŽ ç€ŸäŒã«å¯äžããŸãããªãŒãã³ãœãŒã¹ã§ããç¹ãéèŠãæ¥æ¬äŒæ¥ã倧åŠãã«ã¹ã¿ãã€ãºãããããåœå ç¬èªã®ã¢ãã«éçºãå éãããã§ãããã
BitDanceãæãAIç»åçæã®æªæ¥ãšç€ŸäŒãžã®ç€ºå
BitDanceã®ç»å Žã¯ãAIç»åçæã®æŽå²ã«æ°ããª1ããŒãžãå»ã¿ãŸãããåŸæ¥ã®ãå質ãé床ããã®ãã¬ãŒããªããããã€ããªããŒã¯ã³ãšè³¢ãäºæž¬ã§åæã«è§£æ±ºããç¹ãæå€§ã®å瞟ã§ããByteDanceãTikTokã§å¹ã£ãå€§èŠæš¡ããŒã¿åŠçã®ããŠããŠããç»åçæã«æ³šã蟌ãã ææãšèšããŸãã
ä»åŸã14Bã¢ãã«ãããã«ã¹ã±ãŒã«ã¢ããããã°ãåç»çæã3Dã¢ãã«äœæã«ãåºããå¯èœæ§ããããŸããæ¥æ¬ã®åŒ·ã¿ã§ããã¢ãã¡ã»ãã³ã¬æåãšçµã¿åãããã°ãäžçããªãŒããããæ¥æ¬çºAIã¯ãªãšã€ãã£ãããŒã«ããçãŸãããããããŸãããäŸãã°ãBitDanceãåºã«ãå颚ã¢ãŒãå°çšã¢ãã«ããäœãã°ãæµ®äžçµµé¢šãçŸä»£ã¢ãã¡ã®æ°ã¹ã¿ã€ã«ãççºçã«å¢ããã§ãããã
äžæ¹ã§ã瀟äŒç課é¡ãå¿ããŠã¯ãããŸãããé«å質ã»é«éçæã¯ãã£ãŒããã§ã€ã¯ã®ãªã¹ã¯ãé«ããå¯èœæ§ããããŸããæ¥æ¬æ¿åºãäŒæ¥ã¯ãçæç»åã«éãããå ¥ããæè¡ããå«çã¬ã€ãã©ã€ã³ã®çå®ãæ¥ãã¹ãã§ãããŸããèäœæš©åé¡ïŒåŠç¿ããŒã¿ã«äœ¿ãããäœåã®æš©å©ïŒãéèŠãBitDanceããªãŒãã³ãœãŒã¹ã§ããããšã¯éææ§ãé«ããŸãããå©çšè ã¯è²¬ä»»ããäœ¿ãæ¹ãå¿ãããŸãããã
ããžãã£ãã«èããã°ãBitDanceã¯ãåµé ã®æ°äž»åããããã«é²ããŸãããããŸã§ããã®ãã¶ã€ããŒããäœããªãã£ãã¯ãªãªãã£ããåŠçã䞻婊ãã·ãã¢ã«ãæã®å±ããã®ã«ãªããŸããæè²çŸå Žã§ã¯çŸè¡ã®ææ¥ã楜ãããªããé害ãæã€æ¹ã ã®è¡šçŸæŽ»åãæ¯æŽã§ããã¯ãã§ãã
æçµçã«ãBitDanceã¯ãæè¡ã¯äººéã®æ³ååãè§£æŸããããŒã«ã§ãããããšãæ¹ããŠæããŠãããŸããByteDanceã®ç ç©¶è ãã¡ããARåºç€ã¢ãã«ãã®æªæ¥ãåãéããããã«ãæ¥æ¬ãããæ¬¡äžä»£ã®ã€ãããŒã¿ãŒãçãŸããããšãæåŸ ããŸããçãããGitHubã§ã¢ãã«ã詊ããŠã¿ãŠã¯ãããã§ãããããçŸããç»åããé©ãã»ã©ç°¡åã«çãŸããç¬éãäœæããã°ãAIã®å¯èœæ§ã«å¿å¥ªãããã¯ãã§ãã
ãã®æ°æè¡ãããããåµé ã®æ³¢ã¯ããŸã å§ãŸã£ãã°ããã2026幎ã¯ãBitDanceå 幎ããšããŠãAIç»åçæå²ã«å»ãŸãã幎ã«ãªãã§ããããç§ãã¡äžè¬ãŠãŒã¶ãŒãããã®é²åãæ¥œãã¿ãªãããè³¢ãæŽ»çšããŠãããããã®ã§ãã
ïŒåèïŒè«æãBitDance: Scaling Autoregressive Generative Models with Binary TokensãïŒ









