• ? Æ»¹û°æ-¸£Çåºê·СºúͬµÄÃÃ×Ó½Ðʲô-Áù±ßÐÎսʿM1X¡ª¡ª·çÇåÑï

    ¼øºÚµ£±£Íø

    ɽÎ÷ÐÂÎÅÍø

    ×îÐÂAPP

    ÈÈÃÅAPP

    • ±ÈÑǵÏÐÂÆ·ÅÆ¡°Áì»ã¡±À´ÁË£¬£¬£¬£¬£¬£¬4¿î³µÐÍÆØ¹â£¡ÖªÇéÈËÊ¿£º×¨¹©´óÅúÁ¿²É¹ºÐèÇó

      ÄêÈë10.74ÒÚ £¬£¬£¬£¬£¬£¬Ã«Àû²»Ê亣µ×ÀÌ £¬£¬£¬£¬£¬£¬ÓÖÒ»ÒþÐιھüÒªIPO

      ÕâÊÇÒ»¸ö¹ØÓÚ AI µ×²ãÂß¼­Öع¹µÄʱ¿Ì¡£¡£ºã¾ÃÒÔÀ´ £¬£¬£¬£¬£¬£¬Transformer ¼Ü¹¹±»À§ÔÚÒ»¸öÌÚ¹óµÄã£ÂÛÖУºÎÒÃÇÓÃ×Å×îÏȽøµÄ GPU ËãÁ¦ £¬£¬£¬£¬£¬£¬È¥Èà AI Ä£×Ó " ËÀ¼ÇÓ²±³ " ÄÇЩ²é×Öµä¾ÍÄÜÖªµÀµÄ¾²Ì¬ÖªÊ¶¡£¡£DeepSeek ÁºÎÄ·æÍŶÓÓëÆä±±´óÏàÖúÕßÔÚ½ñÈÕÆÆÏþÐû²¼µÄÖØ°õÂÛÎÄ¡¶Conditional Memory via Scalable Lookup¡· £¬£¬£¬£¬£¬£¬³¹µ×Í»ÆÆÁËÕâÒ»½©¾Ö¡£¡£ËûÃÇÌá³öÁËÒ»ÖÖȫеÄEngram£¨Ó¡¼££©Ä£¿£¿£¿£¿ £¿ £¿é £¬£¬£¬£¬£¬£¬ÔڹŰåµÄ " Ìõ¼þÅÌËã "£¨MoE£©Ö®Íâ £¬£¬£¬£¬£¬£¬¿ª·¢Á˵ڶþÌõÏ£º±»¯Õ½Ïß¡ª¡ª" Ìõ¼þÓ°Ïó "¡£¡£Õâ²»µ«ÊÇÒ»´ÎÊÖÒÕÐÞ²¹ £¬£¬£¬£¬£¬£¬¶øÊÇÒ»³¡¹ØÓÚÄ£×Ó " ÄÔÈÝÁ¿ " µÄ¹©Ó¦²àˢС£¡£Ëü֤ʵÎú£ºµ±ÎÒÃǽ« " Ó°Ïó " ´Ó " ÅÌËã " ÖаþÀë £¬£¬£¬£¬£¬£¬°Ñ¸Ã±³µÄ½»¸ø " ×Öµä " £¬£¬£¬£¬£¬£¬°Ñ¸ÃËãµÄ½»¸ø´óÄÔ £¬£¬£¬£¬£¬£¬AI µÄÍÆÀíÄÜÁ¦½«Ó­À´·´Ö±¾õµÄ±¬·¢Ê½ÔöÌí¡£¡£DeepSeek ÍýÏëÔÚ 2 Ô´º½ÚǰºóÕýʽÐû²¼ V4 £¬£¬£¬£¬£¬£¬¶øÕâÒ»¿Ì»òÐí¾ÍÊÇ DeepSeek V4 ½µÉúµÄǰҹ¡£¡£ ÐòÕ£ºÁù²ãÉñ¾­ÍøÂçµÄ " ÎÞÓù¦ "¹ÊÊÂµÄÆðµã £¬£¬£¬£¬£¬£¬Ô´ÓÚ DeepSeek ÍÅ¶Ó¶Ô Transformer ÄÚ²¿ÔË×÷»úÖÆµÄÒ»´Î " ºË´Å¹²Õñ " ɨÃè¡£¡£ÔÚÈ˹¤ÖÇÄܵĺںÐ×ÓÀï £¬£¬£¬£¬£¬£¬µ±´óÄ£×Ó¿´µ½ "Diana, Princess of Wales"£¨´÷°²ÄÈ £¬£¬£¬£¬£¬£¬Íþ¶ûÊ¿Íõåú£©Õâ¸ö¶ÌÓïʱ £¬£¬£¬£¬£¬£¬ËüµÄÄÚ²¿±¬·¢ÁËÒ»³¡ÁîÈ˷ѽâÇÒ¼«ÆäÌÚ¹óµÄ " ÄÚÚ§ "¡£¡£Ñо¿Ö°Ô±·¢Ã÷ £¬£¬£¬£¬£¬£¬ÎªÁËʶ±ðÕâ¸öÀο¿µÄʵÌå £¬£¬£¬£¬£¬£¬Ä£×Ó¾¹È»¶¯ÓÃÁËÕûÕû 6 ²ãÍøÂ磺µÚ 1-2 ²ã£ºÄ£×Ó»¹ÔÚ×ÁÄ¥ "Wales" »òÐíÊÇÒ»¸ö¹ú¼Ò£»£»£»£»µÚ 3 ²ã£ºËüÒâʶµ½ÕâÊÇÅ·ÖÞµÄÒ»¸öµØÀí¿´·¨£»£»£»£»µÚ 4 ²ã£ºËü×îÏÈÆ´¼¯³ö "Princess of Wales" ËÆºõÊÇÒ»¸öÍ·ÏΣ»£»£»£»µÚ 5 ²ã£ºËüåÚÏëµ½ÁË " Íþ¶ûÊ¿Ç×ÍõµÄÆÞ×Ó "£»£»£»£»µÚ 6 ²ã£ºÖ±µ½ÕâÀï £¬£¬£¬£¬£¬£¬Ëü²ÅÖÕÓÚÈ·ÈÏ £¬£¬£¬£¬£¬£¬ÕâÊÇÖ¸ÄÇÎ»ÖøÃûµÄ " ´÷°²ÄÈÍõåú "¡£¡£ÔÚһλ׷Çó¼«ÖÂЧÂʵļܹ¹Ê¦ÑÛÖÐ £¬£¬£¬£¬£¬£¬Õâ¼òÖ±ÊÇËãÁ¦µÄ±©éåÌìÎï¡£¡£" ´÷°²ÄÈÍõåú " ÊÇÒ»¸ö¿Í¹Û±£´æµÄ¡¢¾²Ì¬µÄʵÌå £¬£¬£¬£¬£¬£¬Ëü²»»áÓÉÓÚÉÏÏÂÎĵÄת±ä¶ø¸Ä±äÆäʵÖÊ¡£¡£ÎªÁËÌáÈ¡Õâ¸öÔ­À´²é×Öµä¾ÍÄÜÖªµÀµÄÊÂʵ £¬£¬£¬£¬£¬£¬Transformer ¾¹È»¶¯ÓÃÁËÕûÕû 6 ²ãÉî¶ÈµÄÌÚ¹ó¾ØÕóÔËËãÈ¥ " ÖØÐÞ " Õâ¸ö¿´·¨¡£¡£Õâ¾ÍÏñÊÇÒ»¸ö¾øÊÀÌì²Å £¬£¬£¬£¬£¬£¬ÔÚÈ¥½â¾ö΢»ý·ÖÄÑÌâ֮ǰ £¬£¬£¬£¬£¬£¬Ã¿´Î¶¼µÃÏÈ»¨°ëСʱĬдһ±é¾Å¾Å³Ë·¨±í¡£¡£ ÕâÖÖ " ÒþʽӰÏó " µÄ»úÖÆ £¬£¬£¬£¬£¬£¬ÆÈʹģ×Ó½«Ãû¹óµÄ²ÎÊýÈÝÁ¿ºÍÍøÂçÉî¶È £¬£¬£¬£¬£¬£¬ÆÌÕÅÔÚÁ˼òÆÓµÄģʽƥÅäÉÏ¡£¡£DeepSeek ÔÚÕâÆª³¤´ï 33 Ò³µÄÂÛÎÄÖÐ £¬£¬£¬£¬£¬£¬Ìá³öÁËÒ»¸öÖ±»÷Áé»êµÄ¿½ÎÊ£ºÎªÊ²Ã´²»Ö±½Ó¸ø´óÄ£×ÓÅäÒ»±¾¿ÉÒÔËæ²éËæÓÃµÄ " ³¬µÈ×Öµä "£¿£¿£¿£¿ £¿ £¿ µÚÒ»Õ£º¼Ü¹¹ÖØËÜ¡ª¡ª Engram Ä£¿£¿£¿£¿ £¿ £¿éµÄ±©Á¦ÃÀѧΪÏàʶ¾öÕâ¸öÎÊÌâ £¬£¬£¬£¬£¬£¬DeepSeek Ìá³öÁËÒ»ÖÖÃûΪ "Engram£¨Ìõ¼þÓ°Ïó£©" µÄÈ«ÐÂÄ£¿£¿£¿£¿ £¿ £¿é¡£¡£ÈôÊÇ˵ MoE£¨»ìÏýר¼ÒÄ£×Ó£©ÊÇ°Ñ " ´óÄÔ " ·Ö³ÉÁ˲î±ðµÄÇøÓò £¬£¬£¬£¬£¬£¬Èòî±ðµÄר¼ÒÈÏÕæ²î±ðµÄ˼Ë÷£¨Ìõ¼þÅÌË㣩£»£»£»£»ÄÇô Engram ¾ÍÊǸø´óÄÔÍâ¹ÒÁËÒ»¸öÖØ´óµÄ " º£ÂíÌå " £¬£¬£¬£¬£¬£¬×¨ÃÅÈÏÕæ´æ´¢¾²Ì¬ÖªÊ¶£¨Ìõ¼þÓ°Ï󣩡£¡£1. ¸´Éú "N-gram"£º´Ó¹ÅÀÏÖÇ»ÛÖÐѰÕÒÃÕµ×Engram µÄ½¹µãÁé¸Ð £¬£¬£¬£¬£¬£¬¾¹È»À´×ÔÓÚ NLP£¨×ÔÈ»ÓïÑÔ´¦Àí£©ÁìÓòµÄ " ÉϹÅÉñÆ÷ " ¡ª¡ª N-gram¡£¡£ÔÚÉî¶ÈѧϰͳÖÎÌìÏÂ֮ǰ £¬£¬£¬£¬£¬£¬ÎÒÃǾÍÊÇ¿¿Í³¼Æ "N ¸ö´Êͬʱ·ºÆðµÄ¸ÅÂÊ " À´Ã÷È·ÓïÑԵġ£¡£DeepSeek ½«ÕâÒ»¾­µä¿´·¨¾ÙÐÐÁËÏÖ´ú»¯µÄħ¸Ä£º¹Å°åµÄ Transformer£ºÖªÊ¶ÊèÉ¢ÔÚÉñ¾­ÔªµÄÈ¨ÖØ£¨Weights£©Àï £¬£¬£¬£¬£¬£¬Ìáȡ֪ʶÐèÒª¾­ÓÉÖØ´óµÄÏßÐÔ²ãÅÌËã £¬£¬£¬£¬£¬£¬ÖØÆ¯ºó¸ß¡£¡£Engram Ä£¿£¿£¿£¿ £¿ £¿é£ºËüÊÇÒ»¸öÖØ´óµÄ¡¢¿ÉÀ©Õ¹µÄǶÈë±í£¨Embedding Table£©¡£¡£µ±Ä£×Ó¶Áµ½ " ÕÅÖÙ¾° " »òÕß " ËÄ´ó·¢Ã÷ " ÕâÖÖÀο¿´îÅ䣨N-gram£©Ê± £¬£¬£¬£¬£¬£¬²»ÐèÒª¶¯ÓôóÄÔÆ¤²ãÈ¥ÍÆÀí £¬£¬£¬£¬£¬£¬Ö±½Óͨ¹ý¹þÏ£Ë÷Òý £¬£¬£¬£¬£¬£¬ÔÚÄÚ´æ±íÖÐ " ²é " ³ö¶ÔÓ¦µÄÏòÁ¿¡£¡£ÕâÒ»Àú³ÌµÄʱ¼äÖØÆ¯ºóÊÇO ( 1 ) ¡ª¡ªÕâÒâζ×ÅÎÞÂÛ֪ʶ¿âÅòÕ͵½¶à´ó£¨ÄÄÅÂÊÇ 1000 ÒÚ²ÎÊý£© £¬£¬£¬£¬£¬£¬²éÕÒËÙÂÊÏÕЩÎÈ¹Ì £¬£¬£¬£¬£¬£¬ÇÒ¼«¿ì¡£¡£2. Èý´óÊÖÒÕ»¤³ÇºÓ¼ÈÈ»²é±íÕâôºÃ £¬£¬£¬£¬£¬£¬ÎªÊ²Ã´ÒÔǰûÈË×ö£¿£¿£¿£¿ £¿ £¿ÓÉÓÚÓÐÈý¸öÀ¹Â·»¢£º´æ´¢±¬Õ¨¡¢¶àÒå´Ê³åÍ»¡¢²ÎÊý·ÖÅä¡£¡£DeepSeek ¸ø³öÁ˽̿ÆÊé¼¶µÄ½â¾ö·½°¸£ºA. ´Ê±íѹËõ£º¼«ÖµÄÈ¥ÖØÌìÏÂÉϵĴÊ×é×éºÏÊÇÌìÎÄÊý×Ö¡£¡£DeepSeek Ê×ÏÈ×öÁËÒ»²½ " ÎÞËðѹËõ "¡£¡£ÔÚ·Ö´ÊÆ÷£¨Tokenizer£©²ãÃæ £¬£¬£¬£¬£¬£¬Ëü½«ÓïÒåÏàͬµ«Ð´·¨²î±ðµÄ´Ê¾ÙÐÐÁ˹éÒ»»¯¡£¡£ÀýÈç £¬£¬£¬£¬£¬£¬"Apple"£¨Ê××Öĸ´óд£©ºÍ "apple"£¨Ð¡Ð´£©ÔÚÓïÒåÉÏͨ³£Ö¸Í³Ò»¸ö¹¤¾ß¡£¡£Í¨¹ýÓ³ÉäºÏ²¢ £¬£¬£¬£¬£¬£¬ÓÐÓôʱíÖ±½ÓËõСÁË 23%¡£¡£Õâ²»µ«½ÚÔ¼ÁË¿Õ¼ä £¬£¬£¬£¬£¬£¬¸üÈÃ֪ʶµÄÃܶȴó·ùÌáÉý¡£¡£B. ¶àÍ·¹þÏ££º½â¾ö " ¹þÏ£³åÍ» "²»¿ÉÄܰÑËùÓÐ N-gram ¶¼´æÏÂÀ´¡£¡£Engram ʹÓÃÁË " ¶àÍ·¹þÏ££¨Multi-Head Hashing£©" ÊÖÒÕ¡£¡£Í¨¹ý¶à¸ö¹þÏ£º¯Êý £¬£¬£¬£¬£¬£¬½«ÎÞÏÞµÄ N-gram Ó³Éäµ½ÓÐÏÞµÄÄÚ´æ²ÛλÖС£¡£ËäÈ»»áÓйþÏ£³åÍ»£¨¼´Á½¸ö²î±ðµÄ´Ê±»Ó³Éäµ½ÁËͳһ¸öλÖã© £¬£¬£¬£¬£¬£¬µ«Í¨¹ý " ¶àÍ· " Éè¼Æ £¬£¬£¬£¬£¬£¬Ä£×Ó¿ÉÒÔ´Ó¶à¸öºòѡЧ¹ûÖÐÆ´¼¯³ö׼ȷµÄÐÅÏ¢ £¬£¬£¬£¬£¬£¬¼«´óµØÌá¸ßÁ˳°ôÐÔ¡£¡£C. ÉÏÏÂÎÄÃſأº¸øÓ°ÏóÅä¸ö " ²ÃÅÐ "ÕâÊÇ×ÃîµÄÒ»±Ê¡£¡£²é±íÊÇËÀµÄ £¬£¬£¬£¬£¬£¬ÓïÑÔÊÇ»îµÄ¡£¡£ºÃ±È " Æ»¹û " Õâ¸ö´Ê¡£¡£ÔÚ " ³ÔÆ»¹û " µÄÓᄈϠ£¬£¬£¬£¬£¬£¬Ëüָˮ¹û£»£»£»£»ÔÚ " Æ»¹ûÐû²¼»á " µÄÓᄈϠ£¬£¬£¬£¬£¬£¬ËüÖ¸¿Æ¼¼¹«Ë¾¡£¡£Ö±½Ó²é±í¿ÉÄÜ»áÒýÈëÔëÉù¡£¡£DeepSeek Éè¼ÆÁËÒ»¸ö " ÉÏÏÂÎĸÐÖªÃÅ¿Ø "£¨Context-aware Gating£©¡£¡£Query£¨ÅÌÎÊ£©£ºÄ¿½ñÉÏÏÂÎĵÄÒþ²Ø×´Ì¬£¨Hidden State£©¡£¡£Key/Value£¨¼üÖµ£©£º²é±í»ñµÃµÄ¾²Ì¬ÏòÁ¿¡£¡£Õâ¸öÃſؾÍÏñÒ»¸ö²ÃÅС£¡£ÈôÊDzé³öÀ´µÄ " ¾²Ì¬ÖªÊ¶ " ºÍÄ¿½ñµÄ " ÉÏÏÂÎÄ " ²»´î £¬£¬£¬£¬£¬£¬²ÃÅоͻá°ÑÈ¨ÖØÑ¹µÍ£¨Gate ÖµÇ÷Ïò 0£© £¬£¬£¬£¬£¬£¬ÈÃÄ£×ÓºöÂÔÕâ¸öÔëÉù£»£»£»£»ÈôÊÇÍêÉÆÆõºÏ£¨ºÃ±È " É˺®ÔÓ²¡ÂÛ " ºóËæ×Å " ÕÅÖÙ¾° "£© £¬£¬£¬£¬£¬£¬²ÃÅоͻá°Ñ´óÃÅ·­¿ª£¨Gate ÖµÇ÷Ïò 1£© £¬£¬£¬£¬£¬£¬Ö±½Ó°Ñ֪ʶעÈëÄ£×Ó¡£¡£ µÚ¶þÕ£º»Æ½ð±ÈÀý¡ª¡ª·¢Ã÷ AI Ä£× "U ÐÍÇúÏß "¼Ü¹¹Éè¼ÆºÃÁË £¬£¬£¬£¬£¬£¬½ÓÏÂÀ´µÄÎÊÌâÊÇ£ºÔõô·Ö¾Ó²ú£¿£¿£¿£¿ £¿ £¿¼ÙÉèÎÒÃÇÏÔ¿¨ÀïµÄÏÔ´æÊÇÓÐÏÞµÄ £¬£¬£¬£¬£¬£¬×ܲÎÊýÔ¤ËãÒ²ÊÇÀο¿µÄ¡£¡£ÎÒÃÇÓ¦¸Ã°Ñ¼¸¶à²ÎÊý·ÖÅ䏸 MoE µÄ " ר¼Ò "£¨ÈÏÕæÅÌË㣩 £¬£¬£¬£¬£¬£¬¼¸¶à²ÎÊý·ÖÅ䏸 Engram µÄ " ×Öµä "£¨ÈÏÕæÓ°Ï󣩣¿£¿£¿£¿ £¿ £¿ÕâÊÇÒ»¸öµä·¶µÄ×ÊÔ´ÉèÖò©ÞÄ¡£¡£DeepSeek ÍŶӾÙÐÐÁËÒ»³¡´ó¹æÄ£µÄÏûÈÚʵÑé £¬£¬£¬£¬£¬£¬É¨ÃèÁË´Ó 0% µ½ 100% µÄ·ÖÅä±ÈÀý £¬£¬£¬£¬£¬£¬Ð§¹û»­³öÁËÒ»ÌõÍêÉÆµÄ "U ÐÍ Scaling Law ÇúÏß "¡£¡£ÕâÕÅͼչÏÖÁË AI Ä£×ÓÉè¼ÆµÄµ×²ã¼ÍÂÉ£º×ó²à¼«¶Ë£¨´¿ Engram£©£ºÈôÊǰѲÎÊýÈ«¸ø×Öµä £¬£¬£¬£¬£¬£¬Loss ºÜ¸ß¡£¡£ÓÉÓÚÄ£×ÓÄð³ÉÁË " Êé°×³Õ " £¬£¬£¬£¬£¬£¬¹âÓÐËÀ¼ÇÓ²±³ £¬£¬£¬£¬£¬£¬Ã»ÓÐÂß¼­ÍÆÀíÄÜÁ¦¡£¡£ÓҲ༫¶Ë£¨´¿ MoE£©£ºÈôÊǰѲÎÊýÈ«¸ø×¨¼Ò £¬£¬£¬£¬£¬£¬Loss Ò²ºÜ¸ß¡£¡£ÓÉÓÚר¼ÒÃDZ»ÆÈ°Ñ¾«Éñ¶¼»¨ÔÚ±³Ê飨ӰÏó¾²Ì¬ÖªÊ¶£©ÉÏ £¬£¬£¬£¬£¬£¬Ã»¿Õ¸ÉÕýÊ¡£¡£»£»£»£»Æ½ðÖ§½âµã£¨¦Ñ ¡Ö 75%-80%£©£ºµ±ÎÒÃǽ«Ô¼20%-25% µÄÏ£º±²ÎÊýÔ¤Ëã·Ö¸ø Engram £¬£¬£¬£¬£¬£¬Ê£Ïµĸø MoE ʱ £¬£¬£¬£¬£¬£¬Ä£×ÓµÄÑéÖ¤¼¯ Loss ½µµ½ÁË×îµÍµã¡£¡£ÕâÊÇÒ»¸ö¼«¾ßÖ¸µ¼ÒâÒåµÄ·¢Ã÷£º¹ØÓÚ¼¸°ÙÒÚ²ÎÊýµÄ´óÄ£×ÓÀ´Ëµ £¬£¬£¬£¬£¬£¬´¿´â¶ÑÆöÅÌË㵥루MoE ר¼Ò£©ÒѾ­ÊDZ߼ÊЧӦµÝ¼õÁË £¬£¬£¬£¬£¬£¬±ØÐèÒýÈëרÃŵľ²Ì¬Ó°ÏóÄ£¿£¿£¿£¿ £¿ £¿éÀ´ÊµÏÖ " ´æËãÆ½ºâ "¡£¡£ µÚÈýÕ£º·´Ö±¾õµÄ±¬·¢¡ª¡ªÎªÊ²Ã´ " ²é×Öµä " ÄÜÌá¸ß " ÊýѧЧ¹û "£¿£¿£¿£¿ £¿ £¿ÈôÊÇ Engram ½ö½öÊÇÈÃÄ£×Ó " ¼ÇÐÔ¸üºÃ " £¬£¬£¬£¬£¬£¬ÕâÆªÂÛÎĵķÖÁ¿»¹È±·¦ÒÔÕð¾ªÉçÇø¡£¡£ÊÂʵ £¬£¬£¬£¬£¬£¬RAG£¨¼ìË÷ÔöÇ¿ÌìÉú£©Ò²Äܽâ¾ö֪ʶÎÊÌâ¡£¡£ÕæÕýÈÃÒµ½ç¸ÐÓ¦Õðº³µÄ £¬£¬£¬£¬£¬£¬ÊÇʵÑéЧ¹ûÖÐÄÇЩÒâÁÏÖ®ÍâµÄÊÕÒæ¡£¡£DeepSeek ¹¹½¨ÁËÈý¸ö±ÈÕÕÄ£×Ó £¬£¬£¬£¬£¬£¬ÑÏ¿á¿ØÖÆ¼¤»î²ÎÊýÄ¿£¨3.8B£©ºÍѵÁ·Êý¾ÝÁ¿£¨262B tokens£©ÍêȫһÖ£ºDense-4B£º¹Å°åµÄŨÃÜÄ£×Ó¡£¡£MoE-27B£º´¿ MoE Ä£×Ó£¨72 ¸öר¼Ò£©¡£¡£Engram-27B£º»ìÏýÄ£×Ó£¨55 ¸öר¼Ò + 5.7B Engram ²ÎÊý£©¡£¡£Ð§¹ûÁîÈË´óµøÑÛ¾µ£º1. ÒâÁÏÖ®ÖУºÖªÊ¶ÀàʹÃü°Ô°ñÔÚ MMLU£¨×ÛºÏ֪ʶ£©ÉÏ £¬£¬£¬£¬£¬£¬Engram Ä£×ÓÌáÉýÁË3.4 ·Ö£»£»£»£»ÔÚ CMMLU£¨ÖÐÎÄ֪ʶ£©ÉÏ £¬£¬£¬£¬£¬£¬ÌáÉýÁË4.0 ·Ö¡£¡£ÕâºÜºÃÃ÷È· £¬£¬£¬£¬£¬£¬Íâ¹ÒÁË×Öµä £¬£¬£¬£¬£¬£¬ÖªÊ¶×ÔÈ»¸üºÃÁË £¬£¬£¬£¬£¬£¬»Ã¾õ¸üÉÙÁË¡£¡£2. ÒâÁÏÖ®Í⣺Âß¼­¡¢´úÂë¡¢ÊýѧÖÜÈ«±©Õǰ´Àí˵ £¬£¬£¬£¬£¬£¬" ²é×Öµä " ºÍ " ×öÊýѧÌâ " û¹ØÏµ¡£¡£µ«ÔÚ BBH£¨×ÛºÏÍÆÀí£©ÉÏ £¬£¬£¬£¬£¬£¬Engram-27B ¾¹È»±Èͬ²ÎÊýµÄ´¿ MoE »ùÏßÌáÉýÁËÕûÕû5.0 ·Ö£¡MATH£¨Êýѧ£©£ºÌáÉý2.4 ·Ö¡£¡£HumanEval£¨´úÂëÌìÉú£©£ºÌáÉý3.0 ·Ö¡£¡£ARC-Challenge£¨ÖØ´óÍÆÀí£©£ºÌáÉý3.7 ·Ö¡£¡£3. Éî¶ÈÆÊÎö£ºÓÐÓÃÉî¶È£¨Effective Depth£©ÀíÂÛΪʲô£¿£¿£¿£¿ £¿ £¿Ò»¸ö " ËÀ¼ÇÓ²±³ " µÄÄ£¿£¿£¿£¿ £¿ £¿é £¬£¬£¬£¬£¬£¬ÎªÊ²Ã´ÄÜÌá¸ßÖÇÉÌ£¿£¿£¿£¿ £¿ £¿DeepSeek ÍŶÓʹÓÃLogitLensºÍ "CKA£¨ÖÐÐÄºË¶ÔÆë£©" ÊÖÒÕ £¬£¬£¬£¬£¬£¬¶ÔÄ£×ÓÄÚ²¿¾ÙÐÐÁË " ÆÊ½â "¡£¡£ËûÃÇ·¢Ã÷ÁËÒ»¸ö¾ªÈ˵ÄÕ÷Ï󣺻¹¼ÇµÃ¿ªÍ·µÄ " ´÷°²ÄÈÍõåú " Â𣿣¿£¿£¿ £¿ £¿ÔÚ´¿ MoE Ä£×ÓÖÐ £¬£¬£¬£¬£¬£¬Ç°¼¸²ãÍøÂç¶¼ÔÚæ×Å " Æ´¼¯¿´·¨ "¡£¡£¶øÔÚ Engram Ä£×ÓÖÐ £¬£¬£¬£¬£¬£¬ÓÉÓÚµÚ 2 ²ã¾Í²åÈëÁË Engram Ä£¿£¿£¿£¿ £¿ £¿é £¬£¬£¬£¬£¬£¬¾²Ì¬ÖªÊ¶µÄ¼ìË÷ÔÚ¼«ÔçµÄ½×¶Î¾ÍÍê³ÉÁË¡£¡£ÕâÒâζ×Å £¬£¬£¬£¬£¬£¬Ô­±¾ÓÃÓÚ " ËÀ¼ÇÓ²±³ " µÄǰ¼¸²ãÍøÂç±»½â·ÅÁË£¡ÕâÏ൱ÓÚ¸øÄ£×Ó " ÐéÔö " ÁËÉî¶È¡£¡£ ÄÇЩ±»ÊͷųöÀ´µÄÍøÂç²ãºÍ×¢ÖØÁ¦Í·£¨Attention Heads£© £¬£¬£¬£¬£¬£¬²»ÔÙÐèÒª´¦ÀíààËյľֲ¿ÒÀÀµ£¨ºÃ±Èʶ±ð " ÕÅÖÙ¾° " ÊÇË­£© £¬£¬£¬£¬£¬£¬´Ó¶ø¿ÉÒÔÈ«Éñ¹á×¢µØÍ¶Èëµ½¸üÖØ´óµÄÈ«¾ÖÍÆÀí¡¢³¤³ÌÂß¼­¹¹½¨ºÍ´úÂëÂß¼­ÌìÉúÖÐÈ¥¡£¡£Engram µÄʵÖÊ £¬£¬£¬£¬£¬£¬²»ÊÇ " Ìæ»» " ÍÆÀí £¬£¬£¬£¬£¬£¬¶øÊÇͨ¹ý " ·ÖÁ÷ " ÔÓ»î £¬£¬£¬£¬£¬£¬ÈôóÄÔרעÓÚ¸ü¸ßά¶ÈµÄ˼Ë÷¡£¡£ µÚËÄÕ£º¹¤³ÌÊÂÒµ¡ª¡ªÍ»ÆÆÓ¢Î°´ïµÄ " ÏÔ´æ°ÔȨ "¹ØÓÚ»ª¶û½ÖµÄͶ×ÊÕߺÍËãÁ¦ÖÐÐĵÄÔËάÕßÀ´Ëµ £¬£¬£¬£¬£¬£¬ÕâÆªÂÛÎÄ×îÐԸеĵط½²»ÔÚÓÚ Score £¬£¬£¬£¬£¬£¬¶øÔÚÓÚCost£¨±¾Ç®£©¡£¡£ÔÚ AI ʱ´ú £¬£¬£¬£¬£¬£¬×îÌÚ¹óµÄ×ÊÔ´²»ÊÇËãÁ¦£¨FLOPs£© £¬£¬£¬£¬£¬£¬¶øÊÇÏԴ棨HBM£©¡£¡£Ó¢Î°´ï H100 Ö®ÒÔÊÇ¹ó £¬£¬£¬£¬£¬£¬ºÜºéÁ÷ƽÉÏÊÇÓÉÓÚÄÇϡȱµÄ HBM3e ÄÚ´æ¡£¡£¶ø Engram ´øÀ´ÁËÒ»¸öÇ㸲ÐÔµÄÌØÕ÷£º³¹µ×µÄ´æËãÊèÉ¢¡£¡£1. MoE µÄÍ´µã£ºÏÔ´æÍÌÊÉÕ߹ŰåµÄ MoE Ä£×Ó £¬£¬£¬£¬£¬£¬Æä·ÓÉ»úÖÆ£¨Routing£©ÊǶ¯Ì¬µÄ¡£¡£Ä£×Ó±ØÐèÏÈËã³öÄ¿½ñ Token µÄÌØÕ÷ £¬£¬£¬£¬£¬£¬ËãÍêÕâÒ»²ã £¬£¬£¬£¬£¬£¬²ÅÖªµÀÏÂÒ»²ã¸ÃÕÒÄĸöר¼Ò¡£¡£ÕâÒâζ×Å £¬£¬£¬£¬£¬£¬ËùÓеÄר¼ÒÄ£×Ó±ØÐèʱ¿ÌÔÚÌÚ¹óµÄ GPU ÏÔ´æÀï´ýÃü £¬£¬£¬£¬£¬£¬Ëæ½ÐËæµ½¡£¡£2. Engram µÄÍ»ÆÆ£ºÈ·¶¨µÄÔ¤ÖªEngram µÄ²é±íÂß¼­ÊÇÈ·¶¨ÐԵġ£¡£Ö»ÒªÊäÈëµÄÎı¾È·¶¨ÁË£¨ºÃ±È "A New Axis of Sparsity"£© £¬£¬£¬£¬£¬£¬ÄÇôËü¶ÔÓ¦µÄ N-gram Ë÷Òý¾ÍÈ·¶¨ÁË¡£¡£ÎÒÃÇ»ù´¡²»ÐèÒªµÈÄ£×ÓËãÍêǰһ²ã £¬£¬£¬£¬£¬£¬ÔÚ Token ½øÈëÄ£×ÓµÄÄÇһ˲¼ä £¬£¬£¬£¬£¬£¬ÎÒÃǾÍÖªµÀËüÐèÒª²éÄÄÕűíµÄÄÄÒ»ÐС£¡£3. CPU µÄÄæÏ®£º°Ñ´óÄ£×ÓÈû½øÄÚ´æÌõÕâÒ»ÌØÕ÷´øÀ´ÁËÖØ´óµÄ¹¤³ÌÓ¯Àû£ºÐ¶ÔØ£¨Offload£©£ºÎÒÃÇ¿ÉÒ԰Ѽ¸°ÙÒÚ¡¢ÉõÖÁÉÏǧÒÚ²ÎÊýµÄ Engram ´Ê±í £¬£¬£¬£¬£¬£¬Ö±½ÓÈÓµ½×ÔÖÆ¡¢Á¿´ó¡¢Ò×À©Õ¹µÄ "CPU Äڴ棨DRAM£©" Àï £¬£¬£¬£¬£¬£¬ÉõÖÁ·ÅÔÚ NVMe SSD ÉÏ¡£¡£Ô¤È¡£¡£¨Prefetching£©£ºÔÚ GPU Æ´ÃüÅÌËãǰһ²ã Transformer µÄʱ¼ä £¬£¬£¬£¬£¬£¬CPU ʹÓà PCIe ͨµÀ £¬£¬£¬£¬£¬£¬Òì²½µØ°ÑÏÂÒ»²ãÐèÒªµÄÓ°ÏóÊý¾Ý " Ԥȡ " ³öÀ´ £¬£¬£¬£¬£¬£¬ÍÆË͵½ GPU¡£¡£ÑÚÊÎÑÓ³Ù £¬£¬£¬£¬£¬£¬²¢Ðд¦Àí¡£¡£DeepSeek ʵ²âÊý¾ÝÏÔʾ£º×ÝÈ»¹ÒÔØÁË100B£¨Ç§ÒÚ£©²ÎÊýµÄ Engram ±íµ½ CPU ÄÚ´æ £¬£¬£¬£¬£¬£¬Ïà±ÈÓÚ´¿ GPU ÍÆÀí £¬£¬£¬£¬£¬£¬ÍÌÍÂÁ¿µÄϽµ²»µ½ 3%¡£¡£ÕâÊÇÒ»¸öÈÃËùÓÐÓÉÓÚÂò²»µ½ HBM ¶ø½¹ÂǵÄÈË¿ñϲµÄ½áÂÛ¡£¡£ÕâÒâζ×Å £¬£¬£¬£¬£¬£¬Î´À´µÄ´óÄ£×Ó £¬£¬£¬£¬£¬£¬" Ó°ÏóÈÝÁ¿ " ¿ÉÒԵͳÉÍâµØÎÞÏÞÀ©ÕÅ £¬£¬£¬£¬£¬£¬¶ø²»±Ø±»Ó¢Î°´ïµÄÏԴ濨²±×Ó¡£¡£ µÚÎåÕ£º³¤Îı¾µÄʤÀû¡ª¡ª NIAH ²âÊÔµÄÔ¾Éý³ýÁËͨÓÃÍÆÀí £¬£¬£¬£¬£¬£¬Engram ÔÚ³¤Îı¾£¨Long Context£©ÁìÓòµÄÌåÏÖͬÑù֤ʵÎú " ·Ö¹¤ " µÄ¼ÛÖµ¡£¡£ÔÚ³¤Îı¾´¦ÀíÖÐ £¬£¬£¬£¬£¬£¬×¢ÖØÁ¦»úÖÆ£¨Attention£©µÄ´°¿ÚÊÇÓÐÏ޵ġ£¡£ÈôÊÇ×¢ÖØÁ¦±»´ó×ڵľֲ¿ÐÅÏ¢£¨ÈçÀο¿¶Ì

      ÏÂÔØ

    • ÃÀ¹ú¹úÎñÔºÒªÇóÃÀ¹ú¹«ÃñÁ¬Ã¦ÍÑÀëÒÁÀÊ

      ¡°ÈýÖ»ÑòÍøÂ硱Õýʽ¸´²¥ £¬£¬£¬£¬£¬£¬¾ÞϸÑî¸ç²¢Î´ÏÖÉí £¬£¬£¬£¬£¬£¬¿ª²¥4СʱÏúÊÛ¶î25ÍòÔª

      ¹úƹ¶ª¹Ú»ù´¡Ô­Òò£¿£¿£¿£¿ £¿ £¿Ã½ÌåÈË£ºÄе¥ÊÇÏÔ×ŶϵµÅ®µ¥ÊÇÖ»ÓÐ2¸öÈËÄÜ´ò

      ÏÂÔØ

    • ±«Íþ¶ûÔâÐÌÊÂÊÓ²ì£¬£¬£¬£¬£¬£¬ÏÂÒ»²½»áÔõÑù£¿£¿£¿£¿£¿£¿

      ÎïÀíAIÈȳ±ÏµÄÀä˼Ë÷ £¬£¬£¬£¬£¬£¬Ð¡ÐÄ¡°»ªÇ¿±±Ê½¡±AI

      ³¤¾çÑÝÔ±ÔÚ¶Ì¾çÆ¬³¡¾í·èÁË£¿£¿£¿£¿ £¿ £¿2026¶Ì¹ú½ø»¯ÂÛ

      ÏÂÔØ

    • ÎĿ⡰»»Á³¡±£¬£¬£¬£¬£¬£¬ËÑË÷¡°»»Ãü¡±£¬£¬£¬£¬£¬£¬°Ù¶ÈµÄAIÖØ¹¹

      AlphabetÊÐÖµÊ×´ÎÍ»ÆÆ4ÍòÒÚÃÀÔª

      ΪʲôÉîÛÚÓ²¼þȦ¶¼ÔÚ̸ÂÛǧÎÊ£¿£¿£¿£¿ £¿ £¿

      ÏÂÔØ

    • ÃÀ¹úÓéÀÖ½çÕ¨ÁѳóÎÅ£¡ÂíÁú¡¤°×À¼¶ÈÒÉΪÂõ¿Ë¶û¡¤½Ü¿ËÑ·Ó××ÓÉú¸¸

      ÒÁÀÊÍⳤ£ºÕþ¸®ÒÑÓ뿹ÒéÕßÕö¿ª¶Ô»°Ê±ÊÆÏÖÔÚÍêÈ«»ñµÃ¿ØÖÆ

      ÄêÊÕ11ÒÚ £¬£¬£¬£¬£¬£¬¸ßê²Í¶µÄ²Í¾Æ°ÉÒªIPOÁË

      ÏÂÔØ

    • Ó¢¾üºäÕ¨»ú¿ÕÇÚ±»Òâ¾ü·ý²£¬£¬£¬£¬£¬£¬×ªÔËʱ¾¹ÀÖ³ÉÐ®ÖÆ³ðÈË·É»ú£¬£¬£¬£¬£¬£¬»Ø»ùµØ²îµã±»¡°Åç»ð¡±»÷Âä
    • ½á¹¹8ÄêÖÕ¡°»ñÖ¤¡±£¬£¬£¬£¬£¬£¬¡°²£ÄòËáÅ®Íõ¡±É±ÈëÈâ¶¾ËØºìº£

      ÃÀ¹ú¹úÎñÔºÒªÇóÃÀ¹ú¹«ÃñÁ¬Ã¦ÍÑÀëÒÁÀÊ

      ³ÂÃεçÊÓ¾çÊ×Ð㣡¡¶Ð¡³Ç´óÊ¡·²Ëµ¶´òƹÅÒÄëѹ±í¸ç»ÆÏþÃ÷ £¬£¬£¬£¬£¬£¬·´²îÃȾøÁË

      ÏÂÔØ

    • ¸ñÁêÀ¼µº×ÔÖÎÕþ¸®Ðû²¼ÉùÃ÷£¡

      ¹¤ÐŲ¿£ºÊµÑ鹤ҵ»¥ÁªÍøÓëÈ˹¤ÖÇÄÜÈںϸ³ÄÜÐж¯ £¬£¬£¬£¬£¬£¬ÍƸÐÈ˹¤ÖÇÄÜÊÖÒÕÔÚ¹¤ÒµÈ«Á´ÌõÉøÍ¸

      ½çÏßÖÇÄÜÈëÑ¡±ÏÂíÍþÖйú½ðÈڿƼ¼50Ç¿ £¬£¬£¬£¬£¬£¬ÒÔAIÓëÇø¿éÁ´ÊÖÒÕ¸³ÄÜÈ«Çò½ðÈÚÁ¢Òì

      ÏÂÔØ

    • ÒÁÀ­¿Ë±í´ï²É¹ºJF-17¡°À׵硱ս¶·»úµÄÐËÈ¤Éæ¼°¸üÉîÈëµÄ·ÀÎñÏàÖú
    • Èý´Î²¡Î£Í¨Öª£¬£¬£¬£¬£¬£¬Ã»µÈÀ´ÊÂÒµ£¡Õâλ±»Âî¡°Èí·¹ÄС±µÄTVBÃ÷ÐÇ£¬£¬£¬£¬£¬£¬ÔÚ²¡´²±ßÎÕ½ôÁËÍâÆÅµÄÊÖ
    • ½á¹¹8ÄêÖÕ¡°»ñÖ¤¡±£¬£¬£¬£¬£¬£¬¡°²£ÄòËáÅ®Íõ¡±É±ÈëÈâ¶¾ËØºìº£

      Æ»¹ûÓë¹È¸èÊ»ÈëAIÉîË®Çø

      °¢ÐÅÑݳª»áÓÖˤÁË£¡Î⽨ºÀѸËÙ½«ÆäÀ­Æð £¬£¬£¬£¬£¬£¬´ËǰÔÚÉϺ£³¡ÔøË¤ÏÂÎę̀

      ÏÂÔØ

    • Shopify¹É¼ÛÒòAI¹ºÎï¿´·¨´óÕÇ£¬£¬£¬£¬£¬£¬ÏÖÔÚÕýЯÊÖGeminiÓëCopilot¼Ó´óͶÈë

      ¼¯Ò¸³Éô㺵¤ÂóÊÔÉäÒÔÉ«Áп¨³µÅÚ·¨¹úÆï±ø²½¶Ó¼ÓÈëʵս·´¿¹ÑÝϰ

      ¹§Ï²£¡32ËêTVBÐÂÎÅÅ®ÉñÐû²¼Íê»é £¬£¬£¬£¬£¬£¬ÐÂÀÉÒ²ÊÇTVBÖ÷²¥

      ÏÂÔØ

    • ÖØ°õ£¡Íò×Ö³¤ÎÄ£¬£¬£¬£¬£¬£¬ËµÍ¸ÃÀ¾üµÄѸËÙתÐÍ

      ¡°ÔóÁ¬Ë¹»ùÎÞ´¦¿É²Ø¡±¡ª¡ª¶íÂÞ˹¡°é»Ê÷¡±µ¼µ¯Ê¹µØ±¤ÐÎͬÐéÉè

      200Äêǰ £¬£¬£¬£¬£¬£¬Ëû¾ÍÔ¤ÑÔÁËÃÀ¹úµÄÒ°ÐÄ£¿£¿£¿£¿ £¿ £¿

      ÏÂÔØ

    • ¸ÐµÂÁºÐУº·¿µØ²úÊг¡ÉÐÓÐDZÁ¦REITsÀ©Î§¿ªÆô²»¶¯²ú½ðÈÚ»¯Ð½׶Î

      ÎÚ¿ËÀ¼¾ü·½ÏÖÒÛ×î´«ÆæµÄ2B16ÆÈÁñÅÚµ±¹ý20ÄêËêÄî±®µÄËÕÁªÀϹǶ­

      ¼ß-35ûÓв൯²ÖÈ´¼ÓËÙÁ¿²ú £¬£¬£¬£¬£¬£¬¾¿¾¹´æ²»±£´æÖØ´óÕ½Êõ¶Ì°å£¿£¿£¿£¿ £¿ £¿

      ÏÂÔØ

    • ¡¶»ªÎªÄêÖÕ½±·ÖÅä²½·¥¡·£ºÇ¿µ÷Т˳ÀûÈó£¬£¬£¬£¬£¬£¬¿ÉÏòδÀ´½è½±½ð

      ÐÙÑÀÀûKF41¡°É½Ã¨¡±²½×äÕ½³µ¼ÓÈëÑÝϰ³ÉΪ±±Ô¼×îÏȽøµÄ½¾üÖ®Ò»

      ÁºÎÄ·æÊðÃûÐÂÂÛÎÄ£º¸ø´óÄ£×ÓÅä±¾¡°×ֵ䡱 £¬£¬£¬£¬£¬£¬ÅÌËã¡¢Ó°Ïó·Ö¾ÓºóÖÇÉ̱¬±í £¬£¬£¬£¬£¬£¬¾ç͸DeepSeekV4£¿£¿£¿£¿ £¿ £¿

      ÏÂÔØ

    • ¼ß-35ûÓв൯²ÖÈ´¼ÓËÙÁ¿²ú£¬£¬£¬£¬£¬£¬¾¿¾¹´æ²»±£´æÖØ´óÕ½Êõ¶Ì°å£¿£¿£¿£¿£¿£¿
    • ÖØÇ첩Ö÷¡°´ô´ô¡±ÇóÍøÓÑ×ÊÖúɱÖí£¬£¬£¬£¬£¬£¬¶Ì¶ÌÈýÌì·ÛË¿Á¿´Óȱ·¦2000±©ÕÇÖÁ³¬°ÙÍò

      ×Ô±äÁ¿»úеÈË»ñºìɼ¡¢ÉͶµÈ10ÒÚԪͶ×Ê £¬£¬£¬£¬£¬£¬¹¹½¨ÎïÀíÌìÏ»ù´¡Ä£×Ó

      ±±¾©ÏÖ´ú2025ÄêÏúÁ¿´ï21ÍòÁ¾ £¬£¬£¬£¬£¬£¬Î´À´Á½Äê½øÈëÐÂÄÜÔ´÷缯Ͷ·ÅÆÚ

      ÏÂÔØ

    • ¶Ô²»Æð³ÂÐÇÐñ£¬£¬£¬£¬£¬£¬Õâ´Î±»36Ëê´úÐñÃÔµ¹ÁË

      ÃÀ¹úפÒÁ¡°ÐéÄâ´óʹ¹Ý¡±ÔٴκôÓõÃÀ¹«ÃñÍÑÀëÒÁÀÊ

      ɽ¶«ØøÙ³Á©ÊÕµ½²ðǨ¿îºó±»¿ØÕ©Æ­ £¬£¬£¬£¬£¬£¬±»î¿Ñº707ÌìÒòÖ¤¾Ýȱ·¦»ñÊÍ £¬£¬£¬£¬£¬£¬¹úÅâÖÁ½ñδ×ã¶î·¢·Å

      ÏÂÔØ

    • µ¶ÇÐÂøÍ·£¡¡°´ó¿Ú¾¶ÐÂÐÍ»ðÅڰ桱054AÁÙ·Ú½¢¶õ¶û¶à˹½¢¹ÙÐûÁÁÏà

      µ¶ÇÐÂøÍ·£¡¡°´ó¿Ú¾¶ÐÂÐÍ»ðÅڰ桱054AÁÙ·Ú½¢¶õ¶û¶à˹½¢¹ÙÐûÁÁÏà

      71ËêºÓÄÏÉÌÇðÍËÐÝÎ÷ϯ»ð³µÉÏÓö26ÄêǰѧÉú¾«×¼½Ð×ÅÃû×Ö £¬£¬£¬£¬£¬£¬Ñ§Éú¾ÍµØÂäÀá

      ÏÂÔØ

    • ²ÌÉÙ·ÒɹÕÕÇì×£Íê»éÖÜÄê£¬£¬£¬£¬£¬£¬Í¸Â¶ºÍÕŽúÏ໥ÈÌÁ˶Է½18Äê£¬£¬£¬£¬£¬£¬¶÷°®ÒÀÈ»

      ÑøÖíÒµÁ½´ó¾ÞÍ·ÎÕÊÖ £¬£¬£¬£¬£¬£¬ÕæÕýµÄÉúËÀÏ߲ŸշºÆð

      Èû¸¥¶û˹¿ËÆ«ÏòµÄÕ½°Ü £¬£¬£¬£¬£¬£¬ÎÚ¾ü×Ü˾ÁîÎ÷¶û˹»ù £¬£¬£¬£¬£¬£¬ÊÇÔõÑùÚ¹Ê͵ģ¿£¿£¿£¿ £¿ £¿

      ÏÂÔØ

    ±êÇ©Áбí

    ×îÐÂÁôÑÔ

    ÈÈÃÅÊÖÓÎ

    ×ܽáÈ«Íø764ƪЧ¹û

    °üС½ã24СʱÈ˹¤¿Í·þ

    • Öֱ𣺠ÉúÑÄ·þÎñ
    • ¾Þϸ£¡£º 500.556MB
    • ϵͳ£º Android
    • ¸üУº 2026-01-14 00:25
    • ÈËÆø£º 57633
    • ̸ÂÛ£º 8369
    °²×¿ÏÂÔØ

    Ó¦ÓýéÉÜ

    • °üС½ã24СʱÈ˹¤¿Í·þ
    • °üС½ã24СʱÈ˹¤¿Í·þ
    • °üС½ã24СʱÈ˹¤¿Í·þ
    °Ù¶È°ü¹Ü £¬£¬£¬£¬£¬£¬ÎªÄúËÑË÷»¤º½wAAAABJRU5ErkJggg==

    ×î¼Ñ»Ø¸²

    1¡¢°üС½ã24СʱÈ˹¤¿Í·þ?¡ª¡ªTG:Ôݲ»¹ûÕæ¡ª¡ª?????????????????????????

    2¡¢¶«½¼ÉÏÃÅ498ÐÑÄ¿Âï?¡ª¡ªTG:Ôݲ»¹ûÕæ¡ª¡ª??????????????????????????

    3¡¢?ÖØ°õÐÂÎÅÀ´Ï®£¡??°üС½ã24СʱÈ˹¤¿Í·þ-APPÏÂÔØ?Ö§³Ö:winall/win7/win10/win11?ϵͳÀàÐÍ?:°üС½ã24СʱÈ˹¤¿Í·þ(2025ȫվ)×îа汾IOS/°²×¿¹Ù·½Èë¿ÚN.19.90.78(Ç徲ƽ̨)

    4¡¢?¶À¼Ò£¡???°üС½ã24СʱÈ˹¤¿Í·þ-APPÏÂÔØ?Ö§³Ö:winall/win7/win10/win11?ϵͳÀàÐÍ?:°üС½ã24СʱÈ˹¤¿Í·þ(2025ȫվ)×îа汾IOS/°²×¿¹Ù·½Èë¿ÚN.7.71.23(Ç徲ƽ̨)

    u=675039822,165421409&fm=30&app=106&f=JPEG?w=312&h=208&s=0810EC1219C84CCA4CF524DA0300D0B2 u=634527587,165459665&fm=30&app=106&f=JPEG?w=312&h=208&s=5780FCAE5443B8E358A9E0260300E05B u=2921408703,165430589&fm=30&app=106&f=JPEG?w=312&h=208&s=7922FC5B00174A7512146B3F03007056

    Ö©Öë³ØÖеÄ302Ìø×ªÊ¹Óù淶

    ×÷Ϊһ¸öרҵµÄSEOÐÐÒµÕ¾³¤ £¬£¬£¬£¬£¬£¬Ïàʶ²¢ÕÆÎÕÖ©Öë³Ø³ÌÐòµÄÔ­ÀíºÍÓÃ;ÊǺÜÊÇÖ÷ÒªµÄ¡£¡£Ö©Öë³ØÊÇÒ»ÖÖÓÃÓÚÄ£ÄâËÑË÷ÒýÇæÖ©Ö루spider£©ÅÀÈ¡ÍøÒ³µÄ¹¤¾ß £¬£¬£¬£¬£¬£¬Ëü¿ÉÒÔÄ£Äâ¶à¸öÖ©Öëͬʱ·ÃÎÊÍøÕ¾ £¬£¬£¬£¬£¬£¬²¢ÍøÂçÍøÕ¾ÉϵÄÐÅÏ¢¡£¡£ÔÚSEOÓÅ»¯µÈÁìÓò £¬£¬£¬£¬£¬£¬Ö©Öë³Ø³ÌÐò¿ÉÒÔ×ÊÖúÕ¾³¤¸üºÃµØÏàʶËÑË÷ÒýÇæ¶ÔÍøÕ¾µÄ»á¼ûÇéÐÎ £¬£¬£¬£¬£¬£¬´Ó¶ø×ö³öÏìÓ¦µÄÓÅ»¯¡£¡£

    Ö©Öë³Ø³ÌÐòµÄÔ­Àí

    Ö©Öë³Ø³ÌÐòµÄÔ­ÀíÖ÷ÒªÊÇͨ¹ýÄ£Äâ¶à¸öÖ©Öëͬʱ·ÃÎÊÍøÕ¾ £¬£¬£¬£¬£¬£¬ÍøÂçÍøÕ¾ÉϵÄÐÅÏ¢¡£¡£ÔÚÏÖʵ²Ù×÷ÖÐ £¬£¬£¬£¬£¬£¬Õ¾³¤¿ÉÒÔÉèÖÃÖ©Öë³Ø³ÌÐòÄ£Äâ²î±ðËÑË÷ÒýÇæµÄÖ©Öë £¬£¬£¬£¬£¬£¬ºÃ±ÈGoogle¡¢BingµÈ £¬£¬£¬£¬£¬£¬ÒÔ´ËÀ´Ïàʶ²î±ðËÑË÷ÒýÇæ¶ÔÍøÕ¾µÄ»á¼ûÇé¿ö¡£¡£Í¨¹ýÖ©Öë³Ø³ÌÐòÍøÂçµ½µÄÊý¾Ý £¬£¬£¬£¬£¬£¬Õ¾³¤¿ÉÒÔÆÊÎöÍøÕ¾ÔÚËÑË÷ÒýÇæÖеÄÅÅÃûÇéÐΡ¢ÍøÒ³±»Ë÷ÒýµÄÇéÐÎµÈ £¬£¬£¬£¬£¬£¬´Ó¶ø¸üºÃµØ¾ÙÐÐSEOÓÅ»¯¡£¡£

    Ö©Öë³Ø³ÌÐòµÄÓÃ;

    Ö©Öë³Ø³ÌÐòÔÚSEOÓÅ»¯ÖÐÓÐ×ÅÆÕ±éµÄÓÃ;¡£¡£Ê×ÏÈ £¬£¬£¬£¬£¬£¬Í¨¹ýÖ©Öë³Ø³ÌÐò¿ÉÒÔÊÓ²ìËÑË÷ÒýÇæÖ©Öë¶ÔÍøÕ¾µÄ»á¼ûÇéÐÎ £¬£¬£¬£¬£¬£¬****ÏÖÍøÕ¾±»ÆÁ±Î»ò±»½µÈ¨µÄÇéÐΡ£¡£Æä´Î £¬£¬£¬£¬£¬£¬Ö©Öë³Ø³ÌÐò¿ÉÒÔ¼à¿ØÍøÕ¾µÄË÷ÒýÇéÐÎ £¬£¬£¬£¬£¬£¬****ÏÖÄÄЩÒ³ÃæÎ´±»Ë÷Òý»ò±»ÒÅ©¡£¡£×îºó £¬£¬£¬£¬£¬£¬Ö©Öë³Ø³ÌÐò»¹¿ÉÒÔ¸ú×ÙÍøÕ¾Òªº¦´ÊµÄÅÅÃûÇéÐÎ £¬£¬£¬£¬£¬£¬ÊµÊ±µ÷ÕûÓÅ»¯Õ½ÂÔ¡£¡£

    ×îºó

    ÕâÊÇÒ»¸ö¹ØÓÚ AI µ×²ãÂß¼­Öع¹µÄʱ¿Ì¡£¡£ºã¾ÃÒÔÀ´ £¬£¬£¬£¬£¬£¬Transformer ¼Ü¹¹±»À§ÔÚÒ»¸öÌÚ¹óµÄã£ÂÛÖУºÎÒÃÇÓÃ×Å×îÏȽøµÄ GPU ËãÁ¦ £¬£¬£¬£¬£¬£¬È¥Èà AI Ä£×Ó " ËÀ¼ÇÓ²±³ " ÄÇЩ²é×Öµä¾ÍÄÜÖªµÀµÄ¾²Ì¬ÖªÊ¶¡£¡£DeepSeek ÁºÎÄ·æÍŶÓÓëÆä±±´óÏàÖúÕßÔÚ½ñÈÕÆÆÏþÐû²¼µÄÖØ°õÂÛÎÄ¡¶Conditional Memory via Scalable Lookup¡· £¬£¬£¬£¬£¬£¬³¹µ×Í»ÆÆÁËÕâÒ»½©¾Ö¡£¡£ËûÃÇÌá³öÁËÒ»ÖÖȫеÄEngram£¨Ó¡¼££©Ä£¿£¿£¿£¿ £¿ £¿é £¬£¬£¬£¬£¬£¬ÔڹŰåµÄ " Ìõ¼þÅÌËã "£¨MoE£©Ö®Íâ £¬£¬£¬£¬£¬£¬¿ª·¢Á˵ڶþÌõÏ£º±»¯Õ½Ïß¡ª¡ª" Ìõ¼þÓ°Ïó "¡£¡£Õâ²»µ«ÊÇÒ»´ÎÊÖÒÕÐÞ²¹ £¬£¬£¬£¬£¬£¬¶øÊÇÒ»³¡¹ØÓÚÄ£×Ó " ÄÔÈÝÁ¿ " µÄ¹©Ó¦²àˢС£¡£Ëü֤ʵÎú£ºµ±ÎÒÃǽ« " Ó°Ïó " ´Ó " ÅÌËã " ÖаþÀë £¬£¬£¬£¬£¬£¬°Ñ¸Ã±³µÄ½»¸ø " ×Öµä " £¬£¬£¬£¬£¬£¬°Ñ¸ÃËãµÄ½»¸ø´óÄÔ £¬£¬£¬£¬£¬£¬AI µÄÍÆÀíÄÜÁ¦½«Ó­À´·´Ö±¾õµÄ±¬·¢Ê½ÔöÌí¡£¡£DeepSeek ÍýÏëÔÚ 2 Ô´º½ÚǰºóÕýʽÐû²¼ V4 £¬£¬£¬£¬£¬£¬¶øÕâÒ»¿Ì»òÐí¾ÍÊÇ DeepSeek V4 ½µÉúµÄǰҹ¡£¡£ ÐòÕ£ºÁù²ãÉñ¾­ÍøÂçµÄ " ÎÞÓù¦ "¹ÊÊÂµÄÆðµã £¬£¬£¬£¬£¬£¬Ô´ÓÚ DeepSeek ÍÅ¶Ó¶Ô Transformer ÄÚ²¿ÔË×÷»úÖÆµÄÒ»´Î " ºË´Å¹²Õñ " ɨÃè¡£¡£ÔÚÈ˹¤ÖÇÄܵĺںÐ×ÓÀï £¬£¬£¬£¬£¬£¬µ±´óÄ£×Ó¿´µ½ "Diana, Princess of Wales"£¨´÷°²ÄÈ £¬£¬£¬£¬£¬£¬Íþ¶ûÊ¿Íõåú£©Õâ¸ö¶ÌÓïʱ £¬£¬£¬£¬£¬£¬ËüµÄÄÚ²¿±¬·¢ÁËÒ»³¡ÁîÈ˷ѽâÇÒ¼«ÆäÌÚ¹óµÄ " ÄÚÚ§ "¡£¡£Ñо¿Ö°Ô±·¢Ã÷ £¬£¬£¬£¬£¬£¬ÎªÁËʶ±ðÕâ¸öÀο¿µÄʵÌå £¬£¬£¬£¬£¬£¬Ä£×Ó¾¹È»¶¯ÓÃÁËÕûÕû 6 ²ãÍøÂ磺µÚ 1-2 ²ã£ºÄ£×Ó»¹ÔÚ×ÁÄ¥ "Wales" »òÐíÊÇÒ»¸ö¹ú¼Ò£»£»£»£»µÚ 3 ²ã£ºËüÒâʶµ½ÕâÊÇÅ·ÖÞµÄÒ»¸öµØÀí¿´·¨£»£»£»£»µÚ 4 ²ã£ºËü×îÏÈÆ´¼¯³ö "Princess of Wales" ËÆºõÊÇÒ»¸öÍ·ÏΣ»£»£»£»µÚ 5 ²ã£ºËüåÚÏëµ½ÁË " Íþ¶ûÊ¿Ç×ÍõµÄÆÞ×Ó "£»£»£»£»µÚ 6 ²ã£ºÖ±µ½ÕâÀï £¬£¬£¬£¬£¬£¬Ëü²ÅÖÕÓÚÈ·ÈÏ £¬£¬£¬£¬£¬£¬ÕâÊÇÖ¸ÄÇÎ»ÖøÃûµÄ " ´÷°²ÄÈÍõåú "¡£¡£ÔÚһλ׷Çó¼«ÖÂЧÂʵļܹ¹Ê¦ÑÛÖÐ £¬£¬£¬£¬£¬£¬Õâ¼òÖ±ÊÇËãÁ¦µÄ±©éåÌìÎï¡£¡£" ´÷°²ÄÈÍõåú " ÊÇÒ»¸ö¿Í¹Û±£´æµÄ¡¢¾²Ì¬µÄʵÌå £¬£¬£¬£¬£¬£¬Ëü²»»áÓÉÓÚÉÏÏÂÎĵÄת±ä¶ø¸Ä±äÆäʵÖÊ¡£¡£ÎªÁËÌáÈ¡Õâ¸öÔ­À´²é×Öµä¾ÍÄÜÖªµÀµÄÊÂʵ £¬£¬£¬£¬£¬£¬Transformer ¾¹È»¶¯ÓÃÁËÕûÕû 6 ²ãÉî¶ÈµÄÌÚ¹ó¾ØÕóÔËËãÈ¥ " ÖØÐÞ " Õâ¸ö¿´·¨¡£¡£Õâ¾ÍÏñÊÇÒ»¸ö¾øÊÀÌì²Å £¬£¬£¬£¬£¬£¬ÔÚÈ¥½â¾ö΢»ý·ÖÄÑÌâ֮ǰ £¬£¬£¬£¬£¬£¬Ã¿´Î¶¼µÃÏÈ»¨°ëСʱĬдһ±é¾Å¾Å³Ë·¨±í¡£¡£ ÕâÖÖ " ÒþʽӰÏó " µÄ»úÖÆ £¬£¬£¬£¬£¬£¬ÆÈʹģ×Ó½«Ãû¹óµÄ²ÎÊýÈÝÁ¿ºÍÍøÂçÉî¶È £¬£¬£¬£¬£¬£¬ÆÌÕÅÔÚÁ˼òÆÓµÄģʽƥÅäÉÏ¡£¡£DeepSeek ÔÚÕâÆª³¤´ï 33 Ò³µÄÂÛÎÄÖÐ £¬£¬£¬£¬£¬£¬Ìá³öÁËÒ»¸öÖ±»÷Áé»êµÄ¿½ÎÊ£ºÎªÊ²Ã´²»Ö±½Ó¸ø´óÄ£×ÓÅäÒ»±¾¿ÉÒÔËæ²éËæÓÃµÄ " ³¬µÈ×Öµä "£¿£¿£¿£¿ £¿ £¿ µÚÒ»Õ£º¼Ü¹¹ÖØËÜ¡ª¡ª Engram Ä£¿£¿£¿£¿ £¿ £¿éµÄ±©Á¦ÃÀѧΪÏàʶ¾öÕâ¸öÎÊÌâ £¬£¬£¬£¬£¬£¬DeepSeek Ìá³öÁËÒ»ÖÖÃûΪ "Engram£¨Ìõ¼þÓ°Ïó£©" µÄÈ«ÐÂÄ£¿£¿£¿£¿ £¿ £¿é¡£¡£ÈôÊÇ˵ MoE£¨»ìÏýר¼ÒÄ£×Ó£©ÊÇ°Ñ " ´óÄÔ " ·Ö³ÉÁ˲î±ðµÄÇøÓò £¬£¬£¬£¬£¬£¬Èòî±ðµÄר¼ÒÈÏÕæ²î±ðµÄ˼Ë÷£¨Ìõ¼þÅÌË㣩£»£»£»£»ÄÇô Engram ¾ÍÊǸø´óÄÔÍâ¹ÒÁËÒ»¸öÖØ´óµÄ " º£ÂíÌå " £¬£¬£¬£¬£¬£¬×¨ÃÅÈÏÕæ´æ´¢¾²Ì¬ÖªÊ¶£¨Ìõ¼þÓ°Ï󣩡£¡£1. ¸´Éú "N-gram"£º´Ó¹ÅÀÏÖÇ»ÛÖÐѰÕÒÃÕµ×Engram µÄ½¹µãÁé¸Ð £¬£¬£¬£¬£¬£¬¾¹È»À´×ÔÓÚ NLP£¨×ÔÈ»ÓïÑÔ´¦Àí£©ÁìÓòµÄ " ÉϹÅÉñÆ÷ " ¡ª¡ª N-gram¡£¡£ÔÚÉî¶ÈѧϰͳÖÎÌìÏÂ֮ǰ £¬£¬£¬£¬£¬£¬ÎÒÃǾÍÊÇ¿¿Í³¼Æ "N ¸ö´Êͬʱ·ºÆðµÄ¸ÅÂÊ " À´Ã÷È·ÓïÑԵġ£¡£DeepSeek ½«ÕâÒ»¾­µä¿´·¨¾ÙÐÐÁËÏÖ´ú»¯µÄħ¸Ä£º¹Å°åµÄ Transformer£ºÖªÊ¶ÊèÉ¢ÔÚÉñ¾­ÔªµÄÈ¨ÖØ£¨Weights£©Àï £¬£¬£¬£¬£¬£¬Ìáȡ֪ʶÐèÒª¾­ÓÉÖØ´óµÄÏßÐÔ²ãÅÌËã £¬£¬£¬£¬£¬£¬ÖØÆ¯ºó¸ß¡£¡£Engram Ä£¿£¿£¿£¿ £¿ £¿é£ºËüÊÇÒ»¸öÖØ´óµÄ¡¢¿ÉÀ©Õ¹µÄǶÈë±í£¨Embedding Table£©¡£¡£µ±Ä£×Ó¶Áµ½ " ÕÅÖÙ¾° " »òÕß " ËÄ´ó·¢Ã÷ " ÕâÖÖÀο¿´îÅ䣨N-gram£©Ê± £¬£¬£¬£¬£¬£¬²»ÐèÒª¶¯ÓôóÄÔÆ¤²ãÈ¥ÍÆÀí £¬£¬£¬£¬£¬£¬Ö±½Óͨ¹ý¹þÏ£Ë÷Òý £¬£¬£¬£¬£¬£¬ÔÚÄÚ´æ±íÖÐ " ²é " ³ö¶ÔÓ¦µÄÏòÁ¿¡£¡£ÕâÒ»Àú³ÌµÄʱ¼äÖØÆ¯ºóÊÇO ( 1 ) ¡ª¡ªÕâÒâζ×ÅÎÞÂÛ֪ʶ¿âÅòÕ͵½¶à´ó£¨ÄÄÅÂÊÇ 1000 ÒÚ²ÎÊý£© £¬£¬£¬£¬£¬£¬²éÕÒËÙÂÊÏÕЩÎÈ¹Ì £¬£¬£¬£¬£¬£¬ÇÒ¼«¿ì¡£¡£2. Èý´óÊÖÒÕ»¤³ÇºÓ¼ÈÈ»²é±íÕâôºÃ £¬£¬£¬£¬£¬£¬ÎªÊ²Ã´ÒÔǰûÈË×ö£¿£¿£¿£¿ £¿ £¿ÓÉÓÚÓÐÈý¸öÀ¹Â·»¢£º´æ´¢±¬Õ¨¡¢¶àÒå´Ê³åÍ»¡¢²ÎÊý·ÖÅä¡£¡£DeepSeek ¸ø³öÁ˽̿ÆÊé¼¶µÄ½â¾ö·½°¸£ºA. ´Ê±íѹËõ£º¼«ÖµÄÈ¥ÖØÌìÏÂÉϵĴÊ×é×éºÏÊÇÌìÎÄÊý×Ö¡£¡£DeepSeek Ê×ÏÈ×öÁËÒ»²½ " ÎÞËðѹËõ "¡£¡£ÔÚ·Ö´ÊÆ÷£¨Tokenizer£©²ãÃæ £¬£¬£¬£¬£¬£¬Ëü½«ÓïÒåÏàͬµ«Ð´·¨²î±ðµÄ´Ê¾ÙÐÐÁ˹éÒ»»¯¡£¡£ÀýÈç £¬£¬£¬£¬£¬£¬"Apple"£¨Ê××Öĸ´óд£©ºÍ "apple"£¨Ð¡Ð´£©ÔÚÓïÒåÉÏͨ³£Ö¸Í³Ò»¸ö¹¤¾ß¡£¡£Í¨¹ýÓ³ÉäºÏ²¢ £¬£¬£¬£¬£¬£¬ÓÐÓôʱíÖ±½ÓËõСÁË 23%¡£¡£Õâ²»µ«½ÚÔ¼ÁË¿Õ¼ä £¬£¬£¬£¬£¬£¬¸üÈÃ֪ʶµÄÃܶȴó·ùÌáÉý¡£¡£B. ¶àÍ·¹þÏ££º½â¾ö " ¹þÏ£³åÍ» "²»¿ÉÄܰÑËùÓÐ N-gram ¶¼´æÏÂÀ´¡£¡£Engram ʹÓÃÁË " ¶àÍ·¹þÏ££¨Multi-Head Hashing£©" ÊÖÒÕ¡£¡£Í¨¹ý¶à¸ö¹þÏ£º¯Êý £¬£¬£¬£¬£¬£¬½«ÎÞÏÞµÄ N-gram Ó³Éäµ½ÓÐÏÞµÄÄÚ´æ²ÛλÖС£¡£ËäÈ»»áÓйþÏ£³åÍ»£¨¼´Á½¸ö²î±ðµÄ´Ê±»Ó³Éäµ½ÁËͳһ¸öλÖã© £¬£¬£¬£¬£¬£¬µ«Í¨¹ý " ¶àÍ· " Éè¼Æ £¬£¬£¬£¬£¬£¬Ä£×Ó¿ÉÒÔ´Ó¶à¸öºòѡЧ¹ûÖÐÆ´¼¯³ö׼ȷµÄÐÅÏ¢ £¬£¬£¬£¬£¬£¬¼«´óµØÌá¸ßÁ˳°ôÐÔ¡£¡£C. ÉÏÏÂÎÄÃſأº¸øÓ°ÏóÅä¸ö " ²ÃÅÐ "ÕâÊÇ×ÃîµÄÒ»±Ê¡£¡£²é±íÊÇËÀµÄ £¬£¬£¬£¬£¬£¬ÓïÑÔÊÇ»îµÄ¡£¡£ºÃ±È " Æ»¹û " Õâ¸ö´Ê¡£¡£ÔÚ " ³ÔÆ»¹û " µÄÓᄈϠ£¬£¬£¬£¬£¬£¬Ëüָˮ¹û£»£»£»£»ÔÚ " Æ»¹ûÐû²¼»á " µÄÓᄈϠ£¬£¬£¬£¬£¬£¬ËüÖ¸¿Æ¼¼¹«Ë¾¡£¡£Ö±½Ó²é±í¿ÉÄÜ»áÒýÈëÔëÉù¡£¡£DeepSeek Éè¼ÆÁËÒ»¸ö " ÉÏÏÂÎĸÐÖªÃÅ¿Ø "£¨Context-aware Gating£©¡£¡£Query£¨ÅÌÎÊ£©£ºÄ¿½ñÉÏÏÂÎĵÄÒþ²Ø×´Ì¬£¨Hidden State£©¡£¡£Key/Value£¨¼üÖµ£©£º²é±í»ñµÃµÄ¾²Ì¬ÏòÁ¿¡£¡£Õâ¸öÃſؾÍÏñÒ»¸ö²ÃÅС£¡£ÈôÊDzé³öÀ´µÄ " ¾²Ì¬ÖªÊ¶ " ºÍÄ¿½ñµÄ " ÉÏÏÂÎÄ " ²»´î £¬£¬£¬£¬£¬£¬²ÃÅоͻá°ÑÈ¨ÖØÑ¹µÍ£¨Gate ÖµÇ÷Ïò 0£© £¬£¬£¬£¬£¬£¬ÈÃÄ£×ÓºöÂÔÕâ¸öÔëÉù£»£»£»£»ÈôÊÇÍêÉÆÆõºÏ£¨ºÃ±È " É˺®ÔÓ²¡ÂÛ " ºóËæ×Å " ÕÅÖÙ¾° "£© £¬£¬£¬£¬£¬£¬²ÃÅоͻá°Ñ´óÃÅ·­¿ª£¨Gate ÖµÇ÷Ïò 1£© £¬£¬£¬£¬£¬£¬Ö±½Ó°Ñ֪ʶעÈëÄ£×Ó¡£¡£ µÚ¶þÕ£º»Æ½ð±ÈÀý¡ª¡ª·¢Ã÷ AI Ä£× "U ÐÍÇúÏß "¼Ü¹¹Éè¼ÆºÃÁË £¬£¬£¬£¬£¬£¬½ÓÏÂÀ´µÄÎÊÌâÊÇ£ºÔõô·Ö¾Ó²ú£¿£¿£¿£¿ £¿ £¿¼ÙÉèÎÒÃÇÏÔ¿¨ÀïµÄÏÔ´æÊÇÓÐÏÞµÄ £¬£¬£¬£¬£¬£¬×ܲÎÊýÔ¤ËãÒ²ÊÇÀο¿µÄ¡£¡£ÎÒÃÇÓ¦¸Ã°Ñ¼¸¶à²ÎÊý·ÖÅ䏸 MoE µÄ " ר¼Ò "£¨ÈÏÕæÅÌË㣩 £¬£¬£¬£¬£¬£¬¼¸¶à²ÎÊý·ÖÅ䏸 Engram µÄ " ×Öµä "£¨ÈÏÕæÓ°Ï󣩣¿£¿£¿£¿ £¿ £¿ÕâÊÇÒ»¸öµä·¶µÄ×ÊÔ´ÉèÖò©ÞÄ¡£¡£DeepSeek ÍŶӾÙÐÐÁËÒ»³¡´ó¹æÄ£µÄÏûÈÚʵÑé £¬£¬£¬£¬£¬£¬É¨ÃèÁË´Ó 0% µ½ 100% µÄ·ÖÅä±ÈÀý £¬£¬£¬£¬£¬£¬Ð§¹û»­³öÁËÒ»ÌõÍêÉÆµÄ "U ÐÍ Scaling Law ÇúÏß "¡£¡£ÕâÕÅͼչÏÖÁË AI Ä£×ÓÉè¼ÆµÄµ×²ã¼ÍÂÉ£º×ó²à¼«¶Ë£¨´¿ Engram£©£ºÈôÊǰѲÎÊýÈ«¸ø×Öµä £¬£¬£¬£¬£¬£¬Loss ºÜ¸ß¡£¡£ÓÉÓÚÄ£×ÓÄð³ÉÁË " Êé°×³Õ " £¬£¬£¬£¬£¬£¬¹âÓÐËÀ¼ÇÓ²±³ £¬£¬£¬£¬£¬£¬Ã»ÓÐÂß¼­ÍÆÀíÄÜÁ¦¡£¡£ÓҲ༫¶Ë£¨´¿ MoE£©£ºÈôÊǰѲÎÊýÈ«¸ø×¨¼Ò £¬£¬£¬£¬£¬£¬Loss Ò²ºÜ¸ß¡£¡£ÓÉÓÚר¼ÒÃDZ»ÆÈ°Ñ¾«Éñ¶¼»¨ÔÚ±³Ê飨ӰÏó¾²Ì¬ÖªÊ¶£©ÉÏ £¬£¬£¬£¬£¬£¬Ã»¿Õ¸ÉÕýÊ¡£¡£»£»£»£»Æ½ðÖ§½âµã£¨¦Ñ ¡Ö 75%-80%£©£ºµ±ÎÒÃǽ«Ô¼20%-25% µÄÏ£º±²ÎÊýÔ¤Ëã·Ö¸ø Engram £¬£¬£¬£¬£¬£¬Ê£Ïµĸø MoE ʱ £¬£¬£¬£¬£¬£¬Ä£×ÓµÄÑéÖ¤¼¯ Loss ½µµ½ÁË×îµÍµã¡£¡£ÕâÊÇÒ»¸ö¼«¾ßÖ¸µ¼ÒâÒåµÄ·¢Ã÷£º¹ØÓÚ¼¸°ÙÒÚ²ÎÊýµÄ´óÄ£×ÓÀ´Ëµ £¬£¬£¬£¬£¬£¬´¿´â¶ÑÆöÅÌË㵥루MoE ר¼Ò£©ÒѾ­ÊDZ߼ÊЧӦµÝ¼õÁË £¬£¬£¬£¬£¬£¬±ØÐèÒýÈëרÃŵľ²Ì¬Ó°ÏóÄ£¿£¿£¿£¿ £¿ £¿éÀ´ÊµÏÖ " ´æËãÆ½ºâ "¡£¡£ µÚÈýÕ£º·´Ö±¾õµÄ±¬·¢¡ª¡ªÎªÊ²Ã´ " ²é×Öµä " ÄÜÌá¸ß " ÊýѧЧ¹û "£¿£¿£¿£¿ £¿ £¿ÈôÊÇ Engram ½ö½öÊÇÈÃÄ£×Ó " ¼ÇÐÔ¸üºÃ " £¬£¬£¬£¬£¬£¬ÕâÆªÂÛÎĵķÖÁ¿»¹È±·¦ÒÔÕð¾ªÉçÇø¡£¡£ÊÂʵ £¬£¬£¬£¬£¬£¬RAG£¨¼ìË÷ÔöÇ¿ÌìÉú£©Ò²Äܽâ¾ö֪ʶÎÊÌâ¡£¡£ÕæÕýÈÃÒµ½ç¸ÐÓ¦Õðº³µÄ £¬£¬£¬£¬£¬£¬ÊÇʵÑéЧ¹ûÖÐÄÇЩÒâÁÏÖ®ÍâµÄÊÕÒæ¡£¡£DeepSeek ¹¹½¨ÁËÈý¸ö±ÈÕÕÄ£×Ó £¬£¬£¬£¬£¬£¬ÑÏ¿á¿ØÖÆ¼¤»î²ÎÊýÄ¿£¨3.8B£©ºÍѵÁ·Êý¾ÝÁ¿£¨262B tokens£©ÍêȫһÖ£ºDense-4B£º¹Å°åµÄŨÃÜÄ£×Ó¡£¡£MoE-27B£º´¿ MoE Ä£×Ó£¨72 ¸öר¼Ò£©¡£¡£Engram-27B£º»ìÏýÄ£×Ó£¨55 ¸öר¼Ò + 5.7B Engram ²ÎÊý£©¡£¡£Ð§¹ûÁîÈË´óµøÑÛ¾µ£º1. ÒâÁÏÖ®ÖУºÖªÊ¶ÀàʹÃü°Ô°ñÔÚ MMLU£¨×ÛºÏ֪ʶ£©ÉÏ £¬£¬£¬£¬£¬£¬Engram Ä£×ÓÌáÉýÁË3.4 ·Ö£»£»£»£»ÔÚ CMMLU£¨ÖÐÎÄ֪ʶ£©ÉÏ £¬£¬£¬£¬£¬£¬ÌáÉýÁË4.0 ·Ö¡£¡£ÕâºÜºÃÃ÷È· £¬£¬£¬£¬£¬£¬Íâ¹ÒÁË×Öµä £¬£¬£¬£¬£¬£¬ÖªÊ¶×ÔÈ»¸üºÃÁË £¬£¬£¬£¬£¬£¬»Ã¾õ¸üÉÙÁË¡£¡£2. ÒâÁÏÖ®Í⣺Âß¼­¡¢´úÂë¡¢ÊýѧÖÜÈ«±©Õǰ´Àí˵ £¬£¬£¬£¬£¬£¬" ²é×Öµä " ºÍ " ×öÊýѧÌâ " û¹ØÏµ¡£¡£µ«ÔÚ BBH£¨×ÛºÏÍÆÀí£©ÉÏ £¬£¬£¬£¬£¬£¬Engram-27B ¾¹È»±Èͬ²ÎÊýµÄ´¿ MoE »ùÏßÌáÉýÁËÕûÕû5.0 ·Ö£¡MATH£¨Êýѧ£©£ºÌáÉý2.4 ·Ö¡£¡£HumanEval£¨´úÂëÌìÉú£©£ºÌáÉý3.0 ·Ö¡£¡£ARC-Challenge£¨ÖØ´óÍÆÀí£©£ºÌáÉý3.7 ·Ö¡£¡£3. Éî¶ÈÆÊÎö£ºÓÐÓÃÉî¶È£¨Effective Depth£©ÀíÂÛΪʲô£¿£¿£¿£¿ £¿ £¿Ò»¸ö " ËÀ¼ÇÓ²±³ " µÄÄ£¿£¿£¿£¿ £¿ £¿é £¬£¬£¬£¬£¬£¬ÎªÊ²Ã´ÄÜÌá¸ßÖÇÉÌ£¿£¿£¿£¿ £¿ £¿DeepSeek ÍŶÓʹÓÃLogitLensºÍ "CKA£¨ÖÐÐÄºË¶ÔÆë£©" ÊÖÒÕ £¬£¬£¬£¬£¬£¬¶ÔÄ£×ÓÄÚ²¿¾ÙÐÐÁË " ÆÊ½â "¡£¡£ËûÃÇ·¢Ã÷ÁËÒ»¸ö¾ªÈ˵ÄÕ÷Ï󣺻¹¼ÇµÃ¿ªÍ·µÄ " ´÷°²ÄÈÍõåú " Â𣿣¿£¿£¿ £¿ £¿ÔÚ´¿ MoE Ä£×ÓÖÐ £¬£¬£¬£¬£¬£¬Ç°¼¸²ãÍøÂç¶¼ÔÚæ×Å " Æ´¼¯¿´·¨ "¡£¡£¶øÔÚ Engram Ä£×ÓÖÐ £¬£¬£¬£¬£¬£¬ÓÉÓÚµÚ 2 ²ã¾Í²åÈëÁË Engram Ä£¿£¿£¿£¿ £¿ £¿é £¬£¬£¬£¬£¬£¬¾²Ì¬ÖªÊ¶µÄ¼ìË÷ÔÚ¼«ÔçµÄ½×¶Î¾ÍÍê³ÉÁË¡£¡£ÕâÒâζ×Å £¬£¬£¬£¬£¬£¬Ô­±¾ÓÃÓÚ " ËÀ¼ÇÓ²±³ " µÄǰ¼¸²ãÍøÂç±»½â·ÅÁË£¡ÕâÏ൱ÓÚ¸øÄ£×Ó " ÐéÔö " ÁËÉî¶È¡£¡£ ÄÇЩ±»ÊͷųöÀ´µÄÍøÂç²ãºÍ×¢ÖØÁ¦Í·£¨Attention Heads£© £¬£¬£¬£¬£¬£¬²»ÔÙÐèÒª´¦ÀíààËյľֲ¿ÒÀÀµ£¨ºÃ±Èʶ±ð " ÕÅÖÙ¾° " ÊÇË­£© £¬£¬£¬£¬£¬£¬´Ó¶ø¿ÉÒÔÈ«Éñ¹á×¢µØÍ¶Èëµ½¸üÖØ´óµÄÈ«¾ÖÍÆÀí¡¢³¤³ÌÂß¼­¹¹½¨ºÍ´úÂëÂß¼­ÌìÉúÖÐÈ¥¡£¡£Engram µÄʵÖÊ £¬£¬£¬£¬£¬£¬²»ÊÇ " Ìæ»» " ÍÆÀí £¬£¬£¬£¬£¬£¬¶øÊÇͨ¹ý " ·ÖÁ÷ " ÔÓ»î £¬£¬£¬£¬£¬£¬ÈôóÄÔרעÓÚ¸ü¸ßά¶ÈµÄ˼Ë÷¡£¡£ µÚËÄÕ£º¹¤³ÌÊÂÒµ¡ª¡ªÍ»ÆÆÓ¢Î°´ïµÄ " ÏÔ´æ°ÔȨ "¹ØÓÚ»ª¶û½ÖµÄͶ×ÊÕߺÍËãÁ¦ÖÐÐĵÄÔËάÕßÀ´Ëµ £¬£¬£¬£¬£¬£¬ÕâÆªÂÛÎÄ×îÐԸеĵط½²»ÔÚÓÚ Score £¬£¬£¬£¬£¬£¬¶øÔÚÓÚCost£¨±¾Ç®£©¡£¡£ÔÚ AI ʱ´ú £¬£¬£¬£¬£¬£¬×îÌÚ¹óµÄ×ÊÔ´²»ÊÇËãÁ¦£¨FLOPs£© £¬£¬£¬£¬£¬£¬¶øÊÇÏԴ棨HBM£©¡£¡£Ó¢Î°´ï H100 Ö®ÒÔÊÇ¹ó £¬£¬£¬£¬£¬£¬ºÜºéÁ÷ƽÉÏÊÇÓÉÓÚÄÇϡȱµÄ HBM3e ÄÚ´æ¡£¡£¶ø Engram ´øÀ´ÁËÒ»¸öÇ㸲ÐÔµÄÌØÕ÷£º³¹µ×µÄ´æËãÊèÉ¢¡£¡£1. MoE µÄÍ´µã£ºÏÔ´æÍÌÊÉÕ߹ŰåµÄ MoE Ä£×Ó £¬£¬£¬£¬£¬£¬Æä·ÓÉ»úÖÆ£¨Routing£©ÊǶ¯Ì¬µÄ¡£¡£Ä£×Ó±ØÐèÏÈËã³öÄ¿½ñ Token µÄÌØÕ÷ £¬£¬£¬£¬£¬£¬ËãÍêÕâÒ»²ã £¬£¬£¬£¬£¬£¬²ÅÖªµÀÏÂÒ»²ã¸ÃÕÒÄĸöר¼Ò¡£¡£ÕâÒâζ×Å £¬£¬£¬£¬£¬£¬ËùÓеÄר¼ÒÄ£×Ó±ØÐèʱ¿ÌÔÚÌÚ¹óµÄ GPU ÏÔ´æÀï´ýÃü £¬£¬£¬£¬£¬£¬Ëæ½ÐËæµ½¡£¡£2. Engram µÄÍ»ÆÆ£ºÈ·¶¨µÄÔ¤ÖªEngram µÄ²é±íÂß¼­ÊÇÈ·¶¨ÐԵġ£¡£Ö»ÒªÊäÈëµÄÎı¾È·¶¨ÁË£¨ºÃ±È "A New Axis of Sparsity"£© £¬£¬£¬£¬£¬£¬ÄÇôËü¶ÔÓ¦µÄ N-gram Ë÷Òý¾ÍÈ·¶¨ÁË¡£¡£ÎÒÃÇ»ù´¡²»ÐèÒªµÈÄ£×ÓËãÍêǰһ²ã £¬£¬£¬£¬£¬£¬ÔÚ Token ½øÈëÄ£×ÓµÄÄÇһ˲¼ä £¬£¬£¬£¬£¬£¬ÎÒÃǾÍÖªµÀËüÐèÒª²éÄÄÕűíµÄÄÄÒ»ÐС£¡£3. CPU µÄÄæÏ®£º°Ñ´óÄ£×ÓÈû½øÄÚ´æÌõÕâÒ»ÌØÕ÷´øÀ´ÁËÖØ´óµÄ¹¤³ÌÓ¯Àû£ºÐ¶ÔØ£¨Offload£©£ºÎÒÃÇ¿ÉÒ԰Ѽ¸°ÙÒÚ¡¢ÉõÖÁÉÏǧÒÚ²ÎÊýµÄ Engram ´Ê±í £¬£¬£¬£¬£¬£¬Ö±½ÓÈÓµ½×ÔÖÆ¡¢Á¿´ó¡¢Ò×À©Õ¹µÄ "CPU Äڴ棨DRAM£©" Àï £¬£¬£¬£¬£¬£¬ÉõÖÁ·ÅÔÚ NVMe SSD ÉÏ¡£¡£Ô¤È¡£¡£¨Prefetching£©£ºÔÚ GPU Æ´ÃüÅÌËãǰһ²ã Transformer µÄʱ¼ä £¬£¬£¬£¬£¬£¬CPU ʹÓà PCIe ͨµÀ £¬£¬£¬£¬£¬£¬Òì²½µØ°ÑÏÂÒ»²ãÐèÒªµÄÓ°ÏóÊý¾Ý " Ԥȡ " ³öÀ´ £¬£¬£¬£¬£¬£¬ÍÆË͵½ GPU¡£¡£ÑÚÊÎÑÓ³Ù £¬£¬£¬£¬£¬£¬²¢Ðд¦Àí¡£¡£DeepSeek ʵ²âÊý¾ÝÏÔʾ£º×ÝÈ»¹ÒÔØÁË100B£¨Ç§ÒÚ£©²ÎÊýµÄ Engram ±íµ½ CPU ÄÚ´æ £¬£¬£¬£¬£¬£¬Ïà±ÈÓÚ´¿ GPU ÍÆÀí £¬£¬£¬£¬£¬£¬ÍÌÍÂÁ¿µÄϽµ²»µ½ 3%¡£¡£ÕâÊÇÒ»¸öÈÃËùÓÐÓÉÓÚÂò²»µ½ HBM ¶ø½¹ÂǵÄÈË¿ñϲµÄ½áÂÛ¡£¡£ÕâÒâζ×Å £¬£¬£¬£¬£¬£¬Î´À´µÄ´óÄ£×Ó £¬£¬£¬£¬£¬£¬" Ó°ÏóÈÝÁ¿ " ¿ÉÒԵͳÉÍâµØÎÞÏÞÀ©ÕÅ £¬£¬£¬£¬£¬£¬¶ø²»±Ø±»Ó¢Î°´ïµÄÏԴ濨²±×Ó¡£¡£ µÚÎåÕ£º³¤Îı¾µÄʤÀû¡ª¡ª NIAH ²âÊÔµÄÔ¾Éý³ýÁËͨÓÃÍÆÀí £¬£¬£¬£¬£¬£¬Engram ÔÚ³¤Îı¾£¨Long Context£©ÁìÓòµÄÌåÏÖͬÑù֤ʵÎú " ·Ö¹¤ " µÄ¼ÛÖµ¡£¡£ÔÚ³¤Îı¾´¦ÀíÖÐ £¬£¬£¬£¬£¬£¬×¢ÖØÁ¦»úÖÆ£¨Attention£©µÄ´°¿ÚÊÇÓÐÏ޵ġ£¡£ÈôÊÇ×¢ÖØÁ¦±»´ó×ڵľֲ¿ÐÅÏ¢£¨ÈçÀο¿¶Ì

    ±¾ÎÄÁ´½Ó£º?/p/Products/4853156.html

    °Ù¶ÈÔÊÐí£ºÈçÓöÐéαڲƭ £¬£¬£¬£¬£¬£¬ÖúÄú****(Ôð±à£º³ÂÞÈÔ£¡¢µËΰÏè)

    Ïà¹ØÓ¦ÓÃ

    ¡¾ÍøÕ¾µØÍ¼¡¿
    Æ»¹û°æ-¸£Çåºê·СºúͬµÄÃÃ×Ó½Ðʲô-Áù±ßÐÎսʿM1