ÀÖÓãµç¾º



½ÌÓýÐÐÒµA¹ÉIPOµÚÒ»¹É£¨¹ÉƱ´úÂë 003032£©

È«¹ú×Éѯ/ͶËßÈÈÏߣº400-618-4000

È˹¤ÖÇÄÜÖ®¸öÐÔ»¯ÍƼö֮·

¸üÐÂʱ¼ä:2020Äê09ÔÂ14ÈÕ14ʱ43·Ö À´Ô´:ÀÖÓã²¥¿Í ä¯ÀÀ´ÎÊý:

Ïà¹Ø±³¾°

¶ÔÓÚÒ»¼ÒÆ·ÅÆµçÉÌ£¬Á¬½Ó׏©Ó¦É̺ÍÓû§£¬¸ÄÉÆ¹©Ó¦Á´µÄͬʱ¸øÓû§Ìṩ¸ßÆ·ÖÊ¡¢¸ßÖÊÁ¿¡¢ÎÞ´óÅÆÒç¼ÛµÄÉÌÆ·¡£ÓªÏú²àÒµÎñ³¡¾°ÊÇÔÚÏßµçÉ̵ÄÖØÖÐÖ®ÖØ£¬ÃæÁٵij¡¾°ÌôÕ½°üÀ¨£º

·ÈçºÎÓÅ»¯Á÷Á¿·ÖÅä?

·ÈçºÎÔö¼Ó GMV?

·ÈçºÎÌáÉýÓû§¹ºÎïÌåÑé?

»¥ÁªÍøÈ˿ںìÀûÏûʧ´ù¾¡£¬ÔÚ´æÁ¿Õù¶áÓû§Ê±¼äµÄÕ½³¡ÉÏ£¬±ØÐë¸ßЧ³ä·ÖÀûÓÃÿ·ÝÁ÷Á¿¡£Ê¹ÓøöÐÔ»¯Ëã·¨ÊǸ÷¼Ò±ØÓü¼Êõ£¬ÔÚµçÉ̵ÄÓªÏú³¡¾°ÌáÉýЧ¹ûÃ÷ÏÔ£¬µã»÷ÂÊÔ¤¹ÀÌáÉýһС²½£¬Á÷Ë®ÔöÒæÒ»´ó²½¡£¸ÃĿǰÖ÷Òª²ÉÓÃÉî¶ÈÄ£ÐÍÓ¦ÓÃÔÚËÑË÷¡¢ÍƼö¡¢¹ã¸æµÈ³¡¾°£¬¾ßÌåËã·¨·½ÏòÓÐDeepCTR¡¢Embedding¡¢NLP µÈ¡£

ÔÚתÏòÉî¶ÈѧϰµÄÕâÁ½Ä꣬¸Ã·½°¸¸öÐÔ»¯³¡¾°µÄÓû§ÌåÑé¡¢Ö¸±êÊý¾Ý¶¼ÓнϴóÌáÉý(²úÆ·ÐÎʽ¡¢ÏµÍ³ÄÜÁ¦¡¢Ë㷨Ч¹û¶¼ÔÚ³ÖÐøÓÅ»¯ÖÐ)¡£¸öÐÔ»¯³¡¾°×îÖØÒªµÄÁ½¸ö½×¶ÎÊÇÕٻغÍÅÅÐò£¬½ÓÏÂÀ´»á´ÓÕâÁ½·½ÃæÇÐÈë·ÖÏíһЩʵ¼ù¾­ÑéºÍ˼·×ܽá¡£

¸öÐÔ»¯ÍƼö


¼¼Êõ·½°¸

ÄÜÁ¦Í³Ò»ÕÙ»ØÌåϵ

ÕâÀïµÄÕٻظü׼ȷ¶¨ÒåÊÇÕÙ»Ø + ´ÖÅÅ¡£ÕٻصÄÄ¿µÄÊÇ¿ìËÙËõС·¶Î§£¬¼õÇᾫÅŵÄѹÁ¦¡£µ±ÏµÄÌôÕ½²»ÔÚÊý¾Ý¹æÄ£ºÍϵͳѹÁ¦£¬ÎÒÃÇÉè¼ÆÏµÍ³×î¿´ÖØµÄÊǸ´ÓÃÐÔ¡¢Î¬»¤ÐÔºÍÀ©Õ¹ÐÔ¡£Òò´Ë¿¼ÂǶÔÕÙ»ØÄÜÁ¦×ö´óͳһ£¬Éæ¼°ËÑË÷¡¢ÍƼö¡¢¹ã¸æ¡¢¿Í·þ¶à¸öÒµÎñ³¡¾°¡£


ÕٻسéÏó

¶ÔÕÙ»ØÀ´Ëµ£¬ÊäÈëÊä³ö¶¼ÊÇÐÅÏ¢£¬Ö»ÊÇÐÅÏ¢µÄÀà±ð¿ÉÄܲ»Í¬£¬Í¬Ê±±ØÐ붨ÒåÐÅÏ¢¼ä¹ØÏµ¶ÈÁ¿£º$ score = f (InputData,OutputData) ¡£Õٻؽ׶ζÔʱÑӱȽÏÃô¸Ð£¬²»ÄÜÑ¡ÓÃÌ«¹ý¸´ÔÓ¹ØÏµ¶ÈÁ¿Ëã·¨¡£¶ÔÓÚ¶à¸ö³¡¾°£¬ÐÅÏ¢¿ÉÒÔ³éÏóΪ£º

·ËÑË÷(ÊäÈë²éѯ´Ê£¬Êä³öÉÌÆ·Áбí¡¢×¨ÌâÄÚÈÝÁбí)

·ÍƼö(ÊäÈëÓû§£¬Êä³öÉÌÆ·Áбí)

·¹ã¸æ(ÊäÈëÓû§£¬Êä³öËØ²ÄÁбí)

·¿Í·þ(ÊäÈëÎÊÌ⣬Êä³ö´ð°¸Áбí¡¢ÏàËÆÎÊÌâÁбí)

ÐÅÏ¢ÔØÌå°üÀ¨Îı¾ºÍÏòÁ¿£¬Á½¶ËÐÅÏ¢ÔØÌåÏàͬµÄÇé¿öϲÅÄܼÆËã¹ØÏµ¶ÈÁ¿Öµ¡£ ÏÔ¶øÒ×¼û²éѯ´Ê¡¢ÉÌÆ·¡¢ÎÊÌâ / ´ð°¸¡¢×¨ÌâÄÚÈÝÌìÈ»°üº¬Îı¾ÐÅÏ¢£¬¶øÊ¹ÓÃÏòÁ¿±ØÐëÒÀ¿¿Ä£ÐÍ×öÄ¿±êµÄÏòÁ¿»¯±íÕ÷¡£

ÀàËÆ Airbnb[1] Sequence Embedding ˼·£¬ÎÒÃÇ»ùÓÚʵÏÖÉÌÆ·ÏòÁ¿»¯±íÕ÷£¬Í¬Ê±ÒýÈëBatchÄÚ pairwise loss ºÍ side info ½øÒ»²½ÌáÉýÄ£Ð͵ÄÐÔÄܺÍЧ¹û¡£»ùÓÚÉÌÆ·ÏòÁ¿¿ÉÒÔ¹ØÁª±íÕ÷Óû§¡¢²éѯ´Ê¡¢×¨Ìâ¡£ÎÊÌâ / ´ð°¸µÄÏòÁ¿»¯²Î¿¼ sentence embedding ¼¼Êõ£¬ÓÉרÃÅµÄ NLP ͬѧ¸ºÔð¡£

ÕÙ»ØÄÜÁ¦

Èç¹ûÐÅÏ¢ÊÇÎı¾»òÏòÁ¿£¬ÕÙ»ØÄÜÁ¦ÐèÒªÖ§³Ö£º

·ÎÄ×ÖÏà¹ØÐÔ

ÒÀ¿¿ ElasticSearch Ë÷Òý£¬BM25 Ëã·¨¶ÈÁ¿Ïà¹ØÐÔ¡£

·ÓïÒåÏàËÆÐÔ

Pre-train ºÍ fine-tuning µÄ BERT Ä£ÐÍÍÆ¶ÏÏà¹ØÐÔ¡£

·ÏòÁ¿×î½üÁÚ

½üËÆ×î½üÁÚËÑË÷(Approximate Nearest Neighbor)£¬ÓàÏÒ¾àÀë¼ÆËãÏà¹ØÐÔ¡£

×÷ΪͨÓÃÕÙ»ØÌåϵ±ØÐë±£Ö¤µÍÑÓʱ¡¢¸ß¿ÉÓ㬾ßÌå×öϵͳ¼Ü¹¹Ê±ÎÒÃǽ«ÈýÖÖÄÜÁ¦¶¼ÓÃË÷ÒýÀ´Ö§³Å¡£ÎÄ×ÖÏà¹ØÐÔ±¾À´¾ÍÊÇË÷ÒýÌṩµÄÄÜÁ¦;ÏòÁ¿×î½üÁÚ»ùÓÚ LSH(Locality-sensitive hashing)ͶӰºóÔÙ½¨Ë÷Òý£¬»ñÈ¡Ë÷ÒýÕýÅÅÊý¾Ý(ԭʼÏòÁ¿)¼ÆËãÆ¥ÅäµÃ·Ö£¬ÊµÏÖÏòÁ¿µÄÔÚÏß²éѯÄÜÁ¦¡£·Ç GPU »·¾³Ï BERT Ä£ÐͱȽϺÄʱ£¬¿É½«ÈÈÃŲéѯ´ÊºÍ¸ßƵÎÊÌâ(¶þ°Ë¶¨ÂÉ)×öÀëÏßÍÆ¶Ï£¬½á¹ûµÃ·Ö T+1 ͬ²½µ½Ë÷Òý¡£

ÖÁ´ËÈýÖÖÄÜÁ¦¶¼ÕûºÏÔÚË÷ÒýÌåϵÏ£¬Õٻس¡¾°¿ÉÒÔ²¢ÐÐʹÓ᣾ٸöÀý×Ó£º ËÑË÷³¡¾°£ºÊäÈë²éѯ´ÊͬʱʹÓÃÎÄ×ÖÏà¹ØÐÔ£¬ÓïÒåÏàËÆÐÔ(Query ºÍ ItemTitle ÐÎ³É setence pair)£¬ÏòÁ¿×î½üÁÚÕÙ»ØÉÌÆ·(QueryVector µ½ ItemVector)¡£ÕâÑùÉÌÆ·×ÜÁ¿Æ«Ð¡£¬À©Õ¹ÕÙ»ØÄÜÁ¦Ï£ÍûÓû§ÄÜ¿´µ½¸ü¶à¹ØÁªÉÌÆ·¡£

ÍÆ¼ö³¡¾°£ºÊäÈëÍÆ¼öÄ£¿éÉÌÆ·³ØÌõ¼þºÍÓû§ÏòÁ¿£¬Ê¹ÓÃÎÄ×ÖÏà¹ØÐÔ(¶à·ÕٻصÄÉÌÆ·¹ýÂËÌõ¼þ£¬ÊÇͨÓõÄË÷ÒýÌõ¼þ²éѯ)£¬ÏòÁ¿×î½üÁÚ (UserVector µ½ ItemVector) ÕÙ»ØÉÌÆ·¡£

¸÷ÏÔÉñͨ¾«ÅÅ·þÎñ

¾«Åų¡¾°Ã»Óп¼ÂǶÔÄÜÁ¦×ö³éÏóͳһ£¬Ã¿¸öÄ£¿éºËÐÄËßÇóµã»á²»Í¬(ÓÐЩսÂÔµ¼ÏòµÄ³¡¾°¿ÉÄÜÖ»¿¼ÂÇÆØ¹âµã»÷Á¿)¡£ºËÐÄÖ¸±êÉÏ×ۺϿ¼Á¿ÆØ¹âת»¯ÂÊ(CTR*CVR)¡¢UV ¼ÛÖµ¡¢×ÜÌå GMV£¬ÕâЩ¶¼ÊÇËã·¨ÐèҪȫ¾ÖÓÅ»¯µÄÄ¿±ê£¬Í¬Ê±×ÛºÏÒµÎñ·½µÄÈ˹¤²ßÂÔ£¬³ÊÏÖ¸øÓû§×îÖÕµÄÉÌÆ·ä¯ÀÀЧ¹û¡£Ð¡»ï°é¿ÉÒÔ³ä·Ö·¢»Ó´ÏÃ÷²ÅÖÇ£¬×ÔÓÉÑ¡ÔñÌØÕ÷ºÍÄ£Ðͽṹ£¬ÒÔÏßÉÏ AB Test Êý¾ÝÆÀ¹À¼ÛÖµ¡£ÎÒÃǵľ«ÅÅÄ£ÐÍ̽Ë÷Íù´óµÄ˵·ÖΪÁ½·½Ã棺׷Ç󾫶ÈЧ¹ûµÄ¸´ÔÓÉî¶ÈÄ£ÐͺÍʵʱ¸ÐÖªÓû§Æ«ºÃµÄÔÚÏßÄ£ÐÍ£¬ÏÂÎÄ×öÏêϸ½éÉÜ¡£

¸´ÔÓÄ£ÐÍ

ʹÓÃÉî¶ÈÄ£ÐÍ×ö DeepCTR Ô¤¹À£¬Ö÷Ìå˼·ÊÇ Embedding&MLP£¬¶ÔÄ£ÐÍ×ö¸Ã·½°¸ÑÏÑ¡³¡¾°µÄÊÊÅäµ÷Õû¡£ÎÒÃÇÊ׸öÉî¶ÈÄ£ÐÍÊÇ WDL[2]£¬Wide Part ÒÀ¿¿ÏÈÑé֪ʶ¹¹½¨ÌØÕ÷£¬ÌôÑ¡¸ß¸²¸Ç¡¢¸ß¼ÛÖµÊôÐÔÁ½Á½½»²æ¹¹½¨ÌØÕ÷¡£Deep Part ÒÀ¿¿Éî¶ÈÍøÂçʵÏÖÌØÕ÷×éºÏ£¬¶ÔÉÌÆ· Id¡¢ÀàÄ¿ Id¡¢ÊôÐÔ Id Embedding£¬µ«²»Ö±½Ó¶ÔÓû§ Id Embedding£¬¶øÊÇ»ùÓÚÓû§ºÍÉÌÆ·µÄÐÐΪÐòÁбíÕ÷Óû§£¬ÒýÈë×¢ÒâÁ¦Ä£¿é[3] ¼ÆËãÓû§ÏòÁ¿¡£

ºóÐøÔÚ¶¨ÖÆ WDL Ö®ÉϼÓÈë CTR¡¢CVR µÄ¶àÈÎÎñѧϰ[4]£¬¹²Ïí Embedding ²ãÈ¨ÖØ£¬¹¹½¨ CTCVR LOSS£¬ÊµÏÖÆØ¹â½×¶ÎµÄ CVR Ô¤¹À¡£

ÔÚÏßÄ£ÐÍ

Òµ½çÒѾ­Ðγɹ²Ê¶Êý¾ÝÐÂÏʶÈÄܽøÒ»²½ÌáÉýÄ£ÐÍЧ¹û¡£ÎªÊµÊ±¸ÐÖªÓû§Æ«ºÃ£¬Ò»°ãʹÓÃÊµÊ±ÌØÕ÷´«ÈëÉî¶ÈÄ£ÐÍ£¬µ«´æÔÚÁ½¸öÎÊÌ⣺

·ÊµÊ±ÌØÕ÷ÔÚÏß¼ÆËãÓÃÓÚÍÆ¶Ï£¬Êý¾ÝÇåÀíÓÃÓÚѵÁ·;ÌØÕ÷µ÷ÕûµÄά»¤³É±¾ºÍ¼ÆËãÁ¿±È½Ï´ó¡£

·ÊµÊ±ÌØÕ÷ÏßÉϼÆËãÓв»¿É¿¿ÐÔ£¬ÈŶ¯Ä£ÐÍЧ¹û¡£

ÎÒÃÇÑ¡ÔñʵÏֳɱ¾½ÏµÍµÄ·½°¸£ºT+1 ¸´ÔÓÄ£Ð͵Ļù´¡ÉÏÌôÑ¡ TopK Ó¦ÓÃÊµÊ±ÖØÅÅÐò£¬ÔÚ·çÏտɿصÄǰÌáϳ¢ÊÔ¼¤½øÊµÊ±Ëã·¨¡£»ùÓÚÓû§ºÍÉÌÆ·½»»¥Êý¾Ý¼ÆËãÓû§¶àÀàÄ¿µÄʵʱÐËȤÏòÁ¿(Óû§ÏòÁ¿±íÕ÷²Î¿¼ÉÏÎÄÕÙ»ØËµÃ÷)£¬ÔÚ²»Í¬µÄ³¡¾°°´ÐèʹÓá£ÈçÔÚËÑË÷ÍÆ¼ö£¬Ê¹Óõ±Ç°ÉÏÏÂÎÄÐÅÏ¢¼¤»î¶àά¶ÈÓû§ÊµÊ±ÏòÁ¿£¬¼ÆËãÏòÁ¿¾àÀëÖØÅÅÐòÉÌÆ·Áбí´ïµ½ÀàËÆÊµÊ±Ä£Ð͵ÄЧ¹û¡£ÏµÍ³¹¤³ÌÉÏÊÕÒæÒ»¸öÊÇÓû§ºÍÉÌÆ·½»»¥Êý¾ÝÊÇ»ù´¡Í³¼Æ£¬Ã»ÓжîÍâÌØÕ÷ά»¤³É±¾£¬ÁíÒ»¸öʹÓÃÏòÁ¿ÓиüºÃ·º»¯ÐÔºÍÊý¾ÝÈÝ´íÐÔ¡£¸Ã·½°¸ÑÏÑ¡³¡¾°Ñé֤Ч¹ûÓÐÃ÷ÏÔµÄÌáÉý¡£


½áÓï

½ü¼¸ÄêÉî¶ÈѧϰÁìÓòµÄ¼¼Êõ¸ïзdz£¿ì£¬ÐÂÄ£ÐÍÐÂ˼·²ã³ö²»Ç²»ÓɸÐ̾ “ÖÕÉíѧϰ” µÄÖªÐкÏÒ»ÆÄΪ²»Òס£ÓªÏú¶Ë¸öÐÔ»¯ÓзḻµÄÉî¶ÈѧϰӦÓó¡¾°£¬´Óµã»÷ÂÊÔ¤¹À¡¢ÓïÒ寥Åä¡¢ÏòÁ¿»¯µ½ËزĺϳÉ¡¢ÎݸÉú³É¡¢ÈËȺ»­Ïñ£¬ÎÒÃÇÒ»Ö±ÔÚÀ©³äËã·¨µÄÄÜÁ¦±ß½ç¡£


²Î¿¼ÎÄÏ×

[1] Real-time Personalization using Embeddings for Search Ranking at Airbnb

[2] Wide & Deep Learning for Recommender Systems

[3] Deep Interest Network for Click-Through Rate Prediction

[4] Entire Space Multi-Task Model: An Effective Approach for Estimating Post-Click Conversion Rate


²ÂÄãϲ»¶£º

»¥ÁªÍø²úÆ·ÈçºÎʵÏÖ¸öÐÔ»¯ÍƼö?

Éî¶ÈÏà»úÊÇʲô?Éî¶ÈÏà»ú³£¼û¼¼Êõ

0 ·ÖÏíµ½£º
ºÍÎÒÃÇÔÚÏß½»Ì¸£¡
    ¡¾ÍøÕ¾µØÍ¼¡¿¡¾sitemap¡¿