ÀÖÓãµç¾º


½ÌÓýÐÐÒµA¹ÉIPOµÚÒ»¹É£¨¹ÉƱ´úÂë 003032£©

È«¹ú×Éѯ/ͶËßÈÈÏߣº400-618-4000

Spark¼¯ÈºµÄÔËÐлù±¾Á÷³ÌÊÇÔõÑùµÄ£¿

¸üÐÂʱ¼ä:2022Äê03ÔÂ29ÈÕ10ʱ31·Ö À´Ô´:ÀÖÓãµç¾º ä¯ÀÀ´ÎÊý:

Spark¼¯ÈºµÄÔËÐмܹ¹

SparkÊÇ»ùÓÚÄÚ´æ¼ÆËãµÄ´óÊý¾Ý²¢ÐмÆËã¿ò¼Ü£¬±ÈMapReduce¼ÆËã¿ò¼Ü¾ßÓиü¸ßµÄʵʱÐÔ£¬Í¬Ê±¾ßÓиßЧÈÝ´íÐԺͿÉÉìËõÐÔ£¬ÔÚѧϰSpark²Ù×÷֮ǰ£¬Ê×ÏȽéÉÜSparkÔËÐмܹ¹£¬Èçͼ2-11Ëùʾ¡£

Spark¼¯ÈºµÄÔËÐмܹ¹

ÔÚÉÏͼÖУ¬SparkÓ¦ÓÃÔÚ¼¯ÈºÉÏÔËÐÐʱ£¬°üÀ¨Á˶à¸ö¶ÀÁ¢µÄ½ø³Ì£¬ÕâЩ½ø³ÌÖ®¼äͨ¹ýÇý¶¯³ÌÐò(Driver Program)ÖеÄSparkContext¶ÔÏó½øÐÐЭµ÷£¬SparkContext¶ÔÏóÄܹ»Óë¶àÖÖ¼¯Èº×ÊÔ´¹ÜÀíÆ÷(Cluster Manager)ͨÐÅ£¬Ò»µ©Ó뼯Ⱥ×ÊÔ´¹ÜÀíÆ÷Á¬½Ó£¬Spark»áΪ¸ÃÓ¦ÓÃÔÚ¸÷¸ö¼¯Èº½ÚµãÉÏÉêÇëÖ´ÐÐÆ÷(Executor)£¬ Óà ÓÚ Ö´ ÐÐ ¼Æ Ëã ÈÎ Îñ ºÍ ´æ ´¢ Êý ¾Ý¡£Spark½«Ó¦ÓóÌÐò´úÂë·¢Ë͸øËùÉêÇëµ½µÄÖ´ÐÐÆ÷£¬SparkContext¶ÔÏ󽫷ָî³öµÄÈÎÎñ(Task)·¢Ë͸ø¸÷¸öÖ´ÐÐÆ÷È¥ÔËÐС£

ÐèҪעÒâµÄÊÇ£¬Ã¿¸öSparkÓ¦ÓóÌÐò¶¼ÓÐÆä¶ÔÓ¦µÄ¶à¸öÖ´ÐÐÆ÷½ø³Ì¡£Ö´ÐÐÆ÷½ø³ÌÔÚÕû±±¾©ÊвýÆ½Çø½¨²Ä³ÇÎ÷·½ðÑàÁú°ì¹«Â¥Ò»²ãµç»°£º400-618-400015¸öÓ¦ÓóÌÐòÉúÃüÖÜÆÚÄÚ£¬¶¼±£³ÖÔËÐÐ״̬£¬²¢ÒÔ¶àÏ̷߳½Ê½Ö´ÐÐÈÎÎñ¡£ÕâÑù×öµÄºÃ´¦ÊÇ£¬Ö´ÐÐÆ÷½ø³Ì¿ÉÒÔ¸ôÀëÿ¸öSparkÓ¦Óᣴӵ÷¶È½Ç¶ÈÀ´¿´£¬Ã¿¸öÇý¶¯Æ÷¿ÉÒÔ¶ÀÁ¢µ÷¶È±¾Ó¦ÓóÌÐòµÄÄÚ²¿ÈÎÎñ¡£´ÓÖ´ÐÐÆ÷½Ç¶ÈÀ´¿´£¬²»Í¬SparkÓ¦ÓöÔÓ¦µÄÈÎÎñ½«»áÔÚ²»Í¬µÄJVMÖÐÔËÐС£È»¶øÕâÑùµÄ¼Ü¹¹Ò²ÓÐȱµã£¬¶à¸öSparkÓ¦ÓóÌÐòÖ®¼äÎÞ·¨¹²ÏíÊý¾Ý£¬³ý·Ç°ÑÊý¾Ýдµ½Íⲿ´æ´¢½á¹¹ÖС£

Spark¶Ôµ×²ãµÄ¼¯Èº¹ÜÀíÆ÷Ò»ÎÞËùÖª£¬Ö»ÒªSparkÄܹ»ÉêÇëµ½Ö´ÐÐÆ÷½ø³Ì£¬ÄÜÓë֮ͨÐż´¿É¡£ÕâÖÖʵÏÖ·½Ê½¿ÉÒÔʹSpark±È½ÏÈÝÒ×µÄÔÚ¶àÖÖ¼¯Èº¹ÜÀíÆ÷ÉÏÔËÐУ¬ÀýÈçMesos¡¢Yarn¡£

Çý¶¯Æ÷³ÌÐòÔÚÕû¸öÉúÃüÖÜÆÚÄÚ±ØÐë¼àÌý²¢½ÓÊÜÆä¶ÔÓ¦µÄ¸÷¸öÖ´ÐÐÆ÷µÄÁ¬½ÓÇëÇó£¬Òò´ËÇý¶¯Æ÷³ÌÐò±ØÐëÄܹ»±»ËùÓÐWorker½Úµã·ÃÎʵ½¡£

ÒòΪ¼¯ÈºÉϵÄÈÎÎñÊÇÓÉÇý¶¯Æ÷À´µ÷¶ÈµÄ£¬ËùÒÔÇý¶¯Æ÷Ó¦¸ÃºÍWorker½Úµã¾àÀë½üһЩ£¬×îºÃÔÚͬһ¸ö±¾µØ¾ÖÓòÍøÖУ¬Èç¹ûÐèÒªÔ¶³Ì¶Ô¼¯Èº·¢ÆðÇëÇó£¬×îºÃ»¹ÊÇÔÚÇý¶¯Æ÷½ÚµãÉÏÆô¶¯RPC·þÎñÏìÓ¦ÕâЩԶ³ÌÇëÇó£¬Í¬Ê±°ÑÇý¶¯Æ÷±¾Éí·ÅÔÚÀ뼯ȺWorker½Úµã±È½Ï½üµÄ»úÆ÷ÉÏ¡£

SparkÔËÐлù±¾Á÷³Ì

ͨ¹ýÉÏһС½ÚÁ˽⵽£¬SparkÔËÐмܹ¹Ö÷ÒªÓÉSparkContext¡¢Cluster ManagerºÍWorker×é³É£¬ÆäÖÐCluster Manager¸ºÔðÕû¸ö¼¯ÈºµÄͳһ×ÊÔ´¹ÜÀí£¬Worker½ÚµãÖеÄExecutorÊÇÓ¦ÓÃÖ´ÐеÄÖ÷Òª½ø³Ì£¬ÄÚ²¿º¬Óжà¸öTaskÏß³ÌÒÔ¼°ÄÚ´æ¿Õ¼ä£¬ÏÂÃæÍ¨¹ýͼ2-12ÉîÈëÁ˽âSparkÔËÐлù±¾Á÷³Ì¡£Í¼2-12SparkÔËÐлù±¾Á÷³Ìͼ

SparkÔËÐлù±¾Á÷³Ì

(1)µ±Ò»¸öSparkÓ¦Óñ»Ìύʱ£¬¸ù¾ÝÌá½»²ÎÊýÔÚÏàӦλÖô´½¨Driver½ø³Ì£¬Driver½ø³Ì¸ù¾ÝÅäÖòÎÊýÐÅÏ¢³õʼ»¯SparkContext¶ÔÏ󣬼´SparkÔËÐл·¾³£¬ÓÉSparkContext¸ºÔðºÍCluster ManagerµÄͨÐÅÒÔ¼°×ÊÔ´µÄÉêÇë¡¢ÈÎÎñµÄ·ÖÅäºÍ¼à¿ØµÈ¡£SparkContextÆô¶¯ºó£¬´´½¨DAG Scheduler(½«DAGͼ·Ö½â³ÉStage)ºÍTask Scheduler(Ìá½»ºÍ¼à¿ØTask)Á½¸öµ÷¶ÈÄ£¿é¡£

(2)Driver½ø³Ì¸ù¾ÝÅäÖòÎÊýÏòCluster ManagerÉêÇë×ÊÔ´(Ö÷ÒªÊÇÓÃÀ´Ö´ÐеÄExecutor)£¬Cluster Manager½ÓÊÕµ½Ó¦ÓÃ(Application)µÄ×¢²áÇëÇóºó£¬»áʹÓÃ×Ô¼ºµÄ×ÊÔ´µ÷¶ÈËã·¨£¬ÔÚSpark¼¯ÈºµÄWorker½ÚµãÉÏ£¬Í¨ÖªWorkerΪӦÓÃÆô¶¯¶à¸öExecutor¡£

(3)Executor´´½¨ºó£¬»áÏòCluster Manager½øÐÐ×ÊÔ´¼°×´Ì¬µÄ·´À¡£¬±ãÓÚCluster Manager¶ÔExecutor½øÐÐ״̬¼à¿Ø£¬Èç¹û¼à¿Øµ½Executorʧ°Ü£¬Ôò»áÁ¢¿ÌÖØÐ´´½¨¡£

(4)Executor»áÏòSparkContext·´Ïò×¢²áÉêÇëTask¡£

(5)Task Scheduler½«Task·¢Ë͸øWorker½ø³ÌÖеÄExecutorÔËÐв¢ÌṩӦÓóÌÐò´úÂë¡£

(6)µ±³ÌÐòÖ´ÐÐÍê±ÏºóдÈëÊý¾Ý£¬DriverÏòCluster Manager×¢ÏúÉêÇëµÄ×ÊÔ´¡£





²ÂÄãϲ»¶£º

SparkµÄ¿ò¼ÜÄ£¿éºÍÔËÐÐģʽÊÇʲô£¿

Spark Streaming¹¤×÷Ô­ÀíÊÇʲô£¿

SparkµÄÓ¦Óó¡¾°ÓÐÄÄЩ£¿

SparkÉú̬ϵͳ°üº¬ÄÄЩ×é¼þ£¿¡¾´óÊý¾ÝÅàѵ¡¿

ÀÖÓãµç¾ºPython+´óÊý¾Ý¿ª·¢Åàѵ¿Î³Ì

0 ·ÖÏíµ½£º
ºÍÎÒÃÇÔÚÏß½»Ì¸£¡
¡¾ÍøÕ¾µØÍ¼¡¿¡¾sitemap¡¿