¸üÐÂʱ¼ä:2021Äê12ÔÂ27ÈÕ11ʱ41·Ö À´Ô´:ÀÖÓãµç¾º ä¯ÀÀ´ÎÊý:
DStream(Discretized Stream)ÊÇSpark StreamingÌṩµÄ»ù±¾Êý¾Ý³éÏó¡£Ëü±íʾһ¸öÁ¬ÐøµÄÊý¾ÝÁ÷£¬¿ÉÒÔÊÇ´ÓÔ´½ÓÊÕµ½µÄÊäÈëÊý¾ÝÁ÷£¬Ò²¿ÉÒÔÊÇͨ¹ýת»»ÊäÈëÁ÷Éú³ÉµÄÒÑ´¦ÀíÊý¾ÝÁ÷¡£
DStreamÓÉһϵÁÐÁ¬ÐøµÄRDD±íʾ£¬Ã¿¸öRDD¶¼°üº¬À´×ÔÌØ¶¨¼ä¸ôµÄÊý¾Ý£¬ÈçÏÂͼËùʾ¡£SparkStreaming¶ÔÁ÷Êý¾Ý°´ÕÕÃë/·ÖµÈʱ¼ä¼ä¸ô½øÐÐ΢Åú»®·Ö£¬Ã¿¸ö΢Åú¾ÍÊÇÒ»¸öRDD£¬ÕâЩ¸öʱ¼äÉÏÁ¬ÐøµÄRDD¾Í×é³ÉÁË
DStream
ËùÒÔDStream±¾ÖÊÉϾÍÊÇһϵÁÐʱ¼äÉÏÁ¬ÐøµÄRDD¼´DStream=>Seq[RDD]

¶ÔDStream½øÐвÙ×÷(È磺flatMap/map/filter..)¾ÍÊÇ¶ÔÆäµ×²ãµÄRDD½øÐвÙ×÷
¶ÔRDD²Ù×÷»á·µ»ØÐµÄRDD£¬¶ÔDStream½øÐвÙ×÷Ò²»á·µ»ØÐµÄDStream


ÉÏͼÏà¹ØËµÃ÷£º
1¡¢Ã¿Ò»¸öÍÖÔ²Ðαíʾһ¸öRDD
2¡¢ÍÖÔ²ÐÎÖеÄÿ¸öÔ²Ðδú±íÒ»¸öRDDÖеÄÒ»¸öPartition·ÖÇø
3¡¢Ã¿Ò»ÁеĶà¸öRDD±íʾһ¸öDStream(ͼÖÐÓÐÈýÁÐËùÒÔÓÐÈý¸öDStream
4¡¢Ã¿Ò»ÐÐ×îºóÒ»¸öRDDÔò±íʾÿһ¸öBatch SizeËù²úÉúµÄÖмä½á¹ûRDD
´ó¶àÊýTransformationºÍAction/OutputºÍ֮ǰµÄRDDµÄÒ»ÑùʹÓÃ.ÉÙ²¿·Ö²»Ò»ÑùµÄͨ¹ý°¸Àý½²½â


Á½ÖÖRDDµÄÒÀÀµ¹ØÏµ½éÉÜ
SparkStreamingÁ¬½ÓKafkaÁ½ÖÖ·½Ê½
SparkÉú̬ϵͳ°üº¬ÄÄЩ×é¼þ£¿
Spark´¦ÀíÊý¾ÝµÄËٶȱÈHive¸ü¿ì£¿ÔÒòÊÇʲô£¿
ÀÖÓãµç¾ºpython+´óÊý¾Ý¿ª·¢Åàѵ
±±¾©Ð£Çø