Map- & gt; Map- & gt; Reduction- & gt; Reduction- & gt; Final re...

3 years ago

source link: https://www.codesd.com/item/map-map-reduction-reduction-final-release.html
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.

Map- & gt; Map- & gt; Reduction- & gt; Reduction- & gt; Final release

advertisements

Recently I read a paper that proposed algorithm for mining Maximum Contiguous patterns from DNA data. The proposed method, which sounds pretty interesting, used the following model of MapReduce. map->map->reduce->reduce. That is, First map phase is executed and its output is input to the second phase map. The second phase map's output is input to the first phase reduce. The output of the first phase reduce is input to the second phase reduce and finally the results are flushed into HDFS. Although it seems like an interesting method, the paper didn't mention how they have implemented it. My question is, how do you implement this sort of MapReduce chaining?

I think there are two methods to deal with your case:

Integrate the two maps function code into one map task with two phase. Reduce task using the same method as map.
Divide the map-map-reduce-reduce progress into two jobs: two maps as first Hadoop job after converting the second map task type to reduce task; two reduces as second Hadoop job after converting first reduce task to map. May be you could use Oozie to deal with Hadoop workflow if submit some hadoop jobs depending on others.

Recommend

Map- & gt; Map- & gt; Reduction- & gt; Reduction- & gt; Final re...

Map- & gt; Map- & gt; Reduction- & gt; Reduction- & gt; Final release

Recommend

各大公链齐发力，波卡如何抢占先机？

NFT发展两极分化，未来方向在哪？

从零开始使用开源文档/Wiki软件 Outline（一）

深入探讨以太坊当前架构及其路线演变

GitHub - cikrf/deg2021

GitHub - weiyithu/NerfingMVS: [ICCV 2021 Oral] NerfingMVS: Guided Optimization o...

GitHub - dribnet/clipit: CLIP + VQGAN / PixelDraw

GitHub - PatrickAlphaC/nft-mix

GitHub - luminoleon/epicgames-claimer: Claim weekly free games from Epic Games S...

GitHub - CookieMonsterTeam/CookieMonster: Addon for Cookie Clicker that offers a...

About Joyk