创建Maven项目打包提交wordcount到Storm集群



在上一篇博客中,我们通过Storm.Net.Adapter创建了一个使用Csharp编写的Storm Topology - wordcount。本文将介绍如何编写Java端的程序以及如何发布到测试的Storm环境中运行。

如果你觉得对你有帮助,欢迎Star和Fork,让更多人看到来帮助完善这个项目。

STEP1: 克隆storm官方示例项目 storm-starter

$ git clone git://github.com/apache/storm.git && cd storm/examples/storm-starter

STEP2: 增加csharp的多语言支持:

将上一篇博客 使用Csharp创建你的第一个Storm拓扑 中完成的项目编译,把生产的组件拷贝到 /multilang/resources/ 文件夹中。

STEP3:使用JAVA创建Topology:

/src/jvm/storm/starter/ 新增 WordCountTopologyCsharp.java

/**
* Licensed to the Apache Software Foundation (ASF) under one
* or more contributor license agreements. See the NOTICE file
* distributed with this work for additional information
* regarding copyright ownership. The ASF licenses this file
* to you under the Apache License, Version 2.0 (the
* "License"); you may not use this file except in compliance
* with the License. You may obtain a copy of the License at
*
* http://www.apache.org/licenses/LICENSE-2.0
*
* Unless required by applicable law or agreed to in writing, software
* distributed under the License is distributed on an "AS IS" BASIS,
* WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
* See the License for the specific language governing permissions and
* limitations under the License.
*/
package storm.starter;

import backtype.storm.Config;
import backtype.storm.LocalCluster;
import backtype.storm.StormSubmitter;
import backtype.storm.spout.ShellSpout;
import backtype.storm.task.ShellBolt;
import backtype.storm.topology.IRichBolt;
import backtype.storm.topology.IRichSpout;
import backtype.storm.topology.OutputFieldsDeclarer;
import backtype.storm.topology.TopologyBuilder;
import backtype.storm.tuple.Fields;

import java.util.Map;

/**
* This topology demonstrates Storm's stream groupings and multilang capabilities.
*/
public class WordCountTopologyCsharp {
public static class Generator extends ShellSpout implements IRichSpout {

public Generator() {
super("cmd", "/k", "CALL", "StormSimple.exe", "generator");

}

@Override
public void declareOutputFields(OutputFieldsDeclarer declarer) {
declarer.declare(new Fields("word"));
}

@Override
public Map<String, Object> getComponentConfiguration() {
return null;
}
}

public static class Splitter extends ShellBolt implements IRichBolt {

public Splitter() {
super("cmd", "/k", "CALL", "StormSimple.exe", "splitter");
}

@Override
public void declareOutputFields(OutputFieldsDeclarer declarer) {
declarer.declare(new Fields("word", "count"));
}

@Override
public Map<String, Object> getComponentConfiguration() {
return null;
}
}

public static class Counter extends ShellBolt implements IRichBolt {

public Counter(){
super("cmd", "/k", "CALL", "StormSimple.exe", "counter");
}

@Override
public void declareOutputFields(OutputFieldsDeclarer declarer) {
declarer.declare(new Fields("word", "count"));
}

@Override
public Map<String, Object> getComponentConfiguration() {
return null;
}
}


public static void main(String[] args) throws Exception {

TopologyBuilder builder = new TopologyBuilder();

builder.setSpout("generator", new Generator(), 1);

builder.setBolt("splitter", new Splitter(), 1).fieldsGrouping("generator",
new Fields("word"));

builder.setBolt("counter", new Counter(), 1).fieldsGrouping("splitter",
new Fields("word", "count"));

Config conf = new Config();
conf.setDebug(true);

if (args != null && args.length > 0) {
conf.setNumWorkers(3);

StormSubmitter.submitTopologyWithProgressBar(args[0], conf,
builder.createTopology());
} else {
conf.setMaxTaskParallelism(3);

LocalCluster cluster = new LocalCluster();
cluster.submitTopology("WordCount", conf, builder.createTopology());

Thread.sleep(10000);

cluster.shutdown();
}
}
}

本例是在window平台使用.Net执行,如果你使用Mono,或者在其它平台通过Mono运行,请将

super("cmd", "/k", "CALL", "StormSimple.exe", "xxxxxx");

替换为

super("mono", "StormSimple.exe", "xxxxxx");

STEP4:编译并提交Topology:

  • 初始化安装storm所需依赖:$ mvn clean install -DskipTests=true
  • 使用Maven打包storm拓扑:$ mvn package
  • 搭建好运行环境并提交:

    $ storm jar storm-starter-*-jar-with-dependencies.jar storm.starter.WordCountTopologyCsharp wordcount

    storm集群的搭建请参考系列文章第一篇 搭建dotNet开发Storm拓扑的环境

    image

    image

本文来自 The NewIdea,作者 Carey Tzou 。
永久地址:https://www.tnidea.com/deploy-wordcount-topology.html
未经授权,拒绝任何全文及摘要转载!