ceph/qa/workunits/hadoop/wordcount.sh

#!/bin/bash

set -e
set -x

WC_INPUT=/wc_input
WC_OUTPUT=/wc_output
DATA_INPUT=$(mktemp -d)

echo "starting hadoop-wordcount test"

# bail if $TESTDIR is not set as this test will fail in that scenario
[ -z $TESTDIR ] && { echo "\$TESTDIR needs to be set, but is not. Exiting."; exit 1; }

# if HADOOP_PREFIX is not set, use default
[ -z $HADOOP_PREFIX ] && { HADOOP_PREFIX=$TESTDIR/hadoop; }

export JAVA_HOME=/usr/lib/jvm/default-java

# Nuke hadoop directories
$HADOOP_PREFIX/bin/hadoop fs -rm -r $WC_INPUT $WC_OUTPUT || true

# Fetch and import testing data set
curl http://ceph.com/qa/hadoop_input_files.tar | tar xf - -C $DATA_INPUT
$HADOOP_PREFIX/bin/hadoop fs -copyFromLocal $DATA_INPUT $WC_INPUT
rm -rf $DATA_INPUT

# Run the job
$HADOOP_PREFIX/bin/hadoop jar \
  $HADOOP_PREFIX/share/hadoop/mapreduce/hadoop-mapreduce-examples-*.jar \
  wordcount $WC_INPUT $WC_OUTPUT

# Cleanup
$HADOOP_PREFIX/bin/hadoop fs -rm -r $WC_INPUT $WC_OUTPUT || true

echo "completed hadoop-wordcount test"
exit 0
qa: hadoop plays nice with new teuthology task This brings the hadoop wordcount up-to-date with the new teuthology hadoop task. Signed-off-by: Noah Watkins <noahwatkins@gmail.com> 2015-02-12 00:02:49 +00:00			`#!/bin/bash`

			`set -e`
			`set -x`

			`WC_INPUT=/wc_input`
			`WC_OUTPUT=/wc_output`
			`DATA_INPUT=$(mktemp -d)`
testing: adding a Hadoop wordcount test Signed-off-by: Joe Buck <jbbuck@gmail.com> Reviewed-by: Sam Lang <sam.lang@inktank.com> 2013-02-18 23:46:20 +00:00
			`echo "starting hadoop-wordcount test"`

			`# bail if $TESTDIR is not set as this test will fail in that scenario`
update hadoop-wordcount test to be able to run on hadoop 2.x. The hadoop and mapreduce library are no longer hard coded so they can be specified to point to the right path. The relative paths hdfs are changed to absolute paths. A sample command to run the test on hadoop 2.x is TESTDIR=/home/test HADOOP_HOME=/usr/lib/hadoop HADOOP_MR_HOME=/usr/lib/hadoop-mapreduce sh workunits/hadoop-wordcount/test.sh starting hadoop-wordcount test Signed-off-by: rootfs <hchen@redhat.com> 2014-07-15 11:39:32 +00:00			`[ -z $TESTDIR ] && { echo "\$TESTDIR needs to be set, but is not. Exiting."; exit 1; }`

qa: fix+cleanup hadoop wordcount test The glob for the examples jar was wrong. Fixes: #9260 Signed-off-by: John Spray <john.spray@redhat.com> 2014-08-29 12:29:22 +00:00			`# if HADOOP_PREFIX is not set, use default`
qa: hadoop plays nice with new teuthology task This brings the hadoop wordcount up-to-date with the new teuthology hadoop task. Signed-off-by: Noah Watkins <noahwatkins@gmail.com> 2015-02-12 00:02:49 +00:00			`[ -z $HADOOP_PREFIX ] && { HADOOP_PREFIX=$TESTDIR/hadoop; }`
testing: adding a Hadoop wordcount test Signed-off-by: Joe Buck <jbbuck@gmail.com> Reviewed-by: Sam Lang <sam.lang@inktank.com> 2013-02-18 23:46:20 +00:00
qa/workunits/hadoop-wordcount: use -x Signed-off-by: Sage Weil <sage@redhat.com> 2014-08-18 15:37:38 +00:00			`export JAVA_HOME=/usr/lib/jvm/default-java`
qa: fix+cleanup hadoop wordcount test The glob for the examples jar was wrong. Fixes: #9260 Signed-off-by: John Spray <john.spray@redhat.com> 2014-08-29 12:29:22 +00:00
qa: hadoop plays nice with new teuthology task This brings the hadoop wordcount up-to-date with the new teuthology hadoop task. Signed-off-by: Noah Watkins <noahwatkins@gmail.com> 2015-02-12 00:02:49 +00:00			`# Nuke hadoop directories`
			`$HADOOP_PREFIX/bin/hadoop fs -rm -r $WC_INPUT $WC_OUTPUT \|\| true`

			`# Fetch and import testing data set`
			`curl http://ceph.com/qa/hadoop_input_files.tar \| tar xf - -C $DATA_INPUT`
			`$HADOOP_PREFIX/bin/hadoop fs -copyFromLocal $DATA_INPUT $WC_INPUT`
			`rm -rf $DATA_INPUT`

			`# Run the job`
			`$HADOOP_PREFIX/bin/hadoop jar \`
			`$HADOOP_PREFIX/share/hadoop/mapreduce/hadoop-mapreduce-examples-*.jar \`
			`wordcount $WC_INPUT $WC_OUTPUT`
qa: fix+cleanup hadoop wordcount test The glob for the examples jar was wrong. Fixes: #9260 Signed-off-by: John Spray <john.spray@redhat.com> 2014-08-29 12:29:22 +00:00
qa: hadoop plays nice with new teuthology task This brings the hadoop wordcount up-to-date with the new teuthology hadoop task. Signed-off-by: Noah Watkins <noahwatkins@gmail.com> 2015-02-12 00:02:49 +00:00			`# Cleanup`
			`$HADOOP_PREFIX/bin/hadoop fs -rm -r $WC_INPUT $WC_OUTPUT \|\| true`
testing: adding a Hadoop wordcount test Signed-off-by: Joe Buck <jbbuck@gmail.com> Reviewed-by: Sam Lang <sam.lang@inktank.com> 2013-02-18 23:46:20 +00:00
			`echo "completed hadoop-wordcount test"`
			`exit 0`