Sunday, 2 November 2014

How To Install Apache Mahout on Ubuntu


  1.  Hadoop Cluster
  2.  Maven

STEP 1: Download mahout latest source code from

Make sure you download .src zipped file.

STEP 2: Unzip the file to a named folder “mahout”

unzip -a

STEP 3: Move mahout to /usr/local

mv mahout /usr/local

STEP 4: Build Mahout

unmesha@client:~$ cd /usr/local/mahout/mahout-distribution-0.9
unmesha@client:/usr/local/mahout/mahout-distribution-0.9$ ls
bin         core          examples     LICENSE.txt  math-scala  pom.xml     src buildtools  distribution  integration  math         NOTICE.txt  README.txt  target
unmesha@client:/usr/local/mahout/mahout-distribution-0.9$mvn install

Wait untill mahout is build. It would perform some tests also.It is recommended to complete the test for the first time.Later you can skip the test using

mvn install -Dmaven.test.skip=true

Once the tests are done and the mahout is built , we get a success message.

Congratz Apache Mahout is installed...

If you are using Cloudera(CDH) package , you can install Mahout in just 1 step.
apt-get install mahout

You can use mahout commands in /usr/bin and if you want to run mahout in hadoop cluster go to /usr/lib and reference mahout-cdhx-core-job.jar and full class path.


  1. Can we implement using this on OSGI framework..?

    1. I think they dont use OSGI at this point. They are primarily producing a library rather than standalone programs.Can you say something about how using OSGi would help ?

  2. Hi,
    Thanks For Sharing Info. If someone want to learn Online (Virtual) instructor lead live training in Apache Mahout

  3. We at COEPD provides finest Data Science and R-Language courses in Hyderabad. Your search to learn Data Science ends here at COEPD. Here, we are an established training institute who have trained more than 10,000 participants in all streams. We will help you to convert your passion to learn into an enriched learning process. We will accelerate your career in data science by mastering concepts of Data Management, Statistics, Machine Learning and Big Data.