An online course designed to provide a blend of Machine learning & Big Data and where Mahout fits in the Hadoop Ecosystem.


Learning Objectives: This module will give you an insight about what 'Machine Learning' is and How Apache Mahout algorithms are used in building intelligent applications.

Topics Machine Learning Fundamentals, Apache Mahout Basics, History of Mahout, Supervised and Unsupervised Learning techniques, Mahout and Hadoop, Introduction to Clustering, Classification.

Learning Objectives: In this module you will learn how to set up Mahout on Apache Hadoop. You will also get an understanding of Myrrix Machine Learning Platform.

Topics Mahout on Apache Hadoop setup, Mahout and Myrrix.

Learning Objectives: In this module you will get an understanding of the recommendation system in Mahout and different filtering methods.

Topics Recommendations using Mahout, Introduction to Recommendation systems, Content Based (Collaborative filtering, User based, Nearest N Users, Threshold, Item based), Mahout Optimizations.

Learning Objectives: In this module you will learn about the Recommendation platforms and implement a Recommender using MapReduce.

Topics User based recommendation, User Neighbourhood, Item based Recommendation, Implementing a Recommender using MapReduce, Platforms: Similarity Measures, Manhattan Distance, Euclidean Distance, Cosine Similarity, Pearson's Correlation Similarity, Loglikihood Similarity, Tanimoto, Evaluating Recommendation Engines (Online and Offline), Recommendors in Production.

Learning Objectives: This module will help you in understanding 'Clustering' in Mahout and also give an overview of common Clustering Algorithms.

Topics Clustering, Common Clustering Algorithms, K-means, Canopy Clustering, Fuzzy K-means and Mean Shift etc., Representing Data, Feature Selection, Vectorization, Representing Vectors, Clustering documents through example, TF-IDF, Implementing clustering in Hadoop, Classification.

Learning Objectives: In this module you will get a clear understanding of Classifier and the common Classifier Algorithms.

Topics Examples, Basics, Predictor variables and Target variables, Common Algorithms, SGD, SVM, Navie Bayes, Random Forests, Training and evaluating a Classifier, Developing a Classifier.

Learning Objectives: At the end of this module, you will get an understanding of how Mahout can be used on Amazon EMR Hadoop distribution.

Topics Mahout on Amazon EMR, Mahout Vs R, Introduction to tools like Weka, Octave, Matlab, SAS.

Learning Objectives: In this module you will develop an intelligent application using Mahout on Hadoop.

Topics A complete recommendation engine built on application logs and transactions.

Course Description

This course covers the fundamentals of machine learning techniques ranging from various algorithms of Support Vector Machines, k-means clustering, Random Forests, Collaborative filtering to recommendation system, Mahout on Hadoop and Amazon EMR, etc.

After the completion of Apache Mahout Course at Brighter Connect, you should be able to:

1. Gain an insight into the Machine Learning techniques.
2. Understand the algorithms of SVM, Naive Bayes, Random Forests,etc.
3. Implement these using 'Apache Mahout'
4. Understand the recommendation system
5. Learn Collaborative filtering, Clustering and Categorization
6. Analyse Big Data using Hadoop and Mahout
7. Implementing a recommender using MapReduce
8. Introduction to tools like Weka, Octave, Matlab, SAS

This course is designed for all those who are interested in learning machine learning techniques in big data domain and write intelligent applications using Apache Mahout. The following professionals can go for this course :

1. Analytics Professionals
2. Data Scientists looking to hone their machine learning skills
3. Software Developers and Architects
4. Business Analysts wanting to learn Mahout for ML implementation
5. Professionals working with R, Matlab, Python, etc.
6. Statisticians looking to learn machine learning techniques
7. Graduates aspiring to take a leap in analytics domain

The basic Java and Hadoop knowledge is recommended and not mandatory as these concepts will also be covered during the course.

In the modern information age of exponential data growth, the success of companies and enterprises depends on how quickly and efficiently they turn vast amounts of data into actionable information. Whether it's for processing hundreds or thousands of personal e-mail messages a day or driving user intent from petabytes of weblogs, the need for tools that can organise and enhance data has never been greater. Therein lies the premise and the promise of the field of machine learning and Apache Mahout.


For your practical work, we will help you setup Brighter Connect's Virtual Machine in your System. This will be a local access for you. The required installation guide is present in LMS.

Your Online (Machine Learning with Mahout Certification Training) Package
Upon purchase, you will receive a password via the email you used to purchase the course.

You will then be able to login to our online learning portal with your email and password.

You will have access to the portal for 12 months to complete your course.

£450 £250 + VAT