Overview

Extreme Learning Machines (ELM) - Learning without iterative tuning

Neural networks (NN) and support vector machines (SVM) play key roles in machine learning and data analysis. Feedforward neural networks and support vector machines are usually considered different learning techniques in computational intelligence community. Both popular learning techniques face some challenging issues such as: intensive human intervene, slow learning speed, poor learning scalability.

It is clear that the learning speed of feedforward neural networks is in general far slower than required and it has been a major bottleneck in their applications for past decades. Two key reasons behind may be: 1) the slow gradient-based learning algorithms are extensively used to train neural networks, and 2) all the parameters of the networks are tuned iteratively by using such learning algorithms. On the other hand, due to their outstanding classification capability, support vector machine and its variants such as least square support vector machine (LS-SVM) have been widely used in binary classification applications. The conventional SVM and LS-SVM cannot be used in regression and multi-class classification applications directly although different SVM/LS-SVM variants have been proposed to handle such cases.

ELM Theories

ELM works for the “generalized” single-hidden layer feedforward networks (SLFNs) but the hidden layer (or called feature mapping) in ELM need not be tuned. Such SLFNs include but are not limited to support vector machine, polynomial network, RBF networks, and the conventional (both single-hidden-layer and multi-hidden-layer) feedforward neural networks. Different from the tenet in neural networks that all the hidden nodes in SLFNs need to be tuned, ELM learning theory shows that the hidden nodes of generalized feedforward networks needn’t be tuned and these hidden nodes can be randomly generated. All the hidden node parameters are independent from the target functions or the training datasets. All the parameters of ELMs can be analytically determined instead of being tuned.

According to ELM theory:

The hidden node parameters are not only independent of the training data but also of each other.

Unlike conventional learning methods which MUST see the training data before generating the hidden node parameters, ELM could generate the hidden node parameters before seeing the training data.

ELM Unified Platform

ELM was originally proposed for standard single hidden layer feedforward neural networks (with random hidden nodes (random features)), and has recently been extended to kernel learning as well:

  1. ELM provides a unified learning platform with widespread type of feature mappings and can be applied in regression and multi-class classification applications directly;
  2. From the optimization method point of view ELM has milder optimization constraints compared to SVM, LS-SVM and PSVM;
  3. In theory ELM can approximate any target continuous function and classify any disjoint regions;
  4. In theory compared to ELM, SVM, LS-SVM and PSVM achieve suboptimal solutions and require higher computational complexity.

ELM Variants

ELM is efficient in:

  1. Batch learning
  2. Sequential learning
  3. Incremental learning

ELM Applications

ELM has been successfully used in the following applications:

  1. Biometrics
  2. Bioinformatics
  3. Image processing (image segmentation, image quality assessment, image super-resolution)
  4. Signal processing
  5. Human action recognition
  6. Disease prediction and eHealthCare
  7. Location positioning system
  8. Brain computer interface
  9. Human computer interface
  10. Feature selection
  11. Time-series
  12. Real-time learning and prediction
  13. Security and data privacy

ELM Impact

Due to the demand on ELM solutions, ELM may help drive R&D in the following areas and make some applications which seem impossible in the past become true in the future:

  1. Machine learning and artificial intelligence
  2. Matrix theory and optimization theory
  3. Functioning artificial “brain”
  4. Robot and automation
  5. Data and knowledge discovery
  6. Cognitive and reasoning system
  7. Big data analytics
  8. Internet of Things (IoT)

Job Openings

The BMW-NTU Joint Future Mobility Research Lab has been recently setup. Several PhD scholarships and Project leaders positions are avaiable. Applicants can submit their application to D-ERIAN@ntu.edu.sg

Postdoctoral research fellow position is available at Agent-based Modeling and Simulation. Applicants can submit their applications to Professor Zhang Jie, SCE/NTU, Singapore.

Two research positions on ELM research (one for realtime clustering and one for Internet data analytics) are available. Students who wish to pursue PhD degree or researchers who are interested in the two positions are welcome to write to elm201x@gmail.com with detail CV. Only shortlisted will be informed for further discussion.

 

Coming Events

International Conference on Extreme Learning Machines (ELM2013)

Beijing, October 15 - 17 2013

Organized by:

Tsinghua University, China
Nanyang Technological University, Singapore
Northeastern University, China

 

Useful links:

The 14th International Conference on Intelligent Data Engineering and Automated Learning (IDEAL'2013)

Hefei, China 20-23 October 2013

Organzied by The USTC-Birmingham Joint Research Institute in Intelligent Computation and Its Applications

Nedstat Basic - Free web site statistics
Personal homepage website counter
Free counter Last update :