Random Ferns Classifier

Miaohui Zhang; Ming Xin

Improve Research Reproducibility A Bio-protocol resource

Home
Protocols

Concise Method

Random Ferns Classifier

MZ Miaohui Zhang

MX Ming Xin

This method is extracted from research article: PLoS One, Sep 2016

Human Detection Using Random Color Similarity Feature and Random Ferns Classifier

DOI: 10.1371/journal.pone.0162830

Request a Protocol

Ask a question

Favorite

Random ferns classifier is a variation on the random forest classifier first introduced in [24] and developed further in [25–26]. They have been applied to object recognition in [27], image classification in [28] and fast keypoints recognition in [16]. The advantage of random ferns classifier is that it enables different cues such as appearance and shape to be effortlessly combined as introduced above.

A common method for human detection is to use a sliding windows approach to search for humans in all possible positions and scales. In this way, the detection problem is transformed into a classification problem. Therefore, given a candidate patch in an image, our task is to assign it to the most likely class. Let c_i, i = 0,1, be the set of classes such as human or non-human, and let f_j, j = 1, 2, ⋯N, be the set of binary features that will be calculated over the patch that we are trying to classify. Formally, we are looking for,

where C refers to the class, our goal is to model the posterior human class probability given a set of N features. This can be expressed by the Bayes rule as,

Similarly, an equivalent expression may be written for the non-human class. By removing the priors P(f₁, f₂, ⋯, f_N), common for all the classes, assuming uniform prior probabilities P(C), the problem reduces to finding,

But learning the joint likelihood distributions over all features is most likely intractable. Naive Bayes makes the simplifying assumption that features are conditionally independent given the class label,

However, this independence assumption is usually false, tending to grossly underestimate the true posterior probabilities. To make the problem tractable while accounting for these dependencies, a good compromise is to partition our features into M groups of size S = N/M. These groups are what we define as ferns, and we compute the joint probability for features in each fern. The conditional probability is as follows,

where F_k = {f_{σ(k, 1)}, f_{σ(k, 2)}, ⋯, f_{σ(k, S)}}, k = 1, 2, ⋯ M, represents the k’th fern and σ(k, j)is a random permutation function. Hence, we follow a semi-naive Bayesian approach by modeling only some of the dependencies between features.

Furthermore, the class-conditional probabilities P(F_m|C = c_i) are estimated for each fern F_m and class c_i in the training phase. For each fern F_m, these terms are written as,

where $N_{k, c_{i}}$ is the number of training samples of class c_i that evaluate to fern value k, k = 1, 2, ⋯, 2^S and $N_{c_{i}}$ is the total number of samples for class c_i. However, when the number of samples is not infinitely large, both $N_{k, c_{i}}$ and $p_{k, c_{i}}$ will be zero. To overcome this problem, $p_{k, c_{i}}$ is to be taken as,

where N_r represents a regularization term, which behaves as a uniform Dirichlet prior over feature values. In the following experiment, the parameter N_r is set to 1.

This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Do you have any questions about this protocol?

Post your question to gather feedback from the community. We will also invite the authors of this article to respond.

Post a Question

0 Q&A

Share your protocol with your peers.

Submit a Preprint Protocol