Home | Read Online | Amazon | GoodReads | Google Books | PDF (code) | GitHub
Learning Vector Quantization, LVQ.
The Learning Vector Quantization algorithm belongs to the field of Artificial Neural Networks and Neural Computation. More broadly to the field of Computational Intelligence. The Learning Vector Quantization algorithm is a supervised neural network that uses a competitive (winner-take-all) learning strategy. It is related to other supervised neural networks such as the Perceptron and the Back-propagation algorithm. It is related to other competitive learning neural networks such as the the Self-Organizing Map algorithm that is a similar algorithm for unsupervised learning with the addition of connections between the neurons. Additionally, LVQ is a baseline technique that was defined with a few variants LVQ1, LVQ2, LVQ2.1, LVQ3, OLVQ1, and OLVQ3 as well as many third-party extensions and refinements too numerous to list.
The Learning Vector Quantization algorithm is related to the Self-Organizing Map which is in turn inspired by the self-organizing capabilities of neurons in the visual cortex.
The information processing objective of the algorithm is to prepare a set of codebook (or prototype) vectors in the domain of the observed input data samples and to use these vectors to classify unseen examples. An initially random pool of vectors is prepared which are then exposed to training samples. A winner-take-all strategy is employed where one or more of the most similar vectors to a given input pattern are selected and adjusted to be closer to the input vector, and in some cases, further away from the winner for runners up. The repetition of this process results in the distribution of codebook vectors in the input space which approximate the underlying distribution of samples from the test dataset.
Vector Quantization is a technique from signal processing where density functions are approximated with prototype vectors for applications such as compression. Learning Vector Quantization is similar in principle, although the prototype vectors are learned through a supervised winner-take-all method.
Algorithm (below) provides a high-level pseudocode for preparing codebook vectors using the Learning Vector Quantization method. Codebook vectors are initialized to small floating point values, or sampled from an available dataset. The Best Matching Unit (BMU) is the codebook vector from the pool that has the minimum distance to an input vector. A distance measure between input patterns must be defined. For real-valued vectors, this is commonly the Euclidean distance:
$dist(x,c) = \sum_{i=1}^{n} (x_i - c_i)^2$where $n$ is the number of attributes, $x$ is the input vector and $c$ is a given codebook vector.
Input
:
ProblemSize
, InputPatterns
, $iterations_{max}$, $CodebookVectors_{num}$, $learn_{rate}$
Output
:
CodebookVectors
CodebookVectors
$\leftarrow$ InitializeCodebookVectors
{$CodebookVectors_{num}$, ProblemSize
}For
($i=1$ To
$iterations_{max}$)SelectInputPattern
{InputPatterns
}SelectBestMatchingUnit
{$Pattern_i$, CodebookVectors
}For
($Bmu_{i}^{attribute}$ $\in$ $Bmu_i$)If
($Bmu_{i}^{class}$ $\equiv$ $Pattern_{i}^{class}$)Else
End
End
End
Return
(CodebookVectors
)Listing (below) provides an example of the Learning Vector Quantization algorithm implemented in the Ruby Programming Language. The problem is a contrived classification problem in a 2-dimensional domain $x\in[0,1], y\in[0,1]$ with two classes: 'A' ($x\in[0,0.4999999], y\in[0,0.4999999]$) and 'B' ($x\in[0.5,1], y\in[0.5,1]$).
The algorithm was implemented using the LVQ1 variant where the best matching codebook vector is located and moved toward the input vector if it is the same class, or away if the classes differ. A linear decay was used for the learning rate that was updated after each pattern was exposed to the model. The implementation can easily be extended to the other variants of the method.
def random_vector(minmax) return Array.new(minmax.size) do |i| minmax[i][0] + ((minmax[i][1] - minmax[i][0]) * rand()) end end def generate_random_pattern(domain) classes = domain.keys selected_class = rand(classes.size) pattern = {:label=>classes[selected_class]} pattern[:vector] = random_vector(domain[classes[selected_class]]) return pattern end def initialize_vectors(domain, num_vectors) classes = domain.keys codebook_vectors = [] num_vectors.times do selected_class = rand(classes.size) codebook = {} codebook[:label] = classes[selected_class] codebook[:vector] = random_vector([[0,1],[0,1]]) codebook_vectors << codebook end return codebook_vectors end def euclidean_distance(c1, c2) sum = 0.0 c1.each_index {|i| sum += (c1[i]-c2[i])**2.0} return Math.sqrt(sum) end def get_best_matching_unit(codebook_vectors, pattern) best, b_dist = nil, nil codebook_vectors.each do |codebook| dist = euclidean_distance(codebook[:vector], pattern[:vector]) best,b_dist = codebook,dist if b_dist.nil? or dist<b_dist end return best end def update_codebook_vector(bmu, pattern, lrate) bmu[:vector].each_with_index do |v,i| error = pattern[:vector][i]-bmu[:vector][i] if bmu[:label] == pattern[:label] bmu[:vector][i] += lrate * error else bmu[:vector][i] -= lrate * error end end end def train_network(codebook_vectors, domain, iterations, learning_rate) iterations.times do |iter| pat = generate_random_pattern(domain) bmu = get_best_matching_unit(codebook_vectors, pat) lrate = learning_rate * (1.0-(iter.to_f/iterations.to_f)) if iter.modulo(10)==0 puts "> iter=#{iter}, got=#{bmu[:label]}, exp=#{pat[:label]}" end update_codebook_vector(bmu, pat, lrate) end end def test_network(codebook_vectors, domain, num_trials=100) correct = 0 num_trials.times do pattern = generate_random_pattern(domain) bmu = get_best_matching_unit(codebook_vectors, pattern) correct += 1 if bmu[:label] == pattern[:label] end puts "Done. Score: #{correct}/#{num_trials}" return correct end def execute(domain, iterations, num_vectors, learning_rate) codebook_vectors = initialize_vectors(domain, num_vectors) train_network(codebook_vectors, domain, iterations, learning_rate) test_network(codebook_vectors, domain) return codebook_vectors end if __FILE__ == $0 # problem configuration domain = {"A"=>[[0,0.4999999],[0,0.4999999]],"B"=>[[0.5,1],[0.5,1]]} # algorithm configuration learning_rate = 0.3 iterations = 1000 num_vectors = 20 # execute the algorithm execute(domain, iterations, num_vectors, learning_rate) end
The Learning Vector Quantization algorithm was described by Kohonen in 1988 [Kohonen1988], and was further described in the same year by Kohonen [Kohonen1988a] and benchmarked by Kohonen, Barna, and Chrisley [Kohonen1988b].
Kohonen provides a detailed overview of the state of LVQ algorithms and variants (LVQ1, LVQ2, and LVQ2.1) [Kohonen1990]. The technical report that comes with the LVQ_PAK software (written by Kohonen and his students) provides both an excellent summary of the technique and its main variants, as well as summarizing the important considerations when applying the approach [Kohonen1996]. The seminal book on Learning Vector Quantization and the Self-Organizing Map is "Self-Organizing Maps" by Kohonen, which includes a chapter (Chapter 6) dedicated to LVQ and its variants [Kohonen1995].
[Kohonen1988] | T. Kohonen, "Learning Vector Quantization", Neural Networks, 1988. |
[Kohonen1988a] | T. Kohonen, "An introduction to neural computing", Neural Networks, 1988. |
[Kohonen1988b] | T. Kohonen and G. Barna and R. Chrisley, "Statistical pattern recognition with neural networks: benchmarking\n\tstudies", in IEEE International Conference on Neural Networks, 1988. |
[Kohonen1990] | T. Kohonen, "Improved versions of learning vector quantization", in IJCNN International Joint Conference on Neural Networks, 1990. |
[Kohonen1995] | T. Kohonen, "Self-Organizing Maps", Springer, 1995. |
[Kohonen1996] | T. Kohonen and J. Hynninen and J. Kangas and J. Laaksonen and K.\n\tTorkkola, "LVQ–PAK: The Learning Vector Quantization Program Package", Technical Report A30, Helsinki University of Technology, Laboratory of Computer and Information\n\tScience, Rakentajanaukio, 1996. |
Please Note: This content was automatically generated from the book content and may contain minor differences.