Kohonen Neural Network Problem, ideas please

Lothia · 2009-04-06T07:31:27

Hello, I am currently using the Kohonen Neural Network for my Character Recognition problem. I have a degrading Neighbor function and learn rate function. I use Euclidean for my distance function. Both of these functions are linear. Now the problem comes when I try and recognize 26 different patterns (7 x 5 matrix), I will get a result such as A B C D E F G H I K L M N P Q R S T T U V V W X Y Z I am missing J and O but have 2 V's and 2 T's, yet neither J or O look anything like V or T. (Edit: The J and T look very similar) So my question is if anyone might have any idea where the problem lies. I know that is a vague question but this might be a common occurrence in KNN's. Also my neighbor function for 1 neighbor just looks at the matrix to the left and right of the winner. My one solution or change might be to find the 2 closest matrix's (based on Euclidean distance) and have those be the neighbors instead of the actual matrix's right by. As well both my neighbor and learn rate functions are linear, does anyone have any suggestions on any functions to use that are not, that may help? Any help, suggestions or anything would be greatly appreciated.

Artificial Intelligence Programming

Started by Lothia March 10, 2009 02:39 AM

9 comments, last by Predictor 15 years, 7 months ago

Predictor

198

April 06, 2009 07:31 AM

Quote: Right now, you have 35 input variables (the 5x7 raster array). Using the horizontal and vertical projection profiles (row and column averages), though, would get you down to 12 input variables (5 vertical averages + 7 horizontal ones). Even throwing in a few more features, you could probably cut the number of input variables in half and still retain the most important information in the images.

Quote: I had not thought of doing something like this. I am wondering how I could define each section differently. How would I add up the values so that if we are looking at a horizontal line it is 01001 it would look different from 10010. As well my actual goal was to make it have 112 or 138 inputs with 28 outputs, would what you are saying still work?

The horizontal sum of 01001 and 10010 would be identical: summaries always discard information. The idea is that, given enough features, you can still tell classes apart. Consider the following letter 'F':

11111
10000
10000
11100
10000
10000
10000

Row sums would not be sufficient to distinguish this from a 'reverse F':

11111
00001
00001
00111
00001
00001
00001

Column sums, however, would be very different (especially those on the extreme left and right).

Naturally, other summaries could be used, such as the center of gravity of the '1' pixels, sums of other pixel regions, counts of 0-1 transitions, etc.

Another route to consider is simple pixel selection. Likely, some pixels provide more information about class than others. In my experience with character/shape recognition in raster images, it is often the case that some pixel locations provide no information at all!

-Will Dwinnell
Data Mining in MATLAB

Kohonen Neural Network Problem, ideas please

This topic is closed to new replies.

Popular Topics

Recommended Tutorials

Kohonen Neural Network Problem, ideas please

This topic is closed to new replies.

Popular Topics

Recommended Tutorials

Reticulating splines