Genetic Algorithm

sharpnova · 2006-09-20T23:32:28

I'm interested in writing a genetic algorithm. I'm sure my idea for it is nothing new or breathtaking, but I'd like to imlement it as follows: There is a world grid of arbitrary size. Food is present distributed more or less randomly. The creatures are placed randomly at the start. They wander around eating food and their fitness will be how many foodstuffs they ate. At the end I'll create a new population of the same size by iterating over the population n times and allowing them to reproduce with a likelihood in proportion to their fitness. I'll have some mutation and crossover. Simple and elementary. Now for my problem.. and I'm sure this is probably the hardest part to figure out for any GA. How will the genetic information work for individuals. More details: The creature can make one of four choices, to move up, down, left, or right. Moving onto a food eats it and causes another food to spawn in a random location (not ontop of a creature or another food) and two creatures can't occupy the same spot, though a creature could elect that his move will be onto a square that another creature occupies. Assuming that the second creature moves away for his move, then the first creature will be able to move there, otherwies he will sit still. The creatures have their genetic code AND are able to see everything in a 5 by 5 grid centered around themself. The world wraps top to bottom and left to right and if they are at an edge, their field of vision is still 5x5 and just wraps around the world to the other side. The problem at last: How do I make some genetic code that can operate on this 5x5 bit of data to yield a result in which direction to choose? I thought at first of having four 5x5 matrices and just multiplying the vision state matrix with each of thoes fuor (one for each direction) and having the creature choose a direction based on which product was largest or smallest or closest to some value. But I realize that with the world being random and there being nothing special about up, down, left, or right from one world to the next, this would not accomplish anything. What I want is for there to be some genetic information that the creature can use to help it decide a direction to take based on it's current 5x5 vision matrix. I'm hoping to see little patterns evolve.. like here are some examples i envision: -creature sees 2 foods to it's lower right, but another creature is in between it and th food. it sees 1 food to it's upper left... and it goes for the food in the upper left.. because evolution wise, those that went for that.. would be more likely to get to get any food at all.. but if there were no creature present.. it would go for the 2 foodstuffs instead of the 1.. or maybe.. if there were 3 or 4 foodstuffs, with another creature in between, and 1 in the upper left, it would go for the 3 or 4, knowing that even though it won't get there first, it would still be more likely to get at least 1 or maybe 2 than if it went for the lone foodstuff well that basically encapsulated the majority of my hopes for the creatures. i realize that this type of behavior is very complex and would probably involve making the genetic code very very long and require hundreds or thousands or millions of generations to evolve.. but i don't care about that. i just want to know how to code some dna that could somehow operate on the 5x5 vision matrix to produce behavior relevant to the current vision state. just this moment i've had a slight idea.. that is probably trash. but maybe.. a vector for each of the 24 non-center squares in the 5x5 vision matrix.. weighted by what it is pointing to.. a foodstuff, an empty square, or an enemy, and by the contents of the few squares near what it's pointing to.. then summing all the vectors up to get a resultant direction.. maybe that's not such a trashy idea after all.. in fact maybe that's a perverse and gross and grossly oversimplified analogue to the way real brains make decisions.. i don't know.. if anyone has a better idea.. or has knowledge of this type of thing and can point me in the right direction, i would appreciate it. thanks. edit: now that i think about it.. my comment about what i figured is probably the hardest part for any GA was probably wrong. i bet it's the fitness function, but since that's so simple in this problem.. i'm just refering to what's the most difficult for this problem.

kirkd

505

September 13, 2006 07:35 PM

Hey! Quit skulking around, fup!! 8^)

BTW, PacMan lives. I have it running currently with the animation working and only need to link up the controls. Once I have that done, I'll start putting NEAT to work. I PROMISE!! My goal is the end of the year (preferably the next month or two - we'll see how much time I have). You still have exclusive license.

Sorry for the aside...

-Kirk

WeirdoFu

205

September 14, 2006 09:16 PM

Actually, it doesn't even have to be a very complex mapping. Here's a simple method.

Instead of thinking binary coded GAs, if you think real coded GA, then it would be more feasible. There are a few ways to approach this once it's real coded. The most straight forward way would be to encode probability values for the 4 directions it can move in. So, you might have N:35%, S:15%, E:45%, and W:5%. That's the simplistic way. Then you expand on that. Since you have a 5x5 grid of visibility, you can then create 4 slightly overlapping sectors that represent each direction. Based on the number of food and other animals in the sector, you multiple those two values by weights. To determine the probability of going in a direction, you simply add the weights and divide the weight of a sector by the total. So, for example, if there's nothing in any given sector, you start out with a base weight of 100. Now if we see that there's 1 food in the N, and the weight for food in the north is 2, then the weight for going north become 102. If there's a food in the south and another animal, and the weight for food and animal are 3 and -5, then the weight for the south becomes 98. So, there will be a total of 8 weights to evolve for the GA, a set of 2 for each direction. Then its just how you want to cap these values to create interesting behavior.

Just a quick idea.

haemonculus

126

September 16, 2006 09:57 AM

I would do a NN with the input neurons as the visibility grid (5x5 or even more). but you should have 1 input for 1 characteristic meaning 1 input for (food there/not there) and 1 for (other creature there/not there). perhaps adding predaters is simply adding another input neuron (per grid field). you could add some static obstacles too...
so you'll have 5x5x2 input neurons (in case of predators added 5x5x3). then you can choose a number of hidden layer neurons which are connected to the every input neuron. and then you have the 4 output neurons which are connected to every hidden layer neuron.
that's your basic NN setup. with the GA you would use the weights on every neuron2neuron connection as the genetic information. you could encode to binary or even let the GA operate on floats (yeah thats possible ;) ).

when you're calculating which instances of creatures(/genetic information) you should use for the next generations, you let the creatures handle with their individual genetic information filled in the NN that's driving their behaviour. simply counting how much food was eaten by each creature after N cycles and then sort by that amount and you get the X first creatures (and their information) for mutation etc...

and if you like somewhat more realistic behaviour, I would add a sort of timestamps to your NN. meaning you could add some input neurons that show the last N states, the creature was in or what the creature did the last time (moved in x direction/didn't move)

let's assume you have a visibility field of 5x5, on the grid tiles there can be (nothing, another creature, food, predator, wall) you would need 4 neurons per field (for example 0100 -> creature; 0000 -> nothing; i think you get the idea), making 5x5x4 input neurons. adding 4 neurons for the last action (nothing(0000), moved north(1000), south(0100),..) you'll have 5x5x4+4=104 neurons in the input layer. then choosing a number for your hidden layer neurons, say 10 (I don't have an idea what number is good, you have to try ;) ). making 104*10 connections from input to hidden layer, plus 10*4 connections from hidden layer to output. you have 1080 connections (floats!) to store in your genetic information per each creature. thats an incredible amount but I think these creatures will behave very good after enough epochs of GA.

ok perhaps this is all overkill *g* but you get the idea... simply choose the characteristics you need and play a little bit, till you get "intelligent" results

sharpnova

Author

108

September 17, 2006 12:38 AM

quick question.

why isn't it just 5x5 inputs? if i'm using floats couldn't the input just be 0 for nothing, 1 for creature 2 for food, 3 for wall, etc.?

actually i already realized the answer to that while typing this... i was going ot have one input per square and use 0 for nothing 1 for creature etc. then in the connection, the weight was going to depend on whether it was 0,1,2,3... in other words.. same amount of overhead as just using 4 input neurons..

and yes i was planning on having predators, walls, (i am going to have the predators evolving as well) and the previous states, i think i mentioned alrady in this thread. i'm going to have all the input neurons necessary to input all the information for the past states (but not past actions.. i'm going to disregard those.. as i think forward planning is behavior.. but basing it on past actions is pointless since a new situation may have a new optimal behavioral response)

i'll be working on this for awhile :p

IMPORTANT QUESTION:

i've read evolution is far slower than back propogation.. but since i have no idea what my target output or target behavior is here.. there is no way for error calculations.. or back propogation.. am i correct in this or not understanding back propogation?

Everyone hates #1.That's why a lot of idiots complain about WoW, the current president, and why they all loved google so much when it was new.Forget the fact that WoW is the greatest game ever created, our president rocks and the brainless buffons of America care more about how articulate you are than your decision making skills, and that google supports adware, spyware, and communism.

kirkd

505

September 17, 2006 08:41 AM

The main reason not to have a single input neuron with 0,1,2,3 for empty, food, wall, creature is that in this situation the various states of a cell have a relative weight with respect to each other. I originally suggested -1,0,1 for creature, empty, and food based on the idea that a creature being present is a bad thing, empty is arbitrary, and food is a good thing. This relative weighting makes sense in this context, but when you add in walls, it doesn't work very well. The neural net will have an extremely hard time trying to decide how to assign weights and may never converge.

The idea propsed by haemonculus sounds like a good one, however, you will end up with a huge number of weights that need to be adjusted and this will take a very long time. The best idea is to simplify the inputs as much as possible without losing information.

You're right about backpropagation. You have to know what your target output is to be able to use it. In this case, all you know is good or bad results. This is an example of reinforcement learning.

-Kirk

Asbestos

169

September 18, 2006 08:20 AM

Quote: Original post by sharpnova
i've read evolution is far slower than back propogation.. but since i have no idea what my target output or target behavior is here.. there is no way for error calculations.. or back propogation.. am i correct in this or not understanding back propogation?

That's right -- this is the exact situation where you should be evolving. If there's no way of creating stimulus-response pairs, you can't really use backprop.

Of course, that's not quite true, as people have tried to find some ways of doing this. You can take a look at a thread I started, http://www.gamedev.net/community/forums/topic.asp?topic_id=411372 which, while it doesn't contain any concrete answers, at least has some good ideas for search terms to do some research on this.

However, that's for a slightly different situation. This is classic A-Life, and thus using a GA is apropriate.

sharpnova

Author

108

September 18, 2006 09:07 AM

Quote: Original post by kirkd
The main reason not to have a single input neuron with 0,1,2,3 for empty, food, wall, creature is that in this situation the various states of a cell have a relative weight with respect to each other.

which is solved by having a different weight for each possible state.. or 4 neurons.

main reason i like 4 neurons is so i can stick to bit operations.. which are probably way faster than floats.

Everyone hates #1.That's why a lot of idiots complain about WoW, the current president, and why they all loved google so much when it was new.Forget the fact that WoW is the greatest game ever created, our president rocks and the brainless buffons of America care more about how articulate you are than your decision making skills, and that google supports adware, spyware, and communism.

kirkd

505

September 18, 2006 10:28 AM

Quote: Original post by sharpnova
Quote: Original post by kirkd
The main reason not to have a single input neuron with 0,1,2,3 for empty, food, wall, creature is that in this situation the various states of a cell have a relative weight with respect to each other.

which is solved by having a different weight for each possible state.. or 4 neurons.

Yes, that was my point. The only down side is the very large number of weights that this method leads to.

Quote:
main reason i like 4 neurons is so i can stick to bit operations.. which are probably way faster than floats.

I'm not sure I'm clear on this one. Even if you have bits as your inputs, you're weights are FP adjustable parameters.

-kirk

sharpnova

Author

108

September 18, 2006 10:57 AM

well i just meant for the very bottom level, the inputs. or should i just use floats for everything?

Everyone hates #1.That's why a lot of idiots complain about WoW, the current president, and why they all loved google so much when it was new.Forget the fact that WoW is the greatest game ever created, our president rocks and the brainless buffons of America care more about how articulate you are than your decision making skills, and that google supports adware, spyware, and communism.

Alrecenk

400

September 18, 2006 11:12 AM

Bits will save memory if that's an issue, but if they are being multiplied by floats than they will probably be changed to floats and then multiplied and that isn't any faster. If you were to do something like: if(input[a].out())node+=weight[a] ;
it might be a little faster than:
node+=input[a].out()*wieght[a] ;
However, I'd still suggest using all floats so the code is more portable. I have a set of neural net and other AI classes I use for lots of things. So if I were doing this I would use all floats just because I already have the code written. My point is: if you plan on doing much with neural nets it's a good idea to make code that is general purpose.

Genetic Algorithm

This topic is closed to new replies.

Popular Topics

Recommended Tutorials

Genetic Algorithm

This topic is closed to new replies.

Popular Topics

Recommended Tutorials

Reticulating splines