I have a test ID that using "policy evaluation" algo, gives me some decision A (correct one) as the decision with the highest expected utility, but using "find best policy", gives me decision B (last decision always) as the optimal policy.
Not sure why this would be the case.
Is there any condition such that this is correct behavior?
The test ID has 2 decision nodes.
My best guess of what happened is the 'find best policy' algo failed and gives the last decision as the optimal one by default.
Find best policy algo vs policy evaluation algo
-
- Site Admin
- Posts: 1457
- Joined: Mon Nov 26, 2007 5:51 pm
Re: Find best policy algo vs policy evaluation algo
Can you post your influence diagram here?
Here it is.
Sorry it's a little convoluted hehe. Messed around with it. I was trying to see if I could answer the question of VOI by phrasing it as another decision node.
Have another question.
How is value of information calculated? Does it use the policy evaluation algo or the find best policy algo. Because I think it could use either and I suppose the find best policy algo is faster?
Sorry it's a little convoluted hehe. Messed around with it. I was trying to see if I could answer the question of VOI by phrasing it as another decision node.
Have another question.
How is value of information calculated? Does it use the policy evaluation algo or the find best policy algo. Because I think it could use either and I suppose the find best policy algo is faster?
- Attachments
-
- ExampleNetwork1.xdsl
- (6.56 KiB) Downloaded 953 times
-
- Site Admin
- Posts: 1457
- Joined: Mon Nov 26, 2007 5:51 pm
See this thread for more information on VOI:gund wrote:How is value of information calculated? Does it use the policy evaluation algo or the find best policy algo. Because I think it could use either and I suppose the find best policy algo is faster?
http://genie.sis.pitt.edu/forum/viewtop ... hlight=voi
-
- Site Admin
- Posts: 1457
- Joined: Mon Nov 26, 2007 5:51 pm
Re: Find best policy algo vs policy evaluation algo
That's a bug in SMILE; it's likely that it's triggered by unusual structure of your influence diagramgund wrote:I have a test ID that using "policy evaluation" algo, gives me some decision A (correct one) as the decision with the highest expected utility, but using "find best policy", gives me decision B (last decision always) as the optimal policy.
Not sure why this would be the case.

We're investigating the issue.