Ex4.7-A possible bug

In your python file Ex4.7-A.py line 51 I think it should read

`temp[((value_A_Changed, value_B_Changed),reward)] = temp.get( ((value_A_Changed, value_B_Changed),reward), 0 )`

instead of

`temp[((value_A_Changed, value_B_Changed),reward)] = temp.get( (value_A_Changed, value_B_Changed), 0 )`

The second line above will always return 0 because the key `(value_A_Changed, value_B_Changed)` does not exist in `temp`
I tried rerunning it with this change and could not reproduce the answer of the book. I am attaching the optimal policy map that I got

![pi_4](https://user-images.githubusercontent.com/34592480/115499467-7be14e00-a23d-11eb-9ca7-4fb832e5f267.png)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ex4.7-A possible bug #82

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Ex4.7-A possible bug #82

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions