Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Querying the expert policy, HW1, dagger #2

Open
ludeksvoboda opened this issue Nov 22, 2023 · 0 comments
Open

Querying the expert policy, HW1, dagger #2

ludeksvoboda opened this issue Nov 22, 2023 · 0 comments

Comments

@ludeksvoboda
Copy link

ludeksvoboda commented Nov 22, 2023

Hi,

This question is regarding HW1, the dagger part.

I am just curious, how does the querying the expert policy with loaded_gaussian_policy works? Could someone point me to some resources? It is not queried directly from the expert labeled actions, but goes through this small net, right? How it was trained? Is this some kind of the way to store the expert actions or this net is the "expert"?

Thank you!

@ludeksvoboda ludeksvoboda changed the title Querying the expert policy Querying the expert policy, HW1, dagger Nov 22, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant