Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

请教关于算法原理 #668

Open
elesun2018 opened this issue Dec 4, 2024 · 1 comment
Open

请教关于算法原理 #668

elesun2018 opened this issue Dec 4, 2024 · 1 comment
Assignees

Comments

@elesun2018
Copy link

请问GLM大模型中数据shape次序是什么样的。
[s, b, h] 形状还是[b, s, h] 形状
为什么要这样设计
hidden_states: 输入到这一层的隐藏状态张量,形状为 [s, b, h],其中 s 是序列长度,b 是批量大小,h 是隐藏层维度。
谢谢

@zhipuch zhipuch self-assigned this Dec 5, 2024
@elesun2018
Copy link
Author

能否空了 解答一下,谢谢!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants