Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

shared-state-async not syncing bat-hosts #1135

Open
pony1k opened this issue Oct 25, 2024 · 9 comments
Open

shared-state-async not syncing bat-hosts #1135

pony1k opened this issue Oct 25, 2024 · 9 comments
Assignees
Milestone

Comments

@pony1k
Copy link
Contributor

pony1k commented Oct 25, 2024

I noticed that the file /var/bat-hosts does not exist, even though shared-state shared-state-bat_hosts is installed. Upon investigation I found that when I run

shared-state-async get bat-hosts

I get an error saying

Error relocating /usr/bin/shared-state-async: _ZNSt7__cxx1112basic_stringIcSt11char_traitsIcESaIcEE15_M_replace_coldEPcjPKcjj: symbol not found

Maybe this has something todo with it. I get the same error with get net-stats.
OpenWrt version: 23.05.5
LibreMesh version: master rev. 08a3948 20240924_1212
Some installed packages:

root@kh-bad:~# opkg list-installed | grep -F -e shared -e lime
lime-app - v0.2.26-r2
lime-proto-anygw - 2024-09-24-1727177548
lime-proto-batadv - git-24.268.41548-08a3948
lime-proto-bmx7 - git-24.268.41548-08a3948
lime-system - 2024-09-24-1727177548
shared-state - 2024-09-24-1727177548
shared-state-async - 2024-08-28-1724853226
shared-state-bat_hosts - 2024-09-24-1727177548
shared-state-dnsmasq_hosts - 2024-09-24-1727177548
shared-state-dnsmasq_leases - 2024-09-24-1727177548
shared-state-network_nodes - git-24.268.41548-08a3948-r1
shared-state-nodes_and_links - 2024-09-24-1727177548
ubus-lime-grondrouting - 2024-09-24-1727177548
ubus-lime-location - 2024-09-24-1727177548
ubus-lime-metrics - 2024-09-24-1727177548
ubus-lime-utils - 2024-09-24-1727177548
@javierbrk
Copy link
Collaborator

javierbrk commented Oct 25, 2024 via email

@pony1k
Copy link
Contributor Author

pony1k commented Oct 25, 2024

The images were built using imagebuilder with the https://feed.libremesh.org/arch_packages/master/ repos. Devices are

  • mips_24kc: TP-Link TL-WDR4300 v1 and TP-Link TL-WDR3600 v1.
  • mipsel_24kc: D-Link DAP-X1860 A1, TP-Link RE650 v1, TP-Link Archer C6 v3
  • aarch64_cortex-a53: Linksys E8450 (UBI)

On E8450, the error looks slightly different: ...replace_coldEPcmPKcmm: symbol not found

I can try to reproduce the problem with a self compiled package, if that helps?

@G10h4ck
Copy link
Member

G10h4ck commented Oct 26, 2024

I use the openwrt buildroot and haven't encountered this problem, so I guess it may be happening due to Image builder usage.

I usually use this script https://gitlab.com/librerouter/librerouteros/-/blob/main/librerouteros_build.sh?ref_type=heads to avoid jumping around make menuconfig but should be the same if you select the packages manually on openwrt buildroot

@a-gave
Copy link
Contributor

a-gave commented Oct 26, 2024

Hi, I think I've reproduced it on a litebeam-m5-xw

It doesn't work with openwrt-23.05.5, libremesh feeds master, using imagebuilder

root@LiMe-07d1b5:~# shared-state-async get bat-hosts
Error relocating /usr/bin/shared-state-async: _ZNSt7__cxx1112basic_stringIcSt11char_traitsIcESaIcEE15_M_replace_coldEPcjPKcjj: symbol not found

It works with openwrt snapshots, libremesh feeds master, using imagebuilder

root@LiMe-07d1b5:~# shared-state-async get bat-hosts
D 1729878728.079 std::task<int> SharedState::merge(const std::string&, const std::map<std::__cxx11::basic_string<char>, StateEntry>&, const sockaddr_storage&, std::error_condition*) bat-hosts got 7 significative changes out of 7 input slice size: 7 state size: 7
{
    "02:58:47:07:d1:b5": "LiMe_07d1b5_wlan0_mesh_29",
    "02:95:39:07:d1:b5": "LiMe_07d1b5_eth0_29",
    "0e:1e:c4:cf:27:27": "LiMe_07d1b5_bat0",
    "fa:ec:da:06:d1:b5": "LiMe_07d1b5_wlan0_mesh",
    "fc:ec:da:06:d1:b5": "LiMe_07d1b5_wlan0_ap",
    "fc:ec:da:07:d1:b5": "LiMe_07d1b5_br_lan",
    "fe:ec:da:06:d1:b5": "LiMe_07d1b5_wlan0_apname"
}

@pony1k
Copy link
Contributor Author

pony1k commented Oct 26, 2024

For mt7621 the OpenWrt 23.05.5 repo contains

  • libstdcpp6 - 12.3.0-4
  • libgcc1 - 12.3.0-4
  • musl libc - 1.2.4-4

Whilst snapshot has

  • libstdcpp6 - 13.3.0-r4
  • libgcc1 - 13.3.0
  • musl libc - 1.2.5-r4

shared-state-async uses those libraries and I guess the LibreMesh builtbot links against the snapshot versions and there is some compatibility issue with one or more of these libraries.

@a-gave
Copy link
Contributor

a-gave commented Oct 26, 2024

Just one more note, I'm thinking it is an error with the ci
master feeds are compiled with openwrt-sdk master ARCH: "x86_64" so they are only compatible with openwrt snapshot

https://github.com/libremesh/lime-packages/blob/08a3948a5a80f6e26318c96227e02466bd09345e/.github/workflows/build.yml#L27C11-L27C25

we should probably create a feeds repo with binaries built from openwrt sdk v23.05.5 or branch-23.05 to be compatible with openwrt-23.05.5 and current builds of libremesh-2024.1-rc1
as had been done for openwrt v19 ARCH: "x86_64-19.07.10"

https://github.com/libremesh/lime-packages/blob/08a3948a5a80f6e26318c96227e02466bd09345e/.github/workflows/build_2020.yml#L19C11-L19C34

@G10h4ck
Copy link
Member

G10h4ck commented Oct 26, 2024

Hi, I think I've reproduced it on a litebeam-m5-xw

It doesn't work with openwrt-23.05.5, libremesh feeds master, using imagebuilder

root@LiMe-07d1b5:~# shared-state-async get bat-hosts
Error relocating /usr/bin/shared-state-async: _ZNSt7__cxx1112basic_stringIcSt11char_traitsIcESaIcEE15_M_replace_coldEPcjPKcjj: symbol not found

It works with openwrt snapshots, libremesh feeds master, using imagebuilder

root@LiMe-07d1b5:~# shared-state-async get bat-hosts
D 1729878728.079 std::task<int> SharedState::merge(const std::string&, const std::map<std::__cxx11::basic_string<char>, StateEntry>&, const sockaddr_storage&, std::error_condition*) bat-hosts got 7 significative changes out of 7 input slice size: 7 state size: 7
{
    "02:58:47:07:d1:b5": "LiMe_07d1b5_wlan0_mesh_29",
    "02:95:39:07:d1:b5": "LiMe_07d1b5_eth0_29",
    "0e:1e:c4:cf:27:27": "LiMe_07d1b5_bat0",
    "fa:ec:da:06:d1:b5": "LiMe_07d1b5_wlan0_mesh",
    "fc:ec:da:06:d1:b5": "LiMe_07d1b5_wlan0_ap",
    "fc:ec:da:07:d1:b5": "LiMe_07d1b5_br_lan",
    "fe:ec:da:06:d1:b5": "LiMe_07d1b5_wlan0_apname"
}

I think this is the issue, shared-state-async needs OpenWrt development branch due to a bug present in gcc and libstdcpp in current releases. AFAIR also @javierbrk encountered this problem some months ago and the fix was moving to Openwrt development branch. Hopefully news OpenWrt release will include those fixes soon.

New stuff that I develop like shared-state-async or APuP often depends on cutting edge, so I work almost always on top of OpenWrt development branch.

@javierbrk
Copy link
Collaborator

Just one more note, I'm thinking it is an error with the ci master feeds are compiled with openwrt-sdk master ARCH: "x86_64" so they are only compatible with openwrt snapshot

https://github.com/libremesh/lime-packages/blob/08a3948a5a80f6e26318c96227e02466bd09345e/.github/workflows/build.yml#L27C11-L27C25

we should probably create a feeds repo with binaries built from openwrt sdk v23.05.5 or branch-23.05 to be compatible with openwrt-23.05.5 and current builds of libremesh-2024.1-rc1 as had been done for openwrt v19 ARCH: "x86_64-19.07.10"

https://github.com/libremesh/lime-packages/blob/08a3948a5a80f6e26318c96227e02466bd09345e/.github/workflows/build_2020.yml#L19C11-L19C34

I agree, please let me know if i can help with something !

@a-gave
Copy link
Contributor

a-gave commented Nov 3, 2024

I agree, please let me know if i can help with something !

I did a couple of tests building shared-state-async with openwrt-sdk at branch openwrt-23.05 an installing it:

  • for ath79/generic previously mentioned ubnt_litebeam-m5-xw with libremesh based on openwrt-23.05.5 compiled with imagebuilder ecc.
  • for x86/64 testing with the scripts for qemu_dev_start in tools
    using libremesh-master-ow23.05.0 libremesh-master-ow23.05.2, taking firmwares from repo, that didn't include by default shared-state-async
    and it works as expected installing before the shared-state-async compiled for openwrt-23.05
	root@LiMe-acb001:/# opkg update
	root@LiMe-abc001:/# opkg install /tmp/shared-state-async_2024-09-24-1727177548_x
86_64.ipk
	root@LiMe-abc001:/# opkg install shared-state-bat_hosts
	root@LiMe-abc001:/# /etc/init.d/shared-state-async stop
	root@LiMe-abc001:/# /etc/init.d/shared-state-async start
	root@LiMe-abc001:/# shared-state-async get bat-hosts

Since it seems enough true (checked changelogs on openwrt.org for openwrt-23.05.x releases)
that openwrt doesn't not update these packages (libc++, musl libc, libstdcpp)
within service releases (minors es. 23.05.4 -> 23.05.5)

It should be doable to use the openwrt-sdk based on the branch openwrt-23.05 (and now also openwrt-24.10) -
instead of having one for each release (v23.05.5 ) - to recompile the packages in c++ (shared-state-async only for now) for each architecture

I would run the job multi-arch-build.yml only if packages in c++ (share-state-async) changes
for all openwrt branchs that we would like to keep (eg. main, openwrt-23.05, openwrt-24.10)

To have two new feeds, for example for x86_64:
https://github.com/libremesh/lime-feed/tree/gh-pages/arch_packages/openwrt-23.05/x86_64
https://github.com/libremesh/lime-feed/tree/gh-pages/arch_packages/openwrt-24.10/x86_64

Available then for imagebuilder usage at:
https://feed.libremesh.org/arch_packages/openwrt-23.05/x86_64/
https://feed.libremesh.org/arch_packages/openwrt-24.10/x86_64/

And, for releases or release candidates,
I would keep the same new 3 builds mechanism to support imagebuilder users
and to do so changing the path of feed from
https://github.com/libremesh/lime-feed/tree/gh-pages/arch_packages/2024.1-rc1/x86_64
to
https://github.com/libremesh/lime-feed/tree/gh-pages/arch_packages/2024.1-rc1/<[main|openwrt-23.05|openwrt-24.10]>/x86_64

And adjusting accordingly /packages/lime-system/files/etc/uci-defaults/92_add-lime-repos

I could try in case to take a look at this, probably starting from the next week 10/11

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: No status
Development

No branches or pull requests

5 participants