Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support shared session running mode in Spark Kyuubi &Spark JobServer #2327

Open
crabo opened this issue May 10, 2022 · 1 comment
Open

Support shared session running mode in Spark Kyuubi &Spark JobServer #2327

crabo opened this issue May 10, 2022 · 1 comment

Comments

@crabo
Copy link

crabo commented May 10, 2022

Long Running Spark as a SparkSQL query Service or ETL ad-hoc Service is very common nowadays, the key-point is "Shared Session accross jobs" as Spark JobServer or Kyuubi introduced, NO spark sumit any more.

And we can tell too much conf or cache settings are bound to one-time job only, eg:
spark.conf.set("spark.tispark.stale_read", 1651766410000L)
spark.sql("select * from test.t")

lots of "Region missing" and "EpochNotMatch" errors among jobs:
RegionManager.getRegionByKey()
22/05/09 WARN RegionStoreClient: Re-splitting region task due to region error:EpochNotMatch current epoch of region 3291273 is conf_ver: 3659 version: 50821, but you sent conf_ver: 3659 version: 50819

Please take "Shared Session" in account. and Thanks for your great works!

@shiyuhang0
Copy link
Member

Thank you for your suggestion, we will investigate the "Shared Session".

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants