Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[WIP] feature/table-from-postgresl-sqlite #1453

Open
wants to merge 18 commits into
base: branch-21.06
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
18 commits
Select commit Hold shift + click to select a range
bddd445
[feature/table-from-postgresl-sqlite] update parser type names
gcca Apr 16, 2021
d9a3d9c
[feature/table-from-postgresl-sqlite] update provider construction
gcca Apr 16, 2021
5270ab2
[feature/table-from-postgresl-sqlite] update sqlite provider to check…
gcca Apr 16, 2021
243a586
[feature/table-from-postgresl-sqlite] update sqlite provider for dist…
gcca Apr 16, 2021
da42dce
[feature/table-from-postgresl-sqlite] update sqlite provider to load …
gcca Apr 17, 2021
8239b91
[feature/table-from-postgresl-sqlite] update postgresql provider for …
gcca Apr 19, 2021
7a9d717
Merge branch 'feature/predicate-pushdown-providers' of github.com:auc…
gcca Apr 19, 2021
928bf53
[feature/table-from-postgresl-sqlite] update postgresql parser for nu…
gcca Apr 19, 2021
c51b670
[feature/table-from-postgresl-sqlite] update postgresql to manage cha…
gcca Apr 20, 2021
1a0ae28
[feature/table-from-postgresl-sqlite] add order by argumento to selec…
gcca Apr 21, 2021
2956ea2
[feature/table-from-postgresl-sqlite] check has_next using count(*) t…
gcca Apr 21, 2021
7267c9c
[feature/table-from-postgresl-sqlite] update sqlite provider to consi…
gcca Apr 21, 2021
6f1c5fc
[feature/table-from-postgresl-sqlite] remove some tries to fix postgr…
gcca Apr 21, 2021
86a009b
Merge remote-tracking branch 'blazingdb/branch-0.20' into feature/tab…
gcca Apr 21, 2021
ec74ff3
Merge branch 'feature/table-from-postgresl-sqlite' of github.com:gcca…
gcca Apr 26, 2021
62d0cfb
[feature/table-from-postgresl-sqlite] update changelog
gcca May 4, 2021
4099410
Merge remote-tracking branch 'upstream/branch-0.20' into feature/tabl…
gcca May 4, 2021
1ab0d2a
[feature/table-from-postgresl-sqlite] update changelog
gcca May 4, 2021
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
30 changes: 15 additions & 15 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,10 +2,10 @@


## New Features
- #1471 Unbounded partitioned windows
- #1471 Unbounded partitioned windows
- #1445 Support for CURRENT_DATE, CURRENT_TIME and CURRENT_TIMESTAMP
- #1505 Support for right outer join

- #1453 Create tables from SQLite data provider

## Improvements
- #1464 Better Support for unsigned types in C++ side
Expand All @@ -24,15 +24,15 @@
## New Features
- #1367 OverlapAccumulator Kernel
- #1364 Implement the concurrent API (bc.sql with token, bc.status, bc.fetch)
- #1426 Window Functions without partitioning
- #1426 Window Functions without partitioning
- #1349 Add e2e test for Hive Partitioned Data
- #1396 Create tables from other RDBMS
- #1427 Support for CONCAT alias operator
- #1424 Add get physical plan with explain
- #1472 Implement predicate pushdown for data providers

## Improvements
- #1325 Refactored CacheMachine.h and CacheMachine.cpp
- #1325 Refactored CacheMachine.h and CacheMachine.cpp
- #1322 Updated and enabled several E2E tests
- #1333 Fixing build due to cudf update
- #1344 Removed GPUCacheDataMetadata class
Expand All @@ -41,7 +41,7 @@
- #1331 Added flag to enable null e2e testing
- #1418 Adding support for docker image
- #1434 Added documentation for C++ and Python in Sphinx
- #1419 Added concat cache machine timeout
- #1419 Added concat cache machine timeout
- #1444 Updating GCP to >= version
- #1349 Add e2e test for Hive Partitioned Data
- #1447 Improve getting estimated output num rows
Expand All @@ -58,18 +58,18 @@
- #1350 Fixed bug where there are no projects in a bindable table scan
- #1359 Avoid cuda issues when free pinned memory
- #1365 Fixed build after sublibs changes on cudf
- #1369 Updated java path for powerpc build
- #1369 Updated java path for powerpc build
- #1371 Fixed e2e settings
- #1372 Recompute `columns_to_hash` in DistributeAggregationKernel
- #1375 Fix empty row_group_ids for parquet
- #1380 Fixed issue with int64 literal values
- #1380 Fixed issue with int64 literal values
- #1379 Remove ProjectRemoveRule
- #1389 Fix issue when CAST a literal
- #1387 Skip getting orc metadata for decimal type
- #1392 Fix substrings with nulls
- #1398 Fix performance regression
- #1401 Fix support for minus unary operation
- #1415 Fixed bug where num_batches was not getting set in BindableTableScan
- #1415 Fixed bug where num_batches was not getting set in BindableTableScan
- #1413 Fix for null tests 13 and 23 of windowFunctionTest
- #1416 Fix full join when both tables contains nulls
- #1423 Fix temporary directory for hive partition test
Expand All @@ -82,7 +82,7 @@
- #1504 Fixing some conflicts in Dockerfile

## Deprecated Features
- #1394 Disabled support for outer joins with inequalities
- #1394 Disabled support for outer joins with inequalities

# BlazingSQL 0.18.0 (February 24, 2021)

Expand All @@ -95,7 +95,7 @@
- #1238 Implements MergeStramKernel executor model
- #1259 Implements SortAndSamplernel executor model, also avoid setting up num of samples
- #1271 Added Hive utility for partitioned data
- #1289 Multiple concurrent query support
- #1289 Multiple concurrent query support
- #1285 Infer PROTOCOL when Dask client is passed
- #1294 Add config options for logger
- #1301 Added usage of pinned buffers for communication and fixes various UCX related bugs
Expand All @@ -104,7 +104,7 @@
- #1303 Add support for INITCAP
- #1313 getting and using ORC metadata
- #1347 Fixing issue when reading orc metadata from DATE dtype
- #1338 Window Function support for LEAD and LAG statements
- #1338 Window Function support for LEAD and LAG statements
- #1362 give useful message when file extension is not recognized
- #1361 Supporting first_value and last_value for Window Function

Expand All @@ -127,7 +127,7 @@
- #1308 Improve the engine loggers
- #1314 Added unit tests to verify that OOM error handling works well
- #1320 Revamping cache logger
- #1323 Made progress bar update continuously and stay after query is done
- #1323 Made progress bar update continuously and stay after query is done
- #1336 Improvements for the cache API
- #1483 Improve dependencies script

Expand All @@ -141,7 +141,7 @@
- #1277 Support FileSystems (GS, S3) when extension of the files are not provided
- #1300 Fixed issue when creating tables from a local dir relative path
- #1312 Fix progress bar for jupyterlab
- #1318 Disabled require acknowledge
- #1318 Disabled require acknowledge

# BlazingSQL 0.17.0 (December 10, 2020)

Expand All @@ -162,7 +162,7 @@
- #1201 Implement string TRIM
- #1216 Add unit test for DAYOFWEEK
- #1205 Implement string REVERSE
- #1220 Implement string LEFT and RIGHT
- #1220 Implement string LEFT and RIGHT
- #1223 Add support for UNION statement
- #1250 updated README.md and CHANGELOG and others preparing for 0.17 release

Expand Down Expand Up @@ -221,7 +221,7 @@
- #1203 Changed code back so that parquet is not read a single rowgroup at a time
- #1207 Calcite uses literal as int32 if not explicit CAST was provided
- #1212 Fixed issue when building the thirdpart, cmake version set to 3.18.4
- #1225 Fixed issue due to change in gather API
- #1225 Fixed issue due to change in gather API
- #1254 Fixing support of nightly and stable on localhost
- #1258 Fixing gtest version issue

Expand Down
6 changes: 3 additions & 3 deletions engine/src/cython/engine.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -98,11 +98,11 @@ std::pair<std::vector<ral::io::data_loader>, std::vector<ral::io::Schema>> get_l
#ifdef POSTGRESQL_SUPPORT
parser = std::make_shared<ral::io::postgresql_parser>();
auto sql = ral::io::getSqlInfo(args_map);
provider = std::make_shared<ral::io::postgresql_data_provider>(sql, 0, 0);
provider = std::make_shared<ral::io::postgresql_data_provider>(sql, total_number_of_nodes, self_node_idx);
isSqlProvider = true;
#else
throw std::runtime_error("ERROR: This BlazingSQL version doesn't support PostgreSQL integration");
#endif
isSqlProvider = true;
} else if(fileType == ral::io::DataType::SQLITE) {
#ifdef SQLITE_SUPPORT
parser = std::make_shared<ral::io::sqlite_parser>();
Expand Down Expand Up @@ -205,7 +205,7 @@ std::shared_ptr<ral::cache::graph> runGenerateGraph(uint32_t masterIndex,
{
using blazingdb::manager::Context;
using blazingdb::transport::Node;

auto& communicationData = ral::communication::CommunicationData::getInstance();

std::vector<Node> contextNodes;
Expand Down
Loading