1 Known Issues, Limitations, Behavior Changes
- 1.1 Disable pushdown optimization
- 1.2 Avoid curl 8.6 in HTTP API clients
2 Patch History
3 Release Notes for SciDB 23.10.0
4 Supported Operating Systems
5 SciDB Features and Changes
6 Performance enhancements and bug fixes

Known Issues, Limitations, Behavior Changes

Disable pushdown optimization

Versions affected: SciDB 23.10.0–4

Pushdown optimization should be manually disabled in versions 23.10.0 through 23.10.4. If left on, it can lead to various issues including server crashes and high memory use.

To disable it manually, edit the SciDB config file to include the line:

enable-optimize-pushdown=0

It is disabled by default in SciDB 23.10.5.

Avoid curl 8.6 in HTTP API clients

Versions affected: SciDB 21.x–

There is a bug in version 8.6 of the “curl” library that causes HTTP clients to fail with this error when connecting to SciDB’s Client API over HTTPS:

curl: (56) OpenSSL SSL_read: SSL_ERROR_SYSCALL, errno 0

(See SDB-8459 for details.)

A fix for this bug was merged into the curl source code on GitHub within days of the ship date of curl 8.6; however, it hasn’t been included in a curl release so far (as of 2023-03-21). The fix should be included in curl 8.7 whenever that gets released.

If you have a client using curl 8.6, you should upgrade to curl 8.7 if available; if not, downgrade to curl 8.5.

You should instruct package managers and dependency frameworks (aptitude, conda, dnf, yum, etc.) to exclude version 8.6 when installing curl/libcurl.

Patch History

Patch notes for SciDB 23.10.8

Release date: July 18, 2024

SciDB SHA: 11b236b

Changes include:

Fixed an issue where the SciDB Client API could hang when too many parallel uploads were attempted.
https://paradigm4.atlassian.net/browse/SDB-8488
Please note that there is a companion fix in SciDBR version 4.2.0 which detects the “Too Many Requests” response issued by this fix and retries the upload.
Fixed an issue where the SciDB Client API could sometimes encounter an assertion when canceling an upload query.
https://paradigm4.atlassian.net/browse/SDB-8519
Adjusted the tokens used for the SciDB Client API’s auth cookies so that they now use a standard name for the signing algorithm.
https://paradigm4.atlassian.net/browse/SDB-8520
Starting SciDB as a service will no longer erroneously treat config file backups as usable config files.
https://paradigm4.atlassian.net/browse/SDB-8495

Patch notes for SciDB 23.10.7

Release date: June 12, 2024

SciDB SHA: 0769f38

Changes include:

In the SciDB Client API, the setting http-threads now defaults to the number of query execution threads.
https://paradigm4.atlassian.net/browse/SDB-8504
Enable SciDB to use more than 1024 file descriptors when permitted by the rlimit max.
https://paradigm4.atlassian.net/browse/SDB-8447

Patch notes for SciDB 23.10.6

Release date: April 29, 2024

SciDB SHA: cc615d4

Changes include:

Fixes for the op_count() operator in which it can return different results from equivalent calls using summarize() and aggregate().
https://paradigm4.atlassian.net/browse/SDB-8463
Tweaks to subarray(): https://paradigm4.atlassian.net/browse/SDB-8469
- Empty pick arrays when using subarray() will now yield an empty result instead of an error.
- Also fixed a problem with cases in which subarray() is called with an empty pick array while inverse:true is set.
- Please note that subdelete() will still maintain the same behavior as before.
Eliminated an unnecessary use of a file descriptor.
https://paradigm4.atlassian.net/browse/SDB-8447
Fixed an issue when using filter(, is_null(attr)) with empty chunks.
https://paradigm4.atlassian.net/browse/SDB-8474
Fixed a race condition in the SciDB Client API which could lead to a crash when canceling queries under high query load.
https://paradigm4.atlassian.net/browse/SDB-8435
The default log rotation frequency has been changed from hourly to daily. This will only affect new SciDB deployments.
https://paradigm4.atlassian.net/browse/SDB-8416

Patch notes for SciDB 23.10.5

Release date: March 27, 2024

SciDB SHA: 56b2ad5

Changes include:

Mitigation to avoid potential crashes in RocksDB when initially storing data into an array, and the addition of diagnostic tools for RocksDB, specifically tailored to its use in SciDB.
https://paradigm4.atlassian.net/browse/SDB-8451
Addition of a new config option http-max-json-bytes to the Client API for the maximum JSON payload length that the server will accept from a client.
https://paradigm4.atlassian.net/browse/SDB-8457
Pushdown optimization now defaults to OFF in new installations. If the SciDB config on an existing installation has enable-pushdown-optimization=1, this will not change the setting that is currently in use.
https://paradigm4.atlassian.net/browse/SDB-8453
Delivering new scripts for help in diagnosing and debugging “too many open files” errors.
https://paradigm4.atlassian.net/browse/SDB-8447

Patch notes for SciDB 23.10.4

Release date: March 6, 2024

SciDB SHA: bf30ffc

Fixes include:

Additional fixes for memory and index management:
Support for excluding arrays matching a given regex when running the scidb_backup.py utility
https://paradigm4.atlassian.net/browse/SDB-8425
Additional logging for the Client API when a client and server get out of sync while resuming an interrupted query
https://paradigm4.atlassian.net/browse/SDB-8436

Patch notes for SciDB 23.10.3

Release date: February 14, 2024

SciDB SHA: 1a41e7e

Major changes:

https://paradigm4.atlassian.net/browse/SDB-8360
- The namespaces library is always loaded by default. load_library('namespaces') still works, but does nothing.
- You can now start up scidb directly in security=password mode — you no longer have to start it in trust mode, load the namespaces library, and restart in password mode.
- The old security=trust mode is deprecated.

Other fixes include:

Client API now has dedicated admin and non-admin thread pools
https://paradigm4.atlassian.net/browse/SDB-8371
Fix for versions() giving an incorrect result if the most recent version had been removed
https://paradigm4.atlassian.net/browse/SDB-8352
Fix for pushdown optimization with builtin_equi_join() when attributes and dimensions have the same name
https://paradigm4.atlassian.net/browse/SDB-8403
Fix for init-syscat using deprecated Postgres calls for setting up triggers
https://paradigm4.atlassian.net/browse/SDB-8396
Fixes for crashes and deadlocks while managing buffers
https://paradigm4.atlassian.net/browse/SDB-8414

Patch notes for SciDB 23.10.2

This release was rejected and should not be used.

Release date: January 29, 2024

SciDB SHA: 2603c1c

Patch notes for SciDB 23.10.1

Release date: November 21, 2023

SciDB SHA: 3fb47aa

Fixes include:

Update Intel MKL installation to only install necessary components. Depends on 2019 version of Intel MKL.
https://paradigm4.atlassian.net/browse/SDB-8311
Add ns and vmax options to versions() operator
https://paradigm4.atlassian.net/browse/SDB-8335
Fixed general protection fault when programs forked by stream() plugin wrote to stderr
https://paradigm4.atlassian.net/browse/SDB-8331

Release Notes for SciDB 23.10.0

Release date: October 31, 2023

SciDB commit SHA: 5d0895e

These release notes apply to version 23.10.0 of SciDB; they cover all features and changes since version 21.8.

Release notes for version 21.8 cover all features and changes since version 20.10 and can be found here.

Supported Operating Systems

SciDB 23.5 supports the following operating systems:

RedHat8
Rocky8

Support for CentOS7 and RedHat7 has been discontinued in this release.

SciDB Features and Changes

HTTP API

SciDB now has an HTTP/HTTPS interface allowing full querying and data transfer support. When SciDB has security mode enabled, only secure HTTPS connections are allowed and you must configure an X.509 certificate to enable the new interface. See https://paradigm4.atlassian.net/wiki/spaces/scidb/pages/3395882601 for instructions.

Log file rotation behavior and new log file location

In previous versions, log files were written to each instance’s data directory. Now, by default, they are in a logs/ subdirectory of the data directory (i.e. ${base_path}/${server_number}/${instance_number}/logs/scidb.log), where ${base_path} is defined in the configuration file).

The log configuration file is now named log4cxx-conf.xml instead of log4cxx.properties; it uses an XML configuration format. See the XML examples on https://logging.apache.org/log4cxx/latest_stable/configuration-samples.html for guidance with using these settings.

By default, log files are rotated hourly and compressed. The current log file has the name scidb.log; rotated logs are in the same directory and are named scidb.log.<timestamp>.gz. To change this behavior, edit the log configuration file. Examples of different settings can be found in /opt/scidb/${SCIDB_VER}/share/scidb/logconf-examples/*.xml.

All timestamps in log files and filenames now use the UTC timezone and are formatted according to ISO-8601, with a Z suffix to indicate UTC (e.g., 2023-05-04T15:40:03Z). This does not affect how dates and times are printed within SciDB.

The `builtin_equi_join` operator

The CS-developed plugin equi_join has been ported into the SciDB with minor changes:

support for filter-pushdown and projection-pushdown optimizations.
the output is a SciDB dataframe.
keys are specified as “left_keys:” and “right_keys:” using attribute/dimension names interpreted in the context of the left or right input array only, reducing the need for array aliases and cast’s.
the “out_names:” keyword is not allowed, as this relies on the positions of attributes and thus interacts confusingly with projection-pushdown which eliminates unused attributes.
the implementation uses different operator name and symbol names so that it can coexist with the plugin equi_join() without confusion.

The `builtin_grouped_aggregate` operator

The CS-developed plugin grouped_aggregate has been ported into core SciDB with minor changes:

support for filter-pushdown and projection-pushdown optimizations.
the output is a SciDB dataframe.
the implementation uses different operator name and symbol names so that it can coexist with the plugin grouped_aggregate() without confusion.

Support for `subarray` operator

The subarray() operator selects elements of an input array according to coordinates specified by the contents of one or more secondary input arrays, called pick arrays.

This operator is intended to add support for these use cases:

to produce a sparse subset of cells from an input array for feeding to downstream linear algebra operations, and
to produce a sparse or dense subset of cells, sometimes with attached pick array attributes, for conversion to R or NumPy arrays within REVEAL^TMapplications.

Please refer to the SciDB Reference Guide for details of how to use subarray(), including syntax and examples, here: https://paradigm4.atlassian.net/wiki/spaces/scidb/pages/3395881545 .

The `subdelete` operator

The subdelete() operator deletes cells from an array using the same cell selection criteria as subarray().

Please refer to the SciDB Reference Guide for details of how to use subdelete(), including syntax and examples, here: https://paradigm4.atlassian.net/wiki/spaces/scidb/pages/3395881600 .

Deprecation of `mquery()` operator

The mquery() operator was deprecated in release 22.5 and has been removed in this release. Instead of mquery(), users are recommended to use transactions via the begin(), commit(), and rollback() operators. Please refer to the section “Transactions and Transaction Operators” in the SciDB Reference Guide for more details of how to use these operators.

Added `dimension` keyword to some operators

Several operators which output an array with a new single dimension now allow the user to override the default dimension name. These operators include:

aggregate() (for grand aggregate only)
help()
list()
show()
uniq()

Example:

AFL% limit(list('operators', dimension: MyDimName), 5);
{MyDimName} name,library
{0} 'add_attributes','scidb'
{1} 'add_instances','system'
{2} 'aggregate','scidb'
{3} 'apply','scidb'
{4} 'attributes','scidb'

Improvement to the `remove_versions()` operator

The remove_versions() operator can now remove an arbitrary half-open interval [first, last) of array versions. Here last can be max_version, allowing the most recent version(s) to be removed. Please refer to the SciDB Reference Guide for details of how to use remove_versions(), including syntax and examples.

Low disk space warnings

This version of SciDB introduces a new configuration parameter, low-disk-space-threshold-mb. Units are MiB, and the default is 1024 MiB == 1 GiB.

Before SciDB enlarges any datastore file, it will check the available free space on the device hosting the data store. If there is less than low-disk-space-threshold-mb mebibytes available on the device, SciDB will prevent further WRITE queries by taking the global array lock (GAL). This is the same catalog lock used to implement the lock_arrays operator, but unlike lock_arrays, the lock will be taken using a flag that causes subsequent WRITE queries to abort rather than block. When the low disk space condition has been addressed by a system administrator, WRITE queries can be re-enabled using lock_arrays(false).

By “WRITE query” we mean store, insert, delete, subdelete, add_attributes, and the like. However, remove and remove_versions are still permitted, since these operators can free up disk space.

From a user’s perspective, the first WRITE query to cross the threshold receives this error (edited for width):

SystemException in file: src/util/DataStore.cpp function: _throwOnLowDiskSpace \
  line: 922 instance: s0-i0 (0)
Error id: scidb::SCIDB_SE_IO::SCIDB_LE_LOW_DISK_SPACE
Error description: I/O error. Under 1024 MiB (low-disk-space-threshold-mb) free on \
  device /sys/devices/pci0000:00/0000:00:01.3/0000:04:00.0/virtio2/block/vda/vda1, \
  WRITE queries are disabled.  SciDB is in read-only mode.  Ask your system \
  administrator to (a) free up or provision more disk space, and (b) re-enable \
  WRITE queries using lock_arrays(false).
Failed query id: 0.1680820326868675802

Other in-progress WRITE queries may receive the same error, or they may complete successfully so long as they do not try to grow a datastore file.

Once the condition is detected, subsequent WRITE queries are failed immediately with this error:

SystemException in file: src/system/catalog/SystemCatalog.cpp function: _lockArray \
  line: 4767 instance: s0-i0 (0)
Error id: scidb::SCIDB_SE_SYSCAT::SCIDB_LE_LOW_SPACE_READ_ONLY_MODE
Error description: System catalog error. The database is in read-only mode because \
  low-disk-space-threshold-mb was reached.  Ask your system administrator to \
  (a) free up or provision more disk space, and (b) re-enable WRITE queries using \
  lock_arrays(false).
Failed query id: 0.1681396258853237596

WRITE queries will continue to fail in this way until an administrative user re-enables them with lock_arrays(false). Ideally, before that the problem storage device will have been reprovisioned with more space, or existing space will have been freed.

Discontinued the `de_rle` plugin

Starting in 23.10, SciDB no longer ships with the de_rle plugin.

Performance enhancements and bug fixes

Filter pushdown optimization of logical query plan.

Operators which filter down to a subset of the input cells, known as “cell-filters”, are eligible for pushdown optimization, to be executed early in a query. The optimization is enabled by default, but can be globally disabled by the boolean configuration option “enable-optimize-pushdown”.

Due to outstanding issues we have uncovered with filter pushdown, we recommend that this feature be disabled wherever possible when deploying at customer sites.

Edit the config file (/opt/scidb/23.10/service/config-0-mydb if running SciDB as a service, /opt/scidb/23.10/etc/config.ini otherwise) and make sure there is a line enable-optimize-pushdown=0. You may need to restart SciDB if you needed to change this value. This feature will be disabled by default in SciDB 23.10.5.

This applies to the following operators:

between()
cross_between()
filter()
subarray() without the join keyword

Projection pushdown optimization of logical query plan.

With knowledge of the flow of dimension/attribute data through each operator, the optimizer can do a top-down analysis to find unused attributes, and then can rewrite the plan to eliminate those attributes early in the query. Ideally, this will reduce the quantity of data used in relatively expensive operators such as join, redimension, and aggregate which require data shuffling.

Changes to LogicalOperator API to support pushdown optimizations.

The pushdown optimization framework relies on an abstract model of dataflow through each operator. Operators with the default behavior will be treated pessimistically, inhibiting opportunities for optimization. The commonly-used builtin operators fully support optimization; plugin operators can also be enhanced to support optimization, but this requires detailed operator-specific knowledge about the relationships between input attrs/dims and output attrs/dims for the operator.

Relaxed restrictions for creating `io-paths-list` subdirectories

Non-privileged users can now create subdirectories of the io-paths-list directories when saving files there. Formerly only users with admin privilege could do this.

Miscellaneous performance improvements

The secure_scan() operator now caches “permissions array” information, resulting in better performance for sites with large permissions arrays.

23.10 Release Notes

Known Issues, Limitations, Behavior Changes

Disable pushdown optimization

Avoid curl 8.6 in HTTP API clients

Patch History

Patch notes for SciDB 23.10.8

Patch notes for SciDB 23.10.7

Patch notes for SciDB 23.10.6

Patch notes for SciDB 23.10.5

Patch notes for SciDB 23.10.4

Patch notes for SciDB 23.10.3

Patch notes for SciDB 23.10.2

Patch notes for SciDB 23.10.1

Release Notes for SciDB 23.10.0

Supported Operating Systems

SciDB Features and Changes

HTTP API

Log file rotation behavior and new log file location

The builtin_equi_join operator

The builtin_grouped_aggregate operator

Support for subarray operator

The subdelete operator

Deprecation of mquery() operator

Added dimension keyword to some operators

Improvement to the remove_versions() operator

Low disk space warnings

Discontinued the de_rle plugin

Performance enhancements and bug fixes

Filter pushdown optimization of logical query plan.

Projection pushdown optimization of logical query plan.

Changes to LogicalOperator API to support pushdown optimizations.

Relaxed restrictions for creating io-paths-list subdirectories

Miscellaneous performance improvements

The `builtin_equi_join` operator

The `builtin_grouped_aggregate` operator

Support for `subarray` operator

The `subdelete` operator

Deprecation of `mquery()` operator

Added `dimension` keyword to some operators

Improvement to the `remove_versions()` operator

Discontinued the `de_rle` plugin

Relaxed restrictions for creating `io-paths-list` subdirectories