Release 16.5.0

Prev Next

Date of Release: 6 February 2025

General Remarks

  • Performance optimisations introduced since release 16 which achieved significant improvements to Export/Import memory usage and processing times are now marked as GA.
    • Therefore enable_xml_streaming and enable_sisl_streaming flags now default to 'true'.

Component / Services Released

  • Glasswall Embedded Engine (Glasswall_core2): 2.1394.0
  • SDK Editor Wrappers: 20250110.1
  • Word Search and Redact (libglasswall.word.search): 1.233.0
  • Word Search Wrappers: 0.33
  • Rebuild: 1.955.0

New Features

  • Licence keys can now be assigned to a session via a path for integrations using the Glasswall Python wrapper. Previously a licence key needed to be present in the same directory as the engine libraries. See Python tab in GW2RegisterLicenceFile for API documentation.
    • For online installations of the wrapper, pip install --upgrade glasswall for the latest wrapper version. For offline installs, refer to the 'README' in the Python wrapper folder of the release package for Python whl installations.

Fixes and Improvements

  • Fixed issue in PDF that could lead to an internal buffer issue reported for complex files.
  • Significant performance improvements where processing XLSX files containing a large number of worksheets.
  • Connections.xml content management policies are now applied ('sanitise' by default) where running content export, content import and Word Search.
  • Fixed issue affecting some PDFs that led to the failure 'Cross reference stream entry type not recognised. Found(0x03)'.
  • Fixed issue affecting the successful round trip of content export/import where export streams are greater than 2GiB.
  • Fixed issue in OOXML where blip elements were not removed under hyperlink content management policies.
  • Fixed issue affecting some OOXML files that could lead to a segmentation fault during content import.
  • Fixed processing issue affecting some OOXML files where in-text comments are enabled.
  • Updates to the Text Dump feature to ensure an empty output file is written out for scenarios where no text content or metadata is present in the input file. The -17error code is now reserved for write errors only.
  • Fixed file determination issue that led to a gw2ret_unexpectedEndOfFile error being reported rather than a gw2ret_fileTypeUnknown error being reported, as some xml files were being misidentified as office files.
  • Updates to gwConfigurationManagement.xsd and example config.xml policy file.

Defects and Limitations

  • The Alpine library is currently excluded from the release package and will be reintroduced in a future release.
  • The JavaScript wrapper is currently excluded from the release package and will be reintroduced in a future release.
  • The Word Search Java wrapper compatibility is currently limited to JRE8.
  • PDFs generated via the MS Outlook Adobe plugin are not supported.
  • Word Search analysis XML no longer reports structural content or changes.
  • Processing times may be impacted by changes made to licensing functionality.
  • Processing files using multiple threads is not supported. Spawning multiple processes is advised in order to facilitate parallel processing.
  • Word Search functionality is provided as Beta:
    • 'text' or 'regex' cannot be the same for 'require' or 'allow'/'disallow' as this will regenerate the file and ignore the 'disallow' or 'require' setting
    • | symbol at the start or end of 'require' can potentially cause the Engine to hang where using regex search policies