File Type Specifications
Specification Validated Against:
- PDF Reference (third edition) Adobe Portable Document Format Version 1.7.
Note: Glasswall supports PDF Reference (third edition) Adobe Portable Document Format from Version 1.3.
Images
There are no content management switches for any of the image file formats. Images embedded in a document which do not conform to the specification would result in the original document being quarantined.
JPG
Specification Validated Against:
- JPEG File Interchange Format, Version 1.02 September 1, 1992
- JP2 for ISO/IEC 15444-1 (JPEG 2000)
- JPX for ISO/IEC 15444-2 (JPEG 2000)
GIF
Specification Validated Against:
- Graphics Interchange Format, Version GIF 87
- Graphics Interchange Format, Version GIF 89a
PNG
Specification Validated Against:
- Portable Network Graphics (PNG): Functional specification. ISO/IEC 15948:2003 (E)
EMF
Specification Validated Against:
- [MS-EMF]: Enhanced Metafile Format: Open Specifications Documentation. v20151016
- [MS-EMFPLUS]: Enhanced Metafile Format Plus Extensions: : Open Specifications Documentation. V20121017
WMF
Specification Validated Against:
- [MS-WMF]: Windows Metafile Format: Open Specifications Documentation. v20121017
TIFF and GeoTIFF
Specification Validated Against:
- TIFF 6.0 Specification: Final โ June 3, 1992
- GeoTIFF Format Specification - Specification Version: 1.8.1
BMP
Specification Validated Against:
- BMP is covered by the WMF file specification.
Binary Office
Processing Specific to Binary Office in Protect Mode
The content management switch GWFILE_MANAGE_REMOVE_MACROS when set to sanitise in Protect mode removes the VBA script from the regenerated file but copies over the macro container. This can give the impression that the macros have not been deleted.
The content management switch GWFILE_MANAGE_REMOVE_EMBEDDED_FILES when set to sanitise in Protect mode do one of two options.
- If the embedded file is one that Glasswall supports it will process that file as if it were a standalone file, including its own configuration management polices if appropriate. If the file can be regenerated the regenerated file will be re-embedded, if it cannot the file it is part of will be marked as non-conforming.
- If the embedded file is one that Glasswall cannot process like a .mdb or .dat file then the embedded file is removed.
Embedded files are validated to five levels deep.
DOC
Specifications Validated Against:
The following specifications are used to validate a doc file type:
- From [MS-DOC]: Word (.doc) Binary File Format - Microsoft Corporation, 2011, v20121003 to [MS-DOC]: Word (.doc) Binary File Format - Corporation, 2017, v20170112
- Office Common Data Types and Objects Structure Specification- Microsoft Corporation, 2011, v20110608
- Office Document Cryptography Structure Specification- Microsoft Corporation, 2011, v20110608
- Office VBA File Format Structure Specification- Microsoft Corporation, 2011, v20110608
Non Supported Specifications:
Word 95 and earlier versions (Word 1 to Word 6) are not supported and if these file types are processed they will cause an Issue to be reported in the XML report in Analysis mode and the file not to be regenerated in Protect mode.
Word files containing Apple Mac or LibreOffice specific file features will be marked as non conformant.
XLS
Specifications Validated Against:
The following specifications are used to validate an xls file type:
- From [MS-XLS]: Excel Binary File Format (.xls) Structure Specification - Microsoft Corporation, v20141018 to [MS-XLS]: Excel Binary File Format (.xls) Structure Specification - Microsoft Corporation, v20161017.
- Spreadsheet Data Model File Format
- Office Graph Binary File Format v20141019
Non Supported Specifications:
Excel 95 and earlier versions are not supported and these file types are processed cause an Issue to be reported in the XML report in Analysis mode and the file not to be regenerated in Protect mode.
Excel files containing Apple Mac or LibreOffice specific file features will be marked as non conformant.
PPT
Specifications Validated Against:
The following specifications are used to validate a PowerPoint file type against:
- From [MS-PPT]: PowerPoint (.ppt) Binary File Format - Microsoft Corporation, v20121003 to [MS-PPT]: PowerPoint (.ppt) Binary File Format - Microsoft Corporation, v20160922.
Non Supported Specifications:
PowerPoint 95 and earlier versions are not supported and if these file types are processed it will cause an Issue to be reported in the XML report in Analysis mode and the file not to be regenerated in Protect mode.
PowerPoint files containing Apple Mac or LibreOffice specific file features will be marked as non conformant.
XML Office
Processing Specific to XML Office in Protect Mode
If a XML Office file has another XML Office file embedded inside it and the GWFILE_MANAGE_REMOVE_EMBEDDED_FILES is set to sanitise, the embedded file will be removed when a new file is regenerated.
Office XML files are stored as zipped archive files and although the zipped file may be less than 10Mb when the streams are extracted and uncompressed an individual stream may be greater than 10MB. When using the 32-bit DLL, streams that are larger than that 10 MB cause the file to be marked as nonโconforming and are not processed any further. When using the 64-bit DLL, streams of up to 50MB are processed.
Office 2010
The following specifications are used to validate docx/xlsx/pptx file types:
- ECMA-376 5th edition Part 1
- ECMA-376 5th edition Part 2
- ECMA-376 5th edition Part 3
- ECMA-376 5th edition Part 4
Office 2013
The following specifications are used to validate docx/xlsx/pptx file types:
- Word (.docx) Extensions to the Office Open XML SpreadsheetML File Format
- Excel (.xlsx) Extensions to the Office Open XML SpreadsheetML File Format
- PowerPoint (.pptx) Extensions to the Office Open XML SpreadsheetML File Format
- Office Drawing Extensions to the Office Open XML Structure
Non Supported Specifications:
Office 2010 or Office 2013 files containing Apple Mac or LibreOffice specific file features may be marked as non conformant or the components may be removed from the file.
Other
WAV
Specifications Validated Against:
- Multimedia Programming Interface and Data Specifications 1.0 August 1991.
MPG
Specifications Validated Against:
- ISO/IEC 13818-1:2000(E). Second edition 2000-12-01
MP3
Specifications Validated Against:
MPEG Layer III audio encoding is defined in two ISO/IEC specification families:
- MPEG-1: ISO/IEC 11172-3. Information technology -- Coding of moving pictures and associated audio for digital storage media at up to about 1,5 Mbit/s -- Part 3: Audio.
- MPEG-2: ISO/IEC 13818-3. Information technology -- Generic coding of moving pictures and associated audio information -- Part 3: Audio.
MP4
Specification Validated Against:
- ISO/IEC 14496-12:2005(E). Part 12: Corrected version 2005-10-01.
PE and COFF
Specifications Validated Against:
- Microsoft Portabole Executable and Common Object File Format Spcecification Revision 6.0 โ February 1999.
ELF
Specifications Validated Against:
- Tool Interface Standard (TIS) Executable and Linking Format (ELF) Specifiaction Version 1.2. May 1995
- ELF-64 Object File Format Version 1.2 Draft 2. May 27, 1998.
Mach-O
Specifications Validated Against:
- OS X ABI Mach-O File Format Reference. 2009-02-04.