File Type Specifications

Prev Next

File Type Specifications

PDF

Specification Validated Against:

  • PDF Reference (third edition) Adobe Portable Document Format Version 1.7.

    Note: Glasswall supports PDF Reference (third edition) Adobe Portable Document Format from Version 1.3.

Images

There are no content management switches for any of the image file formats. Images embedded in a document which do not conform to the specification would result in the original document being quarantined.

JPG

Specification Validated Against:

  • JPEG File Interchange Format, Version 1.02 September 1, 1992
  • JP2 for ISO/IEC 15444-1 (JPEG 2000)
  • JPX for ISO/IEC 15444-2 (JPEG 2000)

GIF

Specification Validated Against:

  • Graphics Interchange Format, Version GIF 87
  • Graphics Interchange Format, Version GIF 89a

PNG

Specification Validated Against:

  • Portable Network Graphics (PNG): Functional specification. ISO/IEC 15948:2003 (E)

EMF

Specification Validated Against:

  • [MS-EMF]: Enhanced Metafile Format: Open Specifications Documentation. v20151016
  • [MS-EMFPLUS]: Enhanced Metafile Format Plus Extensions: : Open Specifications Documentation. V20121017

WMF

Specification Validated Against:

  • [MS-WMF]: Windows Metafile Format: Open Specifications Documentation. v20121017

TIFF and GeoTIFF

Specification Validated Against:

  • TIFF 6.0 Specification: Final โ€“ June 3, 1992
  • GeoTIFF Format Specification - Specification Version: 1.8.1

BMP

Specification Validated Against:

  • BMP is covered by the WMF file specification.

Binary Office

Processing Specific to Binary Office in Protect Mode

The content management switch GWFILE_MANAGE_REMOVE_MACROS when set to sanitise in Protect mode removes the VBA script from the regenerated file but copies over the macro container. This can give the impression that the macros have not been deleted.

The content management switch GWFILE_MANAGE_REMOVE_EMBEDDED_FILES when set to sanitise in Protect mode do one of two options.

  • If the embedded file is one that Glasswall supports it will process that file as if it were a standalone file, including its own configuration management polices if appropriate. If the file can be regenerated the regenerated file will be re-embedded, if it cannot the file it is part of will be marked as non-conforming.
  • If the embedded file is one that Glasswall cannot process like a .mdb or .dat file then the embedded file is removed.

Embedded files are validated to five levels deep.

DOC

Specifications Validated Against:

The following specifications are used to validate a doc file type:

  • From [MS-DOC]: Word (.doc) Binary File Format - Microsoft Corporation, 2011, v20121003 to [MS-DOC]: Word (.doc) Binary File Format - Corporation, 2017, v20170112
  • Office Common Data Types and Objects Structure Specification- Microsoft Corporation, 2011, v20110608
  • Office Document Cryptography Structure Specification- Microsoft Corporation, 2011, v20110608
  • Office VBA File Format Structure Specification- Microsoft Corporation, 2011, v20110608

Non Supported Specifications:

Word 95 and earlier versions (Word 1 to Word 6) are not supported and if these file types are processed they will cause an Issue to be reported in the XML report  in Analysis mode and the file not to be regenerated in Protect mode.

Word files containing Apple Mac or LibreOffice specific file features will be marked as non conformant.

XLS

Specifications Validated Against:

The following specifications are used to validate an xls file type:

  • From [MS-XLS]: Excel Binary File Format (.xls) Structure Specification - Microsoft Corporation, v20141018 to [MS-XLS]: Excel Binary File Format (.xls) Structure Specification - Microsoft Corporation, v20161017.
  • Spreadsheet Data Model File Format
  • Office Graph Binary File Format v20141019

Non Supported Specifications:

Excel 95 and earlier versions are not supported and these file types are processed cause an Issue to be reported in the XML report  in Analysis mode and the file not to be regenerated in Protect mode.

Excel files containing Apple Mac or LibreOffice specific file features will be marked as non conformant.

PPT

Specifications Validated Against:

The following specifications are used to validate a PowerPoint file type against:

  • From [MS-PPT]: PowerPoint (.ppt) Binary File Format - Microsoft Corporation, v20121003 to [MS-PPT]: PowerPoint (.ppt) Binary File Format - Microsoft Corporation, v20160922.

Non Supported Specifications:

PowerPoint 95 and earlier versions are not supported and if these file types are processed it will cause an Issue to be reported in the XML report  in Analysis mode and the file not to be regenerated in Protect mode.

PowerPoint files containing Apple Mac or LibreOffice specific file features will be marked as non conformant.

XML Office

Processing Specific to XML Office in Protect Mode

If a XML Office file has another XML Office file embedded inside it and the GWFILE_MANAGE_REMOVE_EMBEDDED_FILES is set to sanitise, the embedded file will be removed when a new file is regenerated.

Office XML files are stored as zipped archive files and although the zipped file may be less than 10Mb when the streams are extracted and uncompressed an individual stream may be greater than 10MB. When using the 32-bit DLL, streams that are larger than that 10 MB cause the file to be marked as nonโ€“conforming and are not processed any further. When using the 64-bit DLL, streams of up to 50MB are processed.

Office 2010

The following specifications are used to validate docx/xlsx/pptx file types:

Office 2013

The following specifications are used to validate docx/xlsx/pptx file types:

  • Word (.docx) Extensions to the Office Open XML SpreadsheetML File Format
  • Excel (.xlsx) Extensions to the Office Open XML SpreadsheetML File Format
  • PowerPoint  (.pptx) Extensions to the Office Open XML SpreadsheetML File Format
  • Office Drawing Extensions to the Office Open XML Structure

Non Supported Specifications:

Office 2010 or Office 2013 files containing Apple Mac or LibreOffice specific file features may be marked as non conformant or the components may be removed from the file.

Other

WAV

Specifications Validated Against:

  • Multimedia Programming Interface and Data Specifications 1.0 August 1991.

MPG

Specifications Validated Against:

  • ISO/IEC 13818-1:2000(E). Second edition 2000-12-01

MP3

Specifications Validated Against:

MPEG Layer III audio encoding is defined in two ISO/IEC specification families:

  • MPEG-1: ISO/IEC 11172-3. Information technology -- Coding of moving pictures and associated audio for digital storage media at up to about 1,5 Mbit/s -- Part 3: Audio.
  • MPEG-2: ISO/IEC 13818-3. Information technology -- Generic coding of moving pictures and associated audio information -- Part 3: Audio.

MP4

Specification Validated Against:

  • ISO/IEC 14496-12:2005(E). Part 12: Corrected version 2005-10-01.

PE and COFF

Specifications Validated Against:

  • Microsoft Portabole Executable and Common Object File Format Spcecification Revision 6.0 โ€“ February 1999.

ELF

Specifications Validated Against:

  • Tool Interface Standard (TIS) Executable and Linking Format (ELF) Specifiaction Version 1.2. May 1995
  • ELF-64 Object File Format Version 1.2 Draft 2. May 27, 1998.

Mach-O

Specifications Validated Against:

  • OS X ABI Mach-O File Format Reference. 2009-02-04.