Change Hachoir can't support PY2 so backport their PY3 to prevent a need for system dependant external binaries like mediainfo.
Backported 400 revisions from rev 1de4961-8897c5b (2018-2014).
Move core/benchmark, core/cmd_line, core/memory, core/profiler and core/timeout to core/optional/*
Remove metadata/qt*
PORT: Version 2.0a3 (inline with 3.0a3 @ f80c7d5).
Basic Support for XMP Packets.
tga: improvements to adhere more closely to the spec.
pdf: slightly improved parsing.
rar: fix TypeError on unknown block types.
Add MacRoman win32 codepage.
tiff/exif: support SubIFDs and tiled images.
Add method to export metadata in dictionary.
mpeg_video: don't attempt to parse Stream past length.
mpeg_video: parse ESCR correctly, add SCR value.
Change centralise CustomFragments.
field: don't set parser class if class is None, to enable autodetect.
field: add value/display for CustomFragment.
parser: inline warning to enable tracebacks in debug mode.
Fix empty bytestrings in makePrintable.
Fix contentSize in jpeg.py to account for image_data blocks.
Fix the ELF parser.
Enhance the AR archive parser.
elf parser: fix wrong wrong fields order in parsing little endian section flags.
elf parser: add s390 as a machine type.
Flesh out mp4 parser.
PORT: Version 2.0a1 (inline with 3.0a1).
Major refactoring and PEP8.
Fix ResourceWarning warnings on files. Add a close() method and support for the context manager protocol ("with obj: ...") to parsers, input and output streams.
metadata: get comment from ZIP.
Support for InputIOStream.read(0).
Fix sizeGe when size is None.
Remove unused new_seekable_field_set file.
Remove parser Mapsforge .map.
Remove parser Parallel Realities Starfighter .pak files.
sevenzip: fix for newer archives.
java: update access flags and modifiers for Java 1.7 and update description text for most recent Java.
Support ustar prefix field in tar archives.
Remove file_system* parsers.
Remove misc parsers 3d0, 3ds, gnome_keyring, msoffice*, mstask, ole*, word*.
Remove program parsers macho, nds, prc.
Support non-8bit Character subclasses.
Python parser supports Python 3.7.
Enhance mpeg_ts parser to support MTS/M2TS.
Support for creation date in tiff.
Change don't hardcode errno constant.
PORT: 1.9.1
Internal Only: The following are legacy reference to upstream commit messages.
Relevant changes up to b0a115f8.
Use integer division.
Replace HACHOIR_ERRORS with Exception.
Fix metadata.Data: make it sortable.
Import fixes from e7de492.
PORT: Version 2.0a1 (inline with 3.0a1 @ e9f8fad).
Replace hachoir.core.field with hachoir.field
Replace hachoir.core.stream with hachoir.stream
Remove the compatibility module for PY1.5 to PY2.5.
metadata: support TIFF picture.
metadata: fix string normalization.
metadata: fix datetime regex Fix hachoir bug #57.
FileFromInputStream: fix comparison between None and an int.
InputIOStream: open the file in binary mode.
2018-03-24 14:04:15 +00:00
|
|
|
from hachoir.metadata.metadata import RootMetadata, registerExtractor
|
|
|
|
from hachoir.parser.program import ExeFile
|
|
|
|
from hachoir.metadata.safe import fault_tolerant, getValue
|
|
|
|
|
2014-03-10 05:18:05 +00:00
|
|
|
|
|
|
|
class ExeMetadata(RootMetadata):
|
|
|
|
KEY_TO_ATTR = {
|
|
|
|
u"ProductName": "title",
|
|
|
|
u"LegalCopyright": "copyright",
|
|
|
|
u"LegalTrademarks": "copyright",
|
|
|
|
u"LegalTrademarks1": "copyright",
|
|
|
|
u"LegalTrademarks2": "copyright",
|
|
|
|
u"CompanyName": "author",
|
|
|
|
u"BuildDate": "creation_date",
|
|
|
|
u"FileDescription": "title",
|
|
|
|
u"ProductVersion": "version",
|
|
|
|
}
|
|
|
|
SKIP_KEY = set((u"InternalName", u"OriginalFilename", u"FileVersion", u"BuildVersion"))
|
|
|
|
|
|
|
|
def extract(self, exe):
|
|
|
|
if exe.isPE():
|
|
|
|
self.extractPE(exe)
|
|
|
|
elif exe.isNE():
|
|
|
|
self.extractNE(exe)
|
|
|
|
|
|
|
|
def extractNE(self, exe):
|
|
|
|
if "ne_header" in exe:
|
|
|
|
self.useNE_Header(exe["ne_header"])
|
|
|
|
if "info" in exe:
|
|
|
|
self.useNEInfo(exe["info"])
|
|
|
|
|
|
|
|
@fault_tolerant
|
|
|
|
def useNEInfo(self, info):
|
|
|
|
for node in info.array("node"):
|
|
|
|
if node["name"].value == "StringFileInfo":
|
|
|
|
self.readVersionInfo(node["node[0]"])
|
|
|
|
|
|
|
|
def extractPE(self, exe):
|
|
|
|
# Read information from headers
|
|
|
|
if "pe_header" in exe:
|
|
|
|
self.usePE_Header(exe["pe_header"])
|
|
|
|
if "pe_opt_header" in exe:
|
|
|
|
self.usePE_OptHeader(exe["pe_opt_header"])
|
|
|
|
|
|
|
|
# Use PE resource
|
|
|
|
resource = exe.getResource()
|
|
|
|
if resource and "version_info/node[0]" in resource:
|
|
|
|
for node in resource.array("version_info/node[0]/node"):
|
|
|
|
if getValue(node, "name") == "StringFileInfo" \
|
Change Hachoir can't support PY2 so backport their PY3 to prevent a need for system dependant external binaries like mediainfo.
Backported 400 revisions from rev 1de4961-8897c5b (2018-2014).
Move core/benchmark, core/cmd_line, core/memory, core/profiler and core/timeout to core/optional/*
Remove metadata/qt*
PORT: Version 2.0a3 (inline with 3.0a3 @ f80c7d5).
Basic Support for XMP Packets.
tga: improvements to adhere more closely to the spec.
pdf: slightly improved parsing.
rar: fix TypeError on unknown block types.
Add MacRoman win32 codepage.
tiff/exif: support SubIFDs and tiled images.
Add method to export metadata in dictionary.
mpeg_video: don't attempt to parse Stream past length.
mpeg_video: parse ESCR correctly, add SCR value.
Change centralise CustomFragments.
field: don't set parser class if class is None, to enable autodetect.
field: add value/display for CustomFragment.
parser: inline warning to enable tracebacks in debug mode.
Fix empty bytestrings in makePrintable.
Fix contentSize in jpeg.py to account for image_data blocks.
Fix the ELF parser.
Enhance the AR archive parser.
elf parser: fix wrong wrong fields order in parsing little endian section flags.
elf parser: add s390 as a machine type.
Flesh out mp4 parser.
PORT: Version 2.0a1 (inline with 3.0a1).
Major refactoring and PEP8.
Fix ResourceWarning warnings on files. Add a close() method and support for the context manager protocol ("with obj: ...") to parsers, input and output streams.
metadata: get comment from ZIP.
Support for InputIOStream.read(0).
Fix sizeGe when size is None.
Remove unused new_seekable_field_set file.
Remove parser Mapsforge .map.
Remove parser Parallel Realities Starfighter .pak files.
sevenzip: fix for newer archives.
java: update access flags and modifiers for Java 1.7 and update description text for most recent Java.
Support ustar prefix field in tar archives.
Remove file_system* parsers.
Remove misc parsers 3d0, 3ds, gnome_keyring, msoffice*, mstask, ole*, word*.
Remove program parsers macho, nds, prc.
Support non-8bit Character subclasses.
Python parser supports Python 3.7.
Enhance mpeg_ts parser to support MTS/M2TS.
Support for creation date in tiff.
Change don't hardcode errno constant.
PORT: 1.9.1
Internal Only: The following are legacy reference to upstream commit messages.
Relevant changes up to b0a115f8.
Use integer division.
Replace HACHOIR_ERRORS with Exception.
Fix metadata.Data: make it sortable.
Import fixes from e7de492.
PORT: Version 2.0a1 (inline with 3.0a1 @ e9f8fad).
Replace hachoir.core.field with hachoir.field
Replace hachoir.core.stream with hachoir.stream
Remove the compatibility module for PY1.5 to PY2.5.
metadata: support TIFF picture.
metadata: fix string normalization.
metadata: fix datetime regex Fix hachoir bug #57.
FileFromInputStream: fix comparison between None and an int.
InputIOStream: open the file in binary mode.
2018-03-24 14:04:15 +00:00
|
|
|
and "node[0]" in node:
|
2014-03-10 05:18:05 +00:00
|
|
|
self.readVersionInfo(node["node[0]"])
|
|
|
|
|
|
|
|
@fault_tolerant
|
|
|
|
def useNE_Header(self, hdr):
|
|
|
|
if hdr["is_dll"].value:
|
|
|
|
self.format_version = u"New-style executable: Dynamic-link library (DLL)"
|
|
|
|
elif hdr["is_win_app"].value:
|
|
|
|
self.format_version = u"New-style executable: Windows 3.x application"
|
|
|
|
else:
|
|
|
|
self.format_version = u"New-style executable for Windows 3.x"
|
|
|
|
|
|
|
|
@fault_tolerant
|
|
|
|
def usePE_Header(self, hdr):
|
|
|
|
self.creation_date = hdr["creation_date"].value
|
|
|
|
self.comment = "CPU: %s" % hdr["cpu"].display
|
|
|
|
if hdr["is_dll"].value:
|
|
|
|
self.format_version = u"Portable Executable: Dynamic-link library (DLL)"
|
|
|
|
else:
|
|
|
|
self.format_version = u"Portable Executable: Windows application"
|
|
|
|
|
|
|
|
@fault_tolerant
|
|
|
|
def usePE_OptHeader(self, hdr):
|
|
|
|
self.comment = "Subsystem: %s" % hdr["subsystem"].display
|
|
|
|
|
|
|
|
def readVersionInfo(self, info):
|
|
|
|
values = {}
|
|
|
|
for node in info.array("node"):
|
|
|
|
if "value" not in node or "name" not in node:
|
|
|
|
continue
|
|
|
|
value = node["value"].value.strip(" \0")
|
|
|
|
if not value:
|
|
|
|
continue
|
|
|
|
key = node["name"].value
|
|
|
|
values[key] = value
|
|
|
|
|
|
|
|
if "ProductName" in values and "FileDescription" in values:
|
|
|
|
# Make sure that FileDescription is set before ProductName
|
|
|
|
# as title value
|
|
|
|
self.title = values["FileDescription"]
|
|
|
|
self.title = values["ProductName"]
|
|
|
|
del values["FileDescription"]
|
|
|
|
del values["ProductName"]
|
|
|
|
|
|
|
|
for key, value in values.iteritems():
|
|
|
|
if key in self.KEY_TO_ATTR:
|
|
|
|
setattr(self, self.KEY_TO_ATTR[key], value)
|
|
|
|
elif key not in self.SKIP_KEY:
|
|
|
|
self.comment = "%s=%s" % (key, value)
|
|
|
|
|
|
|
|
|
Change Hachoir can't support PY2 so backport their PY3 to prevent a need for system dependant external binaries like mediainfo.
Backported 400 revisions from rev 1de4961-8897c5b (2018-2014).
Move core/benchmark, core/cmd_line, core/memory, core/profiler and core/timeout to core/optional/*
Remove metadata/qt*
PORT: Version 2.0a3 (inline with 3.0a3 @ f80c7d5).
Basic Support for XMP Packets.
tga: improvements to adhere more closely to the spec.
pdf: slightly improved parsing.
rar: fix TypeError on unknown block types.
Add MacRoman win32 codepage.
tiff/exif: support SubIFDs and tiled images.
Add method to export metadata in dictionary.
mpeg_video: don't attempt to parse Stream past length.
mpeg_video: parse ESCR correctly, add SCR value.
Change centralise CustomFragments.
field: don't set parser class if class is None, to enable autodetect.
field: add value/display for CustomFragment.
parser: inline warning to enable tracebacks in debug mode.
Fix empty bytestrings in makePrintable.
Fix contentSize in jpeg.py to account for image_data blocks.
Fix the ELF parser.
Enhance the AR archive parser.
elf parser: fix wrong wrong fields order in parsing little endian section flags.
elf parser: add s390 as a machine type.
Flesh out mp4 parser.
PORT: Version 2.0a1 (inline with 3.0a1).
Major refactoring and PEP8.
Fix ResourceWarning warnings on files. Add a close() method and support for the context manager protocol ("with obj: ...") to parsers, input and output streams.
metadata: get comment from ZIP.
Support for InputIOStream.read(0).
Fix sizeGe when size is None.
Remove unused new_seekable_field_set file.
Remove parser Mapsforge .map.
Remove parser Parallel Realities Starfighter .pak files.
sevenzip: fix for newer archives.
java: update access flags and modifiers for Java 1.7 and update description text for most recent Java.
Support ustar prefix field in tar archives.
Remove file_system* parsers.
Remove misc parsers 3d0, 3ds, gnome_keyring, msoffice*, mstask, ole*, word*.
Remove program parsers macho, nds, prc.
Support non-8bit Character subclasses.
Python parser supports Python 3.7.
Enhance mpeg_ts parser to support MTS/M2TS.
Support for creation date in tiff.
Change don't hardcode errno constant.
PORT: 1.9.1
Internal Only: The following are legacy reference to upstream commit messages.
Relevant changes up to b0a115f8.
Use integer division.
Replace HACHOIR_ERRORS with Exception.
Fix metadata.Data: make it sortable.
Import fixes from e7de492.
PORT: Version 2.0a1 (inline with 3.0a1 @ e9f8fad).
Replace hachoir.core.field with hachoir.field
Replace hachoir.core.stream with hachoir.stream
Remove the compatibility module for PY1.5 to PY2.5.
metadata: support TIFF picture.
metadata: fix string normalization.
metadata: fix datetime regex Fix hachoir bug #57.
FileFromInputStream: fix comparison between None and an int.
InputIOStream: open the file in binary mode.
2018-03-24 14:04:15 +00:00
|
|
|
registerExtractor(ExeFile, ExeMetadata)
|