: Research discussed in Computerphile explores using compression algorithms to determine the similarity between different data sets, such as genomes.

: This more recent paper (published in Forensic Science International: Digital Investigation ) discusses how the internal structure of a ZIP file can be analyzed to identify the operating system or application that created it.

: This seminal paper by Abraham Lempel and Jacob Ziv introduced the LZ77 algorithm. It is the cornerstone of lossless compression and the basis for the DEFLATE method used in almost all ZIP files.