|
|
||||||||
infoRAD |
1 From the Department of Diagnostic Radiology, University of Utah School of Medicine, 50 N Medical Dr, Salt Lake City, UT 84132 (R.H.W., H.R.H.); the Electronic Medical Education Resource Group, Center for Advanced Medical Technologies, University of Utah School of Medicine, Salt Lake City (R.H.W., H.C.D., H.R.H., J.R.L., P.A.G.); and the Department of Diagnostic Radiology, Salt Lake City VA Medical Center, Salt Lake City, Utah (H.C.D.). Received July 17, 2000; revision requested September 27 and final revision received February 2, 2001; accepted February 20. Address correspondence to R.H.W. (e-mail: richard.wiggins@hsc.utah.edu).
| Abstract |
|---|
|
|
|---|
Index Terms: Data compression Digital imaging and communications in medicine (DICOM) Education Images, digitization Images, display Images, quality Images, storage and retrieval Images, transmission Internet Picture archiving and communication system (PACS)
| Introduction |
|---|
|
|
|---|
In this article, we will define the three principal bit-mapped image file formats in widespread use todaygraphic interchange format (GIF [pronounced "jiff," although "giff" is more often used]), JPEG, and TIFFand will discuss their history as well as their strengths and weaknesses. In addition, we will discuss DICOM with respect to PACS and as a possible image format in the electronic teaching environment. In addition, we will review the three major digital image compression techniques (JPEG method, fractal compression, wavelet compression) and discuss the novel features of the PNG file format.
| Bit-mapped Images: Past and Present |
|---|
|
|
|---|
Bit-mapped images with higher resolution contain more pixels and more detail but are consequently large files that move slowly in the electronic environment. Compression algorithms help overcome these problems by decreasing the volume of data needed to reconstruct an image, thereby decreasing the storage space needed and increasing the speed of transmission over a network. With JPEG and GIF, these schemes can be used to maintain detail in a smaller file size. This gives these formats a theoretic advantage for use in the electronic teaching environment.
Graphic Interchange Format
GIF was created in 1987 (4). As one of the first solutions to the problem of electronic image storage, it is the oldest and most widely supported and used Web-based graphic file format (1,4,5). The primary strength of GIF is its lossless compression algorithm (46), known as the Lempel-Ziv-Welch (LZW) linear compression routine. This algorithm provides up to 4:1 lossless compression of images with similar compression and decompression times (5,6). GIF also offers an interlace option that allows a rough preliminary version of an image to be displayed before the entire image is transmitted (6).
However, GIF also has several weaknesses. First, it is limited to 256 colors or shades of gray in an image. GIF is a bit-mapped (raster) image format with pixel values stored in image files. It uses the classic red-green-blue (RGB) method for defining pixel values, in which values from 0 to 255 are defined for red, green, and blue. This allows several million color definitions. To limit the size of files, however, GIF allows only 256 different colors from a palette of several million possibilities and keeps these 256 color values indexed in a look-up table. Because of this limitation, complex images may lose some detail when reformatted into GIF. The practical implication of this weakness is the inability to display an image with its full color range (5,6).
Second, the initial GIF, known as GIF87a, had the small but real disadvantage of not allowing transparency creation. The newer GIF89a, released in 1989, does allow one level of transparency in an image, a partial solution to this limitation (5,6). However, for complex image annotation, one level of transparency may not be enough. Both versions of GIF have been supported in most Web browsers since 1990, including the two most widely used, Netscape Navigator (Netscape Communications, Mountain View, Calif) since version 2.0 and Internet Explorer (Microsoft, Redmond, Wash) since version 3.0 (5).
The third and greatest weakness of GIF is the proprietary nature of its compression technique. Unisys Corporation (Blue Bell, Pa) patented the LZW linear compression routine in 1995 (7,8). However, when the company attempted to collect royalties from users of this compression algorithm, GIF fell out of favor in the graphics and education community. Although this patent issue does not affect the ownership or transmission of a GIF file, the graphical application that compresses or decodes with use of the LZW routine may be affected. In response to the commercialization of this compression technique, a small group of graphics and compression users joined to create the PNG file format (4,8), which will be discussed later.
Joint Photographic Experts Group
JPEG was created in the early 1990s as the leader in the next generation of image file format compression schemes (9,10). JPEG is not actually a file format, but the name of the compression algorithm developed by the Independent JPEG Group. The JPEG file format is referred to as the JPEG file interchange format (JIFF) and was created specifically for the storage and transmission of photographic images (5). Like GIF, JPEG also uses bit-mapped information stored in pixel files; however, it does not use indexed color. The 24-bit color scheme of JPEG files displays each pixel on the screen with 24 bits of data encoding, allowing more color and contrast resolution than is possible with GIF (16.7 million colors as opposed to 256 with GIF) (5,911).
The strength of the JPEG format is its capacity to significantly compress larger image files, allowing faster movement in the electronic environment. As opposed to LZW compression, which is a type of linear or "file" compression, the more complicated JPEG algorithms allow genuine "image" compression. The JPEG discrete cosine transform compression scheme divides an image into 8 x 8-pixel sections and compresses each section separately in three stages (10,11). The discrete cosine transform algorithm is used in many common image and video standards, including the Motion Picture Expert Group. With JPEG, the algorithm mathematically compares each pixel with adjacent pixels, allowing the user to adjust the compression level (9,10). Image compression to as little as 1/20 of the original file size can be achieved (5,10). Such compressed images can obviously be moved considerably faster in the electronic environment and require significantly less storage space per image. The basic format of JPEG is not patented (9,10).
Another useful feature of JPEG is progressive display, although it is only available with newer Web browsers (being first implemented in Netscape Navigator 3.0 and Internet Explorer 4.0) and requires more memory. Progressive display allows the user to see a preliminary albeit low-quality version of an image before the full image is transmitted (5,9).
The primary weakness of JPEG is that it is a lossy compression technique that results in a loss of data with each compression, which may translate into image degradation. This decoding process also requires more time before the image can be viewed (5). An additional weakness of JPEG is the possibility of distortion created by the compression technique. This can affect image files that are composed of only a few colors or those with large areas of the same color, such as a radiologic image background. Another possible distortion is Gibbs phenomenon, which can be seen on high-resolution images as well as high-bandwidth images (eg, line art) (5,12).
Tagged Image File Format
TIFF was developed by Microsoft and Aldus in 1986 (13). TIFF is a trademark that was originally registered to Aldus, which subsequently merged with Adobe Systems (San Jose, Calif). Adobe now controls the TIFF specifications copyright. TIFF was created primarily by imaging developers of input and output devices such as printers, monitors, and scanners; as a result, it is specifically designed to be compatible with different image processing devices (13).
The word "Tagged" in "TIFF" refers to this formats complicated file structure. The initial header of the file data is followed by "chunks" of data called "tags," which convey the image information to the program displaying the file. The actual TIFF specifications define over 70 different tag types. This level of complexity allows great flexibility between viewers; however, programs that interpret TIFF images must contain all the different data for the tags. Although many programs simplify this by implementing only certain tags, skipping some tags could theoretically affect image quality, and private tags may limit the use of TIFF files for some applications.
The greatest strength of TIFF is that it can support the full range of image sizes, resolutions, and color depths. Another strength is its use of different compression techniques. Lossless compression allows TIFF files to maintain image resolution without loss of detail. TIFF 5.0 was released in 1988 and incorporated support for the LZW compression technique (13). Although the LZW technique is one of the most popular compression algorithms, its use may be restricted due to proprietary limitations as discussed earlier. Another useful feature of TIFF files is that each file can contain more than one image.
The primary weakness of TIFF is the large file size that results from the use of (a) lossless compression techniques and (b) tags for conveying image data. This larger file size limits the use of TIFF in the Internet educational environment and slows overall performance. The newest specification, TIFF 6.0 (released in 1992), incorporates the JPEG lossy method of compression, thereby allowing a decreased file size. This version has raised some issues concerning compatibility because images that make use of the new compression schemes cannot be successfully decoded by older software. However, this shortcoming is expected to be corrected in TIFF 7.0 (13).
Digital Imaging and Communications in Medicine
DICOM is the third and most comprehensive version of an imaging communications standard developed by the American College of Radiology in conjunction with the National Electrical Manufacturers Association. The purpose of DICOM is to provide platform-independent methods of interconnecting all types of digital medical imaging devices by means of standard computer networks. DICOM offers a wide variety of functions for use in PACS; these functions range from detailed technical and demographic data to methods for generating worklists and interacting with hospital information systems (14).
The strength of DICOM is its comprehensive set of specifications for medical images in PACS. These specifications allow detailed descriptions of bit-mapped digital images (including 12-bit gray-scale images and color images), intensity mapping functions, annotations, and other elements that are used for digital medical images. In addition to complete image specifications, DICOM also prescribes multilayer communications and network protocols that are specific for use in PACS (14).
Unfortunately, the additional data in the DICOM specification that relate to communication protocols make this file format unwieldy for use in multimedia teaching tools outside PACS. Although it is now widely accepted in the radiology community, DICOM is not necessarily supported by medical computer systems outside the field of radiology and is relatively obscure in the nonmedical digital imaging community.
Digital Image Compression Techniques
The performance of PACS has been greatly enhanced by the use of image compression algorithms. The ability to compress and decompress images without loss of diagnostic data has allowed PACS to become a reality by greatly decreasing transmission times and storage requirements and permitting shorter retrieval times. Whereas lossless compression maintains the full integrity of the original image file, the goal of lossy compression in PACS is to reach these objectives without compromising diagnostic image quality. This can be accomplished partly because the unaided human eye cannot perceive minimal changes in degradation; consequently, the data can be described as "diagnostically lossless" (15,16). The three most commonly used image compression techniques are the JPEG method, fractal compression, and wavelet compression.
The JPEG method described earlier can provide lossless compression at low compression ratios up to 4:1. Image data degradation can be minimized while still maintaining a significant amount of integrity if compression ratios below 30:1 are used. JPEG has been the most popular image file format on the Internet primarily because of its compression characteristics and the JPEG standards. These JPEG standards allow a Web browser or graphics program to compress and decompress any JPEG image with ease. The more complicated "tag" format of TIFF files prevents ready compression of TIFF images (5,10, 1517).
Fractal image compression makes use of complex mathematics in analyzing patterns within an image, as opposed to the discrete cosine transform algorithms used in JPEG, which compare individual pixels with surrounding pixels. This is a patented lossy type of compression that permits rapid image decompression. The compression algorithm is complex and therefore slow compared with the JPEG algorithm. The ability to store a series of equations instead of pixel data allows compressed image files to be stored in a much smaller space while allowing some change in the size of the image displayed. Relatively little image degradation can be detected, even at compression ratios up to 100:1. Unlike the JPEG method, however, fractal image compression lacks strict standards (1517).
Wavelet compression is the oldest of the three digital image compression techniques in common use. Wavelet concepts were first described almost 100 years ago (1517). The original Fourier mathematical series theory has been adapted and modified, branching into several different versions that are not compatible with each other. No set standards for the wavelet compression technique have been adopted. Although lossless wavelet compression techniques usually do not exceed compression ratios of 4:1, commercialized lossy techniques have been reported to preserve diagnostic quality at ratios up to 150:1 (15). This multiresolution analysis usually involves filtering the data based on contrast and frequency, followed by removal of data that are unimportant to image quality before image compression encoding is completed. High-contrast data are preferentially retained, allowing preservation of image detail. Consequently, wavelet compression is an attractive method for use in medical imaging. The wavelet algorithms also allow image scalability, similar to fractal compression techniques (1517).
| Bit-mapped Images: The Future Is PNG |
|---|
|
|
|---|
Chunk Architecture
A PNG file has a chunk architecture that allows flexible descriptions of each attribute (Fig 1) (12,18). It is this chunk architecture that gives the PNG file format its greatest theoretic advantage over other image file formats. A wide range of image information as well as supporting alphanumeric data may be housed within the same PNG file. A PNG image file consists of an initial "signature" (the initial eight bytes) identifying the file as a PNG image, followed by a series of chunks of data that encode the image information to the decoder program (eg, Web browser). The information for the image is stored in groups (chunks) marked with four-character names, each ending in a "cyclical redundancy check" (CRC) that verifies the integrity of the data. The initial signature and CRC, together with the Adler-32 Checksum (an algorithm that is similar to the CRC for the uncompressed data), allow three types of integrity checks on a PNG file (12,18,19).
|
Pixel Support
The PNG chunk architecture supports three pixel types: palette-mapped, gray-scale, and true-color (RGB) (12,18,23). PNG supports true color with 48-bit color images, as opposed to the 24-bit color scheme of JPEG. PNG also supports 16-bit gray-scale images, which becomes important in detailed radiologic studies (8,10,12).
JPEG supports only true-color and gray-scale images, whereas GIF supports only palette-based mapping (5,6,10,17,18). PNGs variable alpha-channel transparency support, similar to that of TIFF, allows the creation of an image with differing levels of transparency for each pixel such that multicolored images can be fully or only partially transparent (8,12,13,18,23). This creates true-color pixel data with four bytes for each pixel (red, green, blue, and alpha [RGBA]), predominantly used in transitions and drop shadow effects (18). JPEG does not support alpha-channel transparency or gamma correction, whereas GIF supports only a single layer of transparency (1012).
Interlacing Scheme
The algorithm known as Adam7 is an interlacing method that displays PNG images in seven stages in less time than is required for the four-stage display of GIF. The GIF interlacing scheme remembers which lines of the file are even- or odd-numbered and progressively displays the file with the odd-numbered lines displayed first, so that the image appears to be displayed twice as fast. The PNG format makes use of a two-dimensional (2D) interlacing scheme that progressively displays the image, not only from top to bottom, but also from right to left, displaying the image even faster. However, this progressive display technique may increase the size of the image file (12,18,23).
The interlacing progressive display technique has been supported by GIF and TIFF essentially since 1989 and widely implemented by the JPEG file interchange format since 1996 (18). In a direct comparison of PNG and GIF interlacing schemes, only about the top one-eighth of the GIF image is visible by the time the PNG scheme has completed the first pass of the entire image, and four PNG passes have been completed by the time the first full pass of the GIF image has been completed. As a result, the PNG file image demonstrates better resolution in less time (12,18). Therefore, a PNG file can usually be recognized after approximately 25% of the file is transmitted, compared with the necessary transmission of 50% of a similar GIF file before it is recognized (10).
Gamma Correction
The inherent coding of gamma correction, which refers to the brightness reproduction curve of an image, allows the values displayed on any platform to be the same as those of the original. PNGs platform-independent gamma correction for independent brightness controls is a powerful tool for viewing radiologic images. This inherent features allows PNG-formatted images to have standard colors and contrast settings without appearing too dark on Windows-based personal computers (Microsoft) or too light on Macintosh computers (Apple Computer, Cupertino, Calif) (10,12,18). This problem exists because Macintosh computers, Windows-based computers, and SGI workstations (Silicon Graphics, Mountain View, Calif) all have different gamma values, so there is no Internet gamma value standard. GIF and JPEG files have no inherent gamma correction (10,12,24). The PNG chromicity correction feature (for precise specification of color independent of brightness) allows similar standardization of RGB strength in the individual pixels of an image (10,12,18). These gamma and chromicity controls allow more accurate depiction of pixel image levels in terms of color and gray-scale imaging. The PNG specifications even allow use of a complete color management system for increased color control (18).
Compression Options
Although the data chunks are capable of supporting multiple compression formats, the "deflate" compression method is the only one defined for PNG images (12,25). This algorithm, which is used with .zip files, is a version of LZ77 (so named because it was first published by Lempel and Ziv in 1977), a predecessor of the LZW technique (12). LZ77 is a platform-independent lossless compression technique that actually has 10%30% better compression than the LZW technique used with GIF files (12,18). Unlike the LZW algorithm, however, LZ77 is not patented (4,7).
The PNG deflate compression method is essentially a combination of LZ77 and Huffman encoding algorithms. Lossless encoding with the PNG method was intended for use in applications such as medical imaging and detailed image editing in which strict preservation of all data is mandatory. Because of its chunk architecture, the PNG format is able to adapt and incorporate compression techniques common to PACS, allowing easier transition from DICOM to PNG for incorporation into digital teaching files and more complex multimedia projects (12,25).
The PNG specifications also define a compression filtering technique. There are five different options for this filter (including "None"), which significantly increases decoder efficiency. These options compare specific pixels with surrounding pixels to increase the degree of compression. As an extreme example, a 24-bit, 48-Mbyte image file can be losslessly compressed to 0.1 Mbytes but to only 36 Mbytes without the filtering option (12,18,25).
Browser Support
Perhaps the most significant limitation of the use of PNG in Internet digital teaching files is relatively limited support with earlier browser versions. The creation of PNG, with its complex interlacing scheme, alpha-channel support, and automatic gamma correction, was somewhat focused on Internet use. Although there are over 30 2D Web browsers with at least some PNG support, the two most commonly used browsers, Netscape Navigator and Internet Explorer, have had limited support with early versions and native support since versions 4.04 and 4.0, respectively (23). Both JPEG and GIF have essentially been natively supported by Netscape Navigator and Internet Explorer since version 2.0 (24,26).
Netscape Navigator has limited support for PNG files since version 2.0 with multiple available plug-ins (helper applications such as QuickTime [Apple]) and support for progressive display since version 4.04. There is also full alpha-channel and gamma correction support in the latest version (6.0). This latest version is based on Mozilla, another browser developed with the support of Netscape, which has widely supported PNG (23,26,27).
Microsoft Internet Explorer and the new America Online Internet Explorer (Dulles, Va) have had read-only support of PNG files since version 4.0 and full alpha-channel support since version 5.0 for the Macintosh community. Windows Internet Explorer version 4.0 is able to display PNG files with plug-ins, similar to the earlier Netscape browsers, but full gamma correction and interlacing capabilities are not fully supported in this version. The more recent Internet Explorer version 5.5 for Windows does fix some of these problems but has not addressed full alpha-channel support (26,27). The desire to display Web pages that can easily be viewed by a majority of Internet users may limit PNGs use in digital teaching file creation, and issues concerning random access memory and storage space limit users to early versions of these browsers.
Acceptance
The PNG file format is by design free of the patent issues that have hindered further GIF development. In 1996, the World Wide Web Consortium issued PNG specification 1.0 as a Consortium recommendation, the first recommendation to be issued by the group. The adoption of PNG by the Virtual Reality Modeling Language (VRML) Architecture Group for VRML 2.0 and the formal approval of "image/png" as an official Internet media type by the Internet Assigned Numbers Authority will ensure widespread use of this complex yet flexible graphic file format in the future (4). Both of the leading Web browsers, Netscape Navigator and (Microsoft) Internet Explorer, now have some native PNG support (23,26,27). Adobe PhotoShop 5.5 and Macromedia Fireworks 2.0 (Macromedia, San Jose, Calif) also contain PNG support (8). Photoshop offers easy "save" options for PNG 8-bit and 24-bit images. PNG support already exists or is being planned in over 125 applications. Animation can be achieved with a closely related format known as multiple-image networks graphics (28).
The gamma and color correction features inherent in PNG are important for developing digital teaching files, and the added alpha channels for variable transparency support and lossless compression make PNG an attractive alternative for multimedia projects. The 2D interlacing scheme works up to eight times faster than GIF one-dimensional interlacing schemes and makes PNG graphics an attractive alternative for Web-based teaching files. PNG images can have crisp detail on varied backgrounds for transparency work (if needed) and demonstrate better detail than GIF images while having a similar file size (10,12).
| Conclusions |
|---|
|
|
|---|
|
|
|
|
|
Although TIFF is a good choice for bit-mapped data file transfer for image input and output devices, it is not an option for Web-based teaching tools. Although it would theoretically be possible to encode a new fully-supported (private) tag for TIFF files for Internet use, this would merely complicate an already very complex format. The multiple compression schemes already in use and the countless number of existing tags would make the creation of a Web-based image editor for radiologic case input over the Internet nearly impossible. Moreover, TIFF files have no existing tags for the progressive display that is so popular with the other formats and are not designed for stream processing. As discussed earlier, certain legal issues also limit the use of TIFF for such projects.
The capacity of PNG to integrate private chunks of data should allow seamless conversion of DICOM images into the PNG format. The integration of alphanumeric text in metadata (eg, patient information) will assist in incorporating case material into a database. The acceptance of PNG by the World Wide Web Consortium and by the most popular Web browsers and graphic manipulation software companies ensures its future place in Web-based and imaging editing projects. In conclusion, the PNG file format provides a network-friendly, patent-free, lossless compression scheme that is truly cross-platform and has many new features that are useful for multimedia and Web-based radiologic teaching.
| Acknowledgments |
|---|
| Footnotes |
|---|
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
D. Hoa, A. Micheau, and G. Gahide Creating an Interactive Web-based e-Learning Course: A Practical Introduction for Radiologists RadioGraphics, November 1, 2006; 26(6): e25 - e25. [Abstract] [Full Text] |
||||
![]() |
A. Rosset, L. Spadola, L. Pysher, and O. Ratib Informatics in Radiology (infoRAD): Navigating the Fifth Dimension: Innovative Interface for Multidimensional Multimodality Image Navigation RadioGraphics, January 1, 2006; 26(1): 299 - 308. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. V. HarshaRani, S. J. Vayttaden, and U. S. Bhalla Electronic Data Sources for Kinetic Models of Cell Signaling J. Biochem., June 1, 2005; 137(6): 653 - 657. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. A. Goede, J. R. Lauman, C. Cochella, G. L. Katzman, D. A. Morton, and K. H. Albertine A Methodology and Implementation for Annotating Digital Images for Context-appropriate Use in an Academic Health Care Environment J. Am. Med. Inform. Assoc., January 1, 2004; 11(1): 29 - 41. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. J. Escott and D. Rubinstein Free DICOM Image Viewing and Processing Software for Your Desktop Computer: What's Available and What It Can Do for You RadioGraphics, September 1, 2003; 23(5): 1341 - 1357. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Rosset, O. Ratib, A. Geissbuhler, and J.-P. Vallee Integration of a Multimedia Teaching and Reference Database in a PACS Environment RadioGraphics, November 1, 2002; 22(6): 1567 - 1577. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| RADIOGRAPHICS | RADIOLOGY | RSNA JOURNALS ONLINE |