RadioGraphics
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Abstract Freely available
Right arrow Figures Only
Right arrow Full Text (PDF)
Right arrow Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Wang, C.
Right arrow Articles by Kahn, C. E.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Wang, C.
Right arrow Articles by Kahn, C. E., Jr
Related Collections
Right arrow Informatics
(Radiographics. 2000;20:287-293.)
© RSNA, 2000


infoRAD

Potential Use of Extensible Markup Language for Radiology Reporting: A Tutorial1

Cun Wang, PhD and Charles E. Kahn, Jr, MD

1 From the Office of Clinical Informatics, Medical College of Wisconsin, 9200 W Wisconsin Ave, Milwaukee, WI 53226-3522. Recipient of a Certificate of Merit award for an infoRAD exhibit at the 1998 RSNA scientific assembly. Received April 13, 1999; revision requested July 6 and final revision received September 20; accepted October 1. Supported in part by The Whitaker Foundation. Address reprint requests to C.E.K.


    Abstract
 Top
 Abstract
 Introduction
 What Is XML?
 Structured Reporting
 XML in Radiology Structured...
 Discussion
 Conclusions
 References
 
One of the main goals of radiology is to communicate imaging information to aid in patient management. Information standards can facilitate communication and help realize this goal. Extensible Markup Language (XML) is a new information transmission standard that was developed to meet the growing need for robust, large-scale World Wide Web applications. XML notation provides a compact document representation scheme that allows radiology reports to be transmitted over the Web as universally understandable, self-defining documents. XML documents can include a report-specific document type definition (DTD) that defines the allowable data fields and values. XML may also be used to generate data entry forms for radiology reporting and help physicians improve the efficiency of the reporting process. XML documents can be used to store reporting results directly, thus allowing pertinent data to be shared on the Web. An XML-based approach can allow users to link information to entities outside the information systems of a given institution. XML-based methods and applications have the potential to promote development of robust radiology reporting systems and integration with broader, enterprise-wide information systems.

Index Terms: Computers • Internet • Radiology and radiologists


    Introduction
 Top
 Abstract
 Introduction
 What Is XML?
 Structured Reporting
 XML in Radiology Structured...
 Discussion
 Conclusions
 References
 
A central goal of the practice of radiology is to communicate imaging information to aid in making decisions regarding patient care. This information goes not only to physicians but also to a growing number of automated systems that assist clinicians in delivering care. Such systems archive, retrieve, and present patient-oriented information; mediate between information sources and users; and integrate patient-specific information and general medical knowledge. Information standards allow information systems from different vendors, at different sites, and on different platforms to communicate with one another.

The explosive growth of the World Wide Web demonstrates the power of a simple standard—Hypertext Markup Language (HTML)—to promote interaction among information systems. HTML is a language for transmitting multimedia hypertext documents (1). HTML-based documents have been used to capture and display radiology results (2,3).

Despite its simplicity and ubiquity, HTML has several limitations. It allows only a single, fixed type of document. Users cannot specify their own tags or attributes to parameterize or otherwise semantically qualify their data. HTML does not support the specification of deep structures needed to represent database schemas or object-oriented data for structural validity (4). Extensions to HTML often result in incompatibilities among vendors (5).

HTML is based on Standard Generalized Markup Language (SGML), an international standard for document interchange endorsed by the International Standards Organization (ISO 8879:1986) that has been used in industrial, governmental, and academic applications for 13 years (6). Unfortunately, the complexity of SGML makes it ill-suited for use on the Web. Extensible Markup Language (XML), which is also based on SGML, was created to bring the power of SGML to the Web. In this article, we introduce the general features of XML, describe the use of XML for radiology report information, and analyze the potential of XML-based systems for radiology results reporting.


    What Is XML?
 Top
 Abstract
 Introduction
 What Is XML?
 Structured Reporting
 XML in Radiology Structured...
 Discussion
 Conclusions
 References
 
XML is a proper subset of SGML and preserves its key features— extensibility, structure, and data-checking—to create robust, large-scale Web applications (4,7). XML has been adopted by the World Wide Web Consortium (5,8) and has received support from major software vendors such as Microsoft (Redmond, Wash), Sun Microsystems (Mountain View, Calif), Netscape Communications (Mountain View, Calif), Adobe Systems (San Jose, Calif), and International Business Machines (IBM) (Armonk, NY) (9). It is also being incorporated into health information standards such as Health Level Seven (HL7) (10,11).

As with SGML, XML allows authors to define the structure of documents to represent database schemas or object-oriented hierarchies. XML omits the more complex and less-used parts of SGML such as the SGML declaration and tag minimization (12). Because XML is a subset of SGML, existing SGML "parsers" and tools can be applied to XML documents. The use of XML is particularly attractive in light of the ability to incorporate distributed software modules (eg, Java applets) that embed powerful data manipulation capabilities into Web clients (4). XML documents can serve as the objects upon which such distributed software components operate (13).

The structure of an XML document can be defined in a Document Type Definition (DTD) included within the document or referenced from an external source. Simply stated, a DTD defines the elements contained in the document and the relationships among the elements. The content of the document (document instance) follows the DTD. In a sense, the DTD specifies a grammar for the documents to which it applies.

The DTD specifies the names of the allowable elements in the document, how often a given element may appear, and the order in which the elements appear. An element consists of an opening tag (eg, <NAME>), specific contents, and a closing tag (</NAME>). For "empty" elements that contain no other elements, the closing tag can be omitted by including a forward-slash character at the end of the opening tag (eg, <ITEM/>). Elements can be nested to arbitrary depth. Attributes that further describe the element can appear within the angle brackets of the opening tag.

Figure 1 shows how an author can incorporate the definition of data elements within the document itself.



View larger version (32K):
[in this window]
[in a new window]
[Download PPT slide]
 
Figure 1.   Incorporation of the definition of data elements within an XML document. In this document, programs that can be used to interpret XML (parsers) have specified the patient's name (John Harrison) and identification number (5021029413) within the <Name> and <PID> elements, respectively.

 

    Structured Reporting
 Top
 Abstract
 Introduction
 What Is XML?
 Structured Reporting
 XML in Radiology Structured...
 Discussion
 Conclusions
 References
 
Diagnostic imaging information can be used for population-based health policy decisions, and it is becoming increasingly important for radiologists to provide information that can be aggregated locally, regionally, and nationally for outcomes analysis. The desire to analyze outcomes and other measures across institutions has led to the development of standardized reporting lexicons such as the Breast Imaging Reporting and Data System (14,15) and the Fleischner Society's thoracic imaging vocabulary (16,17). These vocabularies can form the basis for structured reporting applications.

Unlike conventional "free-text" reports, structured reports incorporate a standardized set of concepts in a predefined format. A practical goal of structured reporting applications is to capture most of the information in structured format and allow free-text comments as needed. The major advantages of this approach include reduced transcription costs and turn-around times, increased report completeness, greater usefulness of cases for teaching and research, and improved quality assurance, utilization review, and health services research efforts. Efforts to apply structured reporting to radiology date from the 1960s and have included the UltraSTAR (Ultrasound Structured Attribute Reporting) system at the Brigham and Women's Hospital (18) and the chest radiography reporting system at Johns Hopkins Medical Institutions (19).


    XML in Radiology Structured Reporting
 Top
 Abstract
 Introduction
 What Is XML?
 Structured Reporting
 XML in Radiology Structured...
 Discussion
 Conclusions
 References
 
Report Definition
XML documents can include a report-specific DTD that defines the allowable data fields and values; the resulting report is both portable and self-defining (20,21). Thus, the DTD explicitly describes the structure of a report and the relationships of the incorporated concepts to each other and to other medical concepts. An alternative to reporting with predefined structured formats is to try to impose structure onto the report after the fact. For example, the MedLEE system provides a tool for parsing natural-language reports that encodes the parsed report into XML (22). Natural-language processing may inform model building, but algorithms that would replace manual modeling have not yet been discovered (23). An ongoing challenge is the development of formalisms and tools to help build appropriate models such as DTDs to represent the content of radiology reports. Such DTDs could include a conventional set of modifiers (eg, degree of certainty, chronicity, change from previous examination) for each reporting concept.

Data Entry
Because XML is created for use on the Web directly, it may also be used to generate data entry forms for radiology reporting. XML parsers and some programs may generate specific data entry forms according to user definitions. These forms are very important in radiology structured reporting systems. A well-defined data entry form can help physicians improve the efficiency of the reporting process. To generate these data entry forms, different report templates are necessary to record different findings for a variety of examinations and diseases. XML-based report templates can be easily and effectively mapped to Web browsers through XML parsers or some specific programs. For example, binary (yes/no) reporting elements can be rendered as form objects (eg, checkboxes), and physicians can enter information about a specific patient by simply clicking on these form objects.

Structured Platform-Independent Data Entry and Reporting, or SPIDER, is a developmental system for radiology reporting that can accommodate a wide variety of structured reporting applications (2). It was initially developed for use with HTML and has since been generalized for use with SGML and XML (20). With SPIDER, hierarchically organized reporting concepts are represented as familiar graphical objects such as text windows, checkboxes, and radio buttons in Web data entry forms. Each reporting concept may be a single data element, a collection of data elements (GROUP), or a block of repeated data elements (REPEAT). Three types of data items—binary, numeric, and textual—can be defined. Binary items are indicated by the <BIN> tag and have a Boolean value of either 1 (true) or 0 (false). Numeric items (<NUM>) allow entry of uni- or multidimensional numeric values. Textual items (<TXT>) allow entry of one or more lines of narrative text data. Figure 2 shows a report definition for aortic US that is written in SPIDER's Data-entry and Reporting Markup Language (3). Figure 3 shows a report data entry form for aortic US generated with the SPIDER system.



View larger version (44K):
[in this window]
[in a new window]
[Download PPT slide]
 
Figure 2.   Report definition for aortic ultrasound (US) written in Data-entry and Reporting Markup Language. The Concept Identifier (CID) attributes refer to the unique identifier of the concept in the UMLS Metathesaurus. Concept relationship (CREL) attributes indicate the relationship between the reporting concept and that in the external vocabulary; for example, LT indicates that the concept in the report is narrower in scope than the externally referenced concept.

 


View larger version (38K):
[in this window]
[in a new window]
[Download PPT slide]
 
Figure 3.   Graphical data entry form for structured aortic US reporting. This user interface is defined with the report definition in Figure 2.

 
Report Generation
XML documents can be used to store reporting results directly, thus allowing pertinent data to be shared on the Web. SPIDER can be used to create an XML document with a report-specific DTD and document instance. The DTD defines a given reporting application (eg, aortic US) and can be reused for all similar reports. The document instance encodes the values entered for a specific individual. Figures 4 and 5 show the DTD and the document instance generated with the SPIDER system in structured reporting for aortic US. The elements and their contents, such as the element Aorta–Ultrasound and its four sub-elements—Clinical–Information, Aorta, Common–Iliac–Arteries, and Comments—are defined in the DTD.



View larger version (54K):
[in this window]
[in a new window]
[Download PPT slide]
 
Figure 4.   XML DTD for structured reporting of aortic US.

 


View larger version (31K):
[in this window]
[in a new window]
[Download PPT slide]
 
Figure 5.   XML document instance displays the results of aortic US as defined by the DTD in Figure 4.

 
Vocabulary and Data Linking
One potential advantage of an XML-based approach is the ability to link information to entities outside the information systems of a given institution. For example, SPIDER allows reporting concepts to be linked to external vocabularies such as the National Library of Medicine's Unified Medical Language System (UMLS) Metathesaurus. UMLS allows concepts to be communicated across disparate sources of biomedical information such as bibliographic databases, diagnostic coding schemes, and other medical vocabularies (24,25). Even if a reporting domain does not have a standardized vocabulary, the reporting terms can be mapped to the UMLS to allow interchange with other information systems.

In addition, reporting applications in SPIDER can specify that certain input fields receive data directly from a local or remote computer system. For example, a radiology report could incorporate specific laboratory test results for a given patient by querying the laboratory information system for results of the patient's most recent serum creatinine test. The World Wide Web Consortium is developing an XML Linking Language and is specifying constructs that can be inserted into XML resources to further describe links between objects (26).


    Discussion
 Top
 Abstract
 Introduction
 What Is XML?
 Structured Reporting
 XML in Radiology Structured...
 Discussion
 Conclusions
 References
 
XML offers a number of advantages for radiology reporting. Although XML is a relatively new standard, it derives a strong foundation from prior work with SGML and can take advantage of SGML-based reporting platforms. The newest generation of Web browsers (eg, Microsoft Internet Explorer 5.0, Netscape Communicator 5.0) will be able to interpret XML documents. Many database management systems are moving rapidly to support XML (27). The use of XML affords opportunities to link radiology reporting information with relational or object-oriented databases.

XML has close relationships with other interchange protocols for radiology and health care data. The Digital Imaging and Communications in Medicine (DICOM) standard specifies a nonproprietary data interchange protocol, digital image format, and file structure for biomedical images and image-related information (28). DICOM includes a proposed standard for structured reporting and has a syntax that can readily be expressed in SGML (29). Efforts are underway to use XML as the document interchange protocol for the HL7 standard. As a result, HL7 will use XML as a format for structured reporting as well. Users can select XML, DICOM, HL7, and so on to transfer patient data to other systems or institutions according to the requirements of a particular implementation (29).

Distributed object computing protocols such as CORBA (Common Object Request Broker Architecture) or DCOM (Distributed Component Object Model) allow fast, flexible implementation of Web-based software systems. Client-side software components such as Java applets and "ActiveX" objects can readily be incorporated into Web documents and can add functionality to Web browsers. XML can facilitate communication among such distributed software objects and can play an important role in the exchange of data between client and server software components. For example, a Web-based applet could allow a user to drag a patient's folder icon from the Web application of a hospital to the internal database application of a home health care agency to transfer information into that database (4).


    Conclusions
 Top
 Abstract
 Introduction
 What Is XML?
 Structured Reporting
 XML in Radiology Structured...
 Discussion
 Conclusions
 References
 
XML notation provides a compact document representation scheme that allows radiology reports to be transmitted over the Web as universally understandable, self-defining documents. XML is poised to play an important role in many aspects of radiology reporting. It can be used to build robust radiology reporting systems and integrate them with broader, enterprise-wide information systems.


    Footnotes
 
Abbreviations: DICOM = Digital Imaging and Communications in Medicine HL7 = Health Level Seven HTML = Hypertext Markup Language SGML = Standard Generalized Markup Language SPIDER = Structured Platform-Independent Data Entry and Reporting UMLS = Unified Medical Language System XML = Extensible Markup Language


    References
 Top
 Abstract
 Introduction
 What Is XML?
 Structured Reporting
 XML in Radiology Structured...
 Discussion
 Conclusions
 References
 

  1. McEnery KW. The Internet, World-Wide Web, and Mosaic: an overview. AJR Am J Roentgenol 1995; 164:469-473.[Abstract/Free Full Text]
  2. Kahn CE, Jr, Wang K, Bell DS. Structured entry of radiology reports using World Wide Web technology. RadioGraphics 1996; 16:683-691.[Abstract]
  3. Kahn CE, Jr. A generalized language for platform-independent structured reporting. Methods Inf Med 1997; 36:163-171.[Medline]
  4. Bosak J. XML, Java, and the future of the Web. In: Connolly D, eds. XML: principles, tools, and techniques. Sebastopol, Calif: O'Reilly, 1997; 219-228.
  5. Flynn P, Allen T, Borgman T, et al. Frequently asked questions about the Extensible Markup Language (the XML FAQ); Available at: http://www.ucc.ie/xml/. Accessed July 1, 1999..
  6. International Standards Organization. ISO 8879: 1986 information processing: text and office systems—standard generalized markup language (SGML) Geneva, Switzerland: International Standards Organization, 1986.
  7. Bray T, Paoli J, Sperberg-McQueen CM. Extensible markup language (XML). In: Connolly D, eds. XML: principles, tools, and techniques. Sebastopol, Calif: O'Reilly, 1997; 29-66.
  8. World Wide Web Consortium. Extensible Markup Language (XML) 1.0: W3C recommendation 10 February 1998; Available at: http://www.w3.org/TR/1998/REC-xml-19980210. Accessed July 1, 1999..
  9. EDI News. XML touted as cure for EDI ills; Available at: http://www.commerce.net/members/portfolios/technology/edi/xml-edi/0804edi.html. Accessed July 1, 1999..
  10. Dolin RH, Rishel W, Biron PV, Spinosa J, Mattison JE. SGML and XML as interchange formats for HL7 messages. Proc AMIA Symp 1998; 720-724.
  11. HL7 SGML/XML Special Interest Group. The HL7 patient record proposal; Available at: http://www.mcis.duke.edu/standards/HL7/committees/sgml/WhitePapers/Prap/. Accessed July 1, 1999..
  12. Clark J. Comparison of SGML and XML: World Wide Web Consortium note; Available at: http://www .w3.org/TR/NOTE-sgml-xml.html. Accessed July 1, 1999..
  13. Kahn CE, Jr, de la Cruz NB. Extensible markup language (XML) in health care: integration of structured reporting and decision support. Proc AMIA Symp 1998; 725-729.
  14. American College of Radiology. Breast Imaging Reporting and Data System (BI-RADS) Reston, Va: American College of Radiology, 1993.
  15. D'Orsi CJ, Kopans DB. Mammographic feature analysis. Semin Roentgenol 1993; 28:204-230.[Medline]
  16. Tuddenham WJ. Glossary of terms for thoracic radiology: recommendations of the nomenclature committee of the Fleischner Society. AJR Am J Roentgenol 1984; 143:509-517.[Free Full Text]
  17. Austin JH, Muller NL, Friedman PJ, et al. Glossary of terms for CT of the lungs: recommendations of the Nomenclature Committee of the Fleischner Society. Radiology 1996; 200:327-331.[Free Full Text]
  18. Bell DS, Greenes RA, Doubilet P. Form-based clinical input from a structured vocabulary: initial application in ultrasound reporting. Proc Ann Symp Comput Appl Med Care 1992; 789-790.
  19. Wheeler PS, Simborg DW, Gitlin JN. The Johns Hopkins Radiology Reporting System. Radiology 1976; 119:315-319.[Abstract]
  20. Kahn CE, Jr.. Self-documenting structured reports using open information standards. Medinfo 1998; 403-407.
  21. Kahn CE, Jr. Standard generalized markup language for self-defining structured reports. Int J Med Informatics 1999; 53:203-211.[Medline]
  22. Friedman C, Hripcsak G, Shagina L, Liu H. Representing information in patient reports using natural language processing and the extensible markup language. J Am Med Informatics Assoc 1999; 6:76-87.[Abstract/Free Full Text]
  23. Bell DS, Pattison-Gordon E, Greenes RA. Experiments in concept modeling for radiographic image reports. J Am Med Informatics Assoc 1994; 1:249-262.[Abstract/Free Full Text]
  24. Lindberg DAB, Humphreys BL, McCray AT. The Unified Medical Language System. Methods Inf Med 1993; 32:281-291.[Medline]
  25. Humphreys BL, Lindberg DAB, Schoolman HM, Barnett GO. The Unified Medical Language System: an informatics research collaboration. J Am Med Informatics Assoc 1998; 5:1-11.[Abstract/Free Full Text]
  26. World Wide Web Consortium. XML linking language (XLink): World Wide Web Consortium working draft 3 March 1998; Available at: http://www .w3.org/TR/1998/WD-xlink-19980303. Accessed July 1, 1999..
  27. Oracle Corporation. XML support in Oracle8i and beyond: Oracle Technical White Paper; Available at: http://www.oracle.com/xml/documents/xml–twp. Accessed November 9, 1998..
  28. Bidgood WJ, Horii S, Prior F, Van Syckle D. Understanding and using DICOM, the medical image communication standard. J Am Med Informatics Assoc 1997; 4:199-212.[Abstract/Free Full Text]
  29. Bidgood WD, Jr, alSafadi Y, Tucker M, Prior F, Hagan G, Mattison J. The role of digital imaging and communications in medicine in an evolving healthcare computing environment: the model is the message. J Digit Imaging 1998; 11:1-9.



This article has been cited by other articles:


Home page
Clin. Chem.Home page
G. M. Saadawi and J. H. Harrison Jr
Definition of an XML Markup Language for Clinical Laboratory Procedures and Comparison with Generic XML Markup
Clin. Chem., October 1, 2006; 52(10): 1943 - 1951.
[Abstract] [Full Text] [PDF]


Home page
RadioGraphicsHome page
T. Hacklander, J. Martin, and K. Kleber
Informatics in Radiology (infoRAD): An Open Source Framework for Modification and Communication of DICOM Objects
RadioGraphics, November 1, 2005; 25(6): 1709 - 1721.
[Abstract] [Full Text] [PDF]


Home page
RadioGraphicsHome page
R. K. Taira, S. G. Soderland, and R. M. Jakobovits
Automatic Structuring of Radiology Free-Text Reports
RadioGraphics, January 1, 2001; 21(1): 237 - 245.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Figures Only
Right arrow Full Text (PDF)
Right arrow Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Wang, C.
Right arrow Articles by Kahn, C. E.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Wang, C.
Right arrow Articles by Kahn, C. E., Jr
Related Collections
Right arrow Informatics


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
RADIOGRAPHICS RADIOLOGY RSNA JOURNALS ONLINE