Misplaced Pages

Office Open XML

Article snapshot taken from[REDACTED] with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.

This is an old revision of this page, as edited by 69.73.191.92 (talk) at 22:55, 19 August 2007 (OOXML criticism). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

Revision as of 22:55, 19 August 2007 by 69.73.191.92 (talk) (OOXML criticism)(diff) ← Previous revision | Latest revision (diff) | Newer revision → (diff)
The neutrality of this article is disputed. Relevant discussion may be found on the talk page. Please do not remove this message until conditions to do so are met. (Learn how and when to remove this message)
This article may contain excessive or inappropriate references to self-published sources. Please help improve it by removing references to unreliable sources where they are used inappropriately. (Learn how and when to remove this message)

Template:Distinguish2

Office Open XML (commonly referred to as OOXML or Open XML) is an XML-based file format specification for electronic documents such as memos, reports, books, spreadsheets, charts, presentations and word processing documents. The specification has been developed by Microsoft as a successor of its binary office file formats and was published by Ecma International as the Ecma 376 standard in December 2006. The format specification is available for free at Ecma International.

Office Open XML uses a number of dedicated XML markup languages in fileparts that are placed in an Open Packaging Convention file container. The format specification includes XML schemas that can be used to validate the XML syntax.

The format is currently undergoing a standardization process within the International Organization for Standardization (ISO).

Background

Microsoft Office has long been using binary file formats for electronic office documents. These formats are the most used file formats for Office documents due to the dominant market position of Microsoft Office. Microsoft used to limit licensing to these file formats only for governments and noncommercial use. Currently Microsoft offers the binary format specifications to everyone under a royalty free covenant not to sue . Because of this the formats were fairly difficult to interpret by other commercial or free software developers and never fully interoperable although a very good level of support was achieved. In 2002 Microsoft released a file format for MS Word that was based on XML instead, WordProcessingML, and in 2003 it released the full Office 2003 XML formats and made them available under royalty-free licensing.

The 2003 formats did not use a package file format but still used a single file format with embedded items like pictures as binary encoded within the XML. A new version of WordprocessingML is used in Office Open XML.

In 2004 governments and mainly the European Union recommended both OASIS and Microsoft to standardize their XML office file formats through an official standardisations organization. OASIS then based on these recommendations decided to submit their Open Office XML format for ISO standardization renamed as Open Document Format the term used in EU recommendations. Following this in December 2005 Microsoft decided to standardize the new versions of their Microsoft Office XML formats under development for MS Office 12 through Ecma (renamed as Ecma Office Open XML).

The development and standardization of Office Open XML takes place against a groundswell of interest in open, standards-based technologies by commercial and government organizations.

File format and structure

The Office Open XML file is an Open Packaging Convention package containing the individual files that form the basis of the document. In addition to XML files with Office markup data, the ZIP package can also include embedded (binary) files in formats such as PNG, BMP, AVI or PDF.

According to Microsoft, Office Open XML is backward compatible with Microsoft Office versions 2000, XP and 2003 using Microsoft Office Compatibility Pack.

Document markup languages

An Office Open XML file may contain several documents encoded in specialized markup languages corresponding to applications within the Microsoft Office product line. Office Open XML defines multiple vocabularies (using 27 namespaces and 89 schema modules.) The primary markup languages are:

  • WordprocessingML - Wordprocessing
  • SpreadsheetML - Spreadsheets
  • PresentationML - Presentation

For drawing

  • DrawingML
  • VML (deprecated)

Shared markup language materials include:

  • OMML (Office Math Markup Language)
  • Extended properties
  • Custom properties
  • Variant Types
  • Custom XML data properties
  • Bibliography

In addition to the above markup languages custom XML schema's can be used to extend Office Open XML.

The XML Schema of OOXML can be characterized as being highly generic, highly systematic and with an emphasis on reducing load time and improving parsing speed. In an a test with current implementations XML based office documents still were to be a lot slower than binary formats.. For speed, OOXML uses very short element names for common elements and spreadsheets save dates as index numbers (starting from 1899 or from 1904). In order to be systematic and generic, OOXML typically uses separate child elements for data and metadata (element names ending in Pr for properties) rather than using multiple attributes, which allows structured properties. OOXML does not use mixed content but uses elements to put a series of text runs (element name r) into paragraphs (element name p). The result is terse and highly nested in contrast to HTML, for example, which is fairly flat, designed for humans to write in text editors and is more or less congenial for humans to read.

OMML

Included with Office Open XML is Office Math ML (OMML). This is a mathematical markup language which can integrate with the WordprocessingML markup. This means that the math zones can also include word processing markup like revision markings, footnotes, comments, images and elaborate formatting and styles. The format is different from the World Wide Web Consortium (W3C) MathML recommendation but compatible through relatively simple XSL Transformations. For example Microsoft Office 2007 ships with such XSL transformation files allowing MathML to by copied from a clipboard with XSL transformation into OMML. During XSL transformation from OMML to MathML any WordprocessingML related markup should be lost because MathML does not allow for other markup in math zones.

Container structure

Office Open XML files conform to the Open Packaging Convention and different applications have characteristic directory structures and file names within these packages. An OPC-aware application will use the relationships files rather than directory names and file names to locate individual files. In OPC terminology, a file is a part. A part also has accompanying metadata, in particular MIME metadata.

Office Open XML format uses a ZIP container for packaging XML and other data files.

A basic Office Open XML file contains an XML file called .xml at the root level of the ZIP package, along with three folders: _rels, docProps, and a directory specific for the document type (for example, in a .docx word processing file that would be a word directory). The word directory contains the document.xml file which is the core content of the document.

.xml file
This file describes the content of the ZIP package. It also contains a mapping for file extensions and overrides for specific URIs.
_rels Folder
The _rels folders are where one goes to find the relationships for any given part within the package. To find the relationships for a specific part, one looks for the _rels folder that is a sibling of one's part. If the part has relationships, the _rels folder will contain a file that has one's original part name with a .rels appended to it. For example, if the content types part had any relationships, there would be a file called inside the _rels folder.
_rels/.rel
The root level _rels folder always contains a part called .rels. This URI (/_rels/.rels) and /.xml are the only two reserved URIs for parts in files that adhere to Office Open XML conventions. This is where the "package relationships" are located. Whenever one opens a file using these conventions, one always starts by going to the _rels/.rels file. All relationship files are represented with XML. If one opens it in a text editor, one will see a bunch of XML that outlines each relationship for that part. In a minimal word document containing only the basic document.xml, the top level parts are two metadata parts, and the document.xml part.
word/document.xml
This is the main part for any Word document. If one views it in an XML editor, one will see a pretty basic XML file. The body of the word processing document is contained in this part.

Relationships

Relationship files in Office Open XML

An example relationship file in Office Open XML (for example word/_rels/document.xml.rels)

<?xml version="1.0" encoding="UTF-8" standalone="yes" ?>
<Relationships
  xmlns="http://schemas.microsoft.com/package/2005/06/relationships">
  <Relationship Id="rId1"
     Type="http://schemas.microsoft.com/office/2006/relationships/image"
     Target="http://en.wikipedia.org/images/wiki-en.png"
     TargetMode="External" />
  <Relationship Id="rId2"
     Type="http://schemas.microsoft.com/office/2006/relationships/hyperlink"
     Target="http://www.wikipedia.org"
     TargetMode="External" />
</Relationships>

Relationship files allow navigation of the package without having to open up each part. For example, images that are referenced in a Word document can be found in the relationship file by looking for all relationships that are of type http://schemas.microsoft.com/office/2006/relationships/image. To point to a different image, you just edit the relationship.

Hyperlink relations

The following code shows an example of inline markup for a hyperlink:

<w:hyperlink w:rel="rId2" w:history="1"> 

In this example, the URL is represented by "rId2". The actual URL is located by the corresponding "rId2" item in the accompanying relationships file. Linked images, templates, and other items are referenced in the same way. The locations of referenced items can be updated by editing the relationships file.

Embedded or linked media file relations

Pictures can be embedded or linked in the XML files using a tag:

<v:imagedata w:rel="rId1" o:title="example" />

This is the reference to the image file. In Office Open XML, all references are done via relationships. For example a document.xml part has a relationship to the image part. The actual URI is located by the corresponding "rId1" item in the accompanying relationships file. There is a _rels folder in the ZIP package, in the same directory as document.xml. Inside _rels is a file called document.xml.rels. In this file there will be a relationship definition that contains a type, an ID and a location. The ID is the referenced ID used in the XML document. The type will be a reference schema definition for the media type and the location will be an internal location within the ZIP package or an external location defined with an URL.

Standardization

Creation of Ecma 376

Microsoft stated that Office Open XML would be an open standard, and submitted it to the Ecma standardization process. On 2005-12-08 Ecma created technical committee 45 (TC45) in order to "produce a formal standard for office productivity applications that is fully compatible with the Office Open XML Formats, submitted by Microsoft". The proposal was co-sponsored by Apple Inc., Barclays Capital, BP, the British Library, Essilor, Intel, Microsoft, NextPage, Statoil ASA and Toshiba.

The TC45 committee is co-chaired by two Microsoft employees; it also includes members from Apple, Canon, Intel, NextPage, Novell, Pioneer, Statoil ASA, Toshiba and The United States Library of Congress.

At the General Assembly meeting on 2006-12-07, Ecma International approved Office Open XML as an Ecma standard (Ecma 376). The General Assembly also approved submission of the text to the Fast Track mechanism of ISO/IEC JTC 1, which — if successful — would result in it becoming an ISO standard.

A full copy of Ecma 376 or a copy in bits can be downloaded from Ecma international.

Submission to ISO

As an ISO external Category A liaison, Ecma have submitted Ecma 376 to the ISO Fast Track process, the same process available to National Standard Organizations. To meet the requirements of this process, Ecma have submitted the documents "Explanatory report on Office Open XML Standard (Ecma-376) submitted to JTC 1 for fast-track" and "Licensing conditions that Microsoft offers for Office Open XML".

Contradiction phase

The fast track process allows a 30-day review period by national standardizing bodies (NBs), during this period NBs may identify to the JTC 1 Secretariat any perceived contradiction with other JTC 1, ISO or IEC standards. If such a contradiction is alleged, "the JTC 1 Secretariat and ITTF shall make a best effort to resolve the matter". At the end of the 30-day review, 20 countries submitted responses, the majority of which were either negative or equivocal. The full text of the national bodies' submissions is available from the ISO/IEC JTC1 SC32 website.

Contradiction phase voting

The following table attempts to summarise the positions of the ISO member countries that voted in the contradiction phase. Some countries (including the United States, represented by ANSI) did not respond in the contradictions phase — member countries are not required to respond if they perceive no contradictions. However, some national bodies, such as ANSI, took an extremely narrow view of what constituted a contradiction. The large number of submissions received was unprecedented.

Country Supportive? Equivocal? Against? Abstain? Member comments
Australia x More investigation should precede fast tracking
Canada x Perceived contradictions must be resolved before any ballot
Czech Republic x DIS 29500 should be placed on the normal standards track
Denmark x A ballot will clarify the suitability of DIS 29500
Finland x DIS 29500 should be placed on the normal standards track
France x It is difficult to interpret the JTC 1 Directives
Germany x More investigation should precede fast tracking
Hungary x We abstain because of disagreement in our committee
India x We need more time
Italy x It is difficult to interpret the JTC 1 Directives
Japan x We hope IPR and technical issues are resolved in the ballot
Kenya x DIS 29500 is too large to be fast tracked
Malaysia x We have identified many contradictions
Netherlands x We abstain
New Zealand x Contradictions mean DIS 29500 should be withdrawn from fast track
Norway x More investigation should precede fast tracking
Romania x "We agree with the project as it is"
Singapore x We hope the contradictions can be resolved
Sweden x We are not in a position to have a view
UK x DIS 29500 should be placed on the normal standards track
TOTALS 1 8 6 5

Ecma responded to the issues raised during the contradiction period with a document reviewing the national bodies' comments. This document cites the objections raised by the national bodies, and shows the comments overlap considerably with material on the Web created by opponents of Office Open XML, particularly from the Grokdoc site. Microsoft employees and others have suggested that the national bodies' documents were not written by the bodies, but by Microsoft competitors; these suggestions are supported by the author metadata in Kenya's PDF submission, which contains the names of an IBM Germany employee and a Malaysian ODF supporter. However, national bodies are permitted to source technical skills as required.

Five-month ballot

The JTC 1 directives state that regardless of whether or not resolution is reached on the question of contradiction, a five-month ballot commences immediately. So, on April 2, 2007 the ISO JTC 1 Secretariat duly informed Ecma International that the five-month DIS 29500 (Office Open XML) ballot period had started and would close on September 2, 2007.

At the end of the five-month letter ballot, all the technical comments that have been made are consolidated and redistributed so that the voting nations may form a view on them in their totality. The SC34 secretariat then may decide to arrange for a special ballot resolution meeting (BRM) to take place no sooner than two and one-half months after the ballot has closed.

Response to the ballot

After the five-month letter ballot closes, the proposer (Ecma) has a chance to respond to the comments made by the national bodies that voted. They may combine, de-duplicate, label and group the comments and then attempt to liaise informally with national bodies, to try and arrive at a set of revisions that are acceptable.

The result of Ecma's activity will be a "Disposition of Comments" document — effectively a set of proposed revisions to the DIS 29500 text designed to be acceptable to all the national bodies who disapproved of the text in the letter ballot. It is this series of edits which effectively forms the agenda for the subsequent Ballot Resolution Meeting, and on which the attendees of that meeting will be asked to form opinions.

Ballot resolution process

During the six-month period (of a one-month contradiction phase, and five-month letter ballot) the national bodies are able to cast a vote of approval, disapproval or abstention. P-members are required to vote. The ballot resolution process is the process which follows this vote.

In the event there is not 100% support, or overwhelming disapproval, of DIS 29500 there can be a Ballot Resolution Meeting (BRM) if needed, in which comments submitted with ballot votes can be resolved. The BRM will have been called at the discretion of the SC34 secretariat at the end of the five-month ballot voting. The outcome of this meeting effectively decides whether DIS 29500 succeeds or fails in its bid to become a full International Standard. The DIS 29500 BRM is provisionally scheduled for Q1 2008.

Who attends the BRM?

The participants in the BRM are representatives of the national bodies, the proposer (Ecma), and support and administrative staff.

The national bodies who attend this meeting are:

  • SC34 members (who have to send representatives);
  • those that voted "disapprove" during the five-month ballot, who have a duty to send a delegation to this meeting. (JTC 1 Directives clause 13.7);
  • optionally, those who voted to "approve" DIS 29500.

The meeting is expected to attract between 40 and 100 participants.

Meeting process

During this meeting, the participants consider each of Ecma's responses to the comments gathered in the preceding process. Each comment is thus effectively "resolved" by the meeting's participants either:

  • agreeing to a proposed alteration of the text by Ecma
  • agreeing to withdraw that comment (if, for example, it is incorrect)
  • otherwise agreeing to amend the text or ignore a comment following discussion

In this way, a set of editorial changes to the text is agreed that, collectively, implies a new revised document. The meeting then agrees whether this final "implied" document is acceptable for publication as a full International Standard.

Voting procedures

JTC 1 states that decisions at the BRM should be reached preferably by consensus, but that any unavoidable votes should be taken according to normal JTC 1 procedures (JTC 1 Directives clause 13.8). According to these procedures a vote at the meeting is passed if:

  • At least two-thirds of the P-members voting shall have approved
  • Not more than one-quarter of the total number of votes cast are negative

Votes are taken only of those present at the meeting, and "P-members" for the purposes of this clause are P-members of JTC 1 (not SC 34); votes from the five-month ballot are not carried forward to the BRM. .

Abstentions are excluded from the count. (JTC 1 Directives clause 9.6)

Eligibility to vote

Any country that voted (i.e. either "approve with comments", "disapprove with comments" or "abstain") in the five-month ballot may vote at the BRM. Countries that did not vote in that letter ballot may not vote at the BRM.

Countries may change their position to any of "approve", "disapprove" or "abstain" during the course of the BRM.

The arithmetic of the votes

Voting is on a simple, one-country/one-vote basis, but the vote counting is complicated by the fact that there are two classes of voting country. Those with most voting power are the 33 JTC 1 P-Members. For any vote to pass at the BRM, at least two-thirds of these countries (that are present at the meeting), must vote positively.

If that hurdle is cleared, the votes of all countries present (including the JTC 1 P-Members) are counted. For a vote to pass, the total negative votes for this combined group must not exceed one-quarter of those countries eligible to vote.

Final outcome

If this meeting fails to agree on a final text, the proposal of OOXML for fast-tracking fails and the procedure is terminated: if the meeting does agree on a final text, any required changes are applied by the editor and OOXML is passed for publication as an ISO standard.

ISO maintenance regime

The maintenance regime for OOXML (should it become an ISO Standard) is yet to be determined. Ecma have however tabled a maintenance proposal for discussion by SC34 at a meeting scheduled to take place in December 2007 in Kyoto.

National body activity

Some countries opened their scrutiny procedure to public view during the five-month ballot:

  • The American National Standards Institute (ANSI) is publishing comments they receive on DIS 29500 here. An archive of email exchanged between members of INCITS V1 (who provide recommendations on the U.S. position) is available here. When voting on a position, the INCITS V1 committee divided, with Microsoft and Ecma TC45 members Nextpage and BP voting for "approve with comments" on the one hand, and a group including IBM, Sun and Red Hat voting for "disapprove with comments" on the other hand. The committee thus failed to reach agreement on a recommendation to its parent committee.. Subsequently the INCITS Executive Board held a ballot for a position of "Approval with comments" and to include all comments processed or non-processed by the V1 committee. This ballot did not achieve the necessary two-thirds majority which makes it likely ANSI (on behalf of the US) will abstain in the ISO ballot.
  • The British Standards Institute (BSI) used an open Wiki (read-only to the public; read/write for BSI technical committee members) to help coordinate the UK's input into the ballot. While active, the site gathered 630 comments on the text. The main page of the wiki is here, and specific comments on DIS 29500 are here. Beyond this, the deliberations and voting of BSI are confidential.
  • The Standards Council of Canada (SCC) is seeking comments on a proposal to adopt Office Open XML (Open XML) as an international open standard. The forum soliciting comments from Canadians only is here.

Licensing

Ecma International provides its standard specifications for free without copyright restrictions and under the Ecma code of conduct in patent matters which requires participating and approving member organisations to make available their patent rights under a reasonable and non-discriminatory basis (see Reasonable_and_Non_Discriminatory_Licensing).

Microsoft which is a main contributor to the Ecma standard provided a covenant not to sue. The covenant received a mixed reception, with some like Groklaw identifying problems and others (such as Lawrence Rosen) endorsing it.

Microsoft also added the Office Open XML format to their Microsoft Open Specification Promise in which Microsoft irrevocably promises not to assert any Microsoft Necessary Claims against you for making, using, selling, offering for sale, importing or distributing any implementation to the extent it conforms to a Covered Specification ("Covered Implementation"). The Office Open XML 1.0 - Ecma 376 and its predecessor Office 2003 XML format are among the covered specifications.

The Office Open XML format therefore can be used under the Covenant not to Sue or the Open Specification Promise.

In support of the licensing arrangements Microsoft commissioned an analysis from the London legal firm Baker & Mckenzie.

The Microsoft Open Specification Promise was included in documents submitted to ISO in support of the Ecma 376 fast track submission.. In response to criticism of the licensing, ECMA provided the following statements:"

  • Contributions to Ecma were made under the Ecma Code of Conduct in Patent Matters...
  • The OSP enables both open source and commercial software to implement DIS 29500."

EU defintion of an Open Standard

- With Ecma International publishing the specification for free and patents made irrevocably available on a royalty-free basis, Office Open XML conforms to the European Union definition of an open standard.

  • The standard is adopted and will be maintained by a not-for-profit organisation, and its ongoing development occurs on the basis of an open decision-making procedure available to all interested parties (consensus or majority decision etc.).
  • The standard has been published and the standard specification document is available either freely or at a nominal charge. It must be permissible to all to copy, distribute and use it for no fee or at a nominal fee.
  • The intellectual property — i.e. patents possibly present — of (parts of) the standard is made irrevocably available on a royalty-free basis.
  • There are no constraints on the re-use of the standard. .

Adoption

Office Open XML is the default Office 2007 format. For older versions such as Office 2000, Office XP and Office 2003 a compatibility pack is provided. The compatibility pack can also be used as a stand alone converter with Office 97.

  • Microsoft Office Open XML File Format Converter for Mac 0.2 (Beta)Microsoft had previously advised users of Office 2007 to save their files in the old Office binary format until a file converter is released.
  • Beta testing has started on Microsoft Office 2008 for Mac, which will support the format. The final version is scheduled to release in the mid-January 2008.
  • Corel has announced that by mid-2007 its WordPerfect Office suite will support Office Open XML as well as OpenDocument.
  • Gnumeric has limited SpreadsheetML markup language support.
  • Novell has created an Office Open XML plugin for the Novell edition of OpenOffice.org. An Office Open XML import filter is slated for inclusion in version 2.3 of the OpenOffice.org project.
  • docXConverter by Panergy Ltd. converts from WordprocessingML to Rich Text Format (RTF). DocXConverter can be used to transfer WordprocessingML data to other applications that read RTF data such as Word 97.
  • Datavis' DocumentsToGo for PalmOS supports Office Open XML documents.
  • Datawatch supports Office Open XML spreadsheets in its report mining tool Monarch v9.0

Arguments in support and criticism of OOXML standard

Arguments in support

Organizations and individuals supporting Office Open XML have provided arguments for standardization.

User base argument

The most widely used office productivity packages have previously relied on various proprietary binary file formats. For users of the binary formats there could an advantage to migrating to an open XML standard that is backward compatible with the features of previous binary file formats. Office Open XML for this purpose explicitly states as a goal of the format to preserve investments in existing files and applications

Key benefits arguments

Microsoft provided an overview of benefits in using Office open XML

  • Integration of business information with documents
  • Open and royalty-free specification
  • Compact, robust file format

Key benefits and functionality

  • Compact file format
  • Safer documents
  • Easier integration
  • Transparency and improved information security
  • Compatibility

Policy arguments

ECMA has provided the following policy arguments in favor of standardization with respect to Overlap in Scope with ISO/IEC 26300:2006 (ODF): Overlap in Scope of ISO/IEC standards is common and can serve a practical purpose, OpenXML addresses distinct user requirements, ODF and OpenXML are Structured to Meet Different User Requirements, OpenXML and ODF can and do coexist.

Microsoft attacked IBM's fundamental opposition to the Open XML standardization process

  • "Ecma almost unanimously agreed to submit Open XML as a standard for ratification by ISO/IEC JTC1 with only IBM dissenting."
  • "IBM led a global campaign urging national bodies to ... not even consider Open XML, because ODF had made it through ISO/IEC JTC1 first – in other words, that Open XML should not even be considered on its technical merits because a competing standard had already been adopted. This campaign to stop even the consideration of Open XML in ISO/IEC JTC1 is a blatant attempt to use the standards process to limit choice in the marketplace for ulterior commercial motives – and without regard for the negative impact on consumer choice and technological innovation."

Technical arguments

  • The use of the Open Packaging specification which allows for Indirection, Chunking and Relative indirection.
  • The Open Packaging Convention in OOXML specifies the ZIP format (See format spec part II The Open Packaging Conventions) making ZIP a part of a standard.
  • Due to ZIP compression files are smaller than the currently widely used binary formats
  • It supports custom data elements for integration of data specific to an application or an organisation that wants to use the format.
  • It is currently the only open document standard to define spreadsheet formulae.
  • Office Open XML contains alternate representations for the XML schemas and extensibility mechanisms using RELAX NG (ISO/IEC 19757-2) and NVDL (ISO/IEC 19757-4)
  • OpenXML contains no restriction on image, audio or video types. For example, images can be in Microsoft WMF, GIF, PNG, TIFF, PICT, JPEG or any other image type (§1:14.2.12).
  • Embedded controls can be of any type, such as Java or ActiveX (§1:15.2.8).
  • WordprocessingML font specifications can include font metrics and PANOSE information to assist in finding a substitution font if the original is not available (§3:2.10.5).
  • Alternate Content Block (§3:2.18.4) A solution to define alternate content (like an image) which can be used in various situations where a consuming application might not be capable of interpreting what a producing application wrote.
  • Internationalization supporting all kind of features needed for support by multiple nations. For example date representation: In WordprocessingML (§4:2.18.7) and SpreadsheetML (§4:3.18.5), calendar dates can be written using Gregorian (three variants), Hebrew, Hijri, Japanese (Emperor Era), Korean (Tangun Era), Saka, Taiwanese, and Thai formats and for example several internationalization related spreadsheet conversion functions.
  • Custom XML schema extensibility allowing implementations to the format with features. That can for instance facilitate conversion from other formats and future features that are not part of the official specification yet.

Criticism

The Office Open XML standard has been the subject of wide and varied debate in the software industry. Many of the participants in the approval process are generally supportive of eventual ISO standardization, but are unwilling to support the ISO fast track process until their issues are resolved. At 6000 pages long, the specification is difficult to quickly evaluate. A core point of concern is the existence an ISO document format ISO 26300:6000 and a potential overlap of the new Office Open XML format. Most critics suggest Microsoft to adopt ISO 26300:6000 as its default format for future versions of Microsoft Office. Objectors also complain that user confusion regarding the two standards would be even greater because of the similarity of the "Office Open XML" name to both "OpenDocument" and "OpenOffice".

Criticism by government bodies

The US Department of Defense cites several potential problems with the format, including :

  1. Binary information in the standard that would lead to security concerns.
  2. The referencing of unexplained backward compatibility modes that might pose a problem for third party implementers.
  3. The use of proprietary file formats within the open standard appears to cause potential intellectual property ownership concerns.

Criticism by competitors and free software and open source

The parties contributing in this debate include organizations supporting OpenDocument and advocates of free software and open source, and companies such as IBM. They have lobbied heavily against the standardization of OOXML in its current form due to technical and legal issues in the format. In addition to several specific issues noted below, an overall premise is that the format was inherently closed in many respects and thus a poor candidate for a global standard. They criticize the duplication of, overlap with, and inability to merge with the OpenDocument Format.

Various competitors of Microsoft have criticized the ISO standardization of Office Open XML, e.g., Preliminary Google reply to DIS 29500 : the consideration of ECMA-376 OOXML for ISO standardization] and IBM Comments on INCITS LB 2212 - DIS 29500.

Legal uncertainty

  • Scope of the Patent Licensing does cover only required features of the standard, but not the entire standard. Specifically Microsoft's Covenant not to sue grants patent use "that are necessary to implement only the required portions of the Covered Specification that are described in detail and not merely referenced in such Specification." Also Microsoft's Open Specification Promise only protects what is explicitly specified in the standard.

Technical criticisms

  • Reliance on application-defined behaviors to support important functionality that should be documented or supported via existing standards. For example, book 4 section 6.1.2.19 defines the "equationxml" attribute of "shape" elements, "used to rehydrate an equation using the Office Open XML Math syntax"; however, the "actual format of the contents of this attribute are application-defined".
  • Decimal time, a serial date format, is used in SpreadsheetML. The format incorrectly treats 1900 as a leap year in order to remain backward compatible with previous versions of Microsoft Excel and in Lotus 1-2-3. The criticism is twofold; dates before 1/3/1900 cannot be used and the ISO 8601 standard for the representation of time and date should have been used instead of the serial date format.
  • Use of DrawingML and VML instead of SVG, and of a new mathematical format instead of MathML. MathML and SVG are W3C recommendations. VML is deprecated and should only occur in uncommon files converted from the MS Office WordprocessingML 2003 format.
  • Internal inconsistencies and omissions. For example, book 4 section 2.18.4 lists styles such as "apples", "scaredCat", and "heebieJeebies", but does not fully define these styles. Missing properties include height, width, color depth, and orientation.
  • Inconsistent notations for percentage units. In book 4, section 2.18.85 uses predefined symbols (like "pct15" for 15%) in 5 or 2.5 percent increments, section 2.15.1.95 uses a decimal number giving the percentage, section 2.18.97 uses a number in fiftieths of a percent, and section 5.1.12.41 uses a number in thousandths of a percent.
  • Inflexible numbering format. For example, book 4 section 2.18.66 describes a numbering format that is fixed to a few countries and contradicts both the W3C XSLT recommendation and Unicode ISO 10646 standard.
  • Non-standard, inflexible paper size naming. For example, book 4 sections 3.3.1.61 define a "paperSize" attribute for which values 1 through 68 are predefined standard paper sizes such as A4 paper.
  • Non-standard language codes and color names.
  • Non-extensible bitmasks, some elements attributes are defined as bitmasks. For example, book 4 section 2.8.2.16 "sig (Supported Unicode Subranges and Code Pages)" describes the <w:sig> element, the attributes of which are all bitmasks.
  • Legacy document rendering compatibility is identified using (deprecated) tags. For example, book 4 section 2.15.3.6, "autoSpaceLikeWord95", “useWord97LineBreakRules”, “useWord2002TableStyleRules", and book 4 section 2.15.3.31, "lineWrapLikeWord6", and "suppressTopSpacingWP" for a 16-year-old version of WordPerfect.. These items should only occur in OOXML documents that were converted from predecessor Microsoft Office documents.
  • Errors in the spreadsheet formula specifications confirmed by Microsoft .

OpenOffice.org in Denmark have submitted objections to ECMA 376 to the Danish National Body (Dansk Standard) . The 22 page submission goes into some detail.

References

  1. ^ "Ecma International approves Office Open XML standard" (Press release). Ecma International. December 7 2006. Retrieved 2006-12-08. {{cite press release}}: Check date values in: |date= (help)
  2. "How to extract information from Office files by using Office file formats and schemas". Microsoft. 2007-03-27. Retrieved 2007-07-10.
  3. Brian Jones (2007-01-25). "History of office XML formats (1998-2006)". {{cite web}}: Unknown parameter |Publisher= ignored (|publisher= suggested) (help)
  4. Telematics between Administrations Committee based on IDA expert group on open document formats (2004-05-25). "TAC approval on conclusions and recommendations on open document formats". IDABC - European eGovernment Services. {{cite web}}: Unknown parameter |accesdate= ignored (|access-date= suggested) (help)
  5. Micheal Brauer(Sun) (2007-09-01). "News from IDA, ISO and TC roadmap". OASIS.
  6. "How to use earlier versions of Excel, PowerPoint, and Word to open and save files from 2007 Office programs". Microsoft. Retrieved 2007-02-09.
  7. George Ou (2007-04-27). "MS Office 2007 versus Open Office 2.2 shootout". ZDnet.com. Retrieved 2007-04-27.
  8. Murray Sargent (2007-06-05). "Science and Nature have difficulties with Word 2007 mathematics". MSDN blogs. Retrieved 2007-07-31.
  9. Tom Ngo (December 11 2006). "Office Open XML Overview" (PDF). Ecma International. p. 6. Retrieved 2007-01-23. {{cite web}}: Check date values in: |date= (help)
  10. "The new open standard safeguards the continued use of billions of existing documents". Ecma International. Retrieved 2007-01-28.
  11. "TC45 - Office Open XML Formats". Ecma International. Retrieved 2007-02-08.
  12. ^ "ISO/IEC JTC 1 Directives, 5th Edition, Version 2.0". iso. Retrieved 2007-01-28.
  13. Explanatory report on Office Open XML Standard (Ecma-376) submitted to JTC 1 for fast-track
  14. ^ Licensing conditions that Microsoft offers for Office Open XML
  15. "Response Document: National Body Comments from 30-Day Review of the Fast Track Ballot for ISO/IEC DIS 29500 (ECMA-376) "Office Open XML File Formats"" (PDF). Ecma International. 2007-02-28. Retrieved 2007-04-03.
  16. ^ "EOOXML objections". grokdoc. Retrieved 2007-01-02.
  17. Brian Jones. "A few updates on the OpenXML formats". Retrieved 2007-05-04.
  18. Stephen McGibbon. ""There is no reason to be browbeaten into thinking that there should only be one document format."". Retrieved 2007-06-22.
  19. "Office Open XML reaches next step in ISO/IEC process". Ecma International. 2007-04-02. Retrieved 2007-04-03.
  20. "Email with appended notes from INCITS/V1 meeting on 2007-07-13".
  21. Doug Mahugh (2007-07-19). "INCITS Executive Board to vote on "approve with comments". MSDN blogs.
  22. INCITS (2008-08-14). "Vote Tally for INCITSLB2212". INCITS.
  23. "What is Ecma International". {{cite web}}: Unknown parameter |Publisher= ignored (|publisher= suggested) (help)
  24. "Microsoft Covenant Regarding Office 2003 XML Reference Schemas". Microsoft. Retrieved 2006-07-11.
  25. "2 Escape Hatches in MS's Covenant Not to Sue". Groklaw. Retrieved 2007-01-29.
  26. Berlind, David (November 28 2005). "Top open source lawyer blesses new terms on Microsoft's XML file format". ZDNet. Retrieved 2007-01-27. {{cite web}}: Check date values in: |date= (help)
  27. "Microsoft Open Specification Promise". Microsoft. 2006-09-12. Retrieved 2007-04-22. {{cite web}}: Cite has empty unknown parameter: |1= (help)
  28. Baker & McKenzie (2006). "Standardization and Licensing of Microsoft's Office Open XML Reference Schema" (PDF). Baker & Mckenzie. Retrieved 2007-02-01. {{cite web}}: Unknown parameter |month= ignored (help)
  29. -Response Document- National Body Comments from 30-Day Review of the Fast Track Ballot for ISO/IEC DIS 29500 (ECMA-376) Office Open XML File Formats
  30. ^ IDABC - European eGovernment Services (2004). "European Interoperability Framework for pan-European eGovernment Services". Retrieved 2007-07-30.
  31. "Microsoft Office Compatibility Pack for Word, Excel, and PowerPoint 2007 File Formats". Microsoft. 2006-11-06. Retrieved 2007-11-18.
  32. "Microsoft Office Open XML File Format Converter for Mac 0.2 (Beta)". Microsoft. July 31 2007. {{cite web}}: Check date values in: |date= (help)
  33. sherjo (2006-12-6). "Converters Coming! Free and (Fairly) Fast". The Office for Mac Team Blog. Retrieved 2007-03-18. {{cite web}}: Check date values in: |date= (help)
  34. Forbes (August 2, 2007). "Microsoft Delays Office for Mac Release".
  35. "Apple - iWork - Pages". Retrieved 2007-07-08.
  36. "Apple - iWork - Numbers". Retrieved 2007-07-08.
  37. "Apple - iWork - Keynote". Retrieved 2007-07-08.
  38. "OS X leopard Text Edit to Support Office 2007?". uneasysilence. {{cite web}}: Unknown parameter |acessdate= ignored (|access-date= suggested) (help)
  39. ""iPhone User's Guide"" (PDF). Apple, Inc.
  40. "Corel WordPerfect Office To Support Open Document Format and Microsoft Office Open XML". corel. Retrieved 2007-01-30.
  41. "GNOME Office / Gnumeric". GNOME.org. Retrieved 2006-07-28.
  42. "Download OpenOffice.org–OpenXML translator". Novell. Retrieved 2007-03-02.
  43. "Issue 79123 - Integrate a first version of the import filter for ooxml wordprocessing documents". OpenOffice.org. Retrieved 2007-07-09.
  44. "docXConverter - Features". panergy. Retrieved 2007-01-31.
  45. ""DocumentsToGo for PalmOS Premium Edition"". Datavis.
  46. "Datawatch Announces Availability of Monarch V.9.0; Supports Microsoft® Windows Vista™ and Extends Excel Capabilities". 2007-02-27. {{cite web}}: Unknown parameter |Author= ignored (|author= suggested) (help); Unknown parameter |Publisher= ignored (|publisher= suggested) (help)
  47. Open XML community. "Hear what Ecma has to say about Open XML (paragraph: Key benefits of Open XML)". OpenXMLcommunity.org.
  48. "Ecma Office Open XML File Formats overview".
  49. -Response Document- National Body Comments from 30-Day Review of the Fast Track Ballot for ISO/IEC DIS 29500 (ECMA-376) Office Open XML File Formats
  50. Interoperability, Choice and Open XML
  51. Rick Jeliffe (2007-07-29). "(comment on) Can a file be ODF and Open XML at the same time ?". O'Reilly XML.com. Retrieved 2007-08-06.
  52. ^ Cite error: The named reference ecma_tc45_white paper was invoked but never defined (see the help page).
  53. "Six thousand pages, one month, no chance..." Retrieved 2007-02-03.
  54. Response of US Department of Defense to Letter Ballot INCITS 2212
  55. ODF Alliance. "Office Open XML factsheet" (PDF). {{cite web}}: Unknown parameter |accesdate= ignored (|access-date= suggested) (help)
  56. Sam Hiser (June 14 2007). "Achieving Openness: A Closer Look at ODF and OOXML" (HTML). ONLamp.com. p. 1. Retrieved 2007-07-12. {{cite web}}: Check date values in: |date= (help)
  57. Achieving Openness: A Closer Look at ODF and OOXML
  58. Spolsky, Joel (2006-06-16). "My First BillG Review". Joel on Software. Retrieved 2007-01-31.
  59. Brian Jones. "Spreadsheet formula bugs". MSDN blogs.
  60. "Objections to Ecma 376 from OpenOffice.org in Denmark" (pdf). OpenOffice.org in Denmark. 2007-06-25. Retrieved 2007-07-03. {{cite journal}}: Cite journal requires |journal= (help)

See also

External links

General Office Open XML

OOXML criticism

OOXML support

Converters and tools

Categories:
Office Open XML Add topic