Microsoft Office Online
Sign in to My Office Online (What's this?) | Sign in

 
 
Products
Search
Search
 
Check for updates: (c) Microsoft
Microsoft Update
 
 
 
Warning: You are viewing this page with an unsupported Web browser. This Web site works best with Microsoft Internet Explorer 6.0 or later, Firefox 1.5, or Netscape Navigator 8.0 or later. Learn more about supported browsers.

Email this linkEmail this link Printer-Friendly VersionPrinter-Friendly Version Bookmark and ShareShare
Ecma Office Open XML Formats frequently asked questions
 
In this article


Overview

ShowWhat are the benefits of Office XML Formats?

The new Office XML Formats introduce a number of benefits not only for developers and the solutions they build, but also for individual users and organizations of all sizes.

Integration of business information with documents
Office XML Formats enable rapid creation of documents from disparate data sources, accelerating document assembly, data mining, and content reuse. Exchanging data between Microsoft Office applications and enterprise business systems is simplified?alter information inside a Microsoft Office document or create a document from scratch using standard tools and technologies; access to Microsoft Office applications is not required. Employees can improve productivity by publishing, searching, and reusing information more quickly and accurately in the environment they choose.

Open and royalty-free specification
Office XML Formats are based on industry standard XML and ZIP technologies, support full integration by any technology provider, and are available via a royalty-free license. Format specifications will be published and made available under the same royalty-free license that exists for the Microsoft Office 2003 Reference Schemas?openly offered and available for broad industry use.

Compact, robust file format
Smaller file sizes and improved recovery of corrupted documents enable Microsoft Office users to operate efficiently and confidently and reduces the risk of lost information. Office XML Formats use ZIP compression technology to store documents offering potential cost savings as it reduces the disk space required to store files and decreases the bandwidth needed to transport files by e-mail, over networks, and across the Web.

ShowHow is this different from the binary (.doc, .xls, and .ppt) file formats?

The new defaults are different because they will be based on XML. As a result, you'll be able to open and automatically edit Microsoft Office Word, Microsoft Office Excel, and Microsoft Office PowerPoint files with any XML processing program without having to "Save As" XML. The new formats build on Microsoft's earlier commitment to XML by improving file and data management, reducing file sizes, and improving recovery of damaged files. It will be easier than ever for users to access data from their Word, Excel, and PowerPoint files and integrate that information with back-end systems and processes.

ShowWhy is this an improvement over the current binary file formats?

The Office XML Formats will offer some key improvements over the current binary file formats in use within Microsoft Office Word, Microsoft Office Excel, and Microsoft Office PowerPoint today. Because these file formats are compressed, the resulting document sizes will be much smaller, in-between 50 and 75 percent in some cases. The file formats architecture also improves recovery of damaged files. Developers gain more granular control of the content within the files allowing them to index, remove sensitive information, and dynamically assemble documents.

ShowWhy did Microsoft change the file formats for Microsoft Office Word, Microsoft Office Excel, and Microsoft Office PowerPoint?

Microsoft developed the Office XML Formats to respond to customer requests for more rapid document creation from disparate data sources and to extend our commitment to industry-standard XML. By creating a new robust, yet compact, structure based on industry standards such as XML and ZIP, the new default file formats speed document creation while reducing the size of Word, Excel, and PowerPoint files and improving data recovery in corrupted files.

ShowIs this the same thing as Microsoft Windows Metro?

No. Office XML Formats use some of the conventions described in the Windows Metro Specification, however, the formats are different in several important ways. Metro is a paginated, fixed document format introduced for Microsoft Windows Server 2008. The Office XML Formats are fully editable file formats for Microsoft Office Word, Microsoft Office Excel, and Microsoft Office PowerPoint. While they share similarities in their use of XML and ZIP compression, they are different in file format design and intended usage.

ShowDo Windows Metro and Office XML Formats have the same archiving features?

No. While both formats utilize XML and ZIP, they are designed with different purposes in mind. Metro is a layout-focused document format being introduced by Windows Server 2008 to allow customers to effortlessly create, share, print, and archive documents. Office XML Formats are new default file formats that will be used specifically in Microsoft Office Word, Microsoft Office Excel, and Microsoft Office PowerPoint to enable more rapid creation of documents from disparate data sources, facilitating document assembly as well as data-mining scenarios.

ShowHow will the file format be different from the XML file formats used in Microsoft Office 2003?

The new file formats differ from the current Office 2003 XML Reference Schemas in two ways:

  • XML files are ZIP compressed to reduce document sizes

  • Data within a document (for example, comments, charts, images, document metadata) is segmented and stored in different components with the file. This modular, compact design is small but easy for developers and other programs to access.

When posted publicly, the new specifications will have an open, royalty-free license.

ShowWhere can I find more information about the Office XML Formats and the 2007 Microsoft Office system?

For the latest announcements leading up to the release of the 2007 Microsoft Office system, formerly code-named Office "12", including feature comparisons, developer tools, and deployment resources, visit Future of Microsoft Office Web site.

Brian Jones, a Program Manager and 6-year veteran of the Microsoft Office team, maintains a blog on the Microsoft Developer Network (MSDN) devoted to the XML file formats in Office. Brian's blog includes discussion, links to developer resources, and insights into the 2007 Microsoft Office release decision making processes.

Top of Page Top of Page

Microsoft Office programs and Office XML Formats

ShowWhat applications will use the new file formats?

Microsoft Office Word, Microsoft Office Excel, and Microsoft Office PowerPoint will use the new file formats in the next release of Microsoft Office products.

ShowWill the old .doc, .ppt, and .xls file formats be available for future releases?

Yes. The current file formats will be available with the next release of the Microsoft Office system of products.

ShowWhat about Microsoft Office 2003 programs such as InfoPath, Access, Project, OneNote, Publisher, FrontPage, Visio, and Outlook? Will they incorporate the new Office XML Formats?

Currently, only Microsoft Office Word, Microsoft Office Excel, and Microsoft Office PowerPoint will use Office XML Formats. This does not prevent other applications from doing so in the future. We chose to introduce Office XML Formats with Word, Excel, and PowerPoint because over fifty percent of developers participating in our Microsoft Office 2003 Reference Schema program use Word and Excel's Reference Schemas in their solutions?PowerPoint was added due to customer request.

InfoPath currently uses a native XML file format, called XSN. We are looking at ways to leverage this new file format for other Microsoft Office System technologies such as Visio, Access, Project, OneNote, Publisher, FrontPage, and Outlook based on feedback we hear from our customers.

ShowIs this the first time Microsoft Office will support XML?

No. While the new Office XML Formats will mark the introduction of XML as the default file formats in Microsoft Office Word, Microsoft Office Excel and Microsoft Office PowerPoint, this is actually an evolution of our existing Office XML strategy. Microsoft Office introduced XML with Microsoft Office 2000, using it as part of the HTML file format to represent some features that did not exist in the HTML standard at that time, such as meta data and vector graphics. We introduced SpreadsheetML as an optional file format in Microsoft Office Excel 2002, and WordprocessingML as an optional format in Microsoft Office Word 2003. Microsoft Office 2003 Editions also marked a big step forward by introducing the new XML-based forms application?Microsoft Office InfoPath 2003?as well as support for custom-defined XML schemas in Word, Excel, and InfoPath. All of the XML formats available in previous releases of Microsoft Office will be fully supported in the 2007 Microsoft Office release.

ShowWill there be an XML file converter for people using previous releases that don't support XML?

Yes. Microsoft will release converters for Microsoft Office XP and Microsoft Office 2003, so customers using previous releases of Microsoft Office will be able to open files created using the new file formats.

ShowWill the converter be able to read/interpret/apply information rights management (IRM) policies?

The converters we are creating for prior releases of Microsoft Office will read, honor, and apply IRM protection to documents. A document that has been IRM protected will continue to be protected regardless of whether the recipient is using the 2007 Microsoft Office system, Microsoft Office 2003, or a previous release.

ShowWhy didn't you keep the old file formats as the default?

To maximize the benefit of reduced file size and deliver out-of-the-box data recovery benefits, the new file formats were set to be the default. This will provide information workers with the benefit of the new file formats without impacting their daily work. Office XML Formats enable the rapid creation of documents from disparate data sources, easier data mining, and content reuse in Microsoft Office.

ShowCan users of older Microsoft Office releases view the new file formats?

Yes. We will release updates for Microsoft Office 2000, Microsoft Office XP, and Microsoft Office 2003 to enable customers using older releases of Microsoft Office to read, edit, and save files using the new Office XML Formats.

Top of Page Top of Page

Office XML Formats usage benefits

ShowDoes the new format enable greater interoperability than the previous Microsoft Office 2003 formats?

The new file formats will improve file and data management by reducing file sizes and improving the recovery of corrupt or damaged files. Interoperability with servers and other applications will also improve because other software tools will be able to more easily access the XML document structure to identify or interact with specific document components, such as comments, document metadata, or embedded code. Because the file formats will have a published specification with a royalty-free license, any customer or technology provider will be able to use the file formats in their own systems.

ShowHow will these new formats enable integration with back-office systems?

Similar to Microsoft Office 2003's Reference Schemas, the new Office XML Formats are open, published document formats. XML-based file formats enable developers to access specific contents within files without having to parse entire documents. Users can open and automatically edit Microsoft Office Word, Microsoft Office Excel, and Microsoft Office PowerPoint files with any XML processing program, without having to "Save As" XML.

Microsoft developed the Office XML Formats to respond to customer requests for more rapid document creation from disparate data sources and to extend our commitment to industry-standard XML. By creating a new robust, yet compact, structure based on industry standards such as XML and ZIP, the new default file formats speed document creation while reducing the size of Word, Excel, and PowerPoint files and improving data recovery in corrupted files.

ShowHow do the Office XML Formats aid in simplifying management of personally identifiable information?

Personally identifiable information is data in files that can be used to identify a specific user or author of a document. Typically, this information is found within the "Properties" dialog of the authoring application and may include information such as the authors name, location of the document, and creation date. Office XML Formats store this type of data separately within a document making it easy for programs to remove this information cleanly, with no impact to the remainder of the document.

ShowWould comments and tracked changes be considered personally identifiable information?

Yes. Tracked changes and comments are stored as a separate data component within the file. This enables users to easily identify the presence of comments within documents and eliminate those comments prior to sharing the document.

ShowCan I can remove tracked changes and comments without changing the rest of the file?

Yes. Users can access and remove the comments and tracked changes within a file without impacting other components in the document.

ShowWhat are document assembly scenarios and how will the file format enable such scenarios? Why isn't this possible today?

Document assembly is the ability to construct documents from existing, commonly-used data. These processes can be done manually, by a user, or on a server with no user intervention. Office XML Formats modular file format architecture enables the addition, modification, or subtraction of document contents more easily than is possible today. For example, an updated corporate logo can be added to every document in a repository by a program that understands how to access the images inside each document. This could be done manually, by a user, or programmatically, on a server.

ShowHow do Office XML Formats help ensure consistency, so unintended or corrupt data is not loaded? Is there the risk that valuable data could be lost if it a file gets corrupted?

The new file formats helps ensure greater data consistency by allowing users to:

  • More easily include data from back-end databases, so all users working with the document have accurate, current information.

  • Use the format's modular architecture to preserve and open undamaged components of a file

  • Open only the code their IT departments deems safe. IT departments can set policies for their organizations to prevent unintended code from being loaded when a document is opened by their employees.

  • Potentially quell the spread of viruses. Developers can decide whether or not to allow embedded code to execute based on file type?applications will be able to read at-a-glance whether there is a macro embedded in the document.

ShowHow will these new file formats impact data-recovery efforts?

Because the data within the file is segmented and stored separately, documents are at much lower risk for loss. Any individual component that is damaged will not prevent the rest of the document from loading successfully. For example, if a document has a damaged chart, only the chart will fail to load, and the rest of the document will remain intact. Additionally, file components that do become damaged can be more successfully repaired.

ShowWill customers be able to "Save As" to different versions of the formats?

Yes. Similar to the capability offered in current Microsoft Office products, users will have the ability to select a different document format when saving documents from within Microsoft Office Word, Microsoft Office Excel, and Microsoft Office PowerPoint.

ShowWhy do these file formats produce smaller document sizes?

Internal data storage of the Office XML Formats is segmented into discreet components?for example, embedded code is stored separately from the document content inside the file. Each component within a single file is compressed using ZIP compression technology. In addition to the compression of each document segment, the entire document is also compressed using ZIP compression. As a result, the sizes of Microsoft Office Word documents, Microsoft Office Excel spreadsheets, and Microsoft Office PowerPoint presentations are reduced. The amount of compression and size savings varies per file.

ShowDoes this mean I will have to unzip my files before I open them in Microsoft Office applications?

No. When opening the documents in Microsoft Office, the compression and de-compression happens automatically; users are not required to manually apply compression, and they are never asked to unzip a document.

ShowWill the file compression reduce the quality of images stored in my documents?

No. The ZIP compression technology will not compress images.

ShowCan the file format be modified or customized?

The new Office XML Formats support the inclusion of custom-defined schemas; these facilitate storage of XML data inside Microsoft Office Word documents and Microsoft Office Excel spreadsheets without converting to Word or Excel format. Custom-defined data is stored within its native structure within the file.

ShowDo Office XML Formats have implications for business workflow? For example, is there a way to prevent data from Microsoft Office documents from flowing based on rules I can create?

Workflow is definitely a key issue for the 2007 Microsoft Office release, but it is too early in the development process to discuss any details?we'll be in touch as we have more to share.

ShowWhat impact will the file formats have in enabling better security? Isn't there still a big risk for XML-based viruses being built?

The new Office XML Formats improve security in the sense that files with embedded code and macros are easy to identify and isolate. Files that are not enabled to run code or macros will ignore any code that is embedded. This will improve security by isolating instances of embedded code down to those situations where it is enabled by an administrator.

ShowDo the Office XML Formats offer password protection benefits?

The new Office XML Formats can be password protected using similar functionality to what the current Microsoft Office products use. This will enables users to add a password to a document to prevent opening or modifying a file.

ShowIs there a potential security benefit in helping to manage macro-enabled files?

Yes. The new Office XML Formats will improve security in the sense that files with embedded code and macros are easy to identify and isolate. Files will not be enabled to run code or macros and will ignore any code that is embedded. This will improve security by isolating instances of embedded code down to those situations where it is enabled by an administrator.

ShowCan companies prevent macros from being run inside Microsoft Office applications with the new file formats?

Yes. With Office XML Formats, any embedded code or macros stored within the document will not execute.

Top of Page Top of Page

Preparation and deployment

ShowWhat is Microsoft doing to help prepare customers and partners for the file format change?

Microsoft is working hard to ensure the new file formats are compatible with prior Microsoft Office releases and that the current file formats are fully supported in the next release of the Microsoft Office system of products. We are creating deployment tools, resources, and assistance for our customers and partners. To ensure a smooth transition, we are publishing the file formats specifications prior to product availability enabling customers, partners, and developers to examine the formats before they begin receiving or creating files.

ShowCan the default file formats be changed in the 2007 Microsoft Office release?

Yes. The default file formats can be changed during the installation of Microsoft Office by an individual user or by an IT professional during broader group deployments.

ShowCan a person choose his/her default file format?

Yes. After installation of the software, a user can elect to change which file format is used as the default. Users will also have the ability to choose which format is used when any individual document is saved from within Microsoft Office Word, Microsoft Office Excel, or Microsoft Office PowerPoint.

ShowCan IT professionals opt not to have XML as the default file format?

Yes. The default file format can be changed to a different default during deployment by the IT department, or, an individual can elect to change which file format is used as the default after installation.

ShowCan the file format default be set during deployment for multiple users?

Yes. Organizations will be able to set the file format for their employees during the installation process.

ShowCan the default ever be changed after the 2007 Microsoft Office release is installed?

Yes. After the 2007 release is installed, users will be able to select which file format is used as the default file format within Microsoft Office Word, Microsoft Office Excel, and Microsoft Office PowerPoint.

ShowWill third-party party software vendors be able to use these new formats?

Yes. We will publish the specification publicly prior to product availability. As a result, any solution provider or developer will be able to obtain the Office XML Formats specification via a royalty-free license. This enables third-party developers to integrate the file formats into their solutions without financial consideration to Microsoft.

Top of Page Top of Page

Microsoft Office 2003 historical XML FAQ

ShowHow do users and businesses benefit from XML in the Microsoft Office System currently?

XML enables companies to capture information in such a way that it can be repurposed and reused however and whenever the organization needs to use it, regardless of platform. Building on the XML support in the Microsoft Office System, customers can improve data flow throughout organizations and build customized business process and productivity solutions that help information workers make a greater impact on their business.

For example, information individuals create or capture on their desktops can now be connected directly to key business processes via XML, streamlining the management of those processes and reducing the need to re-key information in disparate systems. Likewise, XML can unlock information currently stored in back-end systems, which can then be processed and re-purposed on the desktop in the Microsoft Office applications with which people are already very familiar.

ShowBut aren't the file formats used by Microsoft Office 2003 just Microsoft's own flavor of XML?

The XML support in Office 2003 is based on industry standards. We support these industry standard XML in two ways:

  • In the presentation of data, with the work we've done with our Office 2003 Reference Schema program, and

  • In the underlying structure of the data itself, with our support for custom defined schema. What this means is that we support any XML schema based on the W3C XSD standard. So customers can tap into industry schemas that are being developed and used now in healthcare, insurance, manufacturing, and more.

ShowDo other people need to be using Microsoft Office 2003 in order to see this XML-based information?

Not at all. The XML-based information is 100 percent based on industry-standard XML, and any application that supports XML and custom-defined schemas will be able to process the documents.

ShowWhat is the difference between reference schemas and custom-defined schemas and what is the benefit of Microsoft supporting both?

The difference is in what the schemas are intended to portray. The reference schemas we released for Microsoft Office Excel 2003, Microsoft Office Word 2003, Microsoft Office InfoPath 2003, and Microsoft Office Visio 2003 earlier are what we refer to as "display oriented" schema. That is, they are focused on describing how information would look and be displayed (such as how a Visio diagram would appear to a user).

Custom schema, on the other hand, represents what are commonly referred to as data-oriented schemas. Like other markup languages, XML uses tags to define specific elements within a document. XML tags define the document's structural elements and the meaning of those elements. As a result, XML can be used to define the document structure and content?not just the look and feel. Within the Office System XML infrastructure, Microsoft supports "custom-defined" schemas which customers can define or create for themselves or that can be defined through an industry process.

Top of Page Top of Page

advertisement