Schemas microsoft com office word 2003 wordml

  • Remove From My Forums
  • Question

  • I have exported a simple word document as XML. When I try to validate the following XML file with a professional XML Editor I get the error «reference to schema not found». When I try to open http://schemas.microsoft.com/office/word/2003/wordml
    with my browser it says «resource not available».

    I could not find the resource. Where can I get the schema? I would like to transform the XML file with XSL but it has to be valid.

    <?xml version=»1.0″ encoding=»UTF-8″ standalone=»yes»?>
    <?mso-application progid=»Word.Document»?>
    <w:wordDocument xmlns:w=»http://schemas.microsoft.com/office/word/2003/wordml» xmlns:v=»urn:schemas-microsoft-com:vml» xmlns:w10=»urn:schemas-microsoft-com:office:word» xmlns:sl=»http://schemas.microsoft.com/schemaLibrary/2003/core»
    xmlns:aml=»http://schemas.microsoft.com/aml/2001/core» xmlns:wx=»http://schemas.microsoft.com/office/word/2003/auxHint» xmlns:o=»urn:schemas-microsoft-com:office:office» xmlns:dt=»uuid:C2F41010-65B3-11d1-A29F-00AA00C14882″
    w:macrosPresent=»no» w:embeddedObjPresent=»no» w:ocxPresent=»no» xml:space=»preserve»>
    <o:DocumentProperties>
    <o:Title>I am just a simple word document :-)</o:Title>
    <o:Author></o:Author>
    <o:LastAuthor></o:LastAuthor>
    <o:Revision>1</o:Revision>
    <o:TotalTime>0</o:TotalTime>
    <o:Created>2014-05-09T07:42:00Z</o:Created>
    <o:LastSaved>2014-05-09T07:42:00Z</o:LastSaved>
    <o:Pages>1</o:Pages>
    <o:Words>5</o:Words>
    <o:Characters>32</o:Characters>
    <o:Lines>1</o:Lines>
    <o:Paragraphs>1</o:Paragraphs>
    <o:CharactersWithSpaces>36</o:CharactersWithSpaces>
    <o:Version></o:Version>
    </o:DocumentProperties>
    <w:fonts>
    <w:defaultFonts w:ascii=»Times New Roman» w:fareast=»Times New Roman» w:h-ansi=»Times New Roman» w:cs=»Times New Roman»/>
    </w:fonts>
    <w:styles>
    <w:versionOfBuiltInStylenames w:val=»4″/>
    <w:latentStyles w:defLockedState=»off» w:latentStyleCount=»156″/>
    <w:style w:type=»paragraph» w:default=»on» w:styleId=»Standard»>
    <w:name w:val=»Normal»/>
    <wx:uiName wx:val=»Standard»/>
    <w:rPr>
    <wx:font wx:val=»Times New Roman»/>
    <w:sz w:val=»24″/>
    <w:sz-cs w:val=»24″/>
    <w:lang w:val=»DE» w:fareast=»DE» w:bidi=»AR-SA»/>
    </w:rPr>
    </w:style>
    <w:style w:type=»character» w:default=»on» w:styleId=»Absatz-Standardschriftart»>
    <w:name w:val=»Default Paragraph Font»/>
    <wx:uiName wx:val=»Absatz-Standardschriftart»/>
    <w:semiHidden/>
    </w:style>
    <w:style w:type=»table» w:default=»on» w:styleId=»NormaleTabelle»>
    <w:name w:val=»Normal Table»/>
    <wx:uiName wx:val=»Normale Tabelle»/>
    <w:semiHidden/>
    <w:rPr>
    <wx:font wx:val=»Times New Roman»/>
    </w:rPr>
    <w:tblPr>
    <w:tblInd w:w=»0″ w:type=»dxa»/>
    <w:tblCellMar>
    <w:top w:w=»0″ w:type=»dxa»/>
    <w:left w:w=»108″ w:type=»dxa»/>
    <w:bottom w:w=»0″ w:type=»dxa»/>
    <w:right w:w=»108″ w:type=»dxa»/>
    </w:tblCellMar>
    </w:tblPr>
    </w:style>
    <w:style w:type=»list» w:default=»on» w:styleId=»KeineListe»>
    <w:name w:val=»No List»/>
    <wx:uiName wx:val=»Keine Liste»/>
    <w:semiHidden/>
    </w:style>
    </w:styles>
    <w:docPr>
    <w:view w:val=»web»/>
    <w:zoom w:percent=»100″/>
    <w:dontDisplayPageBoundaries/>
    <w:doNotEmbedSystemFonts/>
    <w:proofState w:spelling=»clean» w:grammar=»clean»/>
    <w:attachedTemplate w:val=»»/>
    <w:defaultTabStop w:val=»708″/>
    <w:hyphenationZone w:val=»425″/>
    <w:punctuationKerning/>
    <w:characterSpacingControl w:val=»DontCompress»/>
    <w:optimizeForBrowser/>
    <w:validateAgainstSchema/>
    <w:saveInvalidXML w:val=»off»/>
    <w:ignoreMixedContent w:val=»off»/>
    <w:alwaysShowPlaceholderText w:val=»off»/>
    <w:compat>
    <w:breakWrappedTables/>
    <w:snapToGridInCell/>
    <w:wrapTextWithPunct/>
    <w:useAsianBreakRules/>
    <w:dontGrowAutofit/>
    </w:compat>
    </w:docPr>
    <w:body>
    <wx:sect>
    <w:p>
    <w:pPr>
    <w:rPr>
    <w:lang w:val=»EN-GB»/>
    </w:rPr>
    </w:pPr>
    <w:r>
    <w:rPr>
    <w:lang w:val=»EN-GB»/>
    </w:rPr>
    <w:t>I am just a simple word document :-)</w:t>
    </w:r>
    </w:p>
    <w:sectPr>
    <w:pgSz w:w=»11906″ w:h=»16838″/>
    <w:pgMar w:top=»1417″ w:right=»1417″ w:bottom=»1134″ w:left=»1417″ w:header=»708″ w:footer=»708″ w:gutter=»0″/>
    <w:cols w:space=»708″/>
    <w:docGrid w:line-pitch=»360″/>
    </w:sectPr>
    </wx:sect>
    </w:body>
    </w:wordDocument>

    • Edited by

      Friday, May 9, 2014 7:58 AM

Answers

  • You should find OOXML schemas in the ECMA documents at 

    http://www.ecma-international.org/publications/standards/Ecma-376.htm

    The current «Normative» ISO equivalent (29500:2012) is listed at the bottom of this document: 

    http://standards.iso.org/ittf/PubliclyAvailableStandards/index.html

    There are actually two schemas: transitional and strict. At present, Word 2013 is the only Microsoft product that can save/open «strict».

     
    If you go and get the ISO files/documents, one of them is called c61798_ISO_IEC_29500-4_2012_Electronic_inserts.zip. Unzip that, and you will find a .zip called OfficeOpenXML-XMLSchema-Transitional.zip. Unzip that. and you will find a schema file called
    wml.xsd, but you will probably need other schemas from this folder as well. 

    The strict schema is inside OfficeOpenXML-XMLSchema-Strict.zip, inside c061750_ISO_IEC_29500-1_2012_Electronic_inserts.zip


    Peter Jamieson

    • Marked as answer by
      Steve Fan
      Tuesday, May 20, 2014 9:11 AM

I am transforming word documents to xml to compare them using the following code:

word = win32com.client.Dispatch('Word.Application')
wd = word.Documents.Open(inFile)
# Converts the word infile to xml outfile
wd.SaveAs(outFile,11)
wd.Close()
dom=parse(outFile)

The xml file I get looks like:

<?xml version="1.0" encoding="utf-8"?>
<?mso-application progid="Word.Document"?>
<w:wordDocument w:embeddedObjPresent="no" w:macrosPresent="no" w:ocxPresent="no" xml:space="preserve" xmlns:aml="http://schemas.microsoft.com/aml/2001/core" xmlns:dt="uuid:C2F41010-65B3-11d1-A29F-00AA00C14882" xmlns:mc="http://schemas.openxmlformats.org/markup-compatibility/2006" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:sl="http://schemas.microsoft.com/schemaLibrary/2003/core" xmlns:v="urn:schemas-microsoft-com:vml" xmlns:w="http://schemas.microsoft.com/office/word/2003/wordml" xmlns:w10="urn:schemas-microsoft-com:office:word" xmlns:wne="http://schemas.microsoft.com/office/word/2006/wordml" xmlns:wpc="http://schemas.microsoft.com/office/word/2010/wordprocessingCanvas" xmlns:wsp="http://schemas.microsoft.com/office/word/2003/wordml/sp2" xmlns:wx="http://schemas.microsoft.com/office/word/2003/auxHint">
    <w:ignoreSubtree w:val="http://schemas.microsoft.com/office/word/2003/wordml/sp2"/>
    <w:shapeDefaults>
        <o:shapedefaults spidmax="1027" v:ext="edit"/>
        <o:shapelayout v:ext="edit">
            <o:idmap data="1" v:ext="edit"/>
        </o:shapelayout>
    </w:shapeDefaults>
    <w:body>
        <wx:sect>
            <w:tbl>

            <w:tblGrid>
                <w:gridCol w:w="200"/>
                                       ...
            </w:tblGrid>

                <w:pict>
                        <v:shapetype coordsize="21600,21600" filled="f" id="_x0000_t75" o:preferrelative="t" o:spt="75" path="m@4@5l@4@11@9@11@9@5xe" stroked="f">
                            <v:stroke joinstyle="miter"/>
                            <v:formulas>
                                <v:f eqn="if lineDrawn pixelLineWidth 0"/>
                                ...
                            </v:formulas>
                            <v:path gradientshapeok="t" o:connecttype="rect" o:extrusionok="f"/>
                            <o:lock aspectratio="t" v:ext="edit"/>
                        </v:shapetype>
                        <v:shape id="Picture" o:spid="_x0000_s1026" style="position:absolute;left:0;text-align:left;margin-left:0;margin-top:0;width:400pt;height:40pt;z-index:1;visibility:visible;mso-wrap-style:square;mso-wrap-distance-left:0;mso-wrap-distance-top:0;mso-wrap-distance-right:0;mso-wrap-distance-bottom:0;mso-position-horizontal:left;mso-position-horizontal-relative:text;mso-position-vertical:absolute;mso-position-vertical-relative:line" type="#_x0000_t75">
                            <v:imagedata o:title="" src="wordml://03000001.png"/>
                            <w10:wrap anchory="line"/>
                            <w10:anchorlock/>
                        </v:shape>
                </w:pict> 
                                      ...

I can’t use xpath function (lxml library)
when I try for example :

import lxml.etree as et
tree = et.parse(xmlFile)
for elt in tree.xpath("//w:gridCol"):
     elt.getparent().remove(elt)

I get the following error:

 for elt in tree.xpath("//w:gridCol"):
  File "lxml.etree.pyx", line 2029, in lxml.etree._ElementTree.xpath (src/lxml/lxml.etree.c:45934)
  File "xpath.pxi", line 379, in lxml.etree.XPathDocumentEvaluator.__call__ (src/lxml/lxml.etree.c:114389)
  File "xpath.pxi", line 242, in lxml.etree._XPathEvaluatorBase._handle_result (src/lxml/lxml.etree.c:113063)
  File "xpath.pxi", line 227, in lxml.etree._XPathEvaluatorBase._raise_eval_error (src/lxml/lxml.etree.c:112894)
XPathEvalError: Undefined namespace prefix

I did some research and I guess it’s a namespace matter, but I don’t know how to fix it?

From Wikipedia, the free encyclopedia

WordProcessingML

Filename extension .XML (XML document)
Developed by Microsoft
Type of format Document file format
Extended from XML, DOC
DataDiagramingML

Filename extension .VDX (XML Drawing),
.VSX (XML Stencil),
.VTX (XML Template)
Developed by Microsoft
Type of format Diagramming vector graphics
Extended from XML, VSD, VSS, VST
SpreadsheetML

Filename extension .XML (XML Spreadsheet)
Developed by Microsoft
Type of format Spreadsheet
Extended from XML, XLS

The Microsoft Office XML formats are XML-based document formats (or XML schemas) introduced in versions of Microsoft Office prior to Office 2007. Microsoft Office XP introduced a new XML format for storing Excel spreadsheets and Office 2003 added an XML-based format for Word documents.

These formats were succeeded by Office Open XML (ECMA-376) in Microsoft Office 2007.

File formats[edit]

  • Microsoft Office Word 2003 XML Format — WordProcessingML or WordML (.XML)
  • Microsoft Office Excel 2002 and Excel 2003 XML Format — SpreadsheetML (.XML)
  • Microsoft Office Visio 2003 XML Format — DataDiagramingML (.VDX, .VSX, .VTX)
  • Microsoft Office InfoPath 2003 XML Format — XML FormTemplate (.XSN) (Compressed XML templates in a Cabinet file)
  • Microsoft Office InfoPath 2003 XML Format — XMLS FormTemplate (.XSN) (Compressed XML templates in a Cabinet file)

Limitations and differences with Office Open XML[edit]

Besides differences in the schema, there are several other differences between the earlier Office XML schema formats and Office Open XML.

  • Whereas the data in Office Open XML documents is stored in multiple parts and compressed in a ZIP file conforming to the Open Packaging Conventions, Microsoft Office XML formats are stored as plain single monolithic XML files (making them quite large, compared to OOXML and the Microsoft Office legacy binary formats). Also, embedded items like pictures are stored as binary encoded blocks within the XML. In case of Office Open XML, the header, footer, comments of a document etc. are all stored separately.
  • XML Spreadsheet documents cannot store Visual Basic for Applications macros, auditing tracer arrows, chart and other graphic objects, custom views, drawing object layers, outlining, scenarios, shared workbook information and user-defined function categories.[1] In contrast, the newer Office Open XML formats support full document fidelity.
  • Poor backward compatibility with the version of Word/Excel prior to the one in which they were introduced. For example, Word 2002 cannot open Word 2003 XML files unless a third-party converter add-in is installed.[2] Microsoft has released a Word 2003 XML Viewer which allows WordProcessingML files saved by Word 2003 to be viewed as HTML from within Internet Explorer.[3] For Office Open XML, Microsoft provides converters for Office 2003, Office XP and Office 2000.
  • Office Open XML formats are also defined for PowerPoint 2007, equation editing (Office MathML), vector drawing, charts and text art (DrawingML).

Word XML format example[edit]

<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<?mso-application progid="Word.Document"?>
<w:wordDocument
   xmlns:w="http://schemas.microsoft.com/office/word/2003/wordml"
   xmlns:wx="http://schemas.microsoft.com/office/word/2003/auxHint"
   xmlns:o="urn:schemas-microsoft-com:office:office"
   w:macrosPresent="no"
   w:embeddedObjPresent="no"
   w:ocxPresent="no"
   xml:space="preserve">
  <o:DocumentProperties>
    <o:Title>This is the title</o:Title>
    <o:Author>Darl McBride</o:Author>
    <o:LastAuthor>Bill Gates</o:LastAuthor>
    <o:Revision>1</o:Revision>
    <o:TotalTime>0</o:TotalTime>
    <o:Created>2007-03-15T23:05:00Z</o:Created>
    <o:LastSaved>2007-03-15T23:05:00Z</o:LastSaved>
    <o:Pages>1</o:Pages>
    <o:Words>6</o:Words>
    <o:Characters>40</o:Characters>
    <o:Company>SCO Group, Inc.</o:Company>
    <o:Lines>1</o:Lines>
    <o:Paragraphs>1</o:Paragraphs>
    <o:CharactersWithSpaces>45</o:CharactersWithSpaces>
    <o:Version>11.6359</o:Version>
  </o:DocumentProperties>
  <w:fonts>
    <w:defaultFonts
       w:ascii="Times New Roman"
       w:fareast="Times New Roman"
       w:h-ansi="Times New Roman"
       w:cs="Times New Roman" />
  </w:fonts>

  <w:styles>
    <w:versionOfBuiltInStylenames w:val="4" />
    <w:latentStyles w:defLockedState="off" w:latentStyleCount="156" />
    <w:style w:type="paragraph" w:default="on" w:styleId="Normal">
      <w:name w:val="Normal" />
      <w:rPr>
        <wx:font wx:val="Times New Roman" />
        <w:sz w:val="24" />
        <w:sz-cs w:val="24" />
        <w:lang w:val="EN-US" w:fareast="EN-US" w:bidi="AR-SA" />
      </w:rPr>
    </w:style>
    <w:style w:type="paragraph" w:styleId="Heading1">
      <w:name w:val="heading 1" />
      <wx:uiName wx:val="Heading 1" />
      <w:basedOn w:val="Normal" />
      <w:next w:val="Normal" />
      <w:rsid w:val="00D93B94" />
      <w:pPr>
        <w:pStyle w:val="Heading1" />
        <w:keepNext />
        <w:spacing w:before="240" w:after="60" />
        <w:outlineLvl w:val="0" />
      </w:pPr>
      <w:rPr>
        <w:rFonts w:ascii="Arial" w:h-ansi="Arial" w:cs="Arial" />
        <wx:font wx:val="Arial" />
        <w:b />
        <w:b-cs />
        <w:kern w:val="32" />
        <w:sz w:val="32" />
        <w:sz-cs w:val="32" />
      </w:rPr>
    </w:style>
    <w:style w:type="character" w:default="on" w:styleId="DefaultParagraphFont">
      <w:name w:val="Default Paragraph Font" />
      <w:semiHidden />
    </w:style>
    <w:style w:type="table" w:default="on" w:styleId="TableNormal">
      <w:name w:val="Normal Table" />
      <wx:uiName wx:val="Table Normal" />
      <w:semiHidden />
      <w:rPr>
        <wx:font wx:val="Times New Roman" />
      </w:rPr>
      <w:tblPr>
        <w:tblInd w:w="0" w:type="dxa" />
        <w:tblCellMar>
          <w:top w:w="0" w:type="dxa" />
          <w:left w:w="108" w:type="dxa" />
          <w:bottom w:w="0" w:type="dxa" />
          <w:right w:w="108" w:type="dxa" />
        </w:tblCellMar>
      </w:tblPr>
    </w:style>
    <w:style w:type="list" w:default="on" w:styleId="NoList">
      <w:name w:val="No List" />
      <w:semiHidden />
    </w:style>
  </w:styles>
  <w:docPr>
    <w:view w:val="print" />
    <w:zoom w:percent="100" />
    <w:doNotEmbedSystemFonts />
    <w:proofState w:spelling="clean" w:grammar="clean" />
    <w:attachedTemplate w:val="" />
    <w:defaultTabStop w:val="720" />
    <w:punctuationKerning />
    <w:characterSpacingControl w:val="DontCompress" />
    <w:optimizeForBrowser />
    <w:validateAgainstSchema />
    <w:saveInvalidXML w:val="off" />
    <w:ignoreMixedContent w:val="off" />
    <w:alwaysShowPlaceholderText w:val="off" />
    <w:compat>
      <w:breakWrappedTables />
      <w:snapToGridInCell />
      <w:wrapTextWithPunct />
      <w:useAsianBreakRules />
      <w:dontGrowAutofit />
    </w:compat>
  </w:docPr>
  <w:body>
    <wx:sect>
      <w:p>
        <w:r>
          <w:t>This is the first paragraph</w:t>
        </w:r>
      </w:p>
      <wx:sub-section>
        <w:p>
          <w:pPr>
            <w:pStyle w:val="Heading1" />
          </w:pPr>
          <w:r>
            <w:t>This is a heading</w:t>
          </w:r>
        </w:p>
        <w:sectPr>
          <w:pgSz w:w="12240" w:h="15840" />
          <w:pgMar w:top="1440"
		   w:right="1800"
		   w:bottom="1440"
		   w:left="1800"
		   w:header="720"
		   w:footer="720"
		   w:gutter="0" />
          <w:cols w:space="720" />
          <w:docGrid w:line-pitch="360" />
        </w:sectPr>
      </wx:sub-section>
    </wx:sect>
  </w:body>
</w:wordDocument>

Excel XML spreadsheet example[edit]

<?xml version="1.0" encoding="UTF-8"?>
<?mso-application progid="Excel.Sheet"?>
<Workbook xmlns="urn:schemas-microsoft-com:office:spreadsheet" xmlns:x="urn:schemas-microsoft-com:office:excel" xmlns:ss="urn:schemas-microsoft-com:office:spreadsheet" xmlns:html="https://www.w3.org/TR/html401/">
<Worksheet ss:Name="CognaLearn+Intedashboard">
<Table>
<Column ss:Index="1" ss:AutoFitWidth="0" ss:Width="110"/>
<Row>
<Cell><Data ss:Type="String">ID</Data></Cell>
<Cell><Data ss:Type="String">Project</Data></Cell>
<Cell><Data ss:Type="String">Reporter</Data></Cell>
<Cell><Data ss:Type="String">Assigned To</Data></Cell>
<Cell><Data ss:Type="String">Priority</Data></Cell>
<Cell><Data ss:Type="String">Severity</Data></Cell>
<Cell><Data ss:Type="String">Reproducibility</Data></Cell>
<Cell><Data ss:Type="String">Product Version</Data></Cell>
<Cell><Data ss:Type="String">Category</Data></Cell>
<Cell><Data ss:Type="String">Date Submitted</Data></Cell>
<Cell><Data ss:Type="String">OS</Data></Cell>
<Cell><Data ss:Type="String">OS Version</Data></Cell>
<Cell><Data ss:Type="String">Platform</Data></Cell>
<Cell><Data ss:Type="String">View Status</Data></Cell>
<Cell><Data ss:Type="String">Updated</Data></Cell>
<Cell><Data ss:Type="String">Summary</Data></Cell>
<Cell><Data ss:Type="String">Status</Data></Cell>
<Cell><Data ss:Type="String">Resolution</Data></Cell>
<Cell><Data ss:Type="String">Fixed in Version</Data></Cell>
</Row>
<Row>
<Cell><Data ss:Type="Number">0000033</Data></Cell>
<Cell><Data ss:Type="String">CognaLearn Intedashboard</Data></Cell>
<Cell><Data ss:Type="String">janardhana.l</Data></Cell>
<Cell><Data ss:Type="String"></Data></Cell>
<Cell><Data ss:Type="String">normal</Data></Cell>
<Cell><Data ss:Type="String">text</Data></Cell>
<Cell><Data ss:Type="String">always</Data></Cell>
<Cell><Data ss:Type="String"></Data></Cell>
<Cell><Data ss:Type="String">GUI</Data></Cell>
<Cell><Data ss:Type="String">2016-10-14</Data></Cell>
<Cell><Data ss:Type="String"></Data></Cell>
<Cell><Data ss:Type="String"></Data></Cell>
<Cell><Data ss:Type="String"></Data></Cell>
<Cell><Data ss:Type="String">public</Data></Cell>
<Cell><Data ss:Type="String">2016-10-14</Data></Cell>
<Cell><Data ss:Type="String">IE8 browser_Modules screen tool tip text is shown twice</Data></Cell>
<Cell><Data ss:Type="String">new</Data></Cell>
<Cell><Data ss:Type="String">open</Data></Cell>
<Cell><Data ss:Type="String"></Data></Cell>
</Row>
</Table>
</Worksheet>
</Workbook>

See also[edit]

  • List of document markup languages
  • Comparison of document markup languages

References[edit]

  1. ^ «Features and limitations of XML Spreadsheet format (broken)». Archived from the original on 2007-10-09. Retrieved 2007-11-01.
  2. ^ «Polar WordML add-in (broken)». Archived from the original on 2009-04-11. Retrieved 2007-11-01.
  3. ^ Word 2003 XML Viewer
  • Overview of Office 2003 Developer Technologies
  • Office 2003 XML. ISBN 0-596-00538-5

External links[edit]

  • MSDN: XML Spreadsheet Reference
  • MSDN: Word 2003 XML Reference
  • Lawsuit about XML patent

RRS feed

  • Remove From My Forums
  • Question

    • Edited by
      Ben Aminnia
      Friday, March 21, 2014 2:43 AM

All replies

  • Hi,

    >> What is the equivalent of this xml sample which will use a later version of xmlns schemas and will open it in Office 2007 or 2010? <<

    I think it is still supported in Office 2007 system and later versions.
    Continued support for Office 2003 XML schemas
    What about Word 2003’s XML format?

    Hope it is helpful.

    Best regards

    Fei


    We are trying to better understand customer views on social support experience, so your participation in this interview project would be greatly appreciated if you have time. Thanks for helping make community forums a great place.

    Click
    HERE to participate the survey.

  • Hello Ben,

    Consider moving your code to the
    Open XML Format SDK. Starting from Office 2007 the open file format is used as a default file format. However, old binary file formats are still supported.

  • Thanks for the comments.

    We currently have several hundred templates in 2003 format and our management is concerned that Microsoft has announced to retire Office 2003 on April 8, 2014.  Therefore, they are worried that, at some point sooner or later, we may not be able to use
    our 2003 WordML templates anymore.

    Is there a sample «Hello World» xml file which will do the same thing using newer versions of xmlns name spaces(either in 2006 WordML format or Open XML for Office 2007)?


    DB Architect

  • Hi Ben,

    >>Is there a sample «Hello World» xml file which will do the same thing using newer versions of xmlns name spaces(either in 2006 WordML format or Open XML for Office 2007)?<<

    There is no new name spaces for it and I think it is still suppoted in the later version of Office. As far as I test, it is supported in Word 2013. It is just same as you provided.

    And Open XML is different with WordML format. It contains sevral XML file in the pack, you can rename a docx file to zip. Then you can open it and see the content like figure below:

    And also you can open the docx file using OpenXML productivity tools like below:

    Also you may be insterted in article below:
    Word processing (Open XML SDK)

    Best regards

    Fei


    We are trying to better understand customer views on social support experience, so your participation in this interview project would be greatly appreciated if you have time. Thanks for helping make community forums a great place.

    Click
    HERE to participate the survey.

  • Thank you Fei,

    I understand that 2003 namespaces are still supported (and we are currently using them as well). 

    Our concern is the possibility of Microsoft discontinuing to support them sometime in the future. In which case, hundreds of our templates which are using xmlns:w=»http://schemas.microsoft.com/office/word/2003/wordml»
    will no longer work.

    Is there a document that explains the differences between the above series of 2003 schemas and the ones which reference 2006, such as the following?

    xmlns:wne=»http://schemas.microsoft.com/office/word/2006/wordml»


    DB Architect

  • Hi Ben,

    There is no such document that explains the differences between the 2003 schemas and «2006». Because the WordML and Open XML are two differnent standards.

    >>Our concern is the possibility of Microsoft discontinuing to support them sometime in the future. In which case, hundreds of our templates which are using xmlns:w=»http://schemas.microsoft.com/office/word/2003/wordml»
    will no longer work.<< 

    If Word discontinuing to support the WordML, as a workaround we can write a macro to save the XML file to new format file using the Word which support it( Word 2007, Word 2010, Word 2013).

    Hope it is helpful.

    Best regards

    Fei


    We are trying to better understand customer views on social support experience, so your participation in this interview project would be greatly appreciated if you have time. Thanks for helping make community forums a great place.

    Click
    HERE to participate the survey.

  • The .docx format mentioned by the others has two representations. One is the .docx file, which is a ZIP file containing multiple XML files (and potentially other files such as .jpg files). The other is a single-file .xml format called «Flat OPC» format.
    OPC stands for «Open Packaging Conventions», which are part of the ECMA and ISO standards that define .docx format.

    An example of the beginning of a Flat OPC format fie is as follows:

    <?xml version=»1.0″ encoding=»UTF-8″ standalone=»yes»?>
    <?mso-application progid=»Word.Document»?>
    <pkg:package xmlns:pkg=»http://schemas.microsoft.com/office/2006/xmlPackage»>
        <pkg:part pkg:name=»/_rels/.rels» pkg:contentType=»application/vnd.openxmlformats-package.relationships+xml» pkg:padding=»512″>
            <pkg:xmlData>
                <Relationships xmlns=»http://schemas.openxmlformats.org/package/2006/relationships»>
                    <Relationship Id=»rId3″ Type=»http://schemas.openxmlformats.org/package/2006/relationships/metadata/core-properties» Target=»docProps/core.xml»/>
                    <Relationship Id=»rId1″ Type=»http://schemas.openxmlformats.org/officeDocument/2006/relationships/officeDocument» Target=»word/document.xml»/>
                    <Relationship Id=»rId4″ Type=»http://schemas.openxmlformats.org/officeDocument/2006/relationships/extended-properties» Target=»docProps/app.xml»/>
                </Relationships>
            </pkg:xmlData>
        </pkg:part>
        <pkg:part pkg:name=»/word/_rels/document.xml.rels» pkg:contentType=»application/vnd.openxmlformats-package.relationships+xml» pkg:padding=»256″>
            <pkg:xmlData>
                <Relationships xmlns=»http://schemas.openxmlformats.org/package/2006/relationships»>
                    <Relationship Id=»rId3″ Type=»http://schemas.microsoft.com/office/2007/relationships/stylesWithEffects» Target=»stylesWithEffects.xml»/>
                    <Relationship Id=»rId2″ Type=»http://schemas.openxmlformats.org/officeDocument/2006/relationships/styles» Target=»styles.xml»/>
                    <Relationship Id=»rId1″ Type=»http://schemas.openxmlformats.org/officeDocument/2006/relationships/numbering» Target=»numbering.xml»/>
                    <Relationship Id=»rId6″ Type=»http://schemas.openxmlformats.org/officeDocument/2006/relationships/fontTable» Target=»fontTable.xml»/>
                    <Relationship Id=»rId5″ Type=»http://schemas.openxmlformats.org/officeDocument/2006/relationships/webSettings» Target=»webSettings.xml»/>
                    <Relationship Id=»rId4″ Type=»http://schemas.openxmlformats.org/officeDocument/2006/relationships/settings» Target=»settings.xml»/>
                </Relationships>
            </pkg:xmlData>
        </pkg:part>
        <pkg:part pkg:name=»/word/document.xml» pkg:contentType=»application/vnd.openxmlformats-officedocument.wordprocessingml.document.main+xml»>
            <pkg:xmlData>
                <w:document
                    mc:Ignorable=»w14 wp14″
                    xmlns:wpc=»http://schemas.microsoft.com/office/word/2010/wordprocessingCanvas»
                    xmlns:mc=»http://schemas.openxmlformats.org/markup-compatibility/2006″
                    xmlns:o=»urn:schemas-microsoft-com:office:office»
                    xmlns:r=»http://schemas.openxmlformats.org/officeDocument/2006/relationships»
                    xmlns:m=»http://schemas.openxmlformats.org/officeDocument/2006/math» xmlns:v=»urn:schemas-microsoft-com:vml»
                    xmlns:wp14=»http://schemas.microsoft.com/office/word/2010/wordprocessingDrawing»
                    xmlns:wp=»http://schemas.openxmlformats.org/drawingml/2006/wordprocessingDrawing»
                    xmlns:w10=»urn:schemas-microsoft-com:office:word»
                    xmlns:w=»http://schemas.openxmlformats.org/wordprocessingml/2006/main»
                    xmlns:w14=»http://schemas.microsoft.com/office/word/2010/wordml»
                    xmlns:wpg=»http://schemas.microsoft.com/office/word/2010/wordprocessingGroup»
                    xmlns:wpi=»http://schemas.microsoft.com/office/word/2010/wordprocessingInk»
                    xmlns:wne=»http://schemas.microsoft.com/office/word/2006/wordml»
                    xmlns:wps=»http://schemas.microsoft.com/office/word/2010/wordprocessingShape»>
                    <w:body>
                        <w:p>
                            <w:r>
                                <w:t>Hello World 2010!</w:t>
                            </w:r>
                        </w:p>
                        <w:sectPr>
                            <w:pgSz w:w=»11906″ w:h=»16838″/>
                            <w:pgMar w:top=»1440″ w:right=»1440″ w:bottom=»1440″ w:left=»1440″ w:header=»708″ w:footer=»708″ w:gutter=»0″/>
                            <w:cols w:space=»708″/>
                            <w:docGrid w:linePitch=»360″/>
                        </w:sectPr>
                    </w:body>
                </w:document>
            </pkg:xmlData>
        </pkg:part>
    …and so on

    (there is more)

    As far as I know, only the .zip representation is formally supported, although all current supported desktop versions of Microsoft Word after 2003 (2007, 2010, 2011 (mac) and 2013) can read and save Flat OPC format. The web apps (e.g. on OneDrive) cannot open
    either of the .xml formats as far as I know.

    Flat OPC format is also understood internally by some features in the Windows Word version of VBA and by the Open XML SDK. As far as I know, Microsoft has never formally documented the namespace it uses (xmlns:pkg=»http://schemas.microsoft.com/office/2006/xmlPackage»),
    unless they have done so in the Open XML SDK, although it is fairly obvious what most of the elements and attributes are for.


    Peter Jamieson

У меня есть .doc-файл, содержащий код форматирования XML. Если я открываю его в Windows с помощью MS Word, он отлично работает, показывая таблицы, тексты и т. Д.

Так выглядит MS Word:

Если я открою его в LibreOffice, (только для заголовка):

<?xml version="1.0" encoding="UTF-8" standalone="yes"?>
<?mso-application progid="Word.Document"?>
<w:wordDocument xmlns:aml="http://schemas.microsoft.com/aml/2001/core" xmlns:wpc="http://schemas.microsoft.com/office/word/2010/wordprocessingCanvas" xmlns:dt="uuid:C2F41010-65B3-11d1-A29F-00AA00C14882" xmlns:mc="http://schemas.openxmlformats.org/markup-compatibility/2006" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:v="urn:schemas-microsoft-com:vml" xmlns:w10="urn:schemas-microsoft-com:office:word" xmlns:w="http://schemas.microsoft.com/office/word/2003/wordml" xmlns:wx="http://schemas.microsoft.com/office/word/2003/auxHint" xmlns:wne="http://schemas.microsoft.com/office/word/2006/wordml" xmlns:wsp="http://schemas.microsoft.com/office/word/2003/wordml/sp2" xmlns:sl="http://schemas.microsoft.com/schemaLibrary/2003/core" w:macrosPresent="no" w:embeddedObjPresent="no" w:ocxPresent="no" xml:space="preserve">

Существует некоторый способ открыть его в ubuntu?

Обновить [ ! d7]

Я уже пробовал LibreOffice Calc, загружая источник XML, но он не работает. Точнее, когда я выбираю файл, нет ничего, чтобы отображать в поле «Карта» для документа, и я не могу нажать кнопку «Импорт». Я также попытался изменить расширение на .xml, все равно ничего.

задан
27 April 2017 в 15:07

поделиться

Понравилась статья? Поделить с друзьями:
  • Scan files to word
  • Scheduling templates for excel
  • Scan document with word
  • Scheduling template for excel
  • Scan document for word