Convert PDF to XLSX or XML

Convert PDF to XLSX or XML

Convert PDF to Microsoft Excel

Convert PDF to Microsoft Excel (XLSX)

  1. 1. Open the PDF in Acrobat.

  2. 2. From the global bar, select Convert.

  3. 3. From the Microsoft Excel drop-down menu, select XLSX. 

    A PDF is opened and the convert panel is open on the left. The option to convert PDF to Microsoft Excel (XLSX) is highlighted.

  4. 4. Select Convert to XLSX.

  5. 5. In the Save as dialog, select a location where you want to save the file and then select Save.

Convert PDF to XML

  1. 1. Open the PDF in Acrobat.

  2. 2. From the global bar, select Convert.

  3. 3. From the Microsoft Excel drop-down menu, select XML

    A PDF is opened and the convert panel is open on the left. The option to convert PDF to Microsoft Excel (XML) is highlighted.

  4. 4. Select Convert to XML.

  5. 5. In the Save as dialog, select a location where you want to save the file and then select Save.

PDF to Microsoft Excel (XLSX or XML) conversion settings

In Acrobat, select All tools > Export a PDF and select Microsoft Excel. From the drop down menu next to Microsoft Excel, select the format you want your Excel to be. You get XLSX and XML options. Select the settings icon to change the export preferences. 

Excel Workbook Settings Specifies whether to create a Worksheet for each table or page, or the entire document.

Numeric Settings Specifies the decimal and thousands separators for numeric data. Select one of the following:

  • Detect decimal and thousands separators using regional settings

  • Treat the following as decimal and thousands separators. Then enter or selecct separators in the respective fields.

Text Recognition Settings

Recognize Text If Needed Recognizes text if the PDF contains images that contain text.

Set Language Specifies the language setting for OCR.

Convert PDF to PostScript or Encapsulated PostScript (EPS)

You can export a PDF to PostScript for printing and prepress applications. The PostScript file includes full DSC (Document Structuring Conventions) comments and other advanced information preserved by Adobe Acrobat Distiller. You can also create an EPS file from any PDF for placement or opening in other applications. The options available depend on whether you're converting a document to PostScript or EPS. To convert PDF to PS or EPS format:

  1. 1. Open the PDF in Acrobat.

  2. 2. From the global bar, select Convert.

  3. 3. Select the drop-down menu next to Other format, as required, select EPS or PS

    A PDF is opened in Acrobat. Convert menu is open in the left pane and convert to PS option is highlighted.
    Convert to PS

    A PDF is opened in Acrobat. Convert menu is open in the left pane and convert to EPS option is highlighted.
    Convert to EPS

  4. 4. Select Convert to EPS or Convert to PS. The options will be displayed based on your choice in the previous step. 

  5. 5. In the Save as dialog, select a location where you want to save the file and then select Save.

Note

If you create EPS files for separations in Acrobat Pro, all image color spaces should be CMYK.

PDF to PS and EPS conversion settings

In Acrobat, select All tools > Export a PDF and select Other format. From the drop-down menu next to Other format, select PS or EPS. Select the settings icon to change the export preferences. 

Printer Description File The PostScript Printer Description (PPD) provides the necessary information to format a PostScript file correctly for a particular output device. Device Independent creates only composite (not color-separated) PostScript or EPS files. Acrobat Default provides a starting point and reference for creating all types of PostScript and restores all default settings for the conversion. Adobe PDF 7.0 is compatible with most devices. This option is available only in PostScript (PS) format.

ASCII or Binary Specifies the output format of image data. Binary output yields smaller files, but not all workflows can accommodate binary output.

PostScript Specifies the level of PostScript compatibility. Use Language Level 3 only if the target output device supports it. Language Level 2 is suitable for EPS files placed in another document and color-separated as part of that document. Use Language Level 2 for EPS files that you import into Microsoft applications.

Font Inclusion Specifies the fonts to be included in the PostScript. Embedded fonts are taken from the PDF; the referenced fonts are taken from the computer.

Include Comments Preserves the appearance of comments in the resulting PostScript file.

Convert True Type to Type 1 Converts TrueType fonts to Type 1 fonts in the resulting PostScript file.

Include Preview Specifies if a TIFF preview is created for the resulting EPS file. This option isn't available when saving as PostScript.

Page Range Specifies the pages you want to export. When you export files to EPS output, each page in the range is saved as a separate EPS file.

Convert PDF to Text and XML 1.0

Convert PDF to XML 1.0

To export a file in XML format, go to All Tools Export a PDF.  Select other format, then the drop-down next to other format, and then select XML 1.0.

Convert PDF to Text

To export a file in text format, go to All Tools Export a PDF.  Select other format, then the drop-down next to other format, and then select TXT.

The convert toolbar in Acrobat is shown and other option is highlighted

PDF to Text and XML 1.0 conversion settings

In Acrobat, select All tools > Export a PDF and select Other format. From the drop-down menu next to Other format, select TXT or XML 1.0. Select the settings icon to change the export preferences. 

Output File Settings

Encoding Refers to the binary values, based on international standards, used to represent the text characters. UTF-8 is a Unicode representation of characters using one or more 8-bit bytes per character. UTF-16 represents characters using 16-bit bytes. ISO-Latin-1 is an 8-bit representation of characters that is a superset of ASCII. UCS-4 is a Universal Character Set coded in four octets. HTML/ASCII is a 7-bit representation of characters developed by ANSI.

Use mapping table default uses the default character encoding defined in mapping tables, which appear in the Plug-ins/SaveAsXML/MappingTables folder. These mapping tables specify many characteristics of how the data is output, including the following default character encodings: UTF-8 (Save as XML or HTML 4.0.1) and HTML/ASCII (Save as HTML 3.2).

Generate Bookmarks Generates bookmark links to content for HTML or XML documents. Links are placed at the beginning of the resulting HTML or XML document.

Generate Tags For Untagged Files Generates tags for files not already tagged, such as PDFs created using Acrobat 4.0 or earlier. If this option isn't selected, untagged files are not converted.

Note

Tags are applied only as part of the conversion process and are discarded after the conversion. This is not a method for creating tagged PDFs from legacy files

Image File Settings

Generate Images Controls how images are converted. Converted image files are referenced from within XML and HTML documents.

Use Sub-Folder Specifies the folder in which to store generated images. The default is Images.

Use Prefix Specifies the prefix added to the image filenames if you have several versions of the same image file. Filenames assigned to images have the format filename_img_#.

Output Format Specifies the final format of images. The default is JPG.

Downsample To Downsamples image files to the specified resolution. If you don't select this option, image files have the same resolution as the source file. Image files are never upsampled.

    • Related Articles

    • Convert or export PDFs to other file formats

      Export images from PDFs Besides exporting the entire document or each page to a different format, you can export each image in a PDF to a separate image file. Note You can export raster images, but not vector objects. 1. Open the PDF in Acrobat and ...
    • Convert PDF to Word

      Convert PDF to DOCX 1. Open the PDF in Acrobat. 2. From the global bar, select Convert. 3. From the left pane, Select DOCX from the drop-down menu next to Microsoft Word. Then select Convert to DOCX. 4. In the Save as dialog, select a location where ...
    • Convert PDF to JPEG

      Convert PDF to JPEG or JPEG 2000 1. Open the PDF in Acrobat. 2. Select Convert from the global bar. 3. From the Convert menu in the left pane, select Image format, and then from the drop-down menu, select JPEG or JPEG 2000. 4. Select Convert to JPEG ...
    • Convert PDF to TIFF

      Convert PDF to TIFF 1. Open the PDF in Acrobat. 2. Select Convert from the global bar. 3. From the Convert menu in the left pane, select Image format, and then from the drop-down menu, select TIFF. 4. Select Convert to TIFF. 5. (Windows) Select the ...
    • Convert PDF to PNG

      Convert PDF to PNG 1. Open the PDF in Acrobat. 2. Select Convert from the global bar. 3. From the Convert menu in the left pane, select Image format, and then from the drop-down menu, select PNG. 4. Select Convert to PNG. 5. (Windows) Select the ...