- or
No existing idea results
- ~ No ideas found ~
13578 results found
-
Isolated field format losses with Acrobat PDF to XML exports
ReadMe3-9-2019Isolated fieldformatlosseswithAcrobatPDFtoXML_exports.txt
Adobe Acrobat Pro XML Export Bug Report
by Rich Hartness, rhartness@mabnc.org1 Title: Isolated field format losses with Acrobat PDF to XML exports
2 Intent
I'm writing to report there seems to be a very small but significant bug in the Acrobat 2017 and Pro DC PDF to XML exporter, to share my compelling evidence, and hopefully to gain your support to authorize a code fix that will correct this issue in a future update.
3 Problem description and scope
The problem seems to only exist in XML exports. In these cases, individual fields of data are dumped into single XML records, without preserving their individual field identities in the XML code. This problem does not exist in All the other export formats such as Word, Excel, rich text, HTML, and spreadsheet XML.
4 Steps to reproduce the issue: Perform PDF to XML exports from Acrobat Pro 2017 or Acrobat Pro DC. When you find multiple fields in the original file showing up in a single XML record, then export the same file to Word, Excel, Rich text, HTML or spreadsheet XML, and you will likely find the problem does not exist in those other formats. I have included a sample PDF file and corresponding XML export file that demonstrates the problem in several sections.
5 OS is Windows 8.1 and Windows 10. Applications are latest Acrobat Pro 2017 and Pro DC.
6 Expected result: All individual fields of information in the source PDF should be preserved as distinct units in the XML file. Separate lines of information in the source file, like name address, city/state/zip, as well as column headers and individual cells in a table should appear as distinct elements in the XML file.
7 Observed result: In multiple isolated cases, several fields of information that were separate and distinct in the source PDF file became indistinguishable members of a single XML record, without any delimiters in the XML file.
8 Attached zip containing important file samples
Zip name: Isolatedfield-formatlosseswithAcrobatPDFtoXMLexports.zipThe zip contains 7 files. They are as follows:
Source document: SampleBank1.pdf
Problematic file: SampleBank1.xml
Other file exports demonstrating greater format integrity:
SampleBank1.docx
SampleBank1.html
SampleBank1.rtf
SampleBank1.xlsx
SampleBank1_SpreadsheetXML.xml9 Documented examples of problematic XML export behavior
The attached zip contains a redacted bank statement PDF, and 6 different file formats I exported it to.
There are 3 instances in this XML sample that illustrate this ambiguous XML field formatting export behavior. Strikingly, this bad export behavior does not exist in the other 5 file export formats. I site them all clearly below.
•9A Instance 1 Name/address/CityStateZip
Open the Word formatted file in Word. From the top of the document, search for the text string, "JANE DOE". (not including the quotes) The cursor will land at the beginning of three consecutive lines of name and address information:
JANE DOE
1234 SESAME ST
ANYWHERE NC 12345-6789This information is also presented as 3 distinct lines, Name, address and city/state/zip in rich text, HTML, and spreadsheet XML.
In Excel, the line containing JANE DOE almost appears like it's in the same record as street and city/state/zip, but when that cell is pasted into wordpad, it breaks out into 3 separate lines.
Contrastingly, in the XML file, these same three lines of data are populated to a single XML record, and separated only by a space character. The 3 fields are indistinguishable as follows.
JANE DOE 1234 SESAME ST ANYWHERE NC 12345-6789The record containing the exact XML code taken directly from the XML file is as follows:
<TH>JANE DOE 1234 SESAME ST ANYWHERE NC 12345-6789 </TH>There is no way to determine where the name ends and where the street address begins. Same true for City/state/zip. Instead of there being a discernible delimiter between those fields, an ambiguous space character is present.
•9B Instance 2 Column headers inside the "Summary of checks written" table
Open the Word formatted file in Word, and from the top of document, search for the text string, "Amount" (not including the quotes) The cursor lands in the 3rd column of the row of table headers inside the "Summary of checks written" table. The row of table headers appears in the Word file as:
Number(tab)Date(tab)Amount(tab)(tab)Number(tab)Date(tab)Amount(tab)(tab)Number(tab)Date(tab)Amount
Perfectly delimited, with a single tab character between Number, Date and Amount, and two consecutive tab characters between Amount and Number, .
The same exact presentation with one and two tab characters is also found in the rich text file.
In HTML, spreadsheet XML, and Excel, there's great format integrity, each column header is in its own column, and there is an empty column between AMOUNT and NUMBER when the table repeats further to the right.
Contrastingly, in the XML file, all 9 Colum header fields run together and appear in a single XML record as:
<P>Number Date Amount Number Date Amount Number Date Amount </P>All 9 fields appear in that single XML record, and each column header is separated by a single space character. There's no way to tell where one column header ends and the next one begins. Similarly, there's no way to tell when the sequence of 3 columns is repeated.
•9C Case 3 Column data inside the "Summary of checks written" table
Open the Word formatted file in Word, from the top of the document, search for the second instance of the text string, "299" (not including the quotes). The cursor is placed in the first column of the first row of table data inside the "Summary of checks written" table. The complete first row of table data appears as:
299(tab)11/30(tab)114.38(tab)(tab)305(tab)12/11(tab)30.00(tab)(tab)307(tab)12/20(tab)40.40
Perfect format integrity again. Just like the row containing the column headers, there is a single tab character between adjacent columns containing 299, 11/30, and 114.38. There are also two consecutive tab characters between the Amount and Number columns, 114.38 and 305. Individual field identification is clear and concise.
The same exact superb presentation with one and two tab characters is also true for rich text.
In HTML, spreadsheet XML, and Excel, format integrity is preserved. Each column data entry is in its own column, and there is an empty column between AMOUNT and NUMBER when the table repeats further to the right.
Contrastingly, and even surprisingly, the XML file contains all 36 cells of information of data, that's the entire table, in a single record, with single spaces separating each field. Without meaningful field delimiters, there's no way to distinguish any of the data fields. It is not safe to assume the table entries will not contain spaces.
Record from XML file with exact XML coding:
<P>299 11/30 114.38 305 12/11 30.00 307 12/20 40.40 298 12/3 215.00 302 12/17 15.00 311 12/26 15.00 301 12/3 50.00 306 12/17 39.70 312 12/28 5.00 300 12/4 38.10 309 12/19 43.67 310 12/28 46.70 </P>10 Conclusion:
Adobe has done a terrific job creating PDF content and format export functionality to many other popular file formats. I have sited only the parts of PDF to XML file export that appear broken. That's a small part of the whole. It does correctly identify and pass along most of other content and format properly. There was another table earlier in the document entitled "Transaction history", where every field of Colum headers and table data are properly exported and conveyed as distinct elements in the XML file, so it's not the entire approach of the XML exporter, only a couple of small instances. .
The problematic formatting of those ambiguous records sited above are unique to XML file export only. Since all the other 5 export formats (Word, Excel, Rich text, HTML and spreadsheet XML) export those records and maintain the format integrity of each contributing field, I believe the XML export coding is broken or simply misbehaving. I hope you will please join me in recognizing this rogue XML file export behavior as a bug or flaw, and pass it along to your change control process so that it may be corrected in updates coming soon.
ReadMe3-9-2019Isolated fieldformatlosseswithAcrobatPDFtoXML_exports.txt
Adobe Acrobat Pro XML Export Bug Report
by Rich Hartness, rhartness@mabnc.org1 Title: Isolated field format losses with Acrobat PDF to XML exports
2 Intent
I'm writing to report there seems to be a very small but significant bug in the Acrobat 2017 and Pro DC PDF to XML exporter, to share my compelling evidence, and hopefully to gain your support to authorize a code fix that will correct this issue in a future update.
3 Problem description and scope
The problem seems to only exist in XML exports. In…
1 vote -
Adobe Pro fails to open this PDF, reports bad stream. Other readers open it fine
- Open this PDF in Adobe Pro
- See "bad stream" error.
Any other PDF reader (Chrome, Evince, Mac Preview, GhostScript) opens it just fine, but Adobe Pro is unhappy about it.
Is it a bug in Adobe Pro?
2 votes -
a "button" to delete/clear the contents of the last box or all the boxes on the fill form
Add a Delete / Clear "button" .
Sometimes, after the details have been filled out, we want to print it or just save, but we want to leave the form open for the next customer.
Please add a visible "button" option, which will have the option to choose whether to delete the data filled out in the last box or in the entire form, so that the form is ready to be refilled.1 voteIf you are the author of the PDF form, you can add the button with “Reset Form” using the folllowing steps
1. Prepare Form > Add a button.
2. Open Properties and goto Action tab
3. Select Action – Reset a form and choose the fields you would want to reset.
4. Close and save to have the changes applied.Alternatively, You can also add a reset form button with javascript:
this.resetForm(); However, this will not allow you to choose the fields.I hope this helps.
-
Resize Button Properties window
It would be helpful to be able to resize the Button Properties window in Adobe Acrobat Pro. Currently the window barely has enough room to display the contents. Resizing would help allow for much quicker form preparation workflows, for example, making the window larger could help minimize the need to constantly use the scrollbars in the Actions tab.
10 votesThank you for your suggestion. We shall have our team informed on this
-
What happened to document re-size option in toolbar? Can no longer change % size of doc within window.
My biggest complaint about the latest update to Adobe is the percentage option for re-sizing documents within an open window has now been removed. I review and edit PDFs from countless customers all day long. Now the only way to change the size of the document within the window is to expand or decrease the size of the window itself, which can still be extremely limiting. Having the ability to change the size of any given document for review and edit is essential to my job. Why was the document percentage option removed from the toolbar and how can I get it back?
My biggest complaint about the latest update to Adobe is the percentage option for re-sizing documents within an open window has now been removed. I review and edit PDFs from countless customers all day long. Now the only way to change the size of the document within the window is to expand or decrease the size of the window itself, which can still be extremely limiting. Having the ability to change the size of any given document for review and edit is essential to my job. Why was the document percentage option removed from the toolbar and how can I…
2 votes -
WTF . . . what happened to the "save as" feature in my Acrobat DC program????
WTF . . . what happened to the "save as" feature in my Acrobat DC program????
3 votesCan you please provide a detailed description of your issue faced along with the steps to reproduce the same.
Please also share the acrobat version and OS you are using.
-
Enhance Scan
Hi,
When including Enhance Scan in an Action Wizard and running it against a folder with Multiple files in it, repeatedly prompt and ask for the next file... I don't want to see that prompt, like when I am OCRing I can uncheck the Prompt check box...
2 votes -
print secured PDF fails in latest Acobat Pro build
New issue with Acrobat Pro DC 2019.010.20098.
I am on Mac Mojave, 16.16.7
Attempting to print a college transcript from Parchment, a Secured PDF. Document Properties, Security tab says Printing should be allowed, but printing fails with "There were no pages selected to print" - see attached screenshot. My office mate has version 2019.010.20071 and can print this document no problem. This is the second bug I have found in this build.1 vote -
url's to jpegs or .psd files
Hi, I have hundreds of url's and need to save them as the jpegs each URL does belong to our catalogue, and I am assuming there is a straightforward way of converting them with some batch action?
To be clear I am converting multiple URLs to jpegs or .Psd files and am looking to do it in a batch process
There must be a way
1 vote -
Prevent Adobe Acrobat Reader DC Downloads in Updates
I have purchased the full license for Adobe Acrobat Pro DC, I am requesting Adobe to create a way to always block Reader DC from ever downloading during scheduled updates. Reader DC always commandeers/overrides the default settings by file types and since I PAID for the full software product, I shouldn't have to reset my default program by file type literally every other day. Please change this. Please.
2 votesTo disable updater you can set the following registry key:
HKEY_LOCAL_MACHINE\SOFTWARE\WOW6432Node\Policies\Adobe\Acrobat Reader\DC\FeatureLockDown
“bUpdater”=dword:00000000You can refer to the following document:
https://www.adobe.com/devnet-docs/acrobatetk/tools/PrefRef/Windows/Updater-Win.html#idkeyname_1_24776Thanks
Atir -
Table of Contents
I would like to be able to generate a table of contents using bookmarks within a PDF. While there are some third party plugins that will do this, it would be helpful to have this feature within Acrobat.
19 votes -
Text-to-speek SELECTED text only
I don't know how Acrobat has not allowed for a Text-To-Speech option of SELECTED text only yet. It's an amazing tool for people with disabilities... especially those who need text read to them.
It is extremely frustrating having to listen to the text of a document from the begining if you only need the last paragraph read allowed. Extremely disappointed Adobe hasn't stepped up their game.
4 votesHi,
We understand your problem here. We are continuously working in making a better experience for our customer.
We have kept it in our radar and will implement in the near future.Thanks
Rachit -
Acrobat DC process hangs
Acrobat processes hang in task manager. Clicking on a PDF file or the application shortcut will not launch the program but task manager shows Acrobat running. Killing the process and trying again still does not work. Users have to sign out or reboot several times a day to get Acrobat to launch.
Acrobat DC Pro
Windows 10 1709 build7 votes -
Ability to rename a scanned document while in Preview Pane
I would love the ability to rename my scanned documents while viewing them in Preview Pane in my Explorer folders. I currently need to preview them, remember detailed information, close that view, then rename the document -- hoping I remembered my information correctly. It would save a TON of time and make us more efficient. Thanks!
5 votes -
Cannot open embedded pdfs in word
working on windows but I am on Mac
1 vote -
Request: Pressure Sensitive Draw Tool
Requesting pressure sensitivity on the "Draw Free Form" tool, or a separate draw tool with pressure sensitivity.
6 votes -
Loop video
When inserting a video with Acrobat Pro, allow an option to have the video loop (by default when played).
The video players should also be updated with a loop option that can be changed from the default by the PDF reader.5 votes -
signing in everytime I open program
WE paid for a subscription to Adobe Acrobat DC why do we have to sign in each time. Adobe needs to fix this. Oh the days we had a CD to install and never had these problems. Also my boss has that same problem with our Photoshop and Indesign.
1 vote -
2 votes
I am sorry for the inconvenience you are facing.
Can you share with us the acrobat version and the operating system you are using and also the details which selected options -
Acrobat CC excessive Ram usage
Acrobat CC has suddenly started using excessive amounts of RAM to the point that it is paused and has to be quit and restarted. This also happens if left alone with no files open.
Having pitstop plugin open or not makes no difference. This is on an imac retina 5k late 2014 with 16gbs ram and only started happening Sept 1 2017, Acrobat CC has been installed for at least a year with no issues (and other versions before that have not had this issue).
The RAM usage seems to increase much more quickly if it is an image heavy file open but still happens after a time even if a small vector file is opened, and happens over a couple of hours if no files are open.
Currently I have to restart acrobat at least hourly (usually more often) or all of the software open will be paused and I have to restart the computer.
There is a thread about this on the adobe forums with several others reporting the same issue.Acrobat CC has suddenly started using excessive amounts of RAM to the point that it is paused and has to be quit and restarted. This also happens if left alone with no files open.
Having pitstop plugin open or not makes no difference. This is on an imac retina 5k late 2014 with 16gbs ram and only started happening Sept 1 2017, Acrobat CC has been installed for at least a year with no issues (and other versions before that have not had this issue).
The RAM usage seems to increase much more quickly if it is an image heavy…33 votesThanks for the update Jan. Glad to know that your problem is resolved.
- Don't see your idea?