Losing german äüö (Umlauts) and letters in the Structure Tree when using "Add Tags to Document" in Adobe Acrobat DC
- Brief Title
Losing german äüö (Umlauts) and letters in the Structure Tree when using "Add Tags to Document" in Adobe Acrobat DC (previous operation Number ADB-3207012-T1M7) German Language Problem?
- Steps to reproduce the issue
Export a text document with german words that contain german umlauts (äöü), e.G.: "für, können, öfter, mögen" as a PDF without tags from InDesign or Word
Open the file in Acrobat DC
Go to "Tags" and open the little menu and choose "Add Tags to Document"
After the Structure Tree was created — save and close
when you open the document again in Acrobat DC and you can take a look at the tags in the Structure Tree and see that the letters are missing in the tags. Not only the umlauts but sometimes accompanying letters aswell
Examples: "für" -> "fr"; "können" -> "knen"
Conclusion: blind people can't read these anymore and the document is not accessible anymore!
The problem doesn't exist in Acrobat X and Acrobat XI.
The problem occurs with any font and all operating systems.
The problem was already documented with video and example-documents with the operation Number ADB-3207012-T1M7.
- Operating system
The error occurs in all operating systems.
- Expected results
Loss free texts in the Structure Tree with umlauts and all other letters so that the docment can be read by screen-readers.
- Observed result
Missing letters in the Structure Tree after using "Add tags to Document" mainly in combination with Umlauts äöü and umlaut-ligatures.
surbhi jain commented
I’m excited to uncover this page. I need to thank you for your time for this, particularly fantastic read!! I definitely really liked every part of it and I also have you saved to fav to look at new information in your site.
Hello, I know this really „annoying“ problem als well. Creating Accessible PDF files is getting more and more important in these times and especially for my clients in the social area. I usually use Acrobat CC since a few years (meanwhile on macOS Big Sur 11.2.1.), but fortunately I didn’t scrap my old iMac, so that I can still use my old Acrobat 9 (!!!) for the first step to create a structure tree. Otherwise I also have the problem with german umlauts »öäü«. I can see in this forum, that Adobe knows this problem since at least 2018.
Looking forward for a solution, that’s – sorry for these clear words – kind of embarrassing!!!
I told you in 2018 about the problem with the Umlauts in the stukture-tree and nothing happen till now! Last week I had a adobe acrobat dc release 2021.001.20138, but the mistake is still there.
Please take it serious! My colleagues and me cannot update our Mac to Big Sur, because you no longer support Adobe Acrobat XI. Or we had switch between the platforms, where the Adobe Acrobat XI can runs. You can't work like that!
Thank you in advance.
I can confirm the issue from MS Word. French glyphs éàè work, even arabic does. But no Umlaut.
The issue does not occur when the document was created by LuaLaTeX.
In pdfLaTeX documents äöü are converted to ŁoŁ in the tag tree. In reflow view you get ¨a etc.
we still waiting for the release!
It is incredibly important, because the new macOS Catalina don't support the Acrobat X and XI.
What should we do?
I'm excited. Can't wait till release.
Anette Banck commented
By law every official publication has to be accessible.
so for the Citie-councils , and organisations the accessible pdf documents are essential to meet the laws!
and I have to produce these pdfs for them!
But WITH äöü !!!
Anette Banck commented
Hallo admin of Acrobat DC
to me the same as above occurs!!
When converting into pdf, tagged and all.... in the german language all Umlaute
ä ö ü are gone!! Horrible to us Germans and surely also in Turkey, finnnish or Hungarian languages.
Are you only programming Acrobat CD for english and Spanish speaking users!
Thank you for QUICK help!!
Anette Banck Southern Germany
Matthias Klaus commented
Hello, I work for the german Radio/tv Station (Deutsche Welle), that Nicole mensioned in her comments. As long as the problem is not fixed, we are not able to produce accessible pdf documents using Accrobat.
Hello Team, I still waiting for a feedback of you. Can you tell me, how it continues? This problem is very urgent. Two weeks ago, I gave a workshop about accessibility at a german television and radio institution »Deutsche Welle« and they have the problem, too. The biggest problem is, that the acrobat XI license ends and they had to use acrobat DC now. And now they are unable to generate the structure tree automatically without losing the german umlauts. Alle the documents are unusable!!! (They work with microsoft) Thank you for a fast response.
Phillip Roth commented
Hello Team, the exact thing is also happening with my Acrobat DC on OSX 10.13.6
yes, I can confirm that it is mostly on all files with german umlauts »öäü«. But the problem don´t exist, when you create the file with acrobat.
We have the problem on different OSX and Windows Versions.
I create a textfile in InDesign with words like »für, können, ähnlich« etc. an export a PDF without tags.
Now I open the PDF file in Acrobat DC and create a structure tree with the function »add tags to document«.
when I open the structure tree and look at the text-fragments in the tags, you can see that »für« becomes »fr«.
Sometimes this phenomenon occurs after saving this file.
It`s a shame, because we don´t have the problem with acrobat X and XI!
I send you a PDF with this problem enclosed. Please delete in that file the structure tree and create a new one with »add tags to document«.
Or is it possible to send you a screen recording? It is a big problem for our accessibility work!
I currently working with mac OS high sierra 10.13.6