PDF conversion
Uwekaji wa uzi: Louise Mawbey
Louise Mawbey
Louise Mawbey
Ujerumani
Local time: 12:54
Mwanachama(2006)
Kijerumani hadi Kiingereza
May 17, 2022

There are a few threads on this subject in the various forums but they are quite old. Maybe there are some better options now.

What is the best tool for converting PDFs into Word so that I can translate using Studio? Some of the PDFs I have to translate are scans of certificates etc. that contain tricky formatting, such as columns, tables etc.

I've tried using the option in Word itself and the option in Studio but there are so many formatting issues that I really need s
... See more
There are a few threads on this subject in the various forums but they are quite old. Maybe there are some better options now.

What is the best tool for converting PDFs into Word so that I can translate using Studio? Some of the PDFs I have to translate are scans of certificates etc. that contain tricky formatting, such as columns, tables etc.

I've tried using the option in Word itself and the option in Studio but there are so many formatting issues that I really need something better.

Any tips would be gratefully received.

[Edited at 2022-05-18 07:01 GMT]
Collapse


 
Samuel Murray
Samuel Murray  Identity Verified
Uholanzi
Local time: 12:54
Mwanachama(2006)
Kiingereza hadi Kiafrikana
+ ...
Studio itself, or manually May 17, 2022

Louise Mawbey wrote:
What is the best tool for converting PDFs into Word so that I can translate using Studio?

In my experience, Studio's own conversion is better than that of any OCR program I've tried.

Some of the PDFs I have to translate are scans of certificates etc. that contain tricky formatting, such as columns, tables etc.

There comes a point at which the PDF is so unconvertable that you just have to recreate it manually, in Word. When I translate diplomas etc., I take a screenshot of the file, add it as a watermark in Word, then retype the source text and position it over the watermark, and then remove the watermark.


 
neilmac
neilmac
Uhispania
Local time: 12:54
Kihispania hadi Kiingereza
+ ...
Nitro Pro May 17, 2022

I use Nitro Pro, which works for most PDFs, but not the worst, terribly clunky and incompatible kind.
And I don't know about Studio, which is anathema to me.


Ramanpreet Singh
 
Andriy Yasharov
Andriy Yasharov  Identity Verified
Ukrania
Local time: 13:54
Mwanachama(2008)
Kiingereza hadi Kirusi
+ ...
Online tools May 17, 2022

C̳o̳n̳v̳e̳r̳t̳ S̳c̳a̳n̳n̳e̳d̳ P̳D̳F̳ t̳o̳ W̳o̳r̳d̳ Convert Scanned PDF to Word

I̼m̼a̼g̼e̼ t̼o̼ t̼e̼x̼t̼ c̼o̼n̼v̼e̼r̼t̼e̼r̼ u̼s̼i̼n̼g̼ O̼C̼R̼ o̼n̼l̼i̼n̼e̼ Image to text converter using OCR online


 
Stepan Konev
Stepan Konev  Identity Verified
Shirikisho la Urusi
Local time: 13:54
Kiingereza hadi Kirusi
Solid Documents Technology May 17, 2022

Studio uses Solid Converter blindly. It means that you can ocr a document with Solid Converter and then import the output as is into Studio. The effect will be the same. A better option could be using a stand-alone OCR app, then tidy up your document manually (or build it from scratch) and only then import it into Studio. This is what they recommended at rws community for better OCR output.

Jorge Payan
expressisverbis
 
Jorge Payan
Jorge Payan  Identity Verified
Marekani
Local time: 05:54
Mwanachama(2002)
Kijerumani hadi Kihispania
+ ...
My work flow for scanned PDFs May 17, 2022

ABBYY Finereader -> Transtools -> Studio

expressisverbis
Gennady Lapardin
 
John Fossey
John Fossey  Identity Verified
Kanada
Local time: 06:54
Mwanachama(2008)
Kifaransa hadi Kiingereza
+ ...
ABBYY Finereader May 17, 2022

It's quite expensive, but I use ABBYY Finereader, which can make outstanding conversions of most PDFs to Word. Its system of manual zoning of text, table and image areas, as well as the ability to place text over an image makes it very versatile.

Kevin Fulton
Jorge Payan
Adam Dickinson
expressisverbis
Christel Zipfel
Juan Manosalva
Sebastian Witte
 
expressisverbis
expressisverbis
Ureno
Local time: 11:54
Mwanachama(2015)
Kiingereza hadi Kireno
+ ...
More two: May 17, 2022

Abbyy already provided by others and PDF Element:

https://pdf.wondershare.net/thankyou/install-pdfelement-pro-windows.html

A reasonable free tool too:

https://www.onlineocr.net/pt/


Yaotl Altan
 
Louise Mawbey
Louise Mawbey
Ujerumani
Local time: 12:54
Mwanachama(2006)
Kijerumani hadi Kiingereza
KIANZISHI MADA
Thanks May 19, 2022

Thanks for all the input. I'll try those solutions out and report back

 
Radian Yazynin
Radian Yazynin  Identity Verified
Local time: 13:54
Mwanachama(2004)
Kiingereza hadi Kirusi
+ ...
Foxit PhantomPDF is the best May 19, 2022

Very careful in creating Word docs, in my experience. Much better results than with many other brands.

expressisverbis
Platary (X)
 
Mario Cerutti
Mario Cerutti  Identity Verified
Japani
Local time: 19:54
Kitaliano hadi Kijapani
+ ...
Abby vs Online OCR May 22, 2022

expressisverbis wrote:
https://www.onlineocr.net/pt/

Abbyy Finereader is very good for isolating various parts of documents, but it tends to get complex tables and combinations of texts and images wrong (a mix of tables and overlapping boxes, specially too many independent boxes spread all over the place).

Online OCR has been giving me the best results overall, plus it's free. I haven't read their Terms of Service and Privacy Policy, but I would be very careful when submitting sensitive documents.

[Edited at 2022-05-22 00:14 GMT]


 
expressisverbis
expressisverbis
Ureno
Local time: 11:54
Mwanachama(2015)
Kiingereza hadi Kireno
+ ...
Privacy Sep 20, 2022

Mario Cerutti wrote:
I haven't read their Terms of Service and Privacy Policy, but I would be very careful when submitting sensitive documents.

[Edited at 2022-05-22 00:14 GMT]


"Secure conversion
All documents uploaded under the free "Guest" account will be deleted automatically after conversion. Output files for registered users are stored one month"
https://www.onlineocr.net/

Privacy Policy
We will not view the files that you upload using the OnlineOCR.net service. We may view your file`s information (file extensions, sizes etc. but not your file contents) to provide technical support.
https://www.onlineocr.net/service/privacypolicy

In the past, I used it rarely, as a guest, and I wasn't registered with OnlineOCR.net.
And, yes, I am very careful. The software I use is Abbyy, and I know Foxit and PDFElement deliver also good results.


Stepan Konev
 


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

PDF conversion






Pastey
Your smart companion app

Pastey is an innovative desktop application that bridges the gap between human expertise and artificial intelligence. With intuitive keyboard shortcuts, Pastey transforms your source text into AI-powered draft translations.

Find out more »
Anycount & Translation Office 3000
Translation Office 3000

Translation Office 3000 is an advanced accounting tool for freelance translators and small agencies. TO3000 easily and seamlessly integrates with the business life of professional freelance translators.

More info »