INTELLIGENT WORK FORUMS
FOR COMPUTER PROFESSIONALS

Log In

Come Join Us!

Are you a
Computer / IT professional?
Join Tek-Tips Forums!
  • Talk With Other Members
  • Be Notified Of Responses
    To Your Posts
  • Keyword Search
  • One-Click Access To Your
    Favorite Forums
  • Automated Signatures
    On Your Posts
  • Best Of All, It's Free!

*Tek-Tips's functionality depends on members receiving e-mail. By joining you are opting in to receive e-mail.

Posting Guidelines

Promoting, selling, recruiting, coursework and thesis posting is forbidden.

Jobs

Convert PDF (.pdf) format to Excel (.xls)

RE: Convert PDF (.pdf) format to Excel (.xls)

Unix/Linux does not support Excel worksheet as a native format so no direct conversion is practical.

You could probably use PDFEdit to output as text then maybe use OpenOffice or LibreOffice to import the text file.

Chris.

Indifference will be the downfall of mankind, but who cares?
Time flies like an arrow, however, fruit flies like a banana.

Never mind this jesus character, stars had to die for me to live.

RE: Convert PDF (.pdf) format to Excel (.xls)

(OP)
Hi Cris,

I try the command below using Centos 6.5; but given me the wrong details.

# /usr/bin/pdftotext -layout file.pdf file.txt

Cooperativa Tilza S.C. de A.P. de R.L. de 13 de septiembre de
Tilzapotla, Morelos
C.V. 2013
Cooperativa De Fomento Regional S.C. de 13 de septiembre de
Mérida, Yucatán
A.P. de R.L. de C.V. 2013
Cooperativa Sofic, S.C. de A.P. de R.L. de 06 de septiembre de
Oaxaca, Oax
C.V.- 2013
05 de septiembre de
Caja Popular Independencia, S.C.L.- Leon, Gto
2013
=================================================================================================

I need an output like below; with tab delimited

Cooperativa Tilza S.C. de A.P. de R.L. de C.V. Tilzapotla, Morelos 13 de septiembre de 2013
Cooperativa De Fomento Regional S.C. de A.P. de R.L. de C.V. Mérida, Yucatán 13 de septiembre de 2013
Cooperativa Sofic, S.C. de A.P. de R.L. de C.V.- Oaxaca, Oax 06 de septiembre de 2013
Caja Popular Independencia, S.C.L.- Leon, Gto 05 de septiembre de 2013

Regards,
FPalero

RE: Convert PDF (.pdf) format to Excel (.xls)

Hi

Unless some values contain line wraps ( so in pdftotext appear on multiple lines ), you can pipe it for example to sed :

CODE

/usr/bin/pdftotext -layout file.pdf file.txt | sed 'N;N;y/\n/\t/' 

Feherke.
feherke.ga

Red Flag This Post

Please let us know here why this post is inappropriate. Reasons such as off-topic, duplicates, flames, illegal, vulgar, or students posting their homework.

Red Flag Submitted

Thank you for helping keep Tek-Tips Forums free from inappropriate posts.
The Tek-Tips staff will check this out and take appropriate action.

Reply To This Thread

Posting in the Tek-Tips forums is a member-only feature.

Click Here to join Tek-Tips and talk with other members!

Resources

Close Box

Join Tek-Tips® Today!

Join your peers on the Internet's largest technical computer professional community.
It's easy to join and it's free.

Here's Why Members Love Tek-Tips Forums:

Register now while it's still free!

Already a member? Close this window and log in.

Join Us             Close