SED / AWK

abovebrd · Jun 14, 2000

I am in need of some help. On a routine bases I have large database files that I need to extract data from. I then need to input the extracted data into new files. The data is delimited, Example: **** Joe Smith 408-123-4567 Sample Co Inc. (Followed by a multiple more lines) **** The common delimiter between each set of data is **** The data is then moved to a new file. I then continue to do this 100 or so more times until the file is empty. I have the ability to change the delimiter. The new files can also be named anything. The only requirement is they have a .tag extension. My question: Does any know how I can automate this process through a script. I think SED would accomplish this, but I am not familiar enough with SED to execute it. Any thoughts on automating this task would be helpful. Danny. <A HREF="mailto:dannyd@aboveboardelectronics.com">dannyd@aboveboardelectronics.com</A>

Annihilannic · Jun 22, 2000

Danny, It sounds like awk might be the tool for the job.  You'll have to describe the output format you need more explicitly for a more detailed answer though... do you need it to be comma delimited rows?  SQL insertion statements? Annihilannic.

abovebrd · Jun 23, 2000

Here is a sample of my source file : cvr=none tfn=408-573-5542 fll=<<EOF ABC Company 00402932 IN 05/26/00     16.00      0.00     16.00      0.00      0.00      0.00 00403326 IN 05/31/00    197.32      0.00    197.32      0.00      0.00      0.00                     ------------------------------------------------------------                         213.32      0.00    213.32      0.00      0.00      0.00 EOF cvr=none tfn=408-573-5542 fll=<<EOF XYZ Company 00405461 IN 06/13/00    144.29      0.00    144.29      0.00      0.00      0.00                     ------------------------------------------------------------                         144.29      0.00    144.29      0.00      0.00      0.00 EOF I need to grab all of the information between cvr=none and EOF. I then need to move that information to a new file. I would then need to repeat the process on the next occurence and so on until all occurences are moved to new files. My source file has around 30,000 lines. There would also be around 500 files generated from this process. If you have any ideas I would love to here them Danny   Danny Daniels <a href=mailto:dannyd@aboveboardelectronics.com>dannyd@aboveboardelectronics.com</a> <a href= > </a>

AndyBo · Jun 26, 2000

Is perl available on ths server?  If it is, I have something I've used in the past that would probably work for you as well. <a href=mailto: > </a> <a href= > </a> -- 
0 1 - Just my two bits

Annihilannic · Jun 26, 2000

This awk script does the trick: -------------------- 8< ----------------------- #!/usr/bin/awk -f BEGIN {         FILEINDEX=0         OUTFILENAME="record." FILEINDEX } /^cvr=/ {         close(OUTFILENAME)         FILEINDEX++         OUTFILENAME="record." FILEINDEX } {         print $0 > OUTFILENAME } -------------------- 8< ----------------------- Basically every time it encounters a line beginning with 'cvr=' it opens a new output filename. I tried to do it using '\nEOF\n' as a record separator, but it didn't seem to work... if anyone can tell me why I'd appreciate it! Annihilannic.

abovebrd · Jun 26, 2000

Annihilannic, I can change the syntax of the record separator if that helps. AndyBo, Perl is not loaded on this server. But installing perl is not out of the question ? Danny <A HREF="mailto:dannyd@aboveboardelectronics.com">dannyd@aboveboardelectronics.com</A> Danny Daniels <a href=mailto:dannyd@aboveboardelectronics.com>dannyd@aboveboardelectronics.com</a> <a href= > </a>

AndyBo · Jun 26, 2000

Annihilannic:  "\nEOF\n" probably wouldn't work because what you've actually got is "^EOF\n".  ie, from beginning of the line ("^&quot

look for "EOF" followed by a newline ("\n&quot

. Danny:  Installing a full blown perl might be overkill to solve a single problem.  I'd go with the "awk" solution first as awk will already installed.  If you're having problems getting the awk to run, post back and we'll get into some perl hacking

<a href=mailto: > </a> <a href= > </a> -- 
0 1 - Just my two bits

abovebrd · Jun 26, 2000

Thanks AndyBo, Annihilannic: I ran the awk script earlier today and it seemed to do to trick. If fact it worked great !!!! Thanks guys, Just one more question: Can you recommend a good good source for learing AWK. Maybe a good book ? Danny <A HREF="mailto:dannyd@aboveboardelectronics.com">dannyd@aboveboardelectronics.com</A>    Danny Daniels <a href=mailto:dannyd@aboveboardelectronics.com>dannyd@aboveboardelectronics.com</a> <a href= > </a>

AndyBo · Jun 27, 2000

I would say the "classic" Sed & Awk reference is "Sed & Awk" from O'Reilly.  There are condensed highlights taken from this book in the "Unix in a Nutshell" and "Unix Power Tools" books also from O'Reilly.  It's probably better to go for the original, though, if you want to really get down and learn some Awk. <a href=mailto: > </a> <a href= > </a> -- 
0 1 - Just my two bits

Tek-Tips is the largest IT community on the Internet today!

Members share and learn making Tek-Tips Forums the best source of peer-reviewed technical information on the Internet!

SED / AWK

abovebrd

IS-IT--Management

Annihilannic

MIS

abovebrd

IS-IT--Management

AndyBo

MIS

Annihilannic

MIS

abovebrd

IS-IT--Management

AndyBo

MIS

abovebrd

IS-IT--Management

AndyBo

MIS

Similar threads

Part and Inventory Search

Sponsor