PDF format

Do you know if WM can read and parse data from a structured pdf (Acrobat) file? We receive invoices from the supplier in pdf format, and would like to process them with WM.
Thanks for all comments.

I would recomend writing custom content handlers PDF is same as EXcel oe MS Word file format its not supported out of the box in WM. BUt operhaops sombody has created handler or adapter that can do that.


Have you managed to sort out your problem? I’m facing a
similar situation.

Hi Gabor - IIRC, PDF is a Printer definition language and it
can be parsed by the open-source Ghostscript package. Check
out: http://www.cs.wisc.edu/~ghost/

Hi Gabor,

The problem you have just described can be easily solved using Itemfield’s ContentMaster. ContentMaster is fully equipped to parse/serialize various binary/textual formats both structured and unstructured.

Itemfield is a new webMethods partner. Its unique “Example-Based Parsing” technology offers an intuitive graphical development environment for easy creation of parser scripts.

ContentMaster is easily integrated into WM. See http://www.itemfield.com/solutions/sol_webmeth.shtml for more details.

Please feel free to contact me with any questions.

Meitav Harpaz