A
AutomationHelpinSF
I have several hundred e-mail messages a day coming in, each in one of
roughly 17-20 different Word "format" files. My goal is to extract the same
data from each of these files so I can put them into a database for further
business processes.
For example, from each file I want to extract a Name, a numerical Score, and
a character Code. In each of the format files, the data is in a different
place in the document. Some have them in a sentence structure in the first
paragraph, some have a list format, etc. As it stands now a human has to
read it an extract the data.
What I would like is a way for a user to take a sample file that's in the
first format, make a selection and somehow "tag" it as if to say "this area
I've selected should be the Score". Do that for all the data items in the
file. Then, have a tool / macros which can then process any file in that
format, extract those selected areas, and generate some other file with
name/value pairs (XML, CSV, etc.)
I could instead write code which could extract the data, however if we get a
new file format that means writing new code. We'd like to delegate that to a
data entry person who can create a "mask" for the new format (they change
frequently as we're processing data from many vendors and there are no
standards in this field).
I am not a hardcore Outlook / Word / Office automation person and so I'm
looking for suggestions as to approach, tools which could be purchased which
would accomplish this, suggestions, etc.
If people have suggestions about more appropriate places to ask, I'd
appreciate that as well.
I appreciate the help, Steve.
roughly 17-20 different Word "format" files. My goal is to extract the same
data from each of these files so I can put them into a database for further
business processes.
For example, from each file I want to extract a Name, a numerical Score, and
a character Code. In each of the format files, the data is in a different
place in the document. Some have them in a sentence structure in the first
paragraph, some have a list format, etc. As it stands now a human has to
read it an extract the data.
What I would like is a way for a user to take a sample file that's in the
first format, make a selection and somehow "tag" it as if to say "this area
I've selected should be the Score". Do that for all the data items in the
file. Then, have a tool / macros which can then process any file in that
format, extract those selected areas, and generate some other file with
name/value pairs (XML, CSV, etc.)
I could instead write code which could extract the data, however if we get a
new file format that means writing new code. We'd like to delegate that to a
data entry person who can create a "mask" for the new format (they change
frequently as we're processing data from many vendors and there are no
standards in this field).
I am not a hardcore Outlook / Word / Office automation person and so I'm
looking for suggestions as to approach, tools which could be purchased which
would accomplish this, suggestions, etc.
If people have suggestions about more appropriate places to ask, I'd
appreciate that as well.
I appreciate the help, Steve.