P
pietlinden
I have the onerous task of querying horribly denormalized databases.
I'm trying to convince the boss that normalization is their friend...
The databases at present are basically one column per question/
demographic etc. So they're all for the most part flat files. But not
always (just to screw me up). If not, I could pretty much union the
sets into summarizable form.
What everyone is afraid of is that the records won't be so easy to
look at... (They have NO forms or reports in their "databases") Also
no control over data entry, either, so querying is a HUGE nightmare.
Basically, I have to re-do query after query because some field names
change or the table structures do...
Okay... how would I show that I can from normalized to repeating
fields? For example,
PatientID WeekNumber TestName Result
To
Columns like this:
(PatientID, TestName_WeekNumber)
Then the value under TestName_WeekNumber
they basically have "sideways" tables (wide and shallow) instead of
(narrow and deep).
For me, there's just no contest.queries are portable etc etc...
I can build a single front end with queries that can look at different
databases and run the same routines... I get all that. The problem is
selling this to the people that have to look at this. I think I might
just build a properly normalized database with some sample data that
mimics what kind of data they're collecting and show them how easy and
portable the queries are (well, assuming that the structures remain
nearly the same)
The problem is that it's treatment research, so time is a significant
dimension.
Any suggestions are welcome.
Thanks!
Pieter
I'm trying to convince the boss that normalization is their friend...
The databases at present are basically one column per question/
demographic etc. So they're all for the most part flat files. But not
always (just to screw me up). If not, I could pretty much union the
sets into summarizable form.
What everyone is afraid of is that the records won't be so easy to
look at... (They have NO forms or reports in their "databases") Also
no control over data entry, either, so querying is a HUGE nightmare.
Basically, I have to re-do query after query because some field names
change or the table structures do...
Okay... how would I show that I can from normalized to repeating
fields? For example,
PatientID WeekNumber TestName Result
To
Columns like this:
(PatientID, TestName_WeekNumber)
Then the value under TestName_WeekNumber
they basically have "sideways" tables (wide and shallow) instead of
(narrow and deep).
For me, there's just no contest.queries are portable etc etc...
I can build a single front end with queries that can look at different
databases and run the same routines... I get all that. The problem is
selling this to the people that have to look at this. I think I might
just build a properly normalized database with some sample data that
mimics what kind of data they're collecting and show them how easy and
portable the queries are (well, assuming that the structures remain
nearly the same)
The problem is that it's treatment research, so time is a significant
dimension.
Any suggestions are welcome.
Thanks!
Pieter