A
Anthony
Hi
Hoping someone might be able to assist.
I have a csv file with incident records, which has been extracted from
a large database and provided to me. Each incident has a number of
details including date, time, location etc etc. They also have some
details on person/s involved, such as age, sex etc. Each incident has a
unique number (Report Number). There may be a number of rows for each
incident, depending if there was more than one person involved. In
these cases there may be 2 or more rows with the same report number,
but which will differ in just one or more field (ie second record might
be identical, but have a different age, sex etc indicating more than
one person was involved).
I am trying to work out the easiest way of working with this data. I am
not used to working with such a large dataset (about 400 000 records)
so usually I can use a formula to count unique records in Excel and use
a pivot table to manipulate the data. I figure that I should import the
data into an Access database and then use queries to manipulate, but am
wondering whether it is possible to use a similar function to that in
excel. (ie in excel I would create an additional column, and input a
formula so that for the first occurrence of a report number 1 is the
result, but thereafter it produces a zero. I then use this column as
the count field). That way I can for example count the number of unique
incidents that occurred in each day, in each time period etc etc.
I hope this explanation is reasonably clear, I can provide more
information, such as field names etc if this is necessary.
Any ideas? Thanks in advance.
Cheers, Anthony
Hoping someone might be able to assist.
I have a csv file with incident records, which has been extracted from
a large database and provided to me. Each incident has a number of
details including date, time, location etc etc. They also have some
details on person/s involved, such as age, sex etc. Each incident has a
unique number (Report Number). There may be a number of rows for each
incident, depending if there was more than one person involved. In
these cases there may be 2 or more rows with the same report number,
but which will differ in just one or more field (ie second record might
be identical, but have a different age, sex etc indicating more than
one person was involved).
I am trying to work out the easiest way of working with this data. I am
not used to working with such a large dataset (about 400 000 records)
so usually I can use a formula to count unique records in Excel and use
a pivot table to manipulate the data. I figure that I should import the
data into an Access database and then use queries to manipulate, but am
wondering whether it is possible to use a similar function to that in
excel. (ie in excel I would create an additional column, and input a
formula so that for the first occurrence of a report number 1 is the
result, but thereafter it produces a zero. I then use this column as
the count field). That way I can for example count the number of unique
incidents that occurred in each day, in each time period etc etc.
I hope this explanation is reasonably clear, I can provide more
information, such as field names etc if this is necessary.
Any ideas? Thanks in advance.
Cheers, Anthony