L
Luciano Paulino da Silva
Dear All,
I would like to split one text file (example bellow) containing
several packs of data with the head "MW-....." into one excel file
for which each sheet should contain the data bellow the "MW... "
separated into four columns. After that, I will have to count the
number of times that apear each one of the strings present in the
columns (For example, to the sheet MW-
Silks_1_10-071f53284b36f9841994574243ecb063, it appears on column 3
the string coil 4 times.
Finally, the output should be something like:
For column 3:
coil turn helix bend bridge sheet
MW-Silks_1_10-071f53284b36f9841994574243ecb063 4
1 5 0 0 0
MW-Silks_1_10-07ab959b2314cae1f575921c5b0f7bce 6
0 0 6 1 0
MW-Silks_1_10-0c045e18accfb4d3c78af87e0fbda543 4
1 0 3 0 13
File example (simplified since they have until 500 kb of data):
MW-Silks_1_10-071f53284b36f9841994574243ecb063
255 ALA coil
256 ALA coil
257 ALA coil
258 ALA coil
259 GLY turn
260 GLY helix (helix_alpha, helix1)
261 ALA helix (helix_alpha, helix1)
262 GLY helix (helix_alpha, helix1)
263 GLN helix (helix_alpha, helix1)
264 GLY helix (helix_alpha, helix1)
MW-Silks_1_10-07ab959b2314cae1f575921c5b0f7bce
60 ALA coil
61 GLY coil
62 GLN coil
63 GLY coil
64 GLY bend
65 TYR bend
66 GLU coil
67 GLY bend
68 PRO bend
69 GLY bend
70 ALA bend
71 GLY coil
72 GLN bridge
MW-Silks_1_10-0c045e18accfb4d3c78af87e0fbda543
36 GLY sheet (sheet1, strand1_1)
37 GLY sheet (sheet1, strand1_1)
38 ALA sheet (sheet1, strand1_1)
39 GLY sheet (sheet1, strand1_1)
40 GLN sheet (sheet1, strand1_1)
41 GLY sheet (sheet1, strand1_1)
42 GLY sheet (sheet1, strand1_1)
43 TYR sheet (sheet1, strand1_1)
44 GLY sheet (sheet1, strand1_1)
45 GLY coil
46 GLN coil
47 GLY turn
48 ALA bend
49 GLY coil
50 GLN coil
51 GLY bend
52 ALA bend
53 ALA sheet (sheet1, strand1_2)
54 ALA sheet (sheet1, strand1_2)
55 ALA sheet (sheet1, strand1_2)
56 ALA sheet (sheet1, strand1_2)
Somebody could help me?
I would like to split one text file (example bellow) containing
several packs of data with the head "MW-....." into one excel file
for which each sheet should contain the data bellow the "MW... "
separated into four columns. After that, I will have to count the
number of times that apear each one of the strings present in the
columns (For example, to the sheet MW-
Silks_1_10-071f53284b36f9841994574243ecb063, it appears on column 3
the string coil 4 times.
Finally, the output should be something like:
For column 3:
coil turn helix bend bridge sheet
MW-Silks_1_10-071f53284b36f9841994574243ecb063 4
1 5 0 0 0
MW-Silks_1_10-07ab959b2314cae1f575921c5b0f7bce 6
0 0 6 1 0
MW-Silks_1_10-0c045e18accfb4d3c78af87e0fbda543 4
1 0 3 0 13
File example (simplified since they have until 500 kb of data):
MW-Silks_1_10-071f53284b36f9841994574243ecb063
255 ALA coil
256 ALA coil
257 ALA coil
258 ALA coil
259 GLY turn
260 GLY helix (helix_alpha, helix1)
261 ALA helix (helix_alpha, helix1)
262 GLY helix (helix_alpha, helix1)
263 GLN helix (helix_alpha, helix1)
264 GLY helix (helix_alpha, helix1)
MW-Silks_1_10-07ab959b2314cae1f575921c5b0f7bce
60 ALA coil
61 GLY coil
62 GLN coil
63 GLY coil
64 GLY bend
65 TYR bend
66 GLU coil
67 GLY bend
68 PRO bend
69 GLY bend
70 ALA bend
71 GLY coil
72 GLN bridge
MW-Silks_1_10-0c045e18accfb4d3c78af87e0fbda543
36 GLY sheet (sheet1, strand1_1)
37 GLY sheet (sheet1, strand1_1)
38 ALA sheet (sheet1, strand1_1)
39 GLY sheet (sheet1, strand1_1)
40 GLN sheet (sheet1, strand1_1)
41 GLY sheet (sheet1, strand1_1)
42 GLY sheet (sheet1, strand1_1)
43 TYR sheet (sheet1, strand1_1)
44 GLY sheet (sheet1, strand1_1)
45 GLY coil
46 GLN coil
47 GLY turn
48 ALA bend
49 GLY coil
50 GLN coil
51 GLY bend
52 ALA bend
53 ALA sheet (sheet1, strand1_2)
54 ALA sheet (sheet1, strand1_2)
55 ALA sheet (sheet1, strand1_2)
56 ALA sheet (sheet1, strand1_2)
Somebody could help me?