| 10 | | 1. Pick a pathway of a specific organism, say Glycolysis / Gluconeogenesis pathway of Homo sapiens (human). |
| 11 | | 1. Get all the amino acid sequences of that pathway component. |
| 12 | | 1. Run blast for each sequence against UniProt(SwissProt / TrEMBL). |
| 13 | | 1. Group the blast results according to biological species (s1, s2, ..., sn). |
| 14 | | |
| | 12 | === Part I === |
| | 13 | 1. Pick a pathway of a specific organism, say Glycolysis / Gluconeogenesis pathway of Homo sapiens (human). |
| | 14 | 1. Get all the amino acid sequences of that pathway component (c''1'', c''2'', ..., c''m''). |
| | 15 | === Part II === |
| | 16 | 1. Run blast for each sequence against !UniProt(!SwissProt / TrEMBL). |
| | 17 | 1. Group the blast results according to biological species (s''1'', s''2'', ..., s''n''). |
| | 18 | 1. For each pathway component (c''i''), pick the top hit from each species. |
| | 19 | * At this step, you have (at most) an ''m'' compoents X ''n'' species array of protein sequences: '''Homologous Compoenents Matrix''' (HCM). |
| | 20 | === Part III === |
| | 21 | 1. Pick one column (species) of the HCM (s''j'') For each component of that column, run blast against PDB sequences. |
| | 22 | * You will have a list of PDBID and Chain ID as a result of this blast search. |
| | 23 | 1. If the blast results of two components share the same PDB ID but do not share the same Chain ID, they are judged to be in contact. |
| | 24 | * By regarding each pathway component as a node, the inferred physical contacts between two components correspond to an edge in the network. |
| | 25 | 1. Iterate the above two steps for each column of the HCM. |
| | 26 | 1. Finally, show the list of inferred PPI of the pathway for each species. |
| | 27 | |