DDBJ-KEGG-PDBj/Results

Version 1 (modified by akinjo, 15 years ago)

--

A DDBJ-KEGG-PDBj workflow: from pathways to protein-protein interactions

Members

  • Yasumasa Shigemoto (WABI/SABI; DDBJ, Japan)
  • Akira Kinjo (PDBj, Japan)
  • Soichi Ogishima (Tokyo Medical and Dental Univ, Japan)
  • Masumi Itoh (Hokkaido Univ)

The objective

Given a KEGG pathway ID of a specific species,

  1. Identify homologous proteins of the pathway.
  2. Infer possible protein-protein interactions between components of the homologous pathways.

How?

  1. Input: KEGG pathway ID
  2. Get amino acid sequences of all the proteins involved in the given pathway. (KEGG API)
  3. For each protein sequence, run BLAST against the UniProt (SwissProt / !TrEMBL) database to identify its homologs. (WABI)
  4. Sort the BLAST results according to species. (a local program)
    • At this point, we will have a phylogenetic profile.
  5. For each BLAST hit (a UniProt entry), run BLAST against PDB. (PDBj SeqNavi API).
  6. Find physical interactions between different components. (a local program)
    • That is, if two (UniProt) sequences are homologous to different chains of the same PDB entry, they are predicted to be in physical contact.
  7. Output the phylogenetic profile and the predicted species-wise PPI.

An example

  1. KEGG pathway: path:eco00020
  2. Phylogenetic profile:

Attachments