Intelligent Computing Lab.
Bioinformatics in NCTU, Taiwan.
Home

ProLoc-GO:
  Utilizing informative Gene Ontology terms for sequence-based prediction of protein subcellular localization

Home | help || Nov 1 2007 

ProLoc-GO: Utilizing informative Gene Ontology terms for sequence-based prediction of protein subcellular localization


Content:

Input format for Proloc-GO

List of locations for human proteins

List of locations for eukaryotic proteins


Input format for Proloc-GO

An example of FASTA format with known Accession Number:
There must "not" be "space" after ">" for Prolog-GO to identify this sequence as sequence with Accession Number
>O60563;
MEGERKNNNKRWYFTREQLENSPSRRFGVDPDKELSYRQQAANLLQDMGQRLNVSQLTIN TAIVYMHRFYMIQSFTQFPGNSVAPAALFLAAKVEEQPKKLEHVIKVAHTCLHPQESLPD TRSEAYLQQVQDLVILESIILQTLGFELTIDHPHTHVVKCTQLVRASKDLAQTSYFMATN SLHLTTFSLQYTPPVVACVCIHLACKWSNWEIPVSTDGKHWWEYVDATVTLELLDELTHE FLQILEKTPNRLKRIWNWRACEAAKKTKADDRGTDEKTSEQTILNMISQSSSDTTIAGLM SMSTSTTSAVPSLPVSEESSSNLTSVEMLPGKRWLSSQPSFKLEPTQGHRTSENLALTGV DHSLPQDGSNAFISQKQNSKSVPSAKVSLKEYRAKHAEELAAQKRQLENMEANVKSQYAY AAQNLLSHHDSHSSVILKMPIEGSENPERPFLEKADKTALKMRIPVAGGDKAASSKPEEI KMRIKVHAAADKHNSVEDSVTKSREHKEKHKTHPSNHHHHHNHHSHKHSHSQLPVGTGNK RPGDPKHSSQTSNLAHKTYSLSSSFSSSSSTRKRGPSEETGGAVFDHPAKIAKSTKSSSL NFSFPSLPTMGQMPGHSSDTSGLSFSQPSCKTRVPHSKLDKGPTGANGHNTTQTIDYQDT VNMLHSLLSAQGVQPTQPTAFEFVRPYSDYLNPRSGGISSRSGNTDKPRPPPLPSEPPPP LPPLPK

An example of FASTA format without known Accession Number:
The "space" after ">" is required for Prolog-GO to identify this sequence as sequence without Accession Number
> seq1
MEGERKNNNKRWYFTREQLENSPSRRFGVDPDKELSYRQQAANLLQDMGQRLNVSQLTIN TAIVYMHRFYMIQSFTQFPGNSVAPAALFLAAKVEEQPKKLEHVIKVAHTCLHPQESLPD TRSEAYLQQVQDLVILESIILQTLGFELTIDHPHTHVVKCTQLVRASKDLAQTSYFMATN SLHLTTFSLQYTPPVVACVCIHLACKWSNWEIPVSTDGKHWWEYVDATVTLELLDELTHE FLQILEKTPNRLKRIWNWRACEAAKKTKADDRGTDEKTSEQTILNMISQSSSDTTIAGLM SMSTSTTSAVPSLPVSEESSSNLTSVEMLPGKRWLSSQPSFKLEPTQGHRTSENLALTGV DHSLPQDGSNAFISQKQNSKSVPSAKVSLKEYRAKHAEELAAQKRQLENMEANVKSQYAY AAQNLLSHHDSHSSVILKMPIEGSENPERPFLEKADKTALKMRIPVAGGDKAASSKPEEI KMRIKVHAAADKHNSVEDSVTKSREHKEKHKTHPSNHHHHHNHHSHKHSHSQLPVGTGNK RPGDPKHSSQTSNLAHKTYSLSSSFSSSSSTRKRGPSEETGGAVFDHPAKIAKSTKSSSL NFSFPSLPTMGQMPGHSSDTSGLSFSQPSCKTRVPHSKLDKGPTGANGHNTTQTIDYQDT VNMLHSLLSAQGVQPTQPTAFEFVRPYSDYLNPRSGGISSRSGNTDKPRPPPLPSEPPPP LPPLPK

[Back to content]


List of locations for human proteins

"Centriole", "Cytoplasm", "Cytoskeleton", "Endoplasmic reticulum", "Extracellular", "Golgi apparatus", "Lysosome", "Microsome", "Mitochondrion", "Nucleus", "Peroxisome", "Plasma membrane"

[Back to content]


List of locations for eukaryotic proteins

"Centriole", "Cytoplasm", "Cytoskeleton", "Endoplasmic reticulum", "Extracellular", "Golgi apparatus", "Lysosome", "Chloroplast", "Mitochondrion", "Nucleus", "Peroxisome", "Plasma membrane", "Cell wall", "Cyanelle", "Vacuole", "Plastid"

[Back to content]

-->