Skip to main content
padlock icon - secure page this page is secure

pLoc_bal-mPlant: Predict Subcellular Localization of Plant Proteins by General PseAAC and Balancing Training Dataset

Buy Article:

$68.00 + tax (Refund Policy)

Knowledge of protein subcellular localization is vitally important for both basic research and drug development. With the avalanche of protein sequences emerging in the post-genomic age, it is highly desired to develop computational tools for timely and effectively identifying their subcellular localization based on the sequence information alone. Recently, a predictor called “pLoc-mPlant” was developed for identifying the subcellular localization of plant proteins. Its performance is overwhelmingly better than that of the other predictors for the same purpose, particularly in dealing with multi-label systems in which some proteins, called “multiplex proteins”, may simultaneously occur in two or more subcellular locations. Although it is indeed a very powerful predictor, more efforts are definitely needed to further improve it. This is because pLoc-mPlant was trained by an extremely skewed dataset in which some subsets (i.e., the protein numbers for some subcellular locations) were more than 10 times larger than the others. Accordingly, it cannot avoid the biased consequence caused by such an uneven training dataset. To overcome such biased consequence, we have developed a new and bias-free predictor called pLoc_bal-mPlant by balancing the training dataset. Cross-validation tests on exactly the same experimentconfirmed dataset have indicated that the proposed new predictor is remarkably superior to pLoc-mPlant, the existing state-of-the-art predictor in identifying the subcellular localization of plant proteins. To maximize the convenience for the majority of experimental scientists, a user-friendly web-server for the new predictor has been established at, by which users can easily get their desired results without the need to go through the detailed mathematics.
No References
No Citations
No Supplementary Data
No Article Media
No Metrics

Keywords: 5-step rules; Chou's intuitive metrics; ML-GKR; Multi-label system; PseAAC; balance treatment; plant proteins

Document Type: Review Article

Publication date: September 1, 2018

This article was made available online on December 27, 2018 as a Fast Track article with title: "pLoc_bal-mPlant: Predict Subcellular Localization of Plant Proteins by General PseAAC and Balancing Training Dataset".

More about this publication?
  • Current Pharmaceutical Design publishes timely in-depth reviews covering all aspects of current research in rational drug design. Each issue is devoted to a single major therapeutic area. A Guest Editor who is an acknowledged authority in a therapeutic field has solicits for each issue comprehensive and timely reviews from leading researchers in the pharmaceutical industry and academia.

    Each thematic issue of Current Pharmaceutical Design covers all subject areas of major importance to modern drug design, including: medicinal chemistry, pharmacology, drug targets and disease mechanism.
  • Editorial Board
  • Information for Authors
  • Subscribe to this Title
  • Ingenta Connect is not responsible for the content or availability of external websites
  • Access Key
  • Free content
  • Partial Free content
  • New content
  • Open access content
  • Partial Open access content
  • Subscribed content
  • Partial Subscribed content
  • Free trial content
Cookie Policy
Cookie Policy
Ingenta Connect website makes use of cookies so as to keep track of data that you have filled in. I am Happy with this Find out more